PP224_ARATH
ID PP224_ARATH Reviewed; 694 AA.
AC Q9LTV8; Q56Y62;
DT 16-DEC-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2000, sequence version 1.
DT 03-AUG-2022, entry version 107.
DE RecName: Full=Pentatricopeptide repeat-containing protein At3g12770;
GN Name=PCMP-H43; OrderedLocusNames=At3g12770; ORFNames=MBK21.13;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10819329; DOI=10.1093/dnares/7.2.131;
RA Sato S., Nakamura Y., Kaneko T., Katoh T., Asamizu E., Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 3. I. Sequence
RT features of the regions of 4,504,864 bp covered by sixty P1 and TAC
RT clones.";
RL DNA Res. 7:131-135(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB024033; BAB02421.1; -; Genomic_DNA.
DR EMBL; CP002686; AEE75244.1; -; Genomic_DNA.
DR EMBL; AK221461; BAD94552.1; -; mRNA.
DR RefSeq; NP_187883.2; NM_112113.4.
DR AlphaFoldDB; Q9LTV8; -.
DR SMR; Q9LTV8; -.
DR IntAct; Q9LTV8; 1.
DR STRING; 3702.AT3G12770.1; -.
DR PaxDb; Q9LTV8; -.
DR PRIDE; Q9LTV8; -.
DR ProteomicsDB; 249173; -.
DR EnsemblPlants; AT3G12770.1; AT3G12770.1; AT3G12770.
DR GeneID; 820459; -.
DR Gramene; AT3G12770.1; AT3G12770.1; AT3G12770.
DR KEGG; ath:AT3G12770; -.
DR Araport; AT3G12770; -.
DR TAIR; locus:2087735; AT3G12770.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_8_1; -.
DR InParanoid; Q9LTV8; -.
DR OMA; NCHAATK; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9LTV8; -.
DR PRO; PR:Q9LTV8; -.
DR Proteomes; UP000006548; Chromosome 3.
DR ExpressionAtlas; Q9LTV8; baseline and differential.
DR Genevisible; Q9LTV8; AT.
DR GO; GO:0005739; C:mitochondrion; IEA:GOC.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0080156; P:mitochondrial mRNA modification; IMP:TAIR.
DR Gene3D; 1.25.40.10; -; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF13041; PPR_2; 4.
DR SUPFAM; SSF48452; SSF48452; 1.
DR TIGRFAMs; TIGR00756; PPR; 4.
DR PROSITE; PS51375; PPR; 13.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..694
FT /note="Pentatricopeptide repeat-containing protein
FT At3g12770"
FT /id="PRO_0000356083"
FT REPEAT 52..82
FT /note="PPR 1"
FT REPEAT 83..117
FT /note="PPR 2"
FT REPEAT 118..152
FT /note="PPR 3"
FT REPEAT 153..183
FT /note="PPR 4"
FT REPEAT 186..220
FT /note="PPR 5"
FT REPEAT 221..255
FT /note="PPR 6"
FT REPEAT 256..286
FT /note="PPR 7"
FT REPEAT 287..321
FT /note="PPR 8"
FT REPEAT 322..356
FT /note="PPR 9"
FT REPEAT 357..387
FT /note="PPR 10"
FT REPEAT 388..422
FT /note="PPR 11"
FT REPEAT 423..457
FT /note="PPR 12"
FT REPEAT 458..488
FT /note="PPR 13"
FT REGION 493..568
FT /note="Type E motif"
FT REGION 569..599
FT /note="Type E(+) motif"
FT REGION 600..694
FT /note="Type DYW motif"
FT CONFLICT 212
FT /note="Q -> H (in Ref. 3; BAD94552)"
FT /evidence="ECO:0000305"
FT CONFLICT 449
FT /note="R -> L (in Ref. 3; BAD94552)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 694 AA; 77936 MW; BA30FE1D78AF163D CRC64;
MSEASCLASP LLYTNSGIHS DSFYASLIDS ATHKAQLKQI HARLLVLGLQ FSGFLITKLI
HASSSFGDIT FARQVFDDLP RPQIFPWNAI IRGYSRNNHF QDALLMYSNM QLARVSPDSF
TFPHLLKACS GLSHLQMGRF VHAQVFRLGF DADVFVQNGL IALYAKCRRL GSARTVFEGL
PLPERTIVSW TAIVSAYAQN GEPMEALEIF SQMRKMDVKP DWVALVSVLN AFTCLQDLKQ
GRSIHASVVK MGLEIEPDLL ISLNTMYAKC GQVATAKILF DKMKSPNLIL WNAMISGYAK
NGYAREAIDM FHEMINKDVR PDTISITSAI SACAQVGSLE QARSMYEYVG RSDYRDDVFI
SSALIDMFAK CGSVEGARLV FDRTLDRDVV VWSAMIVGYG LHGRAREAIS LYRAMERGGV
HPNDVTFLGL LMACNHSGMV REGWWFFNRM ADHKINPQQQ HYACVIDLLG RAGHLDQAYE
VIKCMPVQPG VTVWGALLSA CKKHRHVELG EYAAQQLFSI DPSNTGHYVQ LSNLYAAARL
WDRVAEVRVR MKEKGLNKDV GCSWVEVRGR LEAFRVGDKS HPRYEEIERQ VEWIESRLKE
GGFVANKDAS LHDLNDEEAE ETLCSHSERI AIAYGLISTP QGTPLRITKN LRACVNCHAA
TKLISKLVDR EIVVRDTNRF HHFKDGVCSC GDYW