PP330_ARATH
ID PP330_ARATH Reviewed; 595 AA.
AC A8MQA3; Q9SUA7;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 10-FEB-2009, sequence version 2.
DT 03-AUG-2022, entry version 87.
DE RecName: Full=Pentatricopeptide repeat-containing protein At4g21065;
GN Name=PCMP-H28; OrderedLocusNames=At4g21065; ORFNames=T13K14.230;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC STRAIN=cv. Columbia;
RX PubMed=14993207; DOI=10.1101/gr.1515604;
RA Castelli V., Aury J.-M., Jaillon O., Wincker P., Clepet C., Menard M.,
RA Cruaud C., Quetier F., Scarpelli C., Schaechter V., Temple G., Caboche M.,
RA Weissenbach J., Salanoubat M.;
RT "Whole genome sequence comparisons and 'full-length' cDNA sequences: a
RT combined approach to evaluate and improve Arabidopsis genome annotation.";
RL Genome Res. 14:406-413(2004).
RN [4]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=A8MQA3-1; Sequence=Displayed;
CC Name=2;
CC IsoId=A8MQA3-2; Sequence=VSP_036304;
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BX826462; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC Sequence=BX827021; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC Sequence=CAB45902.1; Type=Erroneous gene model prediction; Note=The predicted gene has been split into 2 genes: At4g21065 and At4g21070.; Evidence={ECO:0000305};
CC Sequence=CAB79107.1; Type=Erroneous gene model prediction; Note=The predicted gene has been split into 2 genes: At4g21065 and At4g21070.; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL080282; CAB45902.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161554; CAB79107.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE84394.1; -; Genomic_DNA.
DR EMBL; CP002687; AEE84395.1; -; Genomic_DNA.
DR EMBL; BX826462; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BX827021; -; NOT_ANNOTATED_CDS; mRNA.
DR PIR; A85240; A85240.
DR PIR; T10649; T10649.
DR RefSeq; NP_001078414.1; NM_001084945.2. [A8MQA3-1]
DR RefSeq; NP_001078415.1; NM_001084946.1. [A8MQA3-2]
DR AlphaFoldDB; A8MQA3; -.
DR SMR; A8MQA3; -.
DR iPTMnet; A8MQA3; -.
DR PaxDb; A8MQA3; -.
DR PRIDE; A8MQA3; -.
DR ProteomicsDB; 249230; -. [A8MQA3-1]
DR EnsemblPlants; AT4G21065.1; AT4G21065.1; AT4G21065. [A8MQA3-1]
DR EnsemblPlants; AT4G21065.2; AT4G21065.2; AT4G21065. [A8MQA3-2]
DR GeneID; 5008150; -.
DR Gramene; AT4G21065.1; AT4G21065.1; AT4G21065. [A8MQA3-1]
DR Gramene; AT4G21065.2; AT4G21065.2; AT4G21065. [A8MQA3-2]
DR KEGG; ath:AT4G21065; -.
DR Araport; AT4G21065; -.
DR TAIR; locus:4010713895; AT4G21065.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_2_1; -.
DR OMA; EPKHCGD; -.
DR PhylomeDB; A8MQA3; -.
DR PRO; PR:A8MQA3; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; A8MQA3; baseline and differential.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF13041; PPR_2; 2.
DR TIGRFAMs; TIGR00756; PPR; 5.
DR PROSITE; PS51375; PPR; 10.
PE 2: Evidence at transcript level;
KW Alternative splicing; Reference proteome; Repeat.
FT CHAIN 1..595
FT /note="Pentatricopeptide repeat-containing protein
FT At4g21065"
FT /id="PRO_0000363447"
FT REPEAT 84..118
FT /note="PPR 1"
FT REPEAT 120..154
FT /note="PPR 2"
FT REPEAT 155..185
FT /note="PPR 3"
FT REPEAT 186..220
FT /note="PPR 4"
FT REPEAT 221..255
FT /note="PPR 5"
FT REPEAT 256..290
FT /note="PPR 6"
FT REPEAT 291..317
FT /note="PPR 7"
FT REPEAT 323..353
FT /note="PPR 8"
FT REPEAT 359..389
FT /note="PPR 9"
FT REGION 394..469
FT /note="Type E motif"
FT REGION 470..500
FT /note="Type E(+) motif"
FT REGION 501..595
FT /note="Type DYW motif"
FT VAR_SEQ 1..133
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14993207"
FT /id="VSP_036304"
SQ SEQUENCE 595 AA; 67101 MW; 951053074B1384CF CRC64;
MSPFSETSVL LLPMVEKCIN LLQTYGVSSI TKLRQIHAFS IRHGVSISDA ELGKHLIFYL
VSLPSPPPMS YAHKVFSKIE KPINVFIWNT LIRGYAEIGN SISAFSLYRE MRVSGLVEPD
THTYPFLIKA VTTMADVRLG ETIHSVVIRS GFGSLIYVQN SLLHLYANCG DVASAYKVFD
KMPEKDLVAW NSVINGFAEN GKPEEALALY TEMNSKGIKP DGFTIVSLLS ACAKIGALTL
GKRVHVYMIK VGLTRNLHSS NVLLDLYARC GRVEEAKTLF DEMVDKNSVS WTSLIVGLAV
NGFGKEAIEL FKYMESTEGL LPCEITFVGI LYACSHCGMV KEGFEYFRRM REEYKIEPRI
EHFGCMVDLL ARAGQVKKAY EYIKSMPMQP NVVIWRTLLG ACTVHGDSDL AEFARIQILQ
LEPNHSGDYV LLSNMYASEQ RWSDVQKIRK QMLRDGVKKV PGHSLVEVGN RVHEFLMGDK
SHPQSDAIYA KLKEMTGRLR SEGYVPQISN VYVDVEEEEK ENAVVYHSEK IAIAFMLIST
PERSPITVVK NLRVCADCHL AIKLVSKVYN REIVVRDRSR FHHFKNGSCS CQDYW