PP307_ARATH
ID PP307_ARATH Reviewed; 1064 AA.
AC Q9SVP7;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 10-FEB-2009, sequence version 2.
DT 03-AUG-2022, entry version 115.
DE RecName: Full=Pentatricopeptide repeat-containing protein At4g13650;
GN Name=PCMP-H42; OrderedLocusNames=At4g13650; ORFNames=F18A5.40;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 894-1064.
RC STRAIN=cv. Columbia;
RX PubMed=14993207; DOI=10.1101/gr.1515604;
RA Castelli V., Aury J.-M., Jaillon O., Wincker P., Clepet C., Menard M.,
RA Cruaud C., Quetier F., Scarpelli C., Schaechter V., Temple G., Caboche M.,
RA Weissenbach J., Salanoubat M.;
RT "Whole genome sequence comparisons and 'full-length' cDNA sequences: a
RT combined approach to evaluate and improve Arabidopsis genome annotation.";
RL Genome Res. 14:406-413(2004).
RN [4]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAB36829.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB78407.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL035528; CAB36829.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161537; CAB78407.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE83309.1; -; Genomic_DNA.
DR EMBL; BX827423; -; NOT_ANNOTATED_CDS; mRNA.
DR PIR; T05234; T05234.
DR RefSeq; NP_193101.2; NM_117439.3.
DR AlphaFoldDB; Q9SVP7; -.
DR SMR; Q9SVP7; -.
DR STRING; 3702.AT4G13650.1; -.
DR MetOSite; Q9SVP7; -.
DR PaxDb; Q9SVP7; -.
DR PRIDE; Q9SVP7; -.
DR ProteomicsDB; 248979; -.
DR EnsemblPlants; AT4G13650.1; AT4G13650.1; AT4G13650.
DR GeneID; 826999; -.
DR Gramene; AT4G13650.1; AT4G13650.1; AT4G13650.
DR KEGG; ath:AT4G13650; -.
DR Araport; AT4G13650; -.
DR TAIR; locus:2119440; AT4G13650.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_0_1; -.
DR InParanoid; Q9SVP7; -.
DR OMA; TIHAFFV; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9SVP7; -.
DR PRO; PR:Q9SVP7; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q9SVP7; baseline and differential.
DR Genevisible; Q9SVP7; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 6.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 7.
DR Pfam; PF13041; PPR_2; 3.
DR Pfam; PF13812; PPR_3; 1.
DR TIGRFAMs; TIGR00756; PPR; 5.
DR PROSITE; PS51375; PPR; 21.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..1064
FT /note="Pentatricopeptide repeat-containing protein
FT At4g13650"
FT /id="PRO_0000363425"
FT REPEAT 83..118
FT /note="PPR 1"
FT REPEAT 119..149
FT /note="PPR 2"
FT REPEAT 150..184
FT /note="PPR 3"
FT REPEAT 185..220
FT /note="PPR 4"
FT REPEAT 221..251
FT /note="PPR 5"
FT REPEAT 252..286
FT /note="PPR 6"
FT REPEAT 287..321
FT /note="PPR 7"
FT REPEAT 322..352
FT /note="PPR 8"
FT REPEAT 353..387
FT /note="PPR 9"
FT REPEAT 388..422
FT /note="PPR 10"
FT REPEAT 423..453
FT /note="PPR 11"
FT REPEAT 454..488
FT /note="PPR 12"
FT REPEAT 489..523
FT /note="PPR 13"
FT REPEAT 524..554
FT /note="PPR 14"
FT REPEAT 555..589
FT /note="PPR 15"
FT REPEAT 590..624
FT /note="PPR 16"
FT REPEAT 625..655
FT /note="PPR 17"
FT REPEAT 656..690
FT /note="PPR 18"
FT REPEAT 691..725
FT /note="PPR 19"
FT REPEAT 726..756
FT /note="PPR 20"
FT REPEAT 757..791
FT /note="PPR 21"
FT REPEAT 792..827
FT /note="PPR 22"
FT REPEAT 828..858
FT /note="PPR 23"
FT REGION 863..938
FT /note="Type E motif"
FT REGION 939..969
FT /note="Type E(+) motif"
FT REGION 970..1064
FT /note="Type DYW motif"
FT CONFLICT 1015
FT /note="I -> M (in Ref. 3)"
FT /evidence="ECO:0000305"
FT CONFLICT 1025
FT /note="N -> D (in Ref. 3)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1064 AA; 119740 MW; 87E47DF4376C9243 CRC64;
MNKYIWLVRL WHSKEEPMFL RSVSSSFIFI HGVPRKLKTR TVFPTLCGTR RASFAAISVY
ISEDESFQEK RIDSVENRGI RPNHQTLKWL LEGCLKTNGS LDEGRKLHSQ ILKLGLDSNG
CLSEKLFDFY LFKGDLYGAF KVFDEMPERT IFTWNKMIKE LASRNLIGEV FGLFVRMVSE
NVTPNEGTFS GVLEACRGGS VAFDVVEQIH ARILYQGLRD STVVCNPLID LYSRNGFVDL
ARRVFDGLRL KDHSSWVAMI SGLSKNECEA EAIRLFCDMY VLGIMPTPYA FSSVLSACKK
IESLEIGEQL HGLVLKLGFS SDTYVCNALV SLYFHLGNLI SAEHIFSNMS QRDAVTYNTL
INGLSQCGYG EKAMELFKRM HLDGLEPDSN TLASLVVACS ADGTLFRGQQ LHAYTTKLGF
ASNNKIEGAL LNLYAKCADI ETALDYFLET EVENVVLWNV MLVAYGLLDD LRNSFRIFRQ
MQIEEIVPNQ YTYPSILKTC IRLGDLELGE QIHSQIIKTN FQLNAYVCSV LIDMYAKLGK
LDTAWDILIR FAGKDVVSWT TMIAGYTQYN FDDKALTTFR QMLDRGIRSD EVGLTNAVSA
CAGLQALKEG QQIHAQACVS GFSSDLPFQN ALVTLYSRCG KIEESYLAFE QTEAGDNIAW
NALVSGFQQS GNNEEALRVF VRMNREGIDN NNFTFGSAVK AASETANMKQ GKQVHAVITK
TGYDSETEVC NALISMYAKC GSISDAEKQF LEVSTKNEVS WNAIINAYSK HGFGSEALDS
FDQMIHSNVR PNHVTLVGVL SACSHIGLVD KGIAYFESMN SEYGLSPKPE HYVCVVDMLT
RAGLLSRAKE FIQEMPIKPD ALVWRTLLSA CVVHKNMEIG EFAAHHLLEL EPEDSATYVL
LSNLYAVSKK WDARDLTRQK MKEKGVKKEP GQSWIEVKNS IHSFYVGDQN HPLADEIHEY
FQDLTKRASE IGYVQDCFSL LNELQHEQKD PIIFIHSEKL AISFGLLSLP ATVPINVMKN
LRVCNDCHAW IKFVSKVSNR EIIVRDAYRF HHFEGGACSC KDYW