PP339_ARATH
ID PP339_ARATH Reviewed; 514 AA.
AC Q9SZ20;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 10-FEB-2009, sequence version 2.
DT 25-MAY-2022, entry version 112.
DE RecName: Full=Pentatricopeptide repeat-containing protein At4g26800;
GN OrderedLocusNames=At4g26800; ORFNames=F10M23.140;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC STRAIN=cv. Columbia;
RX PubMed=14993207; DOI=10.1101/gr.1515604;
RA Castelli V., Aury J.-M., Jaillon O., Wincker P., Clepet C., Menard M.,
RA Cruaud C., Quetier F., Scarpelli C., Schaechter V., Temple G., Caboche M.,
RA Weissenbach J., Salanoubat M.;
RT "Whole genome sequence comparisons and 'full-length' cDNA sequences: a
RT combined approach to evaluate and improve Arabidopsis genome annotation.";
RL Genome Res. 14:406-413(2004).
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q9SZ20-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9SZ20-2; Sequence=VSP_036305, VSP_036306;
CC -!- MISCELLANEOUS: [Isoform 1]: May be due to an intron retention.
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BX838540; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC Sequence=CAB36526.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC Sequence=CAB79535.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL035440; CAB36526.1; ALT_INIT; Genomic_DNA.
DR EMBL; AL161565; CAB79535.1; ALT_INIT; Genomic_DNA.
DR EMBL; CP002687; AEE85254.1; -; Genomic_DNA.
DR EMBL; CP002687; ANM67332.1; -; Genomic_DNA.
DR EMBL; BX838540; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BX828174; -; NOT_ANNOTATED_CDS; mRNA.
DR PIR; T04803; T04803.
DR RefSeq; NP_001329165.1; NM_001341831.1.
DR RefSeq; NP_001329166.1; NM_001341830.1. [Q9SZ20-1]
DR RefSeq; NP_194410.2; NM_118814.3. [Q9SZ20-2]
DR AlphaFoldDB; Q9SZ20; -.
DR SMR; Q9SZ20; -.
DR PaxDb; Q9SZ20; -.
DR PRIDE; Q9SZ20; -.
DR ProteomicsDB; 249239; -. [Q9SZ20-1]
DR EnsemblPlants; AT4G26800.1; AT4G26800.1; AT4G26800. [Q9SZ20-2]
DR EnsemblPlants; AT4G26800.2; AT4G26800.2; AT4G26800. [Q9SZ20-1]
DR GeneID; 828787; -.
DR Gramene; AT4G26800.1; AT4G26800.1; AT4G26800. [Q9SZ20-2]
DR Gramene; AT4G26800.2; AT4G26800.2; AT4G26800. [Q9SZ20-1]
DR KEGG; ath:AT4G26800; -.
DR Araport; AT4G26800; -.
DR TAIR; locus:2116292; AT4G26800.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_49_0_1; -.
DR InParanoid; Q9SZ20; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9SZ20; -.
DR PRO; PR:Q9SZ20; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q9SZ20; baseline and differential.
DR Genevisible; Q9SZ20; AT.
DR Gene3D; 1.25.40.10; -; 5.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF01535; PPR; 1.
DR Pfam; PF12854; PPR_1; 1.
DR Pfam; PF13041; PPR_2; 4.
DR TIGRFAMs; TIGR00756; PPR; 10.
DR PROSITE; PS51375; PPR; 12.
PE 2: Evidence at transcript level;
KW Alternative splicing; Reference proteome; Repeat.
FT CHAIN 1..514
FT /note="Pentatricopeptide repeat-containing protein
FT At4g26800"
FT /id="PRO_0000363456"
FT REPEAT 122..156
FT /note="PPR 1"
FT REPEAT 157..191
FT /note="PPR 2"
FT REPEAT 192..226
FT /note="PPR 3"
FT REPEAT 227..261
FT /note="PPR 4"
FT REPEAT 262..296
FT /note="PPR 5"
FT REPEAT 297..331
FT /note="PPR 6"
FT REPEAT 332..366
FT /note="PPR 7"
FT REPEAT 367..401
FT /note="PPR 8"
FT REPEAT 402..436
FT /note="PPR 9"
FT REPEAT 437..471
FT /note="PPR 10"
FT REPEAT 472..510
FT /note="PPR 11"
FT VAR_SEQ 1..145
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14993207"
FT /id="VSP_036305"
FT VAR_SEQ 146..147
FT /note="LG -> ML (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14993207"
FT /id="VSP_036306"
SQ SEQUENCE 514 AA; 57841 MW; F4892DCA91ECE25A CRC64;
MRWSIATAIA STAKGFLHLH HHFLKNSNPG IVLSPSLRFR FWVRAFSGTT IDYREVLRSG
LHNIKFDDAF HLFVLMAYSY PLPSIVEFNK VLTAIAKMQM YDVVINLWKR IENAEGIEIS
PDLYTCNILV NCFCRCFQPS SALSYLGKMM KLGIEPDIVT ASSLVNGFCL SNSIKDAVYV
AGQMEKMGIK RDVVVDTILI DTLCKNRLVV PALEVLKRMK DRGISPNVVT YSSLITGLCK
SGRLADAERR LHEMDSKKIN PNVITFSALI DAYAKRGKLS KVDSVYKMMI QMSIDPNVFT
YSSLIYGLCM HNRVDEAIKM LDLMISKGCT PNVVTYSTLA NGFFKSSRVD DGIKLLDDMP
QRGVAANTVS CNTLIKGYFQ AGKIDLALGV FGYMTSNGLI PNIRSYNIVL AGLFANGEVE
KALSRFEHMQ KTRNDLDIIT YTIMIHGMCK ACMVKEAYDL FYKLKFKRVE PDFKAYTIMI
AELNRAGMRT EADALNRFYQ KHVRQNESAP AEVS