PP154_ARATH
ID PP154_ARATH Reviewed; 849 AA.
AC Q9XIL5; Q0WM50; Q93ZT4;
DT 16-DEC-2008, integrated into UniProtKB/Swiss-Prot.
DT 16-DEC-2008, sequence version 3.
DT 25-MAY-2022, entry version 123.
DE RecName: Full=Pentatricopeptide repeat-containing protein At2g15820, chloroplastic;
DE AltName: Full=Protein ORGANELLE TRANSCRIPT PROCESSING 51 {ECO:0000303|PubMed:18557832};
DE Short=AtOTP51 {ECO:0000303|PubMed:18557832};
DE Flags: Precursor;
GN Name=OTP51 {ECO:0000303|PubMed:18557832}; OrderedLocusNames=At2g15820;
GN ORFNames=F19G14.18;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617197; DOI=10.1038/45471;
RA Lin X., Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town C.D.,
RA Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V.,
RA Buell C.R., Ketchum K.A., Lee J.J., Ronning C.M., Koo H.L., Moffat K.S.,
RA Cronin L.A., Shen M., Pai G., Van Aken S., Umayam L., Tallon L.J.,
RA Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., Goodman H.M.,
RA Somerville C.R., Copenhaver G.P., Preuss D., Nierman W.C., White O.,
RA Eisen J.A., Salzberg S.L., Fraser C.M., Venter J.C.;
RT "Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.";
RL Nature 402:761-768(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 294-849.
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 294-849.
RC STRAIN=cv. Columbia;
RA Cheuk R.F., Chen H., Kim C.J., Shinn P., Ecker J.R.;
RT "Arabidopsis ORF clones.";
RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 355-849.
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
RN [7]
RP FUNCTION, AND DISRUPTION PHENOTYPE.
RX PubMed=18557832; DOI=10.1111/j.1365-313x.2008.03581.x;
RA de Longevialle A.F., Hendrickson L., Taylor N.L., Delannoy E., Lurin C.,
RA Badger M., Millar A.H., Small I.;
RT "The pentatricopeptide repeat gene OTP51 with two LAGLIDADG motifs is
RT required for the cis-splicing of plastid ycf3 intron 2 in Arabidopsis
RT thaliana.";
RL Plant J. 56:157-168(2008).
CC -!- FUNCTION: Promotes the splicing of group II introns in chloroplasts.
CC Required for the splicing of intron 2 of plastid ycf3 transcripts, a
CC factor required for the assembly of photosystem I (PSI). Involved in
CC the splicing of several other group-IIa introns. May be involved in the
CC splicing of precursor forms of trnL, trnG, trnI, and trnA. Required for
CC the assembly of PSI and PSII. {ECO:0000269|PubMed:18557832}.
CC -!- SUBCELLULAR LOCATION: Plastid, chloroplast {ECO:0000255}.
CC -!- DISRUPTION PHENOTYPE: Can grow only in vitro on sucrose-containing
CC medium under low light conditions. Mutant plants show a pale-green
CC phenotype, very slow growth and delayed development.
CC {ECO:0000269|PubMed:18557832}.
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAD41982.2; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=AAL07123.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=AAW80859.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC006438; AAD41982.2; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002685; AEC06439.1; -; Genomic_DNA.
DR EMBL; AY056274; AAL07123.1; ALT_INIT; mRNA.
DR EMBL; BT020586; AAW80859.1; ALT_INIT; mRNA.
DR EMBL; AK229981; BAF01806.1; -; mRNA.
DR RefSeq; NP_565382.4; NM_127144.7.
DR AlphaFoldDB; Q9XIL5; -.
DR SMR; Q9XIL5; -.
DR STRING; 3702.AT2G15820.1; -.
DR PaxDb; Q9XIL5; -.
DR PRIDE; Q9XIL5; -.
DR ProteomicsDB; 249419; -.
DR EnsemblPlants; AT2G15820.1; AT2G15820.1; AT2G15820.
DR GeneID; 816078; -.
DR Gramene; AT2G15820.1; AT2G15820.1; AT2G15820.
DR KEGG; ath:AT2G15820; -.
DR Araport; AT2G15820; -.
DR TAIR; locus:2044541; AT2G15820.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_017335_0_0_1; -.
DR InParanoid; Q9XIL5; -.
DR OMA; NAQRKWI; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9XIL5; -.
DR PRO; PR:Q9XIL5; -.
DR Proteomes; UP000006548; Chromosome 2.
DR ExpressionAtlas; Q9XIL5; baseline and differential.
DR Genevisible; Q9XIL5; AT.
DR GO; GO:0009507; C:chloroplast; ISM:TAIR.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0010239; P:chloroplast mRNA processing; IMP:TAIR.
DR GO; GO:0000373; P:Group II intron splicing; IMP:TAIR.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IBA:GO_Central.
DR GO; GO:0048564; P:photosystem I assembly; IMP:TAIR.
DR GO; GO:0006388; P:tRNA splicing, via endonucleolytic cleavage and ligation; IMP:TAIR.
DR Gene3D; 1.25.40.10; -; 3.
DR Gene3D; 3.10.28.10; -; 2.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR004860; LAGLIDADG_2.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF03161; LAGLIDADG_2; 1.
DR Pfam; PF01535; PPR; 3.
DR SUPFAM; SSF55608; SSF55608; 1.
DR TIGRFAMs; TIGR00756; PPR; 2.
DR PROSITE; PS51375; PPR; 7.
PE 2: Evidence at transcript level;
KW Chloroplast; mRNA processing; mRNA splicing; Plastid; Reference proteome;
KW Repeat; Transit peptide.
FT TRANSIT 1..68
FT /note="Chloroplast"
FT /evidence="ECO:0000255"
FT CHAIN 69..849
FT /note="Pentatricopeptide repeat-containing protein
FT At2g15820, chloroplastic"
FT /id="PRO_0000356014"
FT REPEAT 237..271
FT /note="PPR 1"
FT REPEAT 272..310
FT /note="PPR 2"
FT REPEAT 312..352
FT /note="PPR 3"
FT REPEAT 355..389
FT /note="PPR 4"
FT REPEAT 390..424
FT /note="PPR 5"
FT REPEAT 425..455
FT /note="PPR 6"
FT REPEAT 460..494
FT /note="PPR 7"
FT REPEAT 495..525
FT /note="PPR 8"
FT REPEAT 529..563
FT /note="PPR 9"
FT REPEAT 565..599
FT /note="PPR 10"
FT CONFLICT 455
FT /note="H -> Q (in Ref. 3; AAL07123)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 849 AA; 97396 MW; 2F31D46DE6DAB9E2 CRC64;
MTKSNGHNAT MIVTGACDFS SSFSLASSSS STVSVTTFNI SSLSSNPNII NSSSTLFRSL
SFSLIRHRSS YSRRSLRRLS IHTVHGNKTQ FFSHSSTRTP PLFTANSTAQ RSGTFVEHLT
GITESEEGIS EANGFGDVES ARNDIRNVAT RRIETEFEVR ELEELPEEWR RSKLAWLCKE
VPTHKAVTLV RLLNAQKKWV RQEDATYISV HCMRIRENET GFRVYRWMTQ QNWYRFDFGL
TTKLAEYLGK ERKFTKCREV FDDVLNQGRV PSESTFHILV VAYLSSLSVE GCLEEACSVY
NRMIQLGGYK PRLSLHNSLF RALVSKQGGI LNDQLKQAEF IFHNVVTTGL EVQKDIYSGL
IWLHSCQDEV DIGRINSLRE EMKKAGFQES KEVVVSLLRA YAKEGGVEEV ERTWLELLDL
DCGIPSQAFV YKIEAYSKVG DFAKAMEIFR EMEKHIGGAT MSGYHKIIEV LCKVQQVELV
ETLMKEFEES GKKPLLPSFI EIAKMYFDLG LHEKLEMAFV QCLEKCQPSQ PIYNIYLDSL
TKIGNLEKAG DVFNEMKNNG TINVSARSCN SLLKGYLDCG KQVQAERIYD LMRMKKYEIE
PPLMEKLDYI LSLKKKEVKK RPFSMKLSKD QREVLVGLLL GGLQIESDKE KKSHMIKFEF
RENSQAHLVL KQNIHDQFRE WLHPLSNFQE DIIPFEFYSV PHSYFGFYAE HYWPKGQPEI
PKLIHRWLSP HSLAYWYMYS GVKTSSGDII LRLKGSLEGV EKVVKALQAK SMECRVKKKG
KVFWIGLQGT NSALFWKLIE PHVLENLKEH LKPASESLDN VKEAEEQSIN FKSNSDHSDD
CVNSEAHFY