PPR49_ARATH
ID PPR49_ARATH Reviewed; 860 AA.
AC Q8GYP6; Q9LME0;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 25-MAY-2022, entry version 106.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g18900;
GN OrderedLocusNames=At1g18900; ORFNames=F14D16.2;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=11910074; DOI=10.1126/science.1071006;
RA Seki M., Narusaka M., Kamiya A., Ishida J., Satou M., Sakurai T.,
RA Nakajima M., Enju A., Akiyama K., Oono Y., Muramatsu M., Hayashizaki Y.,
RA Kawai J., Carninci P., Itoh M., Ishii Y., Arakawa T., Shibata K.,
RA Shinagawa A., Shinozaki K.;
RT "Functional annotation of a full-length Arabidopsis cDNA collection.";
RL Science 296:141-145(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=1;
CC Comment=A number of isoforms are produced. According to EST
CC sequences.;
CC Name=1;
CC IsoId=Q8GYP6-1; Sequence=Displayed;
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAF79278.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC068602; AAF79278.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE29778.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE29779.1; -; Genomic_DNA.
DR EMBL; CP002684; ANM60908.1; -; Genomic_DNA.
DR EMBL; AK117464; BAC42129.1; -; mRNA.
DR EMBL; BT005012; AAO50545.1; -; mRNA.
DR RefSeq; NP_001319039.1; NM_001332378.1. [Q8GYP6-1]
DR RefSeq; NP_173324.1; NM_101747.3. [Q8GYP6-1]
DR RefSeq; NP_973860.1; NM_202131.1. [Q8GYP6-1]
DR AlphaFoldDB; Q8GYP6; -.
DR SMR; Q8GYP6; -.
DR STRING; 3702.AT1G18900.3; -.
DR iPTMnet; Q8GYP6; -.
DR PaxDb; Q8GYP6; -.
DR PRIDE; Q8GYP6; -.
DR EnsemblPlants; AT1G18900.1; AT1G18900.1; AT1G18900. [Q8GYP6-1]
DR EnsemblPlants; AT1G18900.2; AT1G18900.2; AT1G18900. [Q8GYP6-1]
DR EnsemblPlants; AT1G18900.4; AT1G18900.4; AT1G18900. [Q8GYP6-1]
DR GeneID; 838471; -.
DR Gramene; AT1G18900.1; AT1G18900.1; AT1G18900. [Q8GYP6-1]
DR Gramene; AT1G18900.2; AT1G18900.2; AT1G18900. [Q8GYP6-1]
DR Gramene; AT1G18900.4; AT1G18900.4; AT1G18900. [Q8GYP6-1]
DR KEGG; ath:AT1G18900; -.
DR Araport; AT1G18900; -.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_015575_1_0_1; -.
DR InParanoid; Q8GYP6; -.
DR OMA; WILQERN; -.
DR PhylomeDB; Q8GYP6; -.
DR PRO; PR:Q8GYP6; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q8GYP6; baseline and differential.
DR Genevisible; Q8GYP6; AT.
DR Gene3D; 1.25.40.10; -; 3.
DR Gene3D; 3.30.1370.110; -; 1.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR002625; Smr_dom.
DR InterPro; IPR036063; Smr_dom_sf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF12854; PPR_1; 1.
DR Pfam; PF13041; PPR_2; 3.
DR SMART; SM00463; SMR; 1.
DR SUPFAM; SSF160443; SSF160443; 1.
DR SUPFAM; SSF48452; SSF48452; 1.
DR TIGRFAMs; TIGR00756; PPR; 8.
DR PROSITE; PS51375; PPR; 9.
DR PROSITE; PS50828; SMR; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Reference proteome; Repeat.
FT CHAIN 1..860
FT /note="Pentatricopeptide repeat-containing protein
FT At1g18900"
FT /id="PRO_0000342790"
FT REPEAT 363..397
FT /note="PPR 1"
FT REPEAT 398..432
FT /note="PPR 2"
FT REPEAT 433..467
FT /note="PPR 3"
FT REPEAT 468..502
FT /note="PPR 4"
FT REPEAT 503..537
FT /note="PPR 5"
FT REPEAT 538..572
FT /note="PPR 6"
FT REPEAT 573..607
FT /note="PPR 7"
FT REPEAT 608..642
FT /note="PPR 8"
FT DOMAIN 760..843
FT /note="Smr"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00321"
SQ SEQUENCE 860 AA; 95214 MW; 0FBF05A1D031A260 CRC64;
MIRAKHISNL SSTARSFFLN GSRTSVTDGN SCVYSDDENC VSKRQQLRKE AGQTEKRPSS
ILPKPSVVGC ILPGEVTKPV VPKKVDDFGR PSLLPQHVSS SPALPLKSHS VNYASTVVRE
EVEGKASSEP IGDQIFKAGI VAVNFLSDLS NCKIPSYDGG SDAFGLPKSC MVDPTRPISS
VKSSNVKAIR REHFAKIYPR SAAKESSVGT TRNPSSNFRG AKEAERTGFV KGFRQVSNSV
VGKSLPTTNN TYGKRTSVLQ RPHIDSNRFV PSGFSNSSVE MMKGPSGTAL TSRQYCNSGH
IVENVSSVLR RFRWGPAAEE ALQNLGLRID AYQANQVLKQ MNDYGNALGF FYWLKRQPGF
KHDGHTYTTM VGNLGRAKQF GAINKLLDEM VRDGCQPNTV TYNRLIHSYG RANYLNEAMN
VFNQMQEAGC KPDRVTYCTL IDIHAKAGFL DIAMDMYQRM QAGGLSPDTF TYSVIINCLG
KAGHLPAAHK LFCEMVDQGC TPNLVTYNIM MDLHAKARNY QNALKLYRDM QNAGFEPDKV
TYSIVMEVLG HCGYLEEAEA VFTEMQQKNW IPDEPVYGLL VDLWGKAGNV EKAWQWYQAM
LHAGLRPNVP TCNSLLSTFL RVNKIAEAYE LLQNMLALGL RPSLQTYTLL LSCCTDGRSK
LDMGFCGQLM ASTGHPAHMF LLKMPAAGPD GENVRNHANN FLDLMHSEDR ESKRGLVDAV
VDFLHKSGQK EEAGSVWEVA AQKNVFPDAL REKSCSYWLI NLHVMSEGTA VTALSRTLAW
FRKQMLASGT CPSRIDIVTG WGRRSRVTGT SMVRQAVEEL LNIFGSPFFT ESGNSGCFVG
SGEPLNRWLL QSHVERMHLL