PPR68_ARATH
ID PPR68_ARATH Reviewed; 606 AA.
AC Q9C6T2; Q0WVV3;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 120.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g31920;
GN Name=PCMP-H11; OrderedLocusNames=At1g31920; ORFNames=F5M6.8;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-527.
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC079041; AAG50713.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE31416.1; -; Genomic_DNA.
DR EMBL; AK226634; BAE98745.1; -; mRNA.
DR PIR; D86443; D86443.
DR RefSeq; NP_174474.1; NM_102928.5.
DR AlphaFoldDB; Q9C6T2; -.
DR SMR; Q9C6T2; -.
DR STRING; 3702.AT1G31920.1; -.
DR PaxDb; Q9C6T2; -.
DR PRIDE; Q9C6T2; -.
DR ProteomicsDB; 236653; -.
DR EnsemblPlants; AT1G31920.1; AT1G31920.1; AT1G31920.
DR GeneID; 840082; -.
DR Gramene; AT1G31920.1; AT1G31920.1; AT1G31920.
DR KEGG; ath:AT1G31920; -.
DR Araport; AT1G31920; -.
DR TAIR; locus:2034456; AT1G31920.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_2_1; -.
DR InParanoid; Q9C6T2; -.
DR OMA; PRADDIY; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9C6T2; -.
DR PRO; PR:Q9C6T2; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9C6T2; differential.
DR Genevisible; Q9C6T2; AT.
DR GO; GO:0003729; F:mRNA binding; IDA:TAIR.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 2.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF13041; PPR_2; 2.
DR TIGRFAMs; TIGR00756; PPR; 4.
DR PROSITE; PS51375; PPR; 8.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..606
FT /note="Pentatricopeptide repeat-containing protein
FT At1g31920"
FT /id="PRO_0000342809"
FT REPEAT 96..130
FT /note="PPR 1"
FT REPEAT 131..165
FT /note="PPR 2"
FT REPEAT 166..200
FT /note="PPR 3"
FT REPEAT 201..227
FT /note="PPR 4"
FT REPEAT 233..263
FT /note="PPR 5"
FT REPEAT 268..298
FT /note="PPR 6"
FT REPEAT 299..333
FT /note="PPR 7"
FT REPEAT 334..368
FT /note="PPR 8"
FT REPEAT 370..404
FT /note="PPR 9"
FT REGION 405..480
FT /note="Type E motif"
FT REGION 481..511
FT /note="Type E(+) motif"
FT REGION 512..606
FT /note="Type DYW motif"
SQ SEQUENCE 606 AA; 68596 MW; 3461654764DB64C9 CRC64;
MIKAPILQSL LASRDDLTHN PEVNNFGGKE QECLYLLKRC HNIDEFKQVH ARFIKLSLFY
SSSFSASSVL AKCAHSGWEN SMNYAASIFR GIDDPCTFDF NTMIRGYVNV MSFEEALCFY
NEMMQRGNEP DNFTYPCLLK ACTRLKSIRE GKQIHGQVFK LGLEADVFVQ NSLINMYGRC
GEMELSSAVF EKLESKTAAS WSSMVSARAG MGMWSECLLL FRGMCSETNL KAEESGMVSA
LLACANTGAL NLGMSIHGFL LRNISELNII VQTSLVDMYV KCGCLDKALH IFQKMEKRNN
LTYSAMISGL ALHGEGESAL RMFSKMIKEG LEPDHVVYVS VLNACSHSGL VKEGRRVFAE
MLKEGKVEPT AEHYGCLVDL LGRAGLLEEA LETIQSIPIE KNDVIWRTFL SQCRVRQNIE
LGQIAAQELL KLSSHNPGDY LLISNLYSQG QMWDDVARTR TEIAIKGLKQ TPGFSIVELK
GKTHRFVSQD RSHPKCKEIY KMLHQMEWQL KFEGYSPDLT QILLNVDEEE KKERLKGHSQ
KVAIAFGLLY TPPGSIIKIA RNLRMCSDCH TYTKKISMIY EREIVVRDRN RFHLFKGGTC
SCKDYW