PPR52_ARATH
ID PPR52_ARATH Reviewed; 894 AA.
AC Q9FXH1; Q0WPD8; Q9M4P6;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 119.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g19720;
DE AltName: Full=Protein DYW7;
GN Name=DYW7; Synonyms=PCMP-H7; OrderedLocusNames=At1g19720;
GN ORFNames=F14P1.33, F6F9.22;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 159-500.
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 489-894, AND GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [5]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAF01011.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC007797; AAG12555.1; -; Genomic_DNA.
DR EMBL; AC024609; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CP002684; AEE29891.1; -; Genomic_DNA.
DR EMBL; AK229137; BAF01011.1; ALT_INIT; mRNA.
DR EMBL; AJ006040; CAA06829.1; -; mRNA.
DR PIR; C86330; C86330.
DR PIR; T52647; T52647.
DR RefSeq; NP_173402.2; NM_101828.2.
DR AlphaFoldDB; Q9FXH1; -.
DR SMR; Q9FXH1; -.
DR STRING; 3702.AT1G19720.1; -.
DR PaxDb; Q9FXH1; -.
DR PRIDE; Q9FXH1; -.
DR ProteomicsDB; 226313; -.
DR EnsemblPlants; AT1G19720.1; AT1G19720.1; AT1G19720.
DR GeneID; 838561; -.
DR Gramene; AT1G19720.1; AT1G19720.1; AT1G19720.
DR KEGG; ath:AT1G19720; -.
DR Araport; AT1G19720; -.
DR TAIR; locus:2013079; AT1G19720.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_1_1; -.
DR InParanoid; Q9FXH1; -.
DR OMA; RCLHHFK; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9FXH1; -.
DR PRO; PR:Q9FXH1; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9FXH1; baseline and differential.
DR GO; GO:0009507; C:chloroplast; HDA:TAIR.
DR GO; GO:0005737; C:cytoplasm; HDA:TAIR.
DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; HDA:TAIR.
DR GO; GO:0003729; F:mRNA binding; IDA:TAIR.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IBA:GO_Central.
DR Gene3D; 1.25.40.10; -; 5.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 3.
DR Pfam; PF13041; PPR_2; 5.
DR TIGRFAMs; TIGR00756; PPR; 9.
DR PROSITE; PS51375; PPR; 14.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..894
FT /note="Pentatricopeptide repeat-containing protein
FT At1g19720"
FT /id="PRO_0000342793"
FT REPEAT 80..110
FT /note="PPR 1"
FT REPEAT 114..144
FT /note="PPR 2"
FT REPEAT 145..179
FT /note="PPR 3"
FT REPEAT 180..214
FT /note="PPR 4"
FT REPEAT 215..245
FT /note="PPR 5"
FT REPEAT 246..280
FT /note="PPR 6"
FT REPEAT 281..315
FT /note="PPR 7"
FT REPEAT 316..350
FT /note="PPR 8"
FT REPEAT 351..385
FT /note="PPR 9"
FT REPEAT 386..416
FT /note="PPR 10"
FT REPEAT 417..451
FT /note="PPR 11"
FT REPEAT 452..486
FT /note="PPR 12"
FT REPEAT 488..522
FT /note="PPR 13"
FT REPEAT 523..557
FT /note="PPR 14"
FT REPEAT 558..588
FT /note="PPR 15"
FT REPEAT 589..623
FT /note="PPR 16"
FT REPEAT 624..659
FT /note="PPR 17"
FT REPEAT 660..694
FT /note="PPR 18"
FT REGION 695..770
FT /note="Type E motif"
FT REGION 771..801
FT /note="Type E(+) motif"
FT REGION 803..894
FT /note="Type DYW motif"
FT CONFLICT 489
FT /note="T -> P (in Ref. 4; CAA06829)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 894 AA; 100815 MW; 529FD2BFD65AB447 CRC64;
MEKLFVPSFP KTFLNYQTPA KVENSPELHP KSRKKNLSFT KKKEPNIIPD EQFDYLCRNG
SLLEAEKALD SLFQQGSKVK RSTYLKLLES CIDSGSIHLG RILHARFGLF TEPDVFVETK
LLSMYAKCGC IADARKVFDS MRERNLFTWS AMIGAYSREN RWREVAKLFR LMMKDGVLPD
DFLFPKILQG CANCGDVEAG KVIHSVVIKL GMSSCLRVSN SILAVYAKCG ELDFATKFFR
RMRERDVIAW NSVLLAYCQN GKHEEAVELV KEMEKEGISP GLVTWNILIG GYNQLGKCDA
AMDLMQKMET FGITADVFTW TAMISGLIHN GMRYQALDMF RKMFLAGVVP NAVTIMSAVS
ACSCLKVINQ GSEVHSIAVK MGFIDDVLVG NSLVDMYSKC GKLEDARKVF DSVKNKDVYT
WNSMITGYCQ AGYCGKAYEL FTRMQDANLR PNIITWNTMI SGYIKNGDEG EAMDLFQRME
KDGKVQRNTA TWNLIIAGYI QNGKKDEALE LFRKMQFSRF MPNSVTILSL LPACANLLGA
KMVREIHGCV LRRNLDAIHA VKNALTDTYA KSGDIEYSRT IFLGMETKDI ITWNSLIGGY
VLHGSYGPAL ALFNQMKTQG ITPNRGTLSS IILAHGLMGN VDEGKKVFYS IANDYHIIPA
LEHCSAMVYL YGRANRLEEA LQFIQEMNIQ SETPIWESFL TGCRIHGDID MAIHAAENLF
SLEPENTATE SIVSQIYALG AKLGRSLEGN KPRRDNLLKK PLGQSWIEVR NLIHTFTTGD
QSKLCTDVLY PLVEKMSRLD NRSDQYNGEL WIEEEGREET CGIHSEKFAM AFGLISSSGA
SKTTIRILKN LRMCRDCHDT AKYVSKRYGC DILLEDTRCL HHFKNGDCSC KDYW