PPR53_ARATH
ID PPR53_ARATH Reviewed; 760 AA.
AC Q9LNU6;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-2008, sequence version 2.
DT 25-MAY-2022, entry version 114.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g20230;
GN Name=PCMP-H21; OrderedLocusNames=At1g20230; ORFNames=T20H2.1;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAF79892.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC022472; AAF79892.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE29953.1; -; Genomic_DNA.
DR PIR; A86336; A86336.
DR RefSeq; NP_173449.1; NM_101875.2.
DR AlphaFoldDB; Q9LNU6; -.
DR SMR; Q9LNU6; -.
DR PaxDb; Q9LNU6; -.
DR PRIDE; Q9LNU6; -.
DR ProteomicsDB; 234836; -.
DR EnsemblPlants; AT1G20230.1; AT1G20230.1; AT1G20230.
DR GeneID; 838612; -.
DR Gramene; AT1G20230.1; AT1G20230.1; AT1G20230.
DR KEGG; ath:AT1G20230; -.
DR Araport; AT1G20230; -.
DR TAIR; locus:2198546; AT1G20230.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_8_1; -.
DR InParanoid; Q9LNU6; -.
DR OMA; MGGYAMH; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9LNU6; -.
DR PRO; PR:Q9LNU6; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9LNU6; baseline and differential.
DR Genevisible; Q9LNU6; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 5.
DR Pfam; PF13041; PPR_2; 2.
DR Pfam; PF13812; PPR_3; 1.
DR TIGRFAMs; TIGR00756; PPR; 9.
DR PROSITE; PS51375; PPR; 12.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..760
FT /note="Pentatricopeptide repeat-containing protein
FT At1g20230"
FT /id="PRO_0000342794"
FT REPEAT 49..79
FT /note="PPR 1"
FT REPEAT 80..114
FT /note="PPR 2"
FT REPEAT 115..149
FT /note="PPR 3"
FT REPEAT 150..180
FT /note="PPR 4"
FT REPEAT 181..215
FT /note="PPR 5"
FT REPEAT 216..250
FT /note="PPR 6"
FT REPEAT 251..285
FT /note="PPR 7"
FT REPEAT 286..316
FT /note="PPR 8"
FT REPEAT 317..351
FT /note="PPR 9"
FT REPEAT 352..386
FT /note="PPR 10"
FT REPEAT 387..421
FT /note="PPR 11"
FT REPEAT 422..452
FT /note="PPR 12"
FT REPEAT 453..487
FT /note="PPR 13"
FT REPEAT 488..523
FT /note="PPR 14"
FT REPEAT 524..554
FT /note="PPR 15"
FT REGION 559..634
FT /note="Type E motif"
FT REGION 635..665
FT /note="Type E(+) motif"
FT REGION 666..760
FT /note="Type DYW motif"
SQ SEQUENCE 760 AA; 84864 MW; 1F9FD6129E4FC2FF CRC64;
MTKQVLPLIE KIPQSIVGFL ESSSYHWSSS LSKTTQAHAR ILKSGAQNDG YISAKLIASY
SNYNCFNDAD LVLQSIPDPT IYSFSSLIYA LTKAKLFTQS IGVFSRMFSH GLIPDSHVLP
NLFKVCAELS AFKVGKQIHC VSCVSGLDMD AFVQGSMFHM YMRCGRMGDA RKVFDRMSDK
DVVTCSALLC AYARKGCLEE VVRILSEMES SGIEANIVSW NGILSGFNRS GYHKEAVVMF
QKIHHLGFCP DQVTVSSVLP SVGDSEMLNM GRLIHGYVIK QGLLKDKCVI SAMIDMYGKS
GHVYGIISLF NQFEMMEAGV CNAYITGLSR NGLVDKALEM FELFKEQTME LNVVSWTSII
AGCAQNGKDI EALELFREMQ VAGVKPNHVT IPSMLPACGN IAALGHGRST HGFAVRVHLL
DNVHVGSALI DMYAKCGRIN LSQIVFNMMP TKNLVCWNSL MNGFSMHGKA KEVMSIFESL
MRTRLKPDFI SFTSLLSACG QVGLTDEGWK YFKMMSEEYG IKPRLEHYSC MVNLLGRAGK
LQEAYDLIKE MPFEPDSCVW GALLNSCRLQ NNVDLAEIAA EKLFHLEPEN PGTYVLLSNI
YAAKGMWTEV DSIRNKMESL GLKKNPGCSW IQVKNRVYTL LAGDKSHPQI DQITEKMDEI
SKEMRKSGHR PNLDFALHDV EEQEQEQMLW GHSEKLAVVF GLLNTPDGTP LQVIKNLRIC
GDCHAVIKFI SSYAGREIFI RDTNRFHHFK DGICSCGDFW