PP122_ARATH
ID PP122_ARATH Reviewed; 643 AA.
AC Q9CA54; Q570H1;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 25-MAY-2022, entry version 114.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g74630;
GN Name=PCMP-H71; OrderedLocusNames=At1g74630; ORFNames=F1M20.31;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 9-643.
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC011765; AAG52363.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE35616.1; -; Genomic_DNA.
DR EMBL; AK220733; BAD93880.1; -; mRNA.
DR EMBL; AK220737; BAD93890.1; -; mRNA.
DR PIR; D96775; D96775.
DR RefSeq; NP_177601.1; NM_106121.2.
DR AlphaFoldDB; Q9CA54; -.
DR SMR; Q9CA54; -.
DR PaxDb; Q9CA54; -.
DR ProteomicsDB; 249403; -.
DR EnsemblPlants; AT1G74630.1; AT1G74630.1; AT1G74630.
DR GeneID; 843802; -.
DR Gramene; AT1G74630.1; AT1G74630.1; AT1G74630.
DR KEGG; ath:AT1G74630; -.
DR Araport; AT1G74630; -.
DR TAIR; locus:2019160; AT1G74630.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_8_1; -.
DR InParanoid; Q9CA54; -.
DR OMA; VGFAHNG; -.
DR PhylomeDB; Q9CA54; -.
DR PRO; PR:Q9CA54; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9CA54; baseline and differential.
DR Genevisible; Q9CA54; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 7.
DR Pfam; PF13041; PPR_2; 1.
DR TIGRFAMs; TIGR00756; PPR; 7.
DR PROSITE; PS51375; PPR; 13.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..643
FT /note="Pentatricopeptide repeat-containing protein
FT At1g74630"
FT /id="PRO_0000342863"
FT REPEAT 69..103
FT /note="PPR 1"
FT REPEAT 105..139
FT /note="PPR 2"
FT REPEAT 140..170
FT /note="PPR 3"
FT REPEAT 171..201
FT /note="PPR 4"
FT REPEAT 202..236
FT /note="PPR 5"
FT REPEAT 237..267
FT /note="PPR 6"
FT REPEAT 268..302
FT /note="PPR 7"
FT REPEAT 303..333
FT /note="PPR 8"
FT REPEAT 335..369
FT /note="PPR 9"
FT REPEAT 370..400
FT /note="PPR 10"
FT REPEAT 406..436
FT /note="PPR 11"
FT REGION 441..516
FT /note="Type E motif"
FT REGION 517..547
FT /note="Type E(+) motif"
FT REGION 549..643
FT /note="Type DYW motif"
SQ SEQUENCE 643 AA; 72226 MW; CA615D3FF6020424 CRC64;
MTIAIHHCLS LLNSCKNLRA LTQIHGLFIK YGVDTDSYFT GKLILHCAIS ISDALPYARR
LLLCFPEPDA FMFNTLVRGY SESDEPHNSV AVFVEMMRKG FVFPDSFSFA FVIKAVENFR
SLRTGFQMHC QALKHGLESH LFVGTTLIGM YGGCGCVEFA RKVFDEMHQP NLVAWNAVIT
ACFRGNDVAG AREIFDKMLV RNHTSWNVML AGYIKAGELE SAKRIFSEMP HRDDVSWSTM
IVGIAHNGSF NESFLYFREL QRAGMSPNEV SLTGVLSACS QSGSFEFGKI LHGFVEKAGY
SWIVSVNNAL IDMYSRCGNV PMARLVFEGM QEKRCIVSWT SMIAGLAMHG QGEEAVRLFN
EMTAYGVTPD GISFISLLHA CSHAGLIEEG EDYFSEMKRV YHIEPEIEHY GCMVDLYGRS
GKLQKAYDFI CQMPIPPTAI VWRTLLGACS SHGNIELAEQ VKQRLNELDP NNSGDLVLLS
NAYATAGKWK DVASIRKSMI VQRIKKTTAW SLVEVGKTMY KFTAGEKKKG IDIEAHEKLK
EIILRLKDEA GYTPEVASAL YDVEEEEKED QVSKHSEKLA LAFALARLSK GANIRIVKNL
RICRDCHAVM KLTSKVYGVE ILVRDRNRFH SFKDGSCSCR DYW