PP114_ARATH
ID PP114_ARATH Reviewed; 745 AA.
AC Q9C9H9; Q0WL09;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 114.
DE RecName: Full=Pentatricopeptide repeat-containing protein At1g71420;
GN Name=PCMP-H70; OrderedLocusNames=At1g71420; ORFNames=F26A9.20;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-727.
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC016163; AAG51830.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE35199.1; -; Genomic_DNA.
DR EMBL; AK230400; BAF02198.1; -; mRNA.
DR RefSeq; NP_177298.1; NM_105811.3.
DR AlphaFoldDB; Q9C9H9; -.
DR SMR; Q9C9H9; -.
DR PaxDb; Q9C9H9; -.
DR PRIDE; Q9C9H9; -.
DR ProteomicsDB; 249062; -.
DR EnsemblPlants; AT1G71420.1; AT1G71420.1; AT1G71420.
DR GeneID; 843483; -.
DR Gramene; AT1G71420.1; AT1G71420.1; AT1G71420.
DR KEGG; ath:AT1G71420; -.
DR Araport; AT1G71420; -.
DR TAIR; locus:2825364; AT1G71420.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_1_1; -.
DR InParanoid; Q9C9H9; -.
DR OMA; MPMQPDY; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9C9H9; -.
DR PRO; PR:Q9C9H9; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9C9H9; baseline and differential.
DR Genevisible; Q9C9H9; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 5.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 5.
DR Pfam; PF13041; PPR_2; 1.
DR Pfam; PF13812; PPR_3; 1.
DR TIGRFAMs; TIGR00756; PPR; 4.
DR PROSITE; PS51375; PPR; 12.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..745
FT /note="Pentatricopeptide repeat-containing protein
FT At1g71420"
FT /id="PRO_0000342855"
FT REPEAT 58..88
FT /note="PPR 1"
FT REPEAT 95..125
FT /note="PPR 2"
FT REPEAT 126..160
FT /note="PPR 3"
FT REPEAT 191..224
FT /note="PPR 4"
FT REPEAT 225..259
FT /note="PPR 5"
FT REPEAT 260..296
FT /note="PPR 6"
FT REPEAT 301..332
FT /note="PPR 7"
FT REPEAT 334..367
FT /note="PPR 8"
FT REPEAT 368..402
FT /note="PPR 9"
FT REPEAT 403..437
FT /note="PPR 10"
FT REPEAT 438..464
FT /note="PPR 11"
FT REPEAT 466..496
FT /note="PPR 12"
FT REPEAT 502..532
FT /note="PPR 13"
FT REGION 537..613
FT /note="Type E motif"
FT REGION 614..644
FT /note="Type E(+) motif"
FT REGION 645..745
FT /note="Type DYW motif"
SQ SEQUENCE 745 AA; 84022 MW; 93F172F835FCF17D CRC64;
MITSLSQISF GTLRRFGSSV LPSALKREFV EGLRTLVRSG DIRRAVSLFY SAPVELQSQQ
AYAALFQACA EQRNLLDGIN LHHHMLSHPY CYSQNVILAN FLINMYAKCG NILYARQVFD
TMPERNVVSW TALITGYVQA GNEQEGFCLF SSMLSHCFPN EFTLSSVLTS CRYEPGKQVH
GLALKLGLHC SIYVANAVIS MYGRCHDGAA AYEAWTVFEA IKFKNLVTWN SMIAAFQCCN
LGKKAIGVFM RMHSDGVGFD RATLLNICSS LYKSSDLVPN EVSKCCLQLH SLTVKSGLVT
QTEVATALIK VYSEMLEDYT DCYKLFMEMS HCRDIVAWNG IITAFAVYDP ERAIHLFGQL
RQEKLSPDWY TFSSVLKACA GLVTARHALS IHAQVIKGGF LADTVLNNSL IHAYAKCGSL
DLCMRVFDDM DSRDVVSWNS MLKAYSLHGQ VDSILPVFQK MDINPDSATF IALLSACSHA
GRVEEGLRIF RSMFEKPETL PQLNHYACVI DMLSRAERFA EAEEVIKQMP MDPDAVVWIA
LLGSCRKHGN TRLGKLAADK LKELVEPTNS MSYIQMSNIY NAEGSFNEAN LSIKEMETWR
VRKEPDLSWT EIGNKVHEFA SGGRHRPDKE AVYRELKRLI SWLKEMGYVP EMRSASQDIE
DEEQEEDNLL HHSEKLALAF AVMEGRKSSD CGVNLIQIMK NTRICIDCHN FMKLASKLLG
KEILMRDSNR FHHFKDSSCS CNDYW