PP348_ARATH
ID PP348_ARATH Reviewed; 823 AA.
AC O81767;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 10-FEB-2009, sequence version 2.
DT 03-AUG-2022, entry version 119.
DE RecName: Full=Pentatricopeptide repeat-containing protein At4g33990;
DE AltName: Full=Protein EMBRYO DEFECTIVE 2758;
GN Name=EMB2758; Synonyms=PCMP-H20; OrderedLocusNames=At4g33990;
GN ORFNames=F17I5.180;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA19881.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB80116.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL031032; CAA19881.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161584; CAB80116.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE86306.1; -; Genomic_DNA.
DR PIR; T05227; T05227.
DR RefSeq; NP_567948.1; NM_119561.2.
DR AlphaFoldDB; O81767; -.
DR SMR; O81767; -.
DR PaxDb; O81767; -.
DR PRIDE; O81767; -.
DR ProteomicsDB; 248996; -.
DR EnsemblPlants; AT4G33990.1; AT4G33990.1; AT4G33990.
DR GeneID; 829546; -.
DR Gramene; AT4G33990.1; AT4G33990.1; AT4G33990.
DR KEGG; ath:AT4G33990; -.
DR Araport; AT4G33990; -.
DR TAIR; locus:2118964; AT4G33990.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_1_1; -.
DR InParanoid; O81767; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; O81767; -.
DR PRO; PR:O81767; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; O81767; baseline and differential.
DR Genevisible; O81767; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 6.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 7.
DR Pfam; PF13041; PPR_2; 1.
DR TIGRFAMs; TIGR00756; PPR; 6.
DR PROSITE; PS51375; PPR; 13.
PE 3: Inferred from homology;
KW Reference proteome; Repeat.
FT CHAIN 1..823
FT /note="Pentatricopeptide repeat-containing protein
FT At4g33990"
FT /id="PRO_0000363465"
FT REPEAT 85..115
FT /note="PPR 1"
FT REPEAT 116..151
FT /note="PPR 2"
FT REPEAT 152..183
FT /note="PPR 3"
FT REPEAT 184..214
FT /note="PPR 4"
FT REPEAT 215..249
FT /note="PPR 5"
FT REPEAT 252..280
FT /note="PPR 6"
FT REPEAT 281..311
FT /note="PPR 7"
FT REPEAT 312..346
FT /note="PPR 8"
FT REPEAT 347..381
FT /note="PPR 9"
FT REPEAT 383..413
FT /note="PPR 10"
FT REPEAT 414..448
FT /note="PPR 11"
FT REPEAT 450..484
FT /note="PPR 12"
FT REPEAT 485..515
FT /note="PPR 13"
FT REPEAT 516..550
FT /note="PPR 14"
FT REPEAT 551..581
FT /note="PPR 15"
FT REPEAT 587..617
FT /note="PPR 16"
FT REGION 622..697
FT /note="Type E motif"
FT REGION 698..728
FT /note="Type E(+) motif"
FT REGION 729..823
FT /note="Type DYW motif"
SQ SEQUENCE 823 AA; 92414 MW; E46C086D20902413 CRC64;
MKFGTFSLPR QIPTCKGGRF TRVLQSIGSV IREFSASANA LQDCWKNGNE SKEIDDVHTL
FRYCTNLQSA KCLHARLVVS KQIQNVCISA KLVNLYCYLG NVALARHTFD HIQNRDVYAW
NLMISGYGRA GNSSEVIRCF SLFMLSSGLT PDYRTFPSVL KACRTVIDGN KIHCLALKFG
FMWDVYVAAS LIHLYSRYKA VGNARILFDE MPVRDMGSWN AMISGYCQSG NAKEALTLSN
GLRAMDSVTV VSLLSACTEA GDFNRGVTIH SYSIKHGLES ELFVSNKLID LYAEFGRLRD
CQKVFDRMYV RDLISWNSII KAYELNEQPL RAISLFQEMR LSRIQPDCLT LISLASILSQ
LGDIRACRSV QGFTLRKGWF LEDITIGNAV VVMYAKLGLV DSARAVFNWL PNTDVISWNT
IISGYAQNGF ASEAIEMYNI MEEEGEIAAN QGTWVSVLPA CSQAGALRQG MKLHGRLLKN
GLYLDVFVVT SLADMYGKCG RLEDALSLFY QIPRVNSVPW NTLIACHGFH GHGEKAVMLF
KEMLDEGVKP DHITFVTLLS ACSHSGLVDE GQWCFEMMQT DYGITPSLKH YGCMVDMYGR
AGQLETALKF IKSMSLQPDA SIWGALLSAC RVHGNVDLGK IASEHLFEVE PEHVGYHVLL
SNMYASAGKW EGVDEIRSIA HGKGLRKTPG WSSMEVDNKV EVFYTGNQTH PMYEEMYREL
TALQAKLKMI GYVPDHRFVL QDVEDDEKEH ILMSHSERLA IAFALIATPA KTTIRIFKNL
RVCGDCHSVT KFISKITERE IIVRDSNRFH HFKNGVCSCG DYW