PP347_ARATH
ID PP347_ARATH Reviewed; 990 AA.
AC Q9SMZ2;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-2000, sequence version 1.
DT 25-MAY-2022, entry version 113.
DE RecName: Full=Pentatricopeptide repeat-containing protein At4g33170;
GN Name=PCMP-H53; OrderedLocusNames=At4g33170; ORFNames=F4I10.100;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP GENE FAMILY.
RX PubMed=10809006; DOI=10.1023/a:1006352315928;
RA Aubourg S., Boudet N., Kreis M., Lecharny A.;
RT "In Arabidopsis thaliana, 1% of the genome codes for a novel protein family
RT unique to plants.";
RL Plant Mol. Biol. 42:603-613(2000).
RN [4]
RP GENE FAMILY.
RX PubMed=15269332; DOI=10.1105/tpc.104.022236;
RA Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C.,
RA Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M.,
RA Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B.,
RA Taconnat L., Small I.;
RT "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins
RT reveals their essential role in organelle biogenesis.";
RL Plant Cell 16:2089-2103(2004).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Pentatricopeptide repeat proteins;
CC URL="https://ppr.plantenergy.uwa.edu.au";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL035525; CAB36791.1; -; Genomic_DNA.
DR EMBL; AL161583; CAB80034.1; -; Genomic_DNA.
DR EMBL; CP002687; AEE86186.1; -; Genomic_DNA.
DR PIR; T05197; T05197.
DR RefSeq; NP_195043.1; NM_119471.2.
DR AlphaFoldDB; Q9SMZ2; -.
DR SMR; Q9SMZ2; -.
DR STRING; 3702.AT4G33170.1; -.
DR iPTMnet; Q9SMZ2; -.
DR PaxDb; Q9SMZ2; -.
DR PRIDE; Q9SMZ2; -.
DR ProteomicsDB; 248995; -.
DR EnsemblPlants; AT4G33170.1; AT4G33170.1; AT4G33170.
DR GeneID; 829454; -.
DR Gramene; AT4G33170.1; AT4G33170.1; AT4G33170.
DR KEGG; ath:AT4G33170; -.
DR Araport; AT4G33170; -.
DR TAIR; locus:2125899; AT4G33170.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_0_1; -.
DR InParanoid; Q9SMZ2; -.
DR OMA; TYHQMRL; -.
DR OrthoDB; 1344243at2759; -.
DR PhylomeDB; Q9SMZ2; -.
DR PRO; PR:Q9SMZ2; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q9SMZ2; baseline and differential.
DR Genevisible; Q9SMZ2; AT.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 1.25.40.10; -; 6.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR002885; Pentatricopeptide_repeat.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF01535; PPR; 6.
DR Pfam; PF13041; PPR_2; 3.
DR TIGRFAMs; TIGR00756; PPR; 5.
DR PROSITE; PS51375; PPR; 15.
PE 3: Inferred from homology;
KW Reference proteome; Repeat.
FT CHAIN 1..990
FT /note="Pentatricopeptide repeat-containing protein
FT At4g33170"
FT /id="PRO_0000363464"
FT REPEAT 73..107
FT /note="PPR 1"
FT REPEAT 109..139
FT /note="PPR 2"
FT REPEAT 144..178
FT /note="PPR 3"
FT REPEAT 179..209
FT /note="PPR 4"
FT REPEAT 210..244
FT /note="PPR 5"
FT REPEAT 279..313
FT /note="PPR 6"
FT REPEAT 314..348
FT /note="PPR 7"
FT REPEAT 349..379
FT /note="PPR 8"
FT REPEAT 380..414
FT /note="PPR 9"
FT REPEAT 415..450
FT /note="PPR 10"
FT REPEAT 451..477
FT /note="PPR 11"
FT REPEAT 481..515
FT /note="PPR 12"
FT REPEAT 516..550
FT /note="PPR 13"
FT REPEAT 551..581
FT /note="PPR 14"
FT REPEAT 582..616
FT /note="PPR 15"
FT REPEAT 617..651
FT /note="PPR 16"
FT REPEAT 652..682
FT /note="PPR 17"
FT REPEAT 683..717
FT /note="PPR 18"
FT REPEAT 718..753
FT /note="PPR 19"
FT REPEAT 754..788
FT /note="PPR 20"
FT REGION 789..864
FT /note="Type E motif"
FT REGION 865..895
FT /note="Type E(+) motif"
FT REGION 896..990
FT /note="Type DYW motif"
SQ SEQUENCE 990 AA; 110815 MW; 6280769E3CBBD850 CRC64;
MRSTSKAIPF SFHTSLIVQC LRPLRFTSAA SPSSSSSSSS QWFGFLRNAI TSSDLMLGKC
THARILTFEE NPERFLINNL ISMYSKCGSL TYARRVFDKM PDRDLVSWNS ILAAYAQSSE
CVVENIQQAF LLFRILRQDV VYTSRMTLSP MLKLCLHSGY VWASESFHGY ACKIGLDGDE
FVAGALVNIY LKFGKVKEGK VLFEEMPYRD VVLWNLMLKA YLEMGFKEEA IDLSSAFHSS
GLNPNEITLR LLARISGDDS DAGQVKSFAN GNDASSVSEI IFRNKGLSEY LHSGQYSALL
KCFADMVESD VECDQVTFIL MLATAVKVDS LALGQQVHCM ALKLGLDLML TVSNSLINMY
CKLRKFGFAR TVFDNMSERD LISWNSVIAG IAQNGLEVEA VCLFMQLLRC GLKPDQYTMT
SVLKAASSLP EGLSLSKQVH VHAIKINNVS DSFVSTALID AYSRNRCMKE AEILFERHNF
DLVAWNAMMA GYTQSHDGHK TLKLFALMHK QGERSDDFTL ATVFKTCGFL FAINQGKQVH
AYAIKSGYDL DLWVSSGILD MYVKCGDMSA AQFAFDSIPV PDDVAWTTMI SGCIENGEEE
RAFHVFSQMR LMGVLPDEFT IATLAKASSC LTALEQGRQI HANALKLNCT NDPFVGTSLV
DMYAKCGSID DAYCLFKRIE MMNITAWNAM LVGLAQHGEG KETLQLFKQM KSLGIKPDKV
TFIGVLSACS HSGLVSEAYK HMRSMHGDYG IKPEIEHYSC LADALGRAGL VKQAENLIES
MSMEASASMY RTLLAACRVQ GDTETGKRVA TKLLELEPLD SSAYVLLSNM YAAASKWDEM
KLARTMMKGH KVKKDPGFSW IEVKNKIHIF VVDDRSNRQT ELIYRKVKDM IRDIKQEGYV
PETDFTLVDV EEEEKERALY YHSEKLAVAF GLLSTPPSTP IRVIKNLRVC GDCHNAMKYI
AKVYNREIVL RDANRFHRFK DGICSCGDYW