CP33_ARATH
ID CP33_ARATH Reviewed; 329 AA.
AC Q39061; Q39062; Q8W4N7;
DT 07-JAN-2015, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1996, sequence version 1.
DT 25-MAY-2022, entry version 163.
DE RecName: Full=RNA-binding protein CP33, chloroplastic {ECO:0000305};
DE AltName: Full=Protein PIGMENT DEFECTIVE 322;
DE Flags: Precursor;
GN Name=CP33 {ECO:0000303|PubMed:7894017}; Synonyms=PDE322 {ECO:0000305};
GN OrderedLocusNames=At3g52380 {ECO:0000312|Araport:AT3G52380};
GN ORFNames=F22O6_240 {ECO:0000312|EMBL:CAB43448.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RX PubMed=7894017; DOI=10.1007/bf00019319;
RA Ohta M., Sugita M., Sugiura M.;
RT "Three types of nuclear genes encoding chloroplast RNA-binding proteins
RT (cp29, cp31 and cp33) are present in Arabidopsis thaliana: presence of cp31
RT in chloroplasts and its homologue in nuclei/cytoplasms.";
RL Plant Mol. Biol. 27:529-539(1995).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130713; DOI=10.1038/35048706;
RA Salanoubat M., Lemcke K., Rieger M., Ansorge W., Unseld M., Fartmann B.,
RA Valle G., Bloecker H., Perez-Alonso M., Obermaier B., Delseny M.,
RA Boutry M., Grivell L.A., Mache R., Puigdomenech P., De Simone V.,
RA Choisne N., Artiguenave F., Robert C., Brottier P., Wincker P.,
RA Cattolico L., Weissenbach J., Saurin W., Quetier F., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Benes V., Wurmbach E., Drzonek H.,
RA Erfle H., Jordan N., Bangert S., Wiedelmann R., Kranz H., Voss H.,
RA Holland R., Brandt P., Nyakatura G., Vezzi A., D'Angelo M., Pallavicini A.,
RA Toppo S., Simionati B., Conrad A., Hornischer K., Kauer G., Loehnert T.-H.,
RA Nordsiek G., Reichelt J., Scharfe M., Schoen O., Bargues M., Terol J.,
RA Climent J., Navarro P., Collado C., Perez-Perez A., Ottenwaelder B.,
RA Duchemin D., Cooke R., Laudie M., Berger-Llauro C., Purnelle B., Masuy D.,
RA de Haan M., Maarse A.C., Alcaraz J.-P., Cottet A., Casacuberta E.,
RA Monfort A., Argiriou A., Flores M., Liguori R., Vitale D., Mannhaupt G.,
RA Haase D., Schoof H., Rudd S., Zaccaria P., Mewes H.-W., Mayer K.F.X.,
RA Kaul S., Town C.D., Koo H.L., Tallon L.J., Jenkins J., Rooney T., Rizzo M.,
RA Walts A., Utterback T., Fujii C.Y., Shea T.P., Creasy T.H., Haas B.,
RA Maiti R., Wu D., Peterson J., Van Aken S., Pai G., Militscher J.,
RA Sellers P., Gill J.E., Feldblyum T.V., Preuss D., Lin X., Nierman W.C.,
RA Salzberg S.L., White O., Venter J.C., Fraser C.M., Kaneko T., Nakamura Y.,
RA Sato S., Kato T., Asamizu E., Sasamoto S., Kimura T., Idesawa K.,
RA Kawashima K., Kishida Y., Kiyokawa C., Kohara M., Matsumoto M., Matsuno A.,
RA Muraki A., Nakayama S., Nakazaki N., Shinpo S., Takeuchi C., Wada T.,
RA Watanabe A., Yamada M., Yasuda M., Tabata S.;
RT "Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana.";
RL Nature 408:820-822(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RA Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B.,
RA Feldmann K.A.;
RT "Full-length cDNA from Arabidopsis thaliana.";
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Could be involved in splicing and/or processing of
CC chloroplast RNAs. {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Plastid, chloroplast {ECO:0000255}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D31714; BAA06522.1; -; Genomic_DNA.
DR EMBL; D31715; BAA06523.1; -; mRNA.
DR EMBL; AL050300; CAB43448.1; -; Genomic_DNA.
DR EMBL; CP002686; AEE78940.1; -; Genomic_DNA.
DR EMBL; AY039607; AAK62662.1; -; mRNA.
DR EMBL; AY062455; AAL32533.1; -; mRNA.
DR EMBL; AY078022; AAL77723.1; -; mRNA.
DR EMBL; AY085279; AAM62511.1; -; mRNA.
DR PIR; S53494; S53494.
DR RefSeq; NP_190806.1; NM_115098.3.
DR AlphaFoldDB; Q39061; -.
DR SMR; Q39061; -.
DR STRING; 3702.AT3G52380.1; -.
DR PaxDb; Q39061; -.
DR PRIDE; Q39061; -.
DR ProteomicsDB; 224377; -.
DR EnsemblPlants; AT3G52380.1; AT3G52380.1; AT3G52380.
DR GeneID; 824403; -.
DR Gramene; AT3G52380.1; AT3G52380.1; AT3G52380.
DR KEGG; ath:AT3G52380; -.
DR Araport; AT3G52380; -.
DR TAIR; locus:2079874; AT3G52380.
DR eggNOG; KOG0118; Eukaryota.
DR HOGENOM; CLU_012062_15_1_1; -.
DR InParanoid; Q39061; -.
DR OMA; DEMHLSE; -.
DR OrthoDB; 1202220at2759; -.
DR PhylomeDB; Q39061; -.
DR PRO; PR:Q39061; -.
DR Proteomes; UP000006548; Chromosome 3.
DR ExpressionAtlas; Q39061; baseline and differential.
DR Genevisible; Q39061; AT.
DR GO; GO:0009507; C:chloroplast; IDA:TAIR.
DR GO; GO:0009570; C:chloroplast stroma; IDA:TAIR.
DR GO; GO:0005829; C:cytosol; HDA:TAIR.
DR GO; GO:0009579; C:thylakoid; HDA:TAIR.
DR GO; GO:0003729; F:mRNA binding; IDA:TAIR.
DR GO; GO:0003723; F:RNA binding; IDA:TAIR.
DR GO; GO:0031425; P:chloroplast RNA processing; IMP:TAIR.
DR GO; GO:1901259; P:chloroplast rRNA processing; IMP:TAIR.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR Gene3D; 3.30.70.330; -; 2.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR Pfam; PF00076; RRM_1; 2.
DR SMART; SM00360; RRM; 2.
DR SUPFAM; SSF54928; SSF54928; 1.
DR PROSITE; PS50102; RRM; 2.
PE 2: Evidence at transcript level;
KW Chloroplast; mRNA processing; Plastid; Reference proteome; Repeat;
KW Ribonucleoprotein; RNA-binding; Transit peptide.
FT TRANSIT 1..69
FT /note="Chloroplast"
FT /evidence="ECO:0000255"
FT CHAIN 70..329
FT /note="RNA-binding protein CP33, chloroplastic"
FT /evidence="ECO:0000255"
FT /id="PRO_0000431492"
FT DOMAIN 116..194
FT /note="RRM 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT DOMAIN 219..297
FT /note="RRM 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT REGION 77..117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 296..329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..104
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 116
FT /note="G -> W (in Ref. 1; BAA06523)"
FT /evidence="ECO:0000305"
FT CONFLICT 172
FT /note="E -> G (in Ref. 4; AAL32533)"
FT /evidence="ECO:0000305"
FT CONFLICT 229
FT /note="N -> D (in Ref. 4; AAL32533)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 329 AA; 35744 MW; 96E34C7E1B8B1DBD CRC64;
MSSAYCSSAV AVSAAATASS AATFNPLLSS HSNSQLFYRF TPKSFKLVAN CPNPLILHSN
IRRHRFFCAA ETEASSADDE IQASVEEEEE VEEEGDEGEE EVEEEKQTTQ ASGEEGRLYV
GNLPYTITSS ELSQIFGEAG TVVDVQIVYD KVTDRSRGFG FVTMGSIEEA KEAMQMFNSS
QIGGRTVKVN FPEVPRGGEN EVMRTKIRDN NRSYVDSPHK VYAGNLGWNL TSQGLKDAFG
DQPGVLGAKV IYERNTGRSR GFGFISFESA ENVQSALATM NGVEVEGRAL RLNLASEREK
PTVSPPSVEE GETEEASLES NEVLSNVSA