RGGC_ARATH
ID RGGC_ARATH Reviewed; 357 AA.
AC Q9LVT8; A8MQD7; A8MRX4; Q8LDQ7;
DT 30-NOV-2016, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2000, sequence version 1.
DT 25-MAY-2022, entry version 134.
DE RecName: Full=RGG repeats nuclear RNA binding protein C {ECO:0000305};
GN Name=RGGC {ECO:0000305};
GN OrderedLocusNames=At5g47210 {ECO:0000312|Araport:AT5G47210};
GN ORFNames=MQL5.6 {ECO:0000312|EMBL:BAA97154.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10718197; DOI=10.1093/dnares/7.1.31;
RA Sato S., Nakamura Y., Kaneko T., Katoh T., Asamizu E., Kotani H.,
RA Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 5. X. Sequence
RT features of the regions of 3,076,755 bp covered by sixty P1 and TAC
RT clones.";
RL DNA Res. 7:31-63(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC STRAIN=cv. Columbia; TISSUE=Root;
RX PubMed=19423640; DOI=10.1093/dnares/dsp009;
RA Iida K., Fukami-Kobayashi K., Toyoda A., Sakaki Y., Kobayashi M., Seki M.,
RA Shinozaki K.;
RT "Analysis of multiple occurrences of alternative splicing events in
RT Arabidopsis thaliana using novel sequenced full-length cDNAs.";
RL DNA Res. 16:155-164(2009).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RA Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B.,
RA Feldmann K.A.;
RT "Full-length cDNA from Arabidopsis thaliana.";
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19376835; DOI=10.1104/pp.109.138677;
RA Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A.,
RA Grossmann J., Gruissem W., Baginsky S.;
RT "Large-scale Arabidopsis phosphoproteome profiling reveals novel
RT chloroplast kinase substrates and phosphorylation networks.";
RL Plant Physiol. 150:889-903(2009).
RN [8]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, CLEAVAGE OF INITIATOR
RP METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS SPECTROMETRY
RP [LARGE SCALE ANALYSIS].
RX PubMed=22223895; DOI=10.1074/mcp.m111.015131;
RA Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., Meinnel T.,
RA Giglione C.;
RT "Comparative large-scale characterisation of plant vs. mammal proteins
RT reveals similar and idiosyncratic N-alpha acetylation features.";
RL Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012).
CC -!- FUNCTION: Binds RNA. {ECO:0000250|UniProtKB:Q9SQ56}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q9SQ56}.
CC Cytoplasm, perinuclear region {ECO:0000250|UniProtKB:O23523}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q9LVT8-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9LVT8-2; Sequence=VSP_058645, VSP_058648;
CC Name=3;
CC IsoId=Q9LVT8-3; Sequence=VSP_058646, VSP_058647;
CC -!- SIMILARITY: Belongs to the RGGA protein family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB018117; BAA97154.1; -; Genomic_DNA.
DR EMBL; CP002688; AED95484.1; -; Genomic_DNA.
DR EMBL; CP002688; AED95485.1; -; Genomic_DNA.
DR EMBL; CP002688; AED95486.1; -; Genomic_DNA.
DR EMBL; AY140040; AAM98181.1; -; mRNA.
DR EMBL; BT010371; AAQ56814.1; -; mRNA.
DR EMBL; AK316963; BAH19662.1; -; mRNA.
DR EMBL; AK226441; BAE98584.1; -; mRNA.
DR EMBL; AY085859; AAM63072.1; -; mRNA.
DR RefSeq; NP_001078723.1; NM_001085254.1. [Q9LVT8-2]
DR RefSeq; NP_001078724.1; NM_001085255.1. [Q9LVT8-3]
DR RefSeq; NP_199532.1; NM_124092.4. [Q9LVT8-1]
DR AlphaFoldDB; Q9LVT8; -.
DR STRING; 3702.AT5G47210.1; -.
DR iPTMnet; Q9LVT8; -.
DR MetOSite; Q9LVT8; -.
DR PaxDb; Q9LVT8; -.
DR PRIDE; Q9LVT8; -.
DR ProteomicsDB; 236908; -. [Q9LVT8-1]
DR EnsemblPlants; AT5G47210.1; AT5G47210.1; AT5G47210. [Q9LVT8-1]
DR EnsemblPlants; AT5G47210.2; AT5G47210.2; AT5G47210. [Q9LVT8-2]
DR EnsemblPlants; AT5G47210.3; AT5G47210.3; AT5G47210. [Q9LVT8-3]
DR GeneID; 834767; -.
DR Gramene; AT5G47210.1; AT5G47210.1; AT5G47210. [Q9LVT8-1]
DR Gramene; AT5G47210.2; AT5G47210.2; AT5G47210. [Q9LVT8-2]
DR Gramene; AT5G47210.3; AT5G47210.3; AT5G47210. [Q9LVT8-3]
DR KEGG; ath:AT5G47210; -.
DR Araport; AT5G47210; -.
DR TAIR; locus:2171504; AT5G47210.
DR eggNOG; KOG2945; Eukaryota.
DR HOGENOM; CLU_033492_0_0_1; -.
DR InParanoid; Q9LVT8; -.
DR OMA; EEANGYQ; -.
DR PhylomeDB; Q9LVT8; -.
DR PRO; PR:Q9LVT8; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; Q9LVT8; baseline and differential.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; HDA:TAIR.
DR GO; GO:0009536; C:plastid; HDA:TAIR.
DR GO; GO:0003729; F:mRNA binding; IDA:TAIR.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR InterPro; IPR039764; HABP4/SERBP1.
DR InterPro; IPR006861; HABP4_PAIRBP1-bd.
DR InterPro; IPR019084; Stm1-like_N.
DR PANTHER; PTHR12299; PTHR12299; 1.
DR Pfam; PF04774; HABP4_PAI-RBP1; 1.
DR Pfam; PF09598; Stm1_N; 1.
DR SMART; SM01233; HABP4_PAI-RBP1; 1.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Cytoplasm; Nucleus; Phosphoprotein;
KW Reference proteome; RNA-binding.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0007744|PubMed:22223895"
FT CHAIN 2..357
FT /note="RGG repeats nuclear RNA binding protein C"
FT /id="PRO_0000438318"
FT DOMAIN 239..299
FT /note="FF"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01013"
FT REGION 25..232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 308..357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..96
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 145..159
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..232
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 2
FT /note="N-acetylalanine"
FT /evidence="ECO:0007744|PubMed:22223895"
FT MOD_RES 355
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:O23523"
FT VAR_SEQ 247..305
FT /note="LQATKVEERKVDTKVFESMQQLSNKKNTDEEIFIKLGSDKEKRKDATEKAKK
FT SLSINEF -> PAEEEVAVVVVKEETKGMQKKLQLRRLETQLSSLRWASKDPWSFSLAI
FT SVFRFSLVEFC (in isoform 2)"
FT /id="VSP_058645"
FT VAR_SEQ 299..301
FT /note="SLS -> VLH (in isoform 3)"
FT /id="VSP_058646"
FT VAR_SEQ 302..357
FT /note="Missing (in isoform 3)"
FT /id="VSP_058647"
FT VAR_SEQ 306..357
FT /note="Missing (in isoform 2)"
FT /id="VSP_058648"
FT CONFLICT 128
FT /note="R -> L (in Ref. 6; AAM63072)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 357 AA; 37999 MW; 44170422CB1FEC1D CRC64;
MASLNPFDLL GDDAEDPSQL AVALSQKVEK AAAAVQPPKA AKFPTKPAPP SQAVRESRNA
PQGGRGGTGG RGGFSRGRGN GGYNRDNRNN DAPGNENGFS GGYRRPSEDA DGASRGGSVG
GYRVGGGREG PRRGGVANGE SGDVERPPRN YDRHSRTGHG TGMKRNGGGR GNWGTTEDDI
PPTSEEPTTE VEKSPVAEKQ GGEDETPEAK KELTAEEKAQ KEAEEAEARE MTLEEYEKIL
EEKKKALQAT KVEERKVDTK VFESMQQLSN KKNTDEEIFI KLGSDKEKRK DATEKAKKSL
SINEFLKPAD GKRYNGRGGG SRGRGGRGGR GEGGNQRYAK EAAAPAIGDT AQFPSLG