位置:首页 > 蛋白库 > RGGC_ARATH
RGGC_ARATH
ID   RGGC_ARATH              Reviewed;         357 AA.
AC   Q9LVT8; A8MQD7; A8MRX4; Q8LDQ7;
DT   30-NOV-2016, integrated into UniProtKB/Swiss-Prot.
DT   01-OCT-2000, sequence version 1.
DT   25-MAY-2022, entry version 134.
DE   RecName: Full=RGG repeats nuclear RNA binding protein C {ECO:0000305};
GN   Name=RGGC {ECO:0000305};
GN   OrderedLocusNames=At5g47210 {ECO:0000312|Araport:AT5G47210};
GN   ORFNames=MQL5.6 {ECO:0000312|EMBL:BAA97154.1};
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia;
RX   PubMed=10718197; DOI=10.1093/dnares/7.1.31;
RA   Sato S., Nakamura Y., Kaneko T., Katoh T., Asamizu E., Kotani H.,
RA   Tabata S.;
RT   "Structural analysis of Arabidopsis thaliana chromosome 5. X. Sequence
RT   features of the regions of 3,076,755 bp covered by sixty P1 and TAC
RT   clones.";
RL   DNA Res. 7:31-63(2000).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia;
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
RN   [3]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC   STRAIN=cv. Columbia;
RX   PubMed=14593172; DOI=10.1126/science.1088305;
RA   Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA   Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA   Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA   Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA   Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA   Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA   Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA   Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA   Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA   Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA   Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA   Ecker J.R.;
RT   "Empirical analysis of transcriptional activity in the Arabidopsis
RT   genome.";
RL   Science 302:842-846(2003).
RN   [4]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC   STRAIN=cv. Columbia; TISSUE=Root;
RX   PubMed=19423640; DOI=10.1093/dnares/dsp009;
RA   Iida K., Fukami-Kobayashi K., Toyoda A., Sakaki Y., Kobayashi M., Seki M.,
RA   Shinozaki K.;
RT   "Analysis of multiple occurrences of alternative splicing events in
RT   Arabidopsis thaliana using novel sequenced full-length cDNAs.";
RL   DNA Res. 16:155-164(2009).
RN   [5]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC   STRAIN=cv. Columbia;
RA   Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA   Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA   Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA   Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA   Shinozaki K.;
RT   "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL   Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN   [6]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RA   Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B.,
RA   Feldmann K.A.;
RT   "Full-length cDNA from Arabidopsis thaliana.";
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [7]
RP   IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX   PubMed=19376835; DOI=10.1104/pp.109.138677;
RA   Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A.,
RA   Grossmann J., Gruissem W., Baginsky S.;
RT   "Large-scale Arabidopsis phosphoproteome profiling reveals novel
RT   chloroplast kinase substrates and phosphorylation networks.";
RL   Plant Physiol. 150:889-903(2009).
RN   [8]
RP   ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, CLEAVAGE OF INITIATOR
RP   METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS SPECTROMETRY
RP   [LARGE SCALE ANALYSIS].
RX   PubMed=22223895; DOI=10.1074/mcp.m111.015131;
RA   Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., Meinnel T.,
RA   Giglione C.;
RT   "Comparative large-scale characterisation of plant vs. mammal proteins
RT   reveals similar and idiosyncratic N-alpha acetylation features.";
RL   Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012).
CC   -!- FUNCTION: Binds RNA. {ECO:0000250|UniProtKB:Q9SQ56}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q9SQ56}.
CC       Cytoplasm, perinuclear region {ECO:0000250|UniProtKB:O23523}.
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=3;
CC       Name=1;
CC         IsoId=Q9LVT8-1; Sequence=Displayed;
CC       Name=2;
CC         IsoId=Q9LVT8-2; Sequence=VSP_058645, VSP_058648;
CC       Name=3;
CC         IsoId=Q9LVT8-3; Sequence=VSP_058646, VSP_058647;
CC   -!- SIMILARITY: Belongs to the RGGA protein family. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AB018117; BAA97154.1; -; Genomic_DNA.
DR   EMBL; CP002688; AED95484.1; -; Genomic_DNA.
DR   EMBL; CP002688; AED95485.1; -; Genomic_DNA.
DR   EMBL; CP002688; AED95486.1; -; Genomic_DNA.
DR   EMBL; AY140040; AAM98181.1; -; mRNA.
DR   EMBL; BT010371; AAQ56814.1; -; mRNA.
DR   EMBL; AK316963; BAH19662.1; -; mRNA.
DR   EMBL; AK226441; BAE98584.1; -; mRNA.
DR   EMBL; AY085859; AAM63072.1; -; mRNA.
DR   RefSeq; NP_001078723.1; NM_001085254.1. [Q9LVT8-2]
DR   RefSeq; NP_001078724.1; NM_001085255.1. [Q9LVT8-3]
DR   RefSeq; NP_199532.1; NM_124092.4. [Q9LVT8-1]
DR   AlphaFoldDB; Q9LVT8; -.
DR   STRING; 3702.AT5G47210.1; -.
DR   iPTMnet; Q9LVT8; -.
DR   MetOSite; Q9LVT8; -.
DR   PaxDb; Q9LVT8; -.
DR   PRIDE; Q9LVT8; -.
DR   ProteomicsDB; 236908; -. [Q9LVT8-1]
DR   EnsemblPlants; AT5G47210.1; AT5G47210.1; AT5G47210. [Q9LVT8-1]
DR   EnsemblPlants; AT5G47210.2; AT5G47210.2; AT5G47210. [Q9LVT8-2]
DR   EnsemblPlants; AT5G47210.3; AT5G47210.3; AT5G47210. [Q9LVT8-3]
DR   GeneID; 834767; -.
DR   Gramene; AT5G47210.1; AT5G47210.1; AT5G47210. [Q9LVT8-1]
DR   Gramene; AT5G47210.2; AT5G47210.2; AT5G47210. [Q9LVT8-2]
DR   Gramene; AT5G47210.3; AT5G47210.3; AT5G47210. [Q9LVT8-3]
DR   KEGG; ath:AT5G47210; -.
DR   Araport; AT5G47210; -.
DR   TAIR; locus:2171504; AT5G47210.
DR   eggNOG; KOG2945; Eukaryota.
DR   HOGENOM; CLU_033492_0_0_1; -.
DR   InParanoid; Q9LVT8; -.
DR   OMA; EEANGYQ; -.
DR   PhylomeDB; Q9LVT8; -.
DR   PRO; PR:Q9LVT8; -.
DR   Proteomes; UP000006548; Chromosome 5.
DR   ExpressionAtlas; Q9LVT8; baseline and differential.
DR   GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR   GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR   GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0005886; C:plasma membrane; HDA:TAIR.
DR   GO; GO:0009536; C:plastid; HDA:TAIR.
DR   GO; GO:0003729; F:mRNA binding; IDA:TAIR.
DR   GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR   InterPro; IPR039764; HABP4/SERBP1.
DR   InterPro; IPR006861; HABP4_PAIRBP1-bd.
DR   InterPro; IPR019084; Stm1-like_N.
DR   PANTHER; PTHR12299; PTHR12299; 1.
DR   Pfam; PF04774; HABP4_PAI-RBP1; 1.
DR   Pfam; PF09598; Stm1_N; 1.
DR   SMART; SM01233; HABP4_PAI-RBP1; 1.
PE   1: Evidence at protein level;
KW   Acetylation; Alternative splicing; Cytoplasm; Nucleus; Phosphoprotein;
KW   Reference proteome; RNA-binding.
FT   INIT_MET        1
FT                   /note="Removed"
FT                   /evidence="ECO:0007744|PubMed:22223895"
FT   CHAIN           2..357
FT                   /note="RGG repeats nuclear RNA binding protein C"
FT                   /id="PRO_0000438318"
FT   DOMAIN          239..299
FT                   /note="FF"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU01013"
FT   REGION          25..232
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          308..357
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        82..96
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        145..159
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        191..232
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOD_RES         2
FT                   /note="N-acetylalanine"
FT                   /evidence="ECO:0007744|PubMed:22223895"
FT   MOD_RES         355
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:O23523"
FT   VAR_SEQ         247..305
FT                   /note="LQATKVEERKVDTKVFESMQQLSNKKNTDEEIFIKLGSDKEKRKDATEKAKK
FT                   SLSINEF -> PAEEEVAVVVVKEETKGMQKKLQLRRLETQLSSLRWASKDPWSFSLAI
FT                   SVFRFSLVEFC (in isoform 2)"
FT                   /id="VSP_058645"
FT   VAR_SEQ         299..301
FT                   /note="SLS -> VLH (in isoform 3)"
FT                   /id="VSP_058646"
FT   VAR_SEQ         302..357
FT                   /note="Missing (in isoform 3)"
FT                   /id="VSP_058647"
FT   VAR_SEQ         306..357
FT                   /note="Missing (in isoform 2)"
FT                   /id="VSP_058648"
FT   CONFLICT        128
FT                   /note="R -> L (in Ref. 6; AAM63072)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   357 AA;  37999 MW;  44170422CB1FEC1D CRC64;
     MASLNPFDLL GDDAEDPSQL AVALSQKVEK AAAAVQPPKA AKFPTKPAPP SQAVRESRNA
     PQGGRGGTGG RGGFSRGRGN GGYNRDNRNN DAPGNENGFS GGYRRPSEDA DGASRGGSVG
     GYRVGGGREG PRRGGVANGE SGDVERPPRN YDRHSRTGHG TGMKRNGGGR GNWGTTEDDI
     PPTSEEPTTE VEKSPVAEKQ GGEDETPEAK KELTAEEKAQ KEAEEAEARE MTLEEYEKIL
     EEKKKALQAT KVEERKVDTK VFESMQQLSN KKNTDEEIFI KLGSDKEKRK DATEKAKKSL
     SINEFLKPAD GKRYNGRGGG SRGRGGRGGR GEGGNQRYAK EAAAPAIGDT AQFPSLG
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024