位置:首页 > 蛋白库 > SMN_BOVIN
SMN_BOVIN
ID   SMN_BOVIN               Reviewed;         287 AA.
AC   O18870; O46481; O62700; Q9TSJ9;
DT   15-JUL-1998, integrated into UniProtKB/Swiss-Prot.
DT   30-MAY-2000, sequence version 2.
DT   03-AUG-2022, entry version 137.
DE   RecName: Full=Survival motor neuron protein;
GN   Name=SMN1; Synonyms=SMN;
OS   Bos taurus (Bovine).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Bovinae; Bos.
OX   NCBI_TaxID=9913;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RX   PubMed=9925919; DOI=10.1159/000015162;
RA   Pietrowski D., Goldammer T., Meinert S., Schwerin M., Forster M.;
RT   "Description and physical localization of the bovine survival of motor
RT   neuron gene (SMN).";
RL   Cytogenet. Cell Genet. 83:39-42(1998).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [MRNA] OF 35-123.
RC   STRAIN=Brown Swiss;
RA   Rieder S., Joerg H., Neuenschwander S., Meijerink E., Stranzinger G.;
RT   "A bovine sequence homologous to human and murine survival motor neuron
RT   gene (SMN).";
RL   Submitted (OCT-1997) to the EMBL/GenBank/DDBJ databases.
RN   [3]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 90-171.
RA   Nonneman D., Shibuya H., Kappes S., Steffen D., Johnson G.S.;
RT   "Bovine survival motor neuron gene.";
RL   Submitted (JUL-1997) to the EMBL/GenBank/DDBJ databases.
RN   [4]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 99-141.
RX   PubMed=9800343;
RA   Eggen A., Masabanda J., Pfister-Genskow M., Fries R., Bishop M.D.;
RT   "The bovine survival of motor neuron gene (SMN) maps to bovine chromosome
RT   20q14.";
RL   Anim. Genet. 29:408-409(1998).
CC   -!- FUNCTION: The SMN complex catalyzes the assembly of small nuclear
CC       ribonucleoproteins (snRNPs), the building blocks of the spliceosome,
CC       and thereby plays an important role in the splicing of cellular pre-
CC       mRNAs. Most spliceosomal snRNPs contain a common set of Sm proteins
CC       SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE, SNRPF and SNRPG that assemble in
CC       a heptameric protein ring on the Sm site of the small nuclear RNA to
CC       form the core snRNP (Sm core). In the cytosol, the Sm proteins SNRPD1,
CC       SNRPD2, SNRPE, SNRPF and SNRPG are trapped in an inactive 6S pICln-Sm
CC       complex by the chaperone CLNS1A that controls the assembly of the core
CC       snRNP. To assemble core snRNPs, the SMN complex accepts the trapped 5Sm
CC       proteins from CLNS1A forming an intermediate. Binding of snRNA inside
CC       5Sm ultimately triggers eviction of the SMN complex, thereby allowing
CC       binding of SNRPD3 and SNRPB to complete assembly of the core snRNP.
CC       Within the SMN complex, SMN1 acts as a structural backbone and together
CC       with GEMIN2 it gathers the Sm complex subunits. Ensures the correct
CC       splicing of U12 intron-containing genes that may be important for
CC       normal motor and proprioceptive neurons development. Also required for
CC       resolving RNA-DNA hybrids created by RNA polymerase II, that form R-
CC       loop in transcription terminal regions, an important step in proper
CC       transcription termination. May also play a role in the metabolism of
CC       small nucleolar ribonucleoprotein (snoRNPs).
CC       {ECO:0000250|UniProtKB:Q16637}.
CC   -!- SUBUNIT: Homooligomer; may form higher order homooligomers in the dimer
CC       to octamer range. Part of the core SMN complex that contains SMN1,
CC       GEMIN2/SIP1, DDX20/GEMIN3, GEMIN4, GEMIN5, GEMIN6, GEMIN7, GEMIN8 and
CC       STRAP/UNRIP. Part of the SMN-Sm complex that contains SMN1,
CC       GEMIN2/SIP1, DDX20/GEMIN3, GEMIN4, GEMIN5, GEMIN6, GEMIN7, GEMIN8,
CC       STRAP/UNRIP and the Sm proteins SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE,
CC       SNRPF and SNRPG. Component of an import snRNP complex composed of
CC       KPNB1, RNUT1, SMN1 and ZNF259. Interacts with DDX20, FBL, NOLA1, RNUT1,
CC       SYNCRIP and with several spliceosomal snRNP core Sm proteins, including
CC       SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE and ILF3. Interacts with GEMIN2;
CC       the interaction is direct. Interacts with GEMIN3; the interaction is
CC       direct. Interacts with GEMIN8; the interaction is direct. Interacts
CC       with SNRPB; the interaction is direct. Interacts (via Tudor domain)
CC       with SNRPD1 (via C-terminus); the interaction is direct. Interacts with
CC       SNRPD2; the interaction is direct. Interacts (via Tudor domain) with
CC       SNRPD3 (via C-terminus); the interaction is direct. Interacts with
CC       SNRPE; the interaction is direct. Interacts with OSTF1, LSM10, LSM11
CC       and RPP20/POP7. Interacts (via C-terminal region) with ZPR1 (via C-
CC       terminal region). Interacts (via Tudor domain) with COIL. Interacts
CC       with SETX; recruits SETX to POLR2A. Interacts with POLR2A (via the C-
CC       terminal domain (CTD)). Interacts with PRMT5. Interacts with XRN2.
CC       Interacts (via C-terminus) with FMR1 (via C-terminus); the interaction
CC       is direct and occurs in a RNA-independent manner. Interacts (via Tudor
CC       domain) with SF3B2 ('Arg-508'-methylated form). Interacts with
CC       WRAP53/TCAB1. Interacts (via Tudor domain) with ELAVL4 in an RNA-
CC       independent manner; the interaction is required for localization of
CC       ELAVL4 to RNA granules. Interacts with FRG1.
CC       {ECO:0000250|UniProtKB:Q16637}.
CC   -!- SUBCELLULAR LOCATION: Nucleus, gem {ECO:0000250|UniProtKB:Q16637}.
CC       Nucleus, Cajal body {ECO:0000250|UniProtKB:Q16637}. Cytoplasm
CC       {ECO:0000250|UniProtKB:Q16637}. Cytoplasmic granule
CC       {ECO:0000250|UniProtKB:Q16637}. Perikaryon
CC       {ECO:0000250|UniProtKB:Q16637}. Cell projection, neuron projection
CC       {ECO:0000250|UniProtKB:Q16637}. Cell projection, axon
CC       {ECO:0000250|UniProtKB:P97801}. Cytoplasm, myofibril, sarcomere, Z line
CC       {ECO:0000250|UniProtKB:P97801}. Note=Colocalizes with actin and at the
CC       Z-line of skeletal muscle (By similarity). Under stress conditions
CC       colocalizes with RPP20/POP7 in punctuated cytoplasmic granules.
CC       Colocalized and redistributed with ZPR1 from the cytoplasm to nuclear
CC       gems (Gemini of coiled bodies) and Cajal bodies. Colocalizes with FMR1
CC       in cytoplasmic granules in the soma and neurite cell processes (By
CC       similarity). {ECO:0000250|UniProtKB:P97801,
CC       ECO:0000250|UniProtKB:Q16637}.
CC   -!- DOMAIN: The Tudor domain mediates association with dimethylarginines,
CC       which are common in snRNP proteins. {ECO:0000250|UniProtKB:Q16637}.
CC   -!- SIMILARITY: Belongs to the SMN family. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AF035322; AAC17995.1; -; Genomic_DNA.
DR   EMBL; AF035323; AAC63439.1; -; mRNA.
DR   EMBL; AF026810; AAB80943.1; -; mRNA.
DR   EMBL; AF016590; AAC04667.1; -; Genomic_DNA.
DR   EMBL; AF034259; AAD01979.1; -; Genomic_DNA.
DR   RefSeq; NP_783632.1; NM_175701.1.
DR   AlphaFoldDB; O18870; -.
DR   SMR; O18870; -.
DR   STRING; 9913.ENSBTAP00000007547; -.
DR   iPTMnet; O18870; -.
DR   PaxDb; O18870; -.
DR   PRIDE; O18870; -.
DR   Ensembl; ENSBTAT00000007547; ENSBTAP00000007547; ENSBTAG00000005743.
DR   GeneID; 281492; -.
DR   KEGG; bta:281492; -.
DR   CTD; 6607; -.
DR   VEuPathDB; HostDB:ENSBTAG00000005743; -.
DR   eggNOG; KOG4327; Eukaryota.
DR   GeneTree; ENSGT00940000153352; -.
DR   HOGENOM; CLU_077852_0_0_1; -.
DR   InParanoid; O18870; -.
DR   OrthoDB; 1316275at2759; -.
DR   TreeFam; TF318390; -.
DR   Reactome; R-BTA-191859; snRNP Assembly.
DR   Proteomes; UP000009136; Chromosome 20.
DR   Bgee; ENSBTAG00000005743; Expressed in oocyte and 106 other tissues.
DR   ExpressionAtlas; O18870; baseline and differential.
DR   GO; GO:0030424; C:axon; IEA:UniProtKB-SubCell.
DR   GO; GO:0015030; C:Cajal body; ISS:UniProtKB.
DR   GO; GO:0005737; C:cytoplasm; ISS:UniProtKB.
DR   GO; GO:0036464; C:cytoplasmic ribonucleoprotein granule; ISS:UniProtKB.
DR   GO; GO:0005829; C:cytosol; ISS:UniProtKB.
DR   GO; GO:0097504; C:Gemini of coiled bodies; ISS:UniProtKB.
DR   GO; GO:0043005; C:neuron projection; ISS:UniProtKB.
DR   GO; GO:0005654; C:nucleoplasm; ISS:UniProtKB.
DR   GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR   GO; GO:0043204; C:perikaryon; ISS:UniProtKB.
DR   GO; GO:0032797; C:SMN complex; ISS:UniProtKB.
DR   GO; GO:0034719; C:SMN-Sm protein complex; ISS:UniProtKB.
DR   GO; GO:0030018; C:Z disc; IEA:UniProtKB-SubCell.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0006353; P:DNA-templated transcription, termination; ISS:UniProtKB.
DR   GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR   GO; GO:0000387; P:spliceosomal snRNP assembly; ISS:UniProtKB.
DR   CDD; cd04508; TUDOR; 1.
DR   InterPro; IPR010304; SMN_Tudor.
DR   InterPro; IPR002999; Tudor.
DR   Pfam; PF06003; SMN; 1.
DR   SMART; SM00333; TUDOR; 1.
DR   PROSITE; PS50304; TUDOR; 1.
PE   2: Evidence at transcript level;
KW   Cell projection; Cytoplasm; Isopeptide bond; mRNA processing;
KW   mRNA splicing; Neurogenesis; Nucleus; Phosphoprotein; Reference proteome;
KW   RNA-binding; Ubl conjugation.
FT   CHAIN           1..287
FT                   /note="Survival motor neuron protein"
FT                   /id="PRO_0000218901"
FT   DOMAIN          86..146
FT                   /note="Tudor"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00211"
FT   REGION          1..28
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          9..40
FT                   /note="P1 (binding site for GEMIN2)"
FT                   /evidence="ECO:0000250"
FT   REGION          51..86
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          92..204
FT                   /note="Required for interaction with RPP20/POP7"
FT                   /evidence="ECO:0000250"
FT   REGION          149..221
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          234..261
FT                   /note="P2 (binding site for SM B)"
FT                   /evidence="ECO:0000250"
FT   REGION          273..287
FT                   /note="Required for interaction with SYNCRIP"
FT                   /evidence="ECO:0000250"
FT   COMPBIAS        149..183
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOD_RES         21
FT                   /note="Phosphothreonine"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   MOD_RES         24
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   MOD_RES         27
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   MOD_RES         65
FT                   /note="Phosphothreonine"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   MOD_RES         80
FT                   /note="Phosphothreonine; by PKA"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   CROSSLNK        47
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   CROSSLNK        204
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q16637"
FT   CONFLICT        61
FT                   /note="K -> E (in Ref. 2; AAB80943)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        64
FT                   /note="G -> A (in Ref. 2; AAB80943)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   287 AA;  31327 MW;  B1B7EFFCE2682A78 CRC64;
     MGGGGGGFPE PEDSVLFRRG TGESDDSDVW DDTALIKAYD KAVASFKHAL KNGDISEASE
     KPKGTPKRKS AKNKSQRKNT TSPSKQWKVG DNCCAIWSED GCIYPATIAS IDFKRETCVV
     VYTGYGNREE QNLSDLLSPT SEVANIEQNA QENENESQIS TDESENSSRS PLNKPNNIRS
     RAAPWNSFLP PPPHMPRSGL GPGKSGLNFS GPPPPPPPPP HFLSRWLPPF PAGPPMIPPP
     PPICPDSLDD ADALGSMLIS WYMSGYHTGY YMGFKQSQKE GRYSHFN
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024