位置:首页 > 蛋白库 > SIM1_MOUSE
SIM1_MOUSE
ID   SIM1_MOUSE              Reviewed;         765 AA.
AC   Q61045; O70284; P70183;
DT   01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT   15-JUL-1999, sequence version 3.
DT   03-AUG-2022, entry version 177.
DE   RecName: Full=Single-minded homolog 1;
DE            Short=mSIM1;
GN   Name=Sim1;
OS   Mus musculus (Mouse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
OX   NCBI_TaxID=10090;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RC   STRAIN=129/Sv, and Swiss Webster;
RC   TISSUE=Embryonic brain, and Embryonic stem cell;
RX   PubMed=8812055; DOI=10.1006/mcne.1996.0001;
RA   Fan C.-M., Kuwana E., Bulfone A., Fletcher C.F., Copeland N.G.,
RA   Jenkins N.A., Crews S., Martinez S., Puelles L., Rubenstein J.L.,
RA   Tessier-Lavigne M.;
RT   "Expression patterns of two murine homologs of Drosophila single-minded
RT   suggest possible roles in embryonic patterning and in the pathogenesis of
RT   Down syndrome.";
RL   Mol. Cell. Neurosci. 7:1-16(1996).
RN   [2]
RP   ERRATUM OF PUBMED:8812055.
RX   PubMed=8875433; DOI=10.1006/mcne.1996.0037;
RA   Fan C.-M., Kuwana E., Bulfone A., Fletcher C.F., Copeland N.G.,
RA   Jenkins N.A., Crews S., Martinez S., Puelles L., Rubenstein J.L.,
RA   Tessier-Lavigne M.;
RL   Mol. Cell. Neurosci. 7:519-519(1996).
RN   [3]
RP   SEQUENCE REVISION TO C-TERMINUS.
RX   PubMed=9199934; DOI=10.1101/gr.7.6.615;
RA   Chrast R., Scott H.S., Chen H., Kudoh J., Rossier C., Minoshima S.,
RA   Wang Y., Shimizu N., Antonarakis S.E.;
RT   "Cloning of two human homologs of the Drosophila single-minded gene SIM1 on
RT   chromosome 6q and SIM2 on 21q within the Down syndrome chromosomal
RT   region.";
RL   Genome Res. 7:615-624(1997).
RN   [4]
RP   NUCLEOTIDE SEQUENCE [MRNA].
RC   STRAIN=C57BL/6J;
RX   PubMed=8927054; DOI=10.1128/mcb.16.10.5865;
RA   Ema M., Morita M., Ikawa S., Tanaka M., Matsuda Y., Gotoh O., Saijoh Y.,
RA   Fujii H., Hamada H., Kikuchi Y., Fujii-Kuriyama Y.;
RT   "Two new members of the murine Sim gene family are transcriptional
RT   repressors and show different expression patterns during mouse
RT   embryogenesis.";
RL   Mol. Cell. Biol. 16:5865-5875(1996).
RN   [5]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC   STRAIN=129/Sv;
RA   Hosoya T.;
RT   "Mouse single-minded1 (mSim1) gene.";
RL   Submitted (APR-1998) to the EMBL/GenBank/DDBJ databases.
RN   [6]
RP   SUBUNIT.
RX   PubMed=9020169; DOI=10.1074/jbc.272.7.4451;
RA   Probst M.R., Fan C.-M., Tessier-Lavigne M., Hankinson O.;
RT   "Two murine homologs of the Drosophila single-minded protein that interact
RT   with the mouse aryl hydrocarbon receptor nuclear translocator protein.";
RL   J. Biol. Chem. 272:4451-4457(1997).
RN   [7]
RP   INTERACTION WITH ARNT AND ARNT2.
RX   PubMed=27782878; DOI=10.7554/elife.18790;
RA   Wu D., Su X., Potluri N., Kim Y., Rastinejad F.;
RT   "NPAS1-ARNT and NPAS3-ARNT crystal structures implicate the bHLH-PAS family
RT   as multi-ligand binding transcription factors.";
RL   Elife 5:0-0(2016).
CC   -!- FUNCTION: Transcriptional factor that may have pleiotropic effects
CC       during embryogenesis and in the adult.
CC   -!- SUBUNIT: Efficient DNA binding requires dimerization with another bHLH
CC       protein. Heterodimer; forms a heterodimer with ARNT, ARNT2.
CC       {ECO:0000269|PubMed:27782878, ECO:0000269|PubMed:9020169}.
CC   -!- INTERACTION:
CC       Q61045; P53762: Arnt; NbExp=4; IntAct=EBI-78890, EBI-78852;
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00632,
CC       ECO:0000255|PROSITE-ProRule:PRU00981}.
CC   -!- TISSUE SPECIFICITY: Detected in lung, skeletal muscle and kidney.
CC       During fetal development it is found in the CNS, developing kidney,
CC       mesodermal and endodermal tissues, including developing somites,
CC       mesonephric duct, and foregut.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAA91201.1; Type=Erroneous termination; Note=Truncated C-terminus.; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; U40575; AAA91201.1; ALT_SEQ; mRNA.
DR   EMBL; AF038857; AAC05481.1; -; Genomic_DNA.
DR   EMBL; AF038853; AAC05481.1; JOINED; Genomic_DNA.
DR   EMBL; AF038854; AAC05481.1; JOINED; Genomic_DNA.
DR   EMBL; AF038856; AAC05481.1; JOINED; Genomic_DNA.
DR   EMBL; AF044913; AAC05481.1; JOINED; Genomic_DNA.
DR   EMBL; AF038855; AAC05481.1; JOINED; Genomic_DNA.
DR   EMBL; D79209; BAA11467.1; -; mRNA.
DR   EMBL; AB013491; BAA28270.1; -; Genomic_DNA.
DR   CCDS; CCDS48556.1; -.
DR   RefSeq; NP_035506.2; NM_011376.3.
DR   RefSeq; XP_006512690.1; XM_006512627.3.
DR   AlphaFoldDB; Q61045; -.
DR   SMR; Q61045; -.
DR   BioGRID; 203254; 2.
DR   CORUM; Q61045; -.
DR   IntAct; Q61045; 3.
DR   MINT; Q61045; -.
DR   STRING; 10090.ENSMUSP00000020071; -.
DR   iPTMnet; Q61045; -.
DR   PhosphoSitePlus; Q61045; -.
DR   PaxDb; Q61045; -.
DR   PRIDE; Q61045; -.
DR   ProteomicsDB; 257250; -.
DR   Antibodypedia; 899; 162 antibodies from 25 providers.
DR   DNASU; 20464; -.
DR   Ensembl; ENSMUST00000020071; ENSMUSP00000020071; ENSMUSG00000019913.
DR   GeneID; 20464; -.
DR   KEGG; mmu:20464; -.
DR   UCSC; uc007fal.1; mouse.
DR   CTD; 6492; -.
DR   MGI; MGI:98306; Sim1.
DR   VEuPathDB; HostDB:ENSMUSG00000019913; -.
DR   eggNOG; KOG3559; Eukaryota.
DR   GeneTree; ENSGT00940000156143; -.
DR   HOGENOM; CLU_010044_4_0_1; -.
DR   InParanoid; Q61045; -.
DR   OMA; ANTSPCE; -.
DR   OrthoDB; 231698at2759; -.
DR   PhylomeDB; Q61045; -.
DR   TreeFam; TF317772; -.
DR   BioGRID-ORCS; 20464; 7 hits in 74 CRISPR screens.
DR   ChiTaRS; Sim1; mouse.
DR   PRO; PR:Q61045; -.
DR   Proteomes; UP000000589; Chromosome 10.
DR   RNAct; Q61045; protein.
DR   Bgee; ENSMUSG00000019913; Expressed in diencephalon lateral wall and 114 other tissues.
DR   ExpressionAtlas; Q61045; baseline and differential.
DR   Genevisible; Q61045; MM.
DR   GO; GO:0005634; C:nucleus; IDA:MGI.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0046982; F:protein heterodimerization activity; IDA:UniProtKB.
DR   GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR   GO; GO:0006355; P:regulation of transcription, DNA-templated; IDA:MGI.
DR   GO; GO:0001657; P:ureteric bud development; IEP:UniProtKB.
DR   CDD; cd00130; PAS; 2.
DR   Gene3D; 4.10.280.10; -; 1.
DR   InterPro; IPR011598; bHLH_dom.
DR   InterPro; IPR036638; HLH_DNA-bd_sf.
DR   InterPro; IPR001610; PAC.
DR   InterPro; IPR000014; PAS.
DR   InterPro; IPR035965; PAS-like_dom_sf.
DR   InterPro; IPR013767; PAS_fold.
DR   InterPro; IPR013655; PAS_fold_3.
DR   InterPro; IPR010578; SIM_C.
DR   Pfam; PF00010; HLH; 1.
DR   Pfam; PF00989; PAS; 1.
DR   Pfam; PF08447; PAS_3; 1.
DR   Pfam; PF06621; SIM_C; 1.
DR   SMART; SM00353; HLH; 1.
DR   SMART; SM00086; PAC; 1.
DR   SMART; SM00091; PAS; 2.
DR   SUPFAM; SSF47459; SSF47459; 1.
DR   SUPFAM; SSF55785; SSF55785; 2.
DR   PROSITE; PS50888; BHLH; 1.
DR   PROSITE; PS50112; PAS; 2.
DR   PROSITE; PS51302; SIM_C; 1.
PE   1: Evidence at protein level;
KW   Developmental protein; Differentiation; DNA-binding; Neurogenesis; Nucleus;
KW   Reference proteome; Repeat; Transcription; Transcription regulation.
FT   CHAIN           1..765
FT                   /note="Single-minded homolog 1"
FT                   /id="PRO_0000127440"
FT   DOMAIN          1..53
FT                   /note="bHLH"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00981"
FT   DOMAIN          77..147
FT                   /note="PAS 1"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT   DOMAIN          218..288
FT                   /note="PAS 2"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT   DOMAIN          292..335
FT                   /note="PAC"
FT   DOMAIN          336..765
FT                   /note="Single-minded C-terminal"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00632"
FT   REGION          352..428
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          527..560
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOTIF           368..387
FT                   /note="Nuclear localization signal"
FT                   /evidence="ECO:0000250"
FT   COMPBIAS        352..394
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        535..560
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CONFLICT        133
FT                   /note="H -> L (in Ref. 1; AAA91201/AAC05481)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        176
FT                   /note="Missing (in Ref. 1; AAA91201)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        322
FT                   /note="P -> R (in Ref. 1; AAA91201/AAC05481)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        480
FT                   /note="A -> P (in Ref. 1; AAA91201)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        537
FT                   /note="D -> S (in Ref. 1; AAA91201)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        698
FT                   /note="F -> K (in Ref. 1; AAA91201)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   765 AA;  85541 MW;  B1A7F7DA8578CD17 CRC64;
     MKEKSKNAAR TRREKENSEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRVVFPEGL
     GEAWGHTSRT SPLDNVGREL GSHLLQTLDG FIFVVAPDGK IMYISETASV HLGLSQVELT
     GNSIYEYIHP ADHDEMTAVL TAHQPYHSHF VQEYEIERSF FLRMKCVLAK RNAGLTCGGY
     KVIHCSGYLK IRQYSLDMSP FDGCYQNVGL VAVGHSLPPS AVTEIKLHSN MFMFRASLDM
     KLIFLDSRVA ELTGYEPQDL IEKTLYHHVH GCDTFHLRCA HHLLLVKGQV TTKYYRFLAK
     QGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDTEYKGL QLSLDQISAS KPTFSYTSSS
     TPTISDNRKG AKSRLSSSKS KSRTSPYPQY SGFHTERSES DHDSQWGGSP LTDTASPQLL
     DPERPGSQHE LSCAYRQFPD RSSLCYGFAL DHSRLVEDRH FHTQACEGGR CEAGRYFLGA
     PPTGRDPWWG SRAALPLTKA SPESREAYEN SMPHITSIHR IHGRGHWDED SVVSSPDPGS
     ASESGDRYRT EQYQNSPHEP SKIETLIRAT QQMIKEEENR LQLRKAPPDQ LASINGAGKK
     HSLCFANYQQ PPPTGEVCHS SALASTSPCD HIQQREGKML SPHENDYDNS PTALSRISSP
     SSDRITKSSL ILAKDYLHSD MSPHQTAGDH PAISPNCFGS HRQYFDKHAY TLTGYALEHL
     YDSETIRNYS LGCNGSHFDV TSHLRMQPDP AQGHKGTSVI ITNGS
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024