SIM2_MOUSE
ID SIM2_MOUSE Reviewed; 657 AA.
AC Q61079; O35391; Q61046; Q61904;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1997, sequence version 1.
DT 03-AUG-2022, entry version 179.
DE RecName: Full=Single-minded homolog 2;
DE AltName: Full=SIM transcription factor;
DE Short=mSIM;
GN Name=Sim2;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC TISSUE=Fetal kidney;
RX PubMed=8661115; DOI=10.1006/geno.1996.0333;
RA Moffett P., Dayo M., Reece M., McCormack M.K., Pelletier J.;
RT "Characterization of msim, a murine homologue of the Drosophila sim
RT transcription factor.";
RL Genomics 35:144-155(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=C57BL/6J;
RX PubMed=8561800; DOI=10.1006/bbrc.1996.0104;
RA Ema M., Suzuki M., Morita M., Hirose K., Sogawa K., Matsuda Y., Gotoh O.,
RA Saijoh Y., Fujii H., Hamada H., Fujii-Kuriyama Y.;
RT "cDNA cloning of a murine homologue of Drosophila single-minded, its mRNA
RT expression in mouse development, and chromosome localization.";
RL Biochem. Biophys. Res. Commun. 218:588-594(1996).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RC STRAIN=129/Sv, and Swiss Webster; TISSUE=Embryonic brain;
RX PubMed=8812055; DOI=10.1006/mcne.1996.0001;
RA Fan C.-M., Kuwana E., Bulfone A., Fletcher C.F., Copeland N.G.,
RA Jenkins N.A., Crews S., Martinez S., Puelles L., Rubenstein J.L.,
RA Tessier-Lavigne M.;
RT "Expression patterns of two murine homologs of Drosophila single-minded
RT suggest possible roles in embryonic patterning and in the pathogenesis of
RT Down syndrome.";
RL Mol. Cell. Neurosci. 7:1-16(1996).
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=ICR X Swiss Webster; TISSUE=Embryo;
RX PubMed=8661114; DOI=10.1006/geno.1996.0332;
RA Yamaki A., Noda S., Kudoh J., Shindoh N., Maeda H., Minoshima S.,
RA Kawasaki K., Shimizu Y., Shimizu N.;
RT "The mammalian single-minded (SIM) gene: mouse cDNA structure and
RT diencephalic expression indicate a candidate gene for Down syndrome.";
RL Genomics 35:136-143(1996).
RN [5]
RP SUBUNIT.
RX PubMed=9020169; DOI=10.1074/jbc.272.7.4451;
RA Probst M.R., Fan C.-M., Tessier-Lavigne M., Hankinson O.;
RT "Two murine homologs of the Drosophila single-minded protein that interact
RT with the mouse aryl hydrocarbon receptor nuclear translocator protein.";
RL J. Biol. Chem. 272:4451-4457(1997).
CC -!- FUNCTION: Transcription factor that may be a master gene of CNS
CC development in cooperation with Arnt. It may have pleiotropic effects
CC in the tissues expressed during development.
CC -!- SUBUNIT: Efficient DNA binding requires dimerization with another bHLH
CC protein. Heterodimer of SIM2 and ARNT. {ECO:0000269|PubMed:9020169}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00632,
CC ECO:0000255|PROSITE-ProRule:PRU00981}.
CC -!- TISSUE SPECIFICITY: Transcripts were detected in high levels in kidney
CC followed by skeletal muscle and lung. Low levels were found in testis,
CC brain and heart. In early fetal development it is found in CNS,
CC developing kidney, tongue epithelium and cartilage primordia.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA91202.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U42554; AAB19098.1; -; mRNA.
DR EMBL; D63383; BAA09700.1; -; mRNA.
DR EMBL; U40576; AAA91202.1; ALT_FRAME; mRNA.
DR EMBL; AF023873; AAB84099.1; -; Genomic_DNA.
DR EMBL; AF023864; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023865; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023869; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023871; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023870; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023868; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023867; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023866; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; AF023872; AAB84099.1; JOINED; Genomic_DNA.
DR EMBL; D64135; BAA11013.1; -; mRNA.
DR CCDS; CCDS37406.1; -.
DR RefSeq; NP_035507.2; NM_011377.2.
DR AlphaFoldDB; Q61079; -.
DR SMR; Q61079; -.
DR BioGRID; 203255; 1.
DR CORUM; Q61079; -.
DR IntAct; Q61079; 2.
DR STRING; 10090.ENSMUSP00000072043; -.
DR iPTMnet; Q61079; -.
DR PhosphoSitePlus; Q61079; -.
DR PaxDb; Q61079; -.
DR PRIDE; Q61079; -.
DR ProteomicsDB; 261236; -.
DR Antibodypedia; 23108; 176 antibodies from 30 providers.
DR DNASU; 20465; -.
DR Ensembl; ENSMUST00000072182; ENSMUSP00000072043; ENSMUSG00000062713.
DR GeneID; 20465; -.
DR KEGG; mmu:20465; -.
DR UCSC; uc008aae.1; mouse.
DR CTD; 6493; -.
DR MGI; MGI:98307; Sim2.
DR VEuPathDB; HostDB:ENSMUSG00000062713; -.
DR eggNOG; KOG3559; Eukaryota.
DR GeneTree; ENSGT00940000159985; -.
DR HOGENOM; CLU_010044_4_1_1; -.
DR InParanoid; Q61079; -.
DR OMA; SECQWHY; -.
DR OrthoDB; 231698at2759; -.
DR PhylomeDB; Q61079; -.
DR TreeFam; TF317772; -.
DR BioGRID-ORCS; 20465; 3 hits in 74 CRISPR screens.
DR ChiTaRS; Sim2; mouse.
DR PRO; PR:Q61079; -.
DR Proteomes; UP000000589; Chromosome 16.
DR RNAct; Q61079; protein.
DR Bgee; ENSMUSG00000062713; Expressed in esophagus and 94 other tissues.
DR ExpressionAtlas; Q61079; baseline and differential.
DR Genevisible; Q61079; MM.
DR GO; GO:0016604; C:nuclear body; ISO:MGI.
DR GO; GO:0005654; C:nucleoplasm; ISO:MGI.
DR GO; GO:0005634; C:nucleus; IDA:MGI.
DR GO; GO:0003677; F:DNA binding; IDA:MGI.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046982; F:protein heterodimerization activity; IPI:UniProtKB.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0009880; P:embryonic pattern specification; IMP:MGI.
DR GO; GO:0030324; P:lung development; IMP:MGI.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IGI:MGI.
DR GO; GO:0045892; P:negative regulation of transcription, DNA-templated; IDA:MGI.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; -; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; SSF47459; 1.
DR SUPFAM; SSF55785; SSF55785; 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 1: Evidence at protein level;
KW Developmental protein; Differentiation; DNA-binding; Neurogenesis; Nucleus;
KW Reference proteome; Repeat; Transcription; Transcription regulation.
FT CHAIN 1..657
FT /note="Single-minded homolog 2"
FT /id="PRO_0000127442"
FT DOMAIN 1..53
FT /note="bHLH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00981"
FT DOMAIN 77..147
FT /note="PAS 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 218..288
FT /note="PAC"
FT DOMAIN 218..288
FT /note="PAS 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 336..657
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00632"
FT REGION 354..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 612..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 367..386
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000250"
FT COMPBIAS 354..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 263
FT /note="K -> R (in Ref. 2; BAA09700 and 3; AAB84099)"
FT /evidence="ECO:0000305"
FT CONFLICT 336
FT /note="E -> G (in Ref. 3; AAA91202)"
FT /evidence="ECO:0000305"
FT CONFLICT 501
FT /note="S -> T (in Ref. 3; AAA91202)"
FT /evidence="ECO:0000305"
FT CONFLICT 512
FT /note="S -> P (in Ref. 3; AAB84099)"
FT /evidence="ECO:0000305"
FT CONFLICT 541
FT /note="P -> R (in Ref. 3; AAA91202)"
FT /evidence="ECO:0000305"
FT CONFLICT 561..585
FT /note="APRQASRDAARLALARAPPECCAPP -> VLARRPGRARCMWES (in
FT Ref. 3; AAA91202)"
FT /evidence="ECO:0000305"
FT CONFLICT 590..591
FT /note="QA -> HG (in Ref. 3; AAA91202)"
FT /evidence="ECO:0000305"
FT CONFLICT 638
FT /note="A -> R (in Ref. 2; BAA09700 and 3; AAB84099)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 657 AA; 72513 MW; C7904CD24C0ABBAF CRC64;
MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRAVFPEGL
GDAWGQPSRT GPLDSVAKEL GSHLLQTLDG FVFVVASDGK IMYISETASV HLGLSQVELT
GNSIYEYIHP SDHDEMTAVL TAHPPLHHHL LQEYEIERSF FLRMKCVLAK RNAGLTCSGY
KVIHCSGYLK IRQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLHSN MFMFRASLDL
KLIFLDSRVT ELTGYEPQDL IEKTLYHHVH GCDTFHLRYA HHLLLVKGQV TTKYYRLLSK
LGGWVWVQSY ATVVHNSRSS RPHCIVSVNY VLTDVEYKEL QLSLDQVSTS KSQESWRTTL
STSQETRKSA KPKNTKMKTK LRTNPYPPQQ YSSFQMDKLE CSQVGNWRTS PPTNAVAPPE
QQLHSEASDL LYGPPYSLPF SYHYGHFPLD SHVFSSKKPG LPAKFGQPQG SPCEVARFFL
STLPASSECQ WHCANSLVPS SSSPAKNLSE PSPVNAARHG LVPNYEAPSA AARRFCEDPA
PPSFPSCGHY REEPALGPAK APRQASRDAA RLALARAPPE CCAPPAPEPQ APAQLPFVLL
NYHRVLARRG PLGSAAPGAP EAAGSLRPRH PGPVAASAPG APRPHYLGAS VIITNGR