SIM1A_DANRE
ID SIM1A_DANRE Reviewed; 745 AA.
AC Q98SJ5;
DT 01-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 114.
DE RecName: Full=Single-minded homolog 1-A;
GN Name=sim1a; Synonyms=sim1;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], AND DEVELOPMENTAL STAGE.
RC TISSUE=Embryo;
RX PubMed=11493543; DOI=10.1242/dev.128.12.2233;
RA Serluca F.C., Fishman M.C.;
RT "Pre-pattern in the pronephric kidney field of zebrafish.";
RL Development 128:2233-2241(2001).
RN [2]
RP FUNCTION, DEVELOPMENTAL STAGE, AND TISSUE SPECIFICITY.
RX PubMed=16691572; DOI=10.1002/dvdy.20848;
RA Eaton J.L., Glasgow E.;
RT "The zebrafish bHLH PAS transcriptional regulator, single-minded 1 (sim1),
RT is required for isotocin cell development.";
RL Dev. Dyn. 235:2071-2082(2006).
RN [3]
RP FUNCTION, AND TISSUE SPECIFICITY.
RX PubMed=18330923; DOI=10.1002/dvdy.21503;
RA Eaton J.L., Holmqvist B., Glasgow E.;
RT "Ontogeny of vasotocin-expressing cells in zebrafish: selective requirement
RT for the transcriptional regulators orthopedia and single-minded 1 in the
RT preoptic area.";
RL Dev. Dyn. 237:995-1005(2008).
CC -!- FUNCTION: Transcriptional factor that may have pleiotropic effects
CC during embryogenesis and in the adult. {ECO:0000269|PubMed:16691572,
CC ECO:0000269|PubMed:18330923}.
CC -!- SUBUNIT: Efficient DNA binding requires dimerization with another bHLH
CC protein. Heterodimer of sim1a and arnt (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00632,
CC ECO:0000255|PROSITE-ProRule:PRU00981}.
CC -!- TISSUE SPECIFICITY: Expressed in embryonic forebrain at the eleven
CC somite stage. Detected in brain throughout embryonic development.
CC {ECO:0000269|PubMed:16691572, ECO:0000269|PubMed:18330923}.
CC -!- DEVELOPMENTAL STAGE: First detected at the two somite stage. Detected
CC in intermediate mesoderm. {ECO:0000269|PubMed:11493543,
CC ECO:0000269|PubMed:16691572}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY028626; AAK27261.1; -; mRNA.
DR AlphaFoldDB; Q98SJ5; -.
DR SMR; Q98SJ5; -.
DR STRING; 7955.ENSDARP00000033085; -.
DR PaxDb; Q98SJ5; -.
DR PRIDE; Q98SJ5; -.
DR ZFIN; ZDB-GENE-020829-1; sim1a.
DR eggNOG; KOG3559; Eukaryota.
DR InParanoid; Q98SJ5; -.
DR PhylomeDB; Q98SJ5; -.
DR PRO; PR:Q98SJ5; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0007411; P:axon guidance; IMP:ZFIN.
DR GO; GO:0021536; P:diencephalon development; IMP:ZFIN.
DR GO; GO:0071542; P:dopaminergic neuron differentiation; IMP:ZFIN.
DR GO; GO:0021979; P:hypothalamus cell differentiation; IMP:ZFIN.
DR GO; GO:0035776; P:pronephric proximal tubule development; IMP:ZFIN.
DR GO; GO:0048793; P:pronephros development; IMP:ZFIN.
DR GO; GO:0072020; P:proximal straight tubule development; IMP:ZFIN.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; -; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; SSF47459; 1.
DR SUPFAM; SSF55785; SSF55785; 2.
DR TIGRFAMs; TIGR00229; sensory_box; 1.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; Differentiation; DNA-binding; Neurogenesis; Nucleus;
KW Reference proteome; Repeat; Transcription; Transcription regulation.
FT CHAIN 1..745
FT /note="Single-minded homolog 1-A"
FT /id="PRO_0000343424"
FT DOMAIN 1..53
FT /note="bHLH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00981"
FT DOMAIN 77..147
FT /note="PAS 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 218..288
FT /note="PAS 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 336..745
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00632"
FT REGION 350..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 529..563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 368..387
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000250"
FT COMPBIAS 350..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..413
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 745 AA; 82734 MW; 8D93E633D1C8715C CRC64;
MKEKSKNAGR TRREKENSEF YELAKLLPLP SAITSQSDKA SIIRLTTSYL KMRIVFPEGL
GESWGHVSRT TSLENVGREL GSHLLQTLDG FIFVVAPDGK ILYISETASV HLGLSQEELT
GNSIYEYIHP ADHDEMTAVL TAHQPYHSHF VHEYEMERSF FLRMKCVLAK ANAGLTCGGY
KVIHCSGYLK IRQYSLDMSP FDGCYQNVGL VAVGHSLPPS AVTEIKLHSN MFMFRASLDM
KLIFLDSRVA ELTGYEPQDL IEKTLYHHVH SCDTFHLRCA HHLLLVKGQV TTKYYRFLAK
QGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDTEYKGL QLSLDQAAST KPSFTYNSPS
NPVTENRRVG KSRVSRTKTK TRLSPYSQYP GFPTDRSESD QDSPWGGSPL TDSASPQLLE
QCEGIESSCV YRQFSDPRSL CYGLPLTEDH HTSNELYSHP HSESCERGCC KAGRYFLGTP
QPGREAWWGA ARSVLPLPKS SPENGDSFEG VSPHIASIHS LQVRGHWDED SAVSSAPDGG
SASDSGDRFR ADQCRSSPQE PSKIETLIRA TQQMIKEEES RLQLRKLPTD VPLEPTNSLA
KSFHSTDFPQ SAMQSVVCRG PAQVISPAPS PVPLSRLSSP LPDRLSKGKD FLQNELSSSQ
LPLTGTCAVS PTPALYSLHP RQYLEKHAAY SLTSYALEHL YEADTFRGYS LGCSGSSHYD
MTTHLRKAEQ APGHKGTSVI ITNGS