SEM2_CAEEL
ID SEM2_CAEEL Reviewed; 404 AA.
AC Q8T3B9;
DT 02-NOV-2016, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2002, sequence version 1.
DT 03-AUG-2022, entry version 127.
DE RecName: Full=Transcription factor sem-2 {ECO:0000305|PubMed:21307099};
DE AltName: Full=Sex muscle abnormal protein 2 {ECO:0000312|WormBase:C32E12.5};
GN Name=sem-2 {ECO:0000312|WormBase:C32E12.5};
GN ORFNames=C32E12.5 {ECO:0000312|WormBase:C32E12.5};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239 {ECO:0000312|Proteomes:UP000001940};
RN [1] {ECO:0000312|Proteomes:UP000001940}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000312|Proteomes:UP000001940};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2] {ECO:0000305}
RP FUNCTION, SUBCELLULAR LOCATION, DEVELOPMENTAL STAGE, AND DISRUPTION
RP PHENOTYPE.
RX PubMed=21307099; DOI=10.1242/dev.062240;
RA Tian C., Shi H., Colledge C., Stern M., Waterston R., Liu J.;
RT "The C. elegans SoxC protein SEM-2 opposes differentiation factors to
RT promote a proliferative blast cell fate in the postembryonic mesoderm.";
RL Development 138:1033-1043(2011).
RN [3] {ECO:0000305}
RP FUNCTION, AND DEVELOPMENTAL STAGE.
RX PubMed=26153233; DOI=10.1242/dev.125740;
RA Vidal B., Santella A., Serrano-Saiz E., Bao Z., Chuang C.F., Hobert O.;
RT "C. elegans SoxB genes are dispensable for embryonic neurogenesis but
RT required for terminal differentiation of specific neuron types.";
RL Development 142:2464-2477(2015).
CC -!- FUNCTION: Probable transcription factor required for embryogenesis,
CC vulval development and cell fate specification of the postembryonic
CC mesoderm (also known as the M lineage) (PubMed:21307099). Specifically,
CC required for the specification of sex myoblast cells and their
CC development into the muscles that are necessary for egg-laying
CC (PubMed:21307099). In addition, may be involved in RME GABAergic motor
CC neuron progenitor cell fate specification (PubMed:26153233).
CC {ECO:0000269|PubMed:21307099, ECO:0000269|PubMed:26153233}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:21307099}.
CC -!- DEVELOPMENTAL STAGE: First expressed in blastomeres in early
CC gastrulating embryos (PubMed:21307099, PubMed:26153233). Expression
CC continues throughout embryonic development with expression in neuronal
CC and non-neuronal progenitors (PubMed:21307099, PubMed:26153233). During
CC larval development, it is expressed in cells including vulval,
CC hypodermal and intestinal cells, and is only expressed in RMH motor
CC neurons (PubMed:21307099, PubMed:26153233). First expressed in the
CC progenitor sex myoblasts at the 16-M cell stage of mesoderm development
CC in hermaphrodite larvae and thereafter in descendants
CC (PubMed:21307099). {ECO:0000269|PubMed:21307099,
CC ECO:0000269|PubMed:26153233}.
CC -!- DISRUPTION PHENOTYPE: Embryonic lethal whereby embryos arrest at the
CC three-fold stage of embryogenesis. Embryos initially develop normally
CC until the late comma stage where a delay in the elongation process
CC culminates in severe morphological defects. RNAi-mediated knockdown
CC results in 98% egg-laying defective mutants with muscles lacking the
CC egg-laying muscle specific protein egl-15. In addition, mutants also
CC exhibit a multiple vulvae phenotype with animals having two or three
CC vulvae. Double knockdown with let-381, sys-1, hlh-1 or fozi-1 results
CC in no sex myoblast production. {ECO:0000269|PubMed:21307099}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284601; CCD61581.1; -; Genomic_DNA.
DR RefSeq; NP_740846.1; NM_170859.3.
DR AlphaFoldDB; Q8T3B9; -.
DR SMR; Q8T3B9; -.
DR IntAct; Q8T3B9; 22.
DR STRING; 6239.C32E12.5.1; -.
DR EPD; Q8T3B9; -.
DR PaxDb; Q8T3B9; -.
DR PeptideAtlas; Q8T3B9; -.
DR EnsemblMetazoa; C32E12.5.1; C32E12.5.1; WBGene00004771.
DR EnsemblMetazoa; C32E12.5.2; C32E12.5.2; WBGene00004771.
DR EnsemblMetazoa; C32E12.5.3; C32E12.5.3; WBGene00004771.
DR GeneID; 172162; -.
DR KEGG; cel:CELE_C32E12.5; -.
DR UCSC; C32E12.5.1; c. elegans.
DR CTD; 172162; -.
DR WormBase; C32E12.5; CE30497; WBGene00004771; sem-2.
DR eggNOG; KOG0527; Eukaryota.
DR HOGENOM; CLU_681941_0_0_1; -.
DR InParanoid; Q8T3B9; -.
DR OMA; EFGHAPL; -.
DR OrthoDB; 1242009at2759; -.
DR Reactome; R-CEL-3769402; Deactivation of the beta-catenin transactivating complex.
DR SignaLink; Q8T3B9; -.
DR PRO; PR:Q8T3B9; -.
DR Proteomes; UP000001940; Chromosome I.
DR Bgee; WBGene00004771; Expressed in pharyngeal muscle cell (C elegans) and 16 other tissues.
DR GO; GO:0005634; C:nucleus; IDA:WormBase.
DR GO; GO:0003677; F:DNA binding; TAS:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central.
DR GO; GO:0007501; P:mesodermal cell fate specification; IDA:UniProtKB.
DR GO; GO:0048626; P:myoblast fate specification; IMP:UniProtKB.
DR GO; GO:0040019; P:positive regulation of embryonic development; IMP:UniProtKB.
DR GO; GO:0048337; P:positive regulation of mesodermal cell fate specification; IMP:UniProtKB.
DR GO; GO:1901046; P:positive regulation of oviposition; IMP:UniProtKB.
DR GO; GO:0048582; P:positive regulation of post-embryonic development; IMP:UniProtKB.
DR GO; GO:0040026; P:positive regulation of vulval development; IMP:UniProtKB.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IBA:GO_Central.
DR Gene3D; 1.10.30.10; -; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; SSF47095; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Nucleus; Reference proteome;
KW Transcription.
FT CHAIN 1..404
FT /note="Transcription factor sem-2"
FT /evidence="ECO:0000305"
FT /id="PRO_0000437965"
FT DNA_BIND 93..161
FT /note="HMG box"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00267"
FT REGION 158..218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 321..359
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 327..359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 404 AA; 44625 MW; 010536C86F28C5AA CRC64;
MDLQKPPNFM LDCGMAPHMM PPINWAAAAI AVASSTSGAT NATSSNSVAT SQQLQHHPYG
TAAGGYKHAQ QAIPKSVTPY SDATNCKKSS NHIKRPMNAF MVWSQMERRK ICEHQPDMHN
AEISKQLGSR WRSLTDEEKA PFVAEAERLR VCHMQEYPDY KYKPRKKPKK NPDGTLQQPA
QPQAPQQQQA PPRGASPQAR QRKRPNTDQQ SETQQFQNFK SVKVEQDWMG NAHMSHAQKM
PFHPSYPSPS EFGHAPLTPE SGFYDDYFTQ QHHQQHFASQ HHNSAGSPLR MTNLGMDMGM
PPQMMGHNSG FGAGNHPFYL HTSPPSVDQD DMRSLSSGSS GYADCSASEQ STSSPNSAGV
VTMATAATTT THLDDLEQIC PTVTTGELVN YPWSDALGID INFS