SOMI1_CAEEL
ID SOMI1_CAEEL Reviewed; 582 AA.
AC A0A486WWJ9; A0A486WWS8; G5EBR5; Q8I4H1;
DT 10-FEB-2021, integrated into UniProtKB/Swiss-Prot.
DT 05-JUN-2019, sequence version 1.
DT 03-AUG-2022, entry version 10.
DE RecName: Full=Zinc finger protein somi-1 {ECO:0000305};
DE AltName: Full=Suppressor of overexpressed micro-RNA protein 1 {ECO:0000312|WormBase:M04G12.4c};
GN Name=somi-1 {ECO:0000312|WormBase:M04G12.4c};
GN ORFNames=M04G12.4 {ECO:0000312|WormBase:M04G12.4c};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239 {ECO:0000312|Proteomes:UP000001940};
RN [1] {ECO:0000312|Proteomes:UP000001940}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000312|Proteomes:UP000001940};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2] {ECO:0000305}
RP FUNCTION, INTERACTION WITH SWSN-9, SUBCELLULAR LOCATION, TISSUE
RP SPECIFICITY, DEVELOPMENTAL STAGE, AND MUTAGENESIS OF 44-GLN--LEU-582;
RP 175-GLN--VAL-582 AND 333-ARG--VAL-582.
RX PubMed=21979920; DOI=10.1101/gad.17153811;
RA Hayes G.D., Riedel C.G., Ruvkun G.;
RT "The Caenorhabditis elegans SOMI-1 zinc finger protein and SWI/SNF promote
RT regulation of development by the mir-84 microRNA.";
RL Genes Dev. 25:2079-2092(2011).
CC -!- FUNCTION: DNA-binding protein which binds to the promoters of let-60,
CC lin-14 and lin-28, possibly to regulate genes involved in hypodermal
CC and vulval development (PubMed:21979920). Together with miRNAs mir-84
CC and let-7 may direct terminal differentiation of the seam cells, exit
CC from the molting cycle, and vulva formation (PubMed:21979920). Does not
CC regulate the expression of mir-84 (PubMed:21979920). May promote
CC hypodermal differentiation in association with swsn-9, a component of
CC SWI/SNF chromatin remodeling complexes (PubMed:21979920).
CC {ECO:0000269|PubMed:21979920}.
CC -!- SUBUNIT: May interact with swsn-9; the interaction promotes hypodermal
CC differentiation. {ECO:0000269|PubMed:21979920}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:21979920}.
CC Note=Localizes to nuclear foci in embryos and larvae (PubMed:21979920).
CC Localizes to DNA (PubMed:21979920). Partially co-localizes with swsn-9
CC in hypodermal nuclei (PubMed:21979920). {ECO:0000269|PubMed:21979920}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=c {ECO:0000312|WormBase:M04G12.4c};
CC IsoId=A0A486WWJ9-1; Sequence=Displayed;
CC Name=a {ECO:0000312|WormBase:M04G12.4a};
CC IsoId=A0A486WWJ9-2; Sequence=VSP_060872;
CC Name=b {ECO:0000312|WormBase:M04G12.4b};
CC IsoId=A0A486WWJ9-3; Sequence=VSP_060871, VSP_060872;
CC Name=d {ECO:0000312|WormBase:M04G12.4d};
CC IsoId=A0A486WWJ9-4; Sequence=VSP_060871;
CC -!- TISSUE SPECIFICITY: Expressed in hypodermal seam cells, the somatic
CC gonad and vulval precursor cells, body wall muscle and head neurons.
CC {ECO:0000269|PubMed:21979920}.
CC -!- DEVELOPMENTAL STAGE: Expressed from embryogenesis to adulthood
CC (PubMed:21979920). First expressed in comma stage embryos
CC (PubMed:21979920). Highly expressed in L4 larvae and adults
CC (PubMed:21979920). {ECO:0000269|PubMed:21979920}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284605; CAB03211.3; -; Genomic_DNA.
DR EMBL; BX284605; CAD56592.2; -; Genomic_DNA.
DR EMBL; BX284605; VGM69540.1; -; Genomic_DNA.
DR EMBL; BX284605; VGM69580.1; -; Genomic_DNA.
DR PIR; T23722; T23722.
DR RefSeq; NP_506320.3; NM_073919.5. [A0A486WWJ9-2]
DR RefSeq; NP_872161.2; NM_182361.4.
DR AlphaFoldDB; A0A486WWJ9; -.
DR IntAct; A0A486WWJ9; 3.
DR STRING; 6239.M04G12.4a; -.
DR EnsemblMetazoa; M04G12.4a.1; M04G12.4a.1; WBGene00010868. [A0A486WWJ9-2]
DR EnsemblMetazoa; M04G12.4b.1; M04G12.4b.1; WBGene00010868. [A0A486WWJ9-3]
DR EnsemblMetazoa; M04G12.4c.1; M04G12.4c.1; WBGene00010868. [A0A486WWJ9-1]
DR EnsemblMetazoa; M04G12.4d.1; M04G12.4d.1; WBGene00010868. [A0A486WWJ9-4]
DR GeneID; 179819; -.
DR KEGG; cel:CELE_M04G12.4; -.
DR UCSC; M04G12.4b.1; c. elegans.
DR CTD; 179819; -.
DR WormBase; M04G12.4a; CE36495; WBGene00010868; somi-1. [A0A486WWJ9-2]
DR WormBase; M04G12.4b; CE36496; WBGene00010868; somi-1. [A0A486WWJ9-3]
DR WormBase; M04G12.4c; CE53045; WBGene00010868; somi-1. [A0A486WWJ9-1]
DR WormBase; M04G12.4d; CE53125; WBGene00010868; somi-1. [A0A486WWJ9-4]
DR eggNOG; ENOG502SA6V; Eukaryota.
DR HOGENOM; CLU_471120_0_0_1; -.
DR OMA; EDAYICD; -.
DR OrthoDB; 798321at2759; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00010868; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR ExpressionAtlas; A0A486WWJ9; baseline and differential.
DR GO; GO:0005634; C:nucleus; IDA:WormBase.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
PE 1: Evidence at protein level;
KW Alternative splicing; Differentiation; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..582
FT /note="Zinc finger protein somi-1"
FT /id="PRO_0000451822"
FT ZN_FING 454..477
FT /note="C2H2-type; Degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 179..251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 352..377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 513..582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 186..247
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 537..569
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..29
FT /note="Missing (in isoform b and isoform d)"
FT /evidence="ECO:0000305"
FT /id="VSP_060871"
FT VAR_SEQ 483..487
FT /note="KSLKE -> KK (in isoform a and isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_060872"
FT MUTAGEN 44..582
FT /note="Missing: In mg431; suppresses the eversion of the
FT vulva (the Evl phenotype) in an miR-84 overexpressing
FT background."
FT /evidence="ECO:0000269|PubMed:21979920"
FT MUTAGEN 175..582
FT /note="Missing: In mg415; adults have a shorter body
FT length. Suppresses the eversion of the vulva (the Evl
FT phenotype) and precocious alae formation in an miR-84
FT overexpressing background."
FT /evidence="ECO:0000269|PubMed:21979920"
FT MUTAGEN 333..582
FT /note="Missing: In mg431; suppresses the eversion of the
FT vulva (the Evl phenotype) in an miR-84 overexpressing
FT background."
FT /evidence="ECO:0000269|PubMed:21979920"
SQ SEQUENCE 582 AA; 63536 MW; B5B85C2AC0099102 CRC64;
MLKPPIITSN DNNNTKVAEN LNDLNNKGKM SGQQIESFSP WHAQTSSSAV TGTSELFGST
YAMLSDHSVY PEQWSGKQLS QSVLFEQPQI QPLVGNSYDP PVRFDPPYAY RATATGYMPT
VPGLSTNSSP YYPRTSGYAA GQQFYAPSLS GVPNTQQLIL AAQVAQASNV QQQLQQQVLR
PEPLRPATQK STNGVHRSTS NSSAETLRNN SVSAATVSPS DDNSLNSPAL TSSGSAGSGT
PPLGIDLNNT DLESGDEERV MCMACRGVYP SRRSLTGHIG RNEKCREIIG RNYLDALAQG
VNPPIPGTDA AIKSGAITTG ADGMSPVCPF CDRFISHYKG NIRRHINQCR KSAEPMKRHR
VEAHEKQSPK KKVKKEQNEM YQHEYNDHDS SSMSGGMMNS PKISPPSSSF YGANSSDLCS
PGEYSNSAYE PYPTPMLENT ERTSTETAVL QDAYICEDCD FVTVYKGNMK RHLNTCHPQP
EFKSLKEWDQ KLEGMRASNL GISGDRLQER LAAHKANSSR GRKPRKKKEN NTEESESIDF
KNILNSETGA LLESLASSSS SMGGYSNGNN FQPPPPPPPM LL