HM22_CAEEL
ID HM22_CAEEL Reviewed; 346 AA.
AC P41936; A1EHR3; A1EHR4; Q19908;
DT 01-NOV-1995, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1995, sequence version 1.
DT 03-AUG-2022, entry version 144.
DE RecName: Full=Homeobox protein ceh-22;
GN Name=ceh-22; ORFNames=F29F11.5;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA], AND DEVELOPMENTAL STAGE.
RC STRAIN=Bristol N2;
RX PubMed=7925019; DOI=10.1242/dev.120.8.2175;
RA Okkema P.G., Fire A.;
RT "The Caenorhabditis elegans NK-2 class homeoprotein CEH-22 is involved in
RT combinatorial activation of gene expression in pharyngeal muscle.";
RL Development 120:2175-2186(1994).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [3]
RP FUNCTION (ISOFORM B), DEVELOPMENTAL STAGE, DOMAIN, AND DISRUPTION
RP PHENOTYPE.
RX PubMed=24346701; DOI=10.1242/dev.090746;
RA Shibata Y., Sawa H., Nishiwaki K.;
RT "HTZ-1/H2A.z and MYS-1/MYST HAT act redundantly to maintain cell fates in
RT somatic gonadal cells through repression of ceh-22 in C. elegans.";
RL Development 141:209-218(2014).
CC -!- FUNCTION: Involved in combinatorial activation of gene expression in
CC pharyngeal muscle. Specifically binds a site necessary for activity of
CC the B subelement of myo-2 enhancer.
CC -!- FUNCTION: [Isoform b]: Regulates distal tip cell fate.
CC {ECO:0000269|PubMed:24346701}.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=a {ECO:0000312|WormBase:F29F11.5a};
CC IsoId=P41936-1; Sequence=Displayed;
CC Name=b {ECO:0000312|WormBase:F29F11.5b};
CC IsoId=P41936-2; Sequence=VSP_057963;
CC Name=c {ECO:0000312|WormBase:F29F11.5c};
CC IsoId=P41936-3; Sequence=VSP_057962;
CC -!- DEVELOPMENTAL STAGE: First expressed prior to myogenic differentiation,
CC expression continues throughout embryonic and larval development and is
CC most abundant in embryos. It is present in decreasing amounts
CC throughout development and a low level is found in the adults
CC (PubMed:7925019). Expressed in distal tip cells (DTC) until L4 larval
CC stage (PubMed:24346701). {ECO:0000269|PubMed:24346701,
CC ECO:0000269|PubMed:7925019}.
CC -!- DOMAIN: The homeobox domain is required for the induction of distal tip
CC cell fate. {ECO:0000269|PubMed:24346701}.
CC -!- DISRUPTION PHENOTYPE: RNAi-mediated knockdown in a bet-1 mutant
CC background prevents the formation of extra distal tip cells (DTC)
CC during gonad development. {ECO:0000269|PubMed:24346701}.
CC -!- SIMILARITY: Belongs to the NK-2 homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U10080; AAA20840.1; -; mRNA.
DR EMBL; U10081; AAA20841.1; -; Genomic_DNA.
DR EMBL; Z73974; CAA98272.1; -; Genomic_DNA.
DR EMBL; BX284605; CAL90891.1; -; Genomic_DNA.
DR EMBL; BX284605; CAL90892.1; -; Genomic_DNA.
DR PIR; T21552; T21552.
DR RefSeq; NP_001076742.1; NM_001083273.2.
DR RefSeq; NP_001076743.1; NM_001083274.1. [P41936-3]
DR RefSeq; NP_001076744.1; NM_001083275.1. [P41936-2]
DR AlphaFoldDB; P41936; -.
DR SMR; P41936; -.
DR BioGRID; 44514; 8.
DR IntAct; P41936; 8.
DR STRING; 6239.F29F11.5a; -.
DR EPD; P41936; -.
DR PaxDb; P41936; -.
DR PeptideAtlas; P41936; -.
DR EnsemblMetazoa; F29F11.5a.1; F29F11.5a.1; WBGene00000445.
DR EnsemblMetazoa; F29F11.5b.1; F29F11.5b.1; WBGene00000445. [P41936-3]
DR EnsemblMetazoa; F29F11.5c.1; F29F11.5c.1; WBGene00000445. [P41936-2]
DR GeneID; 179485; -.
DR UCSC; F29F11.5a; c. elegans.
DR CTD; 179485; -.
DR WormBase; F29F11.5a; CE05771; WBGene00000445; ceh-22.
DR WormBase; F29F11.5b; CE40611; WBGene00000445; ceh-22. [P41936-3]
DR WormBase; F29F11.5c; CE40612; WBGene00000445; ceh-22. [P41936-2]
DR eggNOG; KOG0842; Eukaryota.
DR GeneTree; ENSGT00940000166323; -.
DR HOGENOM; CLU_802231_0_0_1; -.
DR InParanoid; P41936; -.
DR SignaLink; P41936; -.
DR PRO; PR:P41936; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00000445; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR ExpressionAtlas; P41936; baseline and differential.
DR GO; GO:0005634; C:nucleus; IDA:WormBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0043565; F:sequence-specific DNA binding; IDA:WormBase.
DR GO; GO:0007568; P:aging; IMP:UniProtKB.
DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central.
DR GO; GO:0001708; P:cell fate specification; IDA:UniProtKB.
DR GO; GO:0035262; P:gonad morphogenesis; IMP:UniProtKB.
DR GO; GO:0043282; P:pharyngeal muscle development; IMP:WormBase.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IMP:WormBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW Nucleus; Reference proteome.
FT CHAIN 1..346
FT /note="Homeobox protein ceh-22"
FT /id="PRO_0000048992"
FT DNA_BIND 189..248
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..68
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 135..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..158
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..180
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..131
FT /note="Missing (in isoform c)"
FT /evidence="ECO:0000305"
FT /id="VSP_057962"
FT VAR_SEQ 1..71
FT /note="MFNVSALRAATPSIASVSSVASPSEQHGLSTSVGVGVNDTTSRTGDGGAASS
FT ASSASAAPQQQSQSALHNK -> MQTYAFSR (in isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_057963"
FT CONFLICT 8
FT /note="R -> A (in Ref. 2; CAA98272)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 346 AA; 37511 MW; 01806C7D2B396A20 CRC64;
MFNVSALRAA TPSIASVSSV ASPSEQHGLS TSVGVGVNDT TSRTGDGGAA SSASSASAAP
QQQSQSALHN KLEAKWDTLL PTDTNLQCST WPDSIPLLAG YSATPTFSFD PCTYGSYDPS
AYFASNGIAG SMYTLPDQFP RSENDMLDNS NTSNGNKSDK DGIKLEDEDE ILEDEENDEE
DDGTGKRKKR KRRVLFTKAQ TYELERRFRS QKYLSAPERE ALAMQIRLTP TQVKIWFQNH
RYKTKKSHTD KPINAALLTT MPNAFSSQST AASFPTRAMP IPMLVRDSSA RSSDISSTSP
YTVAFGSANS GYLPTPSAYL PATSGYFSNG PSAASSYMTN TQWWPS