HM38_CAEEL
ID HM38_CAEEL Reviewed; 641 AA.
AC Q19720; Q95QJ5;
DT 27-APR-2001, integrated into UniProtKB/Swiss-Prot.
DT 27-MAY-2002, sequence version 2.
DT 03-AUG-2022, entry version 152.
DE RecName: Full=Homeobox protein ceh-38;
GN Name=ceh-38; ORFNames=F22D3.1;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2]
RP TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE.
RX PubMed=9661672; DOI=10.1016/s0378-1119(98)00137-1;
RA Cassata G., Kagoshima H., Pretot R.F., Aspoeck G., Niklaus G.,
RA Buerglin T.R.;
RT "Rapid expression screening of Caenorhabditis elegans homeobox open reading
RT frames using a two-step polymerase chain reaction promoter-gfp reporter
RT construction technique.";
RL Gene 212:127-135(1998).
CC -!- FUNCTION: Probable DNA-binding regulatory protein involved in cell-fate
CC specification. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC ECO:0000255|PROSITE-ProRule:PRU00374}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=b;
CC IsoId=Q19720-1; Sequence=Displayed;
CC Name=a;
CC IsoId=Q19720-2; Sequence=VSP_002313;
CC -!- TISSUE SPECIFICITY: Expressed in the embryo. After gastrulation,
CC expressed in almost all cells. During larval and adult stages,
CC expressed in the dorsal and ventral nerve cord, head and tail neurons,
CC pharynx, gut and head. {ECO:0000269|PubMed:9661672}.
CC -!- DEVELOPMENTAL STAGE: Expression starts during embryogenesis and
CC continues into adulthood. {ECO:0000269|PubMed:9661672}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO080140; CCD61546.1; -; Genomic_DNA.
DR EMBL; FO080140; CCD61547.1; -; Genomic_DNA.
DR RefSeq; NP_741017.1; NM_171016.3.
DR RefSeq; NP_741018.1; NM_171852.1. [Q19720-2]
DR AlphaFoldDB; Q19720; -.
DR SMR; Q19720; -.
DR BioGRID; 39474; 3.
DR IntAct; Q19720; 2.
DR STRING; 6239.F22D3.1b; -.
DR iPTMnet; Q19720; -.
DR EPD; Q19720; -.
DR PaxDb; Q19720; -.
DR PeptideAtlas; Q19720; -.
DR PRIDE; Q19720; -.
DR EnsemblMetazoa; F22D3.1a.1; F22D3.1a.1; WBGene00000459. [Q19720-2]
DR EnsemblMetazoa; F22D3.1a.2; F22D3.1a.2; WBGene00000459. [Q19720-2]
DR EnsemblMetazoa; F22D3.1b.1; F22D3.1b.1; WBGene00000459. [Q19720-1]
DR EnsemblMetazoa; F22D3.1b.2; F22D3.1b.2; WBGene00000459. [Q19720-1]
DR GeneID; 174136; -.
DR UCSC; F22D3.1b; c. elegans. [Q19720-1]
DR CTD; 174136; -.
DR WormBase; F22D3.1a; CE27137; WBGene00000459; ceh-38. [Q19720-2]
DR WormBase; F22D3.1b; CE29772; WBGene00000459; ceh-38. [Q19720-1]
DR eggNOG; KOG2252; Eukaryota.
DR GeneTree; ENSGT00950000183103; -.
DR InParanoid; Q19720; -.
DR OMA; RIYSTQD; -.
DR PRO; PR:Q19720; -.
DR Proteomes; UP000001940; Chromosome II.
DR Bgee; WBGene00000459; Expressed in pharyngeal muscle cell (C elegans) and 4 other tissues.
DR ExpressionAtlas; Q19720; baseline and differential.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.260.40; -; 1.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR Pfam; PF02376; CUT; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR SUPFAM; SSF47413; SSF47413; 1.
DR PROSITE; PS51042; CUT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; DNA-binding; Homeobox; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..641
FT /note="Homeobox protein ceh-38"
FT /id="PRO_0000202409"
FT DNA_BIND 308..394
FT /note="CUT"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374"
FT DNA_BIND 427..486
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 129..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 398..428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 485..508
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 552..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..38
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..79
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..157
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..211
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 400..414
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 569..605
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 612..632
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 254..263
FT /note="NSRKQKKPLG -> S (in isoform a)"
FT /evidence="ECO:0000305"
FT /id="VSP_002313"
SQ SEQUENCE 641 AA; 70816 MW; 9A57D865682D184C CRC64;
MESSRTAATS TNGTEKSRRR NTDYLQIDPS STFINNTGRG FAEELPENFL DTISPHPITP
SASTSSATSA TEEPATSSAP QLASLAPMSM SSEQPSSSFS SASLLSSSYE TIKNEPEFSG
STAGLLSPLH VDSRRRESHD FNTSPYIKEE EDLDGSHLLM GGIRPDTPTN DRSTDLGSIS
SLLNEDHHTN TIGQSPSPRS TFGSDPTPMI QRQLIKNEDG VSPGSMGFSK NHQGYQKPRN
GDRMEYEKAP YQRNSRKQKK PLGLLNQALS SVISTPTISS SNIPTPPSAH IAQPRRIYST
QDSNDPLNAE IGDDIYIDTK DLCKRIAFEL KNHSIPQAIF AERILCRSQG TLSDLLRNPK
PWNKLKSGRE TFRRMYNWVA QPLATRLAIL DMKTEDVNRA SGMSPPTPAQ NVRTHRRSTS
DHDGPVSKRP RLVFTDIQKR TLQAIFKETQ RPSREMQQTI AEHLRLDLST VANFFMNARR
RSRLGGNIDE PTPFQQVKNI SPPPVGDTSD ALLNGDDHVP LLNTVMAEMY KEGAIATSNH
SAEQREMIER GFGVSIPGPS HSGELLNGDS HEDDEELDEL NDSELAYEED VEIGDEEEED
EEQANGDILP TPKVEELEEK TVIKEEAPDD GEYGATKLAA N