HM21_CAEEL
ID HM21_CAEEL Reviewed; 495 AA.
AC Q22811; Q8WQE1;
DT 27-APR-2001, integrated into UniProtKB/Swiss-Prot.
DT 06-JUN-2002, sequence version 3.
DT 03-AUG-2022, entry version 150.
DE RecName: Full=Homeobox protein ceh-21;
GN Name=ceh-21; ORFNames=T26C11.6;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=Bristol N2;
RX PubMed=11902672;
RA Buerglin T.R., Cassata G.;
RT "Loss and gain of domains during evolution of cut superclass homeobox
RT genes.";
RL Int. J. Dev. Biol. 46:115-123(2002).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [3]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 313-495.
RC STRAIN=Bristol N2;
RX PubMed=9593691; DOI=10.1074/jbc.273.22.13552;
RA Lannoy V.J., Buerglin T.R., Rousseau G.G., Lemaigre F.P.;
RT "Isoforms of hepatocyte nuclear factor-6 differ in DNA-binding properties,
RT contain a bifunctional homeodomain, and define the new ONECUT class of
RT homeodomain proteins.";
RL J. Biol. Chem. 273:13552-13562(1998).
CC -!- FUNCTION: Probable DNA-binding regulatory protein involved in cell-fate
CC specification. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC ECO:0000255|PROSITE-ProRule:PRU00374}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ427855; CAD20808.1; -; mRNA.
DR EMBL; FO080541; CCD64528.1; -; Genomic_DNA.
DR EMBL; AF023470; AAB86814.1; -; mRNA.
DR PIR; T28912; T28912.
DR PIR; T42240; T42240.
DR RefSeq; NP_508341.2; NM_075940.5.
DR AlphaFoldDB; Q22811; -.
DR SMR; Q22811; -.
DR BioGRID; 45451; 2.
DR IntAct; Q22811; 2.
DR STRING; 6239.T26C11.6; -.
DR EPD; Q22811; -.
DR PaxDb; Q22811; -.
DR EnsemblMetazoa; T26C11.6.1; T26C11.6.1; WBGene00000444.
DR UCSC; T26C11.6.2; c. elegans.
DR WormBase; T26C11.6; CE29823; WBGene00000444; ceh-21.
DR eggNOG; KOG2252; Eukaryota.
DR GeneTree; ENSGT00950000183103; -.
DR HOGENOM; CLU_551232_0_0_1; -.
DR InParanoid; Q22811; -.
DR PRO; PR:Q22811; -.
DR Proteomes; UP000001940; Chromosome X.
DR Bgee; WBGene00000444; Expressed in embryo and 4 other tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.260.40; -; 1.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR Pfam; PF02376; CUT; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR SUPFAM; SSF47413; SSF47413; 1.
DR PROSITE; PS51042; CUT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW DNA-binding; Homeobox; Nucleus; Reference proteome; Transcription;
KW Transcription regulation.
FT CHAIN 1..495
FT /note="Homeobox protein ceh-21"
FT /id="PRO_0000202408"
FT DNA_BIND 284..370
FT /note="CUT"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374"
FT DNA_BIND 389..449
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 89..267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 450..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 105..135
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..162
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..239
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 495 AA; 54823 MW; C4FFA0984D0DAA08 CRC64;
MSQQFQASSG TGSASLREFK TEHEDLREDL PYSTLRTLFG ITLDKDASQA LNIALLLYGH
NYPQQVVPPE RNYAELDAQL ESVVLEDHTA ESTMEPGVSA TVTEQLEEKS DKSSDGDGTS
KRLTRSLKSV ENETEEDHEE KEDEAPQSSR RESTRLKRKL LESQKTVQTT GNSSRASSKS
QEKEVPGTKS QCAPKIRTTP EQSKAATKRQ SSTTVRASST CGSSVSSTST VSSPDYTAKK
GRATETPKLE ELAPKKQSSA TPKPGGEVCV WDGVQIGDLS AQMNAQIGDD EELDTVDIAR
RILSELKERC IPQTALAEKI LARSQGTLSD LLRMPKPWSV MKNGRATFQR MSNWLGLDPD
VRRALCFLPK EDVARITGLD EPTPAKRKKT VKVIRLTFTE TQLKSLQKSF QQNHRPTREM
RQKLSATLEL DFSTVGNFFM NSRRRLRIDQ QISRSSRSTG NGADTEDELD EEDVVVENVI
ADATDASNQP GPSHL