CUT_CAEEL
ID CUT_CAEEL Reviewed; 1273 AA.
AC Q9BL02;
DT 10-JAN-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 143.
DE RecName: Full=Homeobox protein cut-like ceh-44;
DE Short=Homeobox protein 44;
GN Name=ceh-44; ORFNames=Y54F10AM.4;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- FUNCTION: Probable DNA-binding regulatory protein involved in cell-fate
CC specification. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC ECO:0000255|PROSITE-ProRule:PRU00374}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=a;
CC IsoId=Q9BL02-1; Sequence=Displayed;
CC Name=b;
CC IsoId=Q8IA98-1; Sequence=External;
CC Name=c;
CC IsoId=Q8IA98-2; Sequence=External;
CC -!- MISCELLANEOUS: Asn-1149 may participate in regulating DNA-binding
CC activity by promoting homo- and heterodimerization.
CC -!- SIMILARITY: Belongs to the CUT homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO081806; CCD73822.1; -; Genomic_DNA.
DR RefSeq; NP_497576.1; NM_065175.5. [Q9BL02-1]
DR AlphaFoldDB; Q9BL02; -.
DR SMR; Q9BL02; -.
DR BioGRID; 40620; 3.
DR STRING; 6239.Y54F10AM.4a; -.
DR EPD; Q9BL02; -.
DR PaxDb; Q9BL02; -.
DR PeptideAtlas; Q9BL02; -.
DR EnsemblMetazoa; Y54F10AM.4a.1; Y54F10AM.4a.1; WBGene00000464. [Q9BL02-1]
DR GeneID; 175372; -.
DR UCSC; Y54F10AM.4c; c. elegans. [Q9BL02-1]
DR CTD; 175372; -.
DR WormBase; Y54F10AM.4a; CE27288; WBGene00000464; ceh-44. [Q9BL02-1]
DR eggNOG; KOG0963; Eukaryota.
DR eggNOG; KOG2252; Eukaryota.
DR GeneTree; ENSGT00940000172657; -.
DR HOGENOM; CLU_003980_0_0_1; -.
DR InParanoid; Q9BL02; -.
DR OrthoDB; 181575at2759; -.
DR PhylomeDB; Q9BL02; -.
DR Proteomes; UP000001940; Chromosome III.
DR Bgee; WBGene00000464; Expressed in embryo and 4 other tissues.
DR ExpressionAtlas; Q9BL02; baseline and differential.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.260.40; -; 3.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR Pfam; PF02376; CUT; 3.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 3.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR SUPFAM; SSF47413; SSF47413; 3.
DR PROSITE; PS51042; CUT; 3.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Alternative splicing; Coiled coil; DNA-binding; Homeobox; Nucleus;
KW Reference proteome; Repeat; Transcription; Transcription regulation.
FT CHAIN 1..1273
FT /note="Homeobox protein cut-like ceh-44"
FT /id="PRO_0000202391"
FT DNA_BIND 591..681
FT /note="CUT 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374"
FT DNA_BIND 832..919
FT /note="CUT 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374"
FT DNA_BIND 978..1065
FT /note="CUT 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374"
FT DNA_BIND 1103..1162
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1069..1100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 101..407
FT /evidence="ECO:0000255"
FT COILED 440..468
FT /evidence="ECO:0000255"
FT COMPBIAS 1081..1100
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1273 AA; 143526 MW; AB4F43B8C12B258D CRC64;
MEIVSRAWES VDWDRIQTRV EAEVTALGQR QDDSEIRKTR LVEESNAYRG RTNKDSRKVA
IPLIKAFQSE FDGLLARSTA AENALIDICK SIVSLPDPKS LLKGAEAWKN DAEKTQKAVE
EREELKRQLI KVNNELEDLR GKDVKVRKLK DKLAKLESEQ DIFIENAVNE VEKKAEQELN
DRLTELIAEK EKMKEQNEIL EKNMDSLESK NKDIQRKLEI AKQTVEQKDG LENEQLSIAM
KDLADAKHKI VFLEERVSQL ENEAEKVNES KKAGNIEDIA ALGSVLVQKD DVIQQLTNDI
KRHEASHVEE LAKWKLAVSA VEKKNKTLIG ELNELKNQLE SRNDYEAIKN ELRLLREIEF
GDSAEANAES IERLGETVET LDRLLAEKNR RLQNENASLR VANDGFKGDE VMKAIVSGSH
SRVVETVGKR VGAEEANSYR QKNTDSELIE KIQEAKRNKA VCELKFEDPT INVLTYLKNQ
KAKEAGKRDA PTPILAAPVT PKHVTKLGTH TITTTALPPR TQTAETTQSI LQRLSNGSNK
HLLNEDLKLS TVLNLKRFSG NPAKPLEAKT SEEQKAELET IEKMQKRIQV NVQALNGHPL
NTTEIASHCK RLMIAYNIGQ RLFAKHVMNQ VVKSQGSLSE LLSKPRHWNK LTDKGREAFR
RIYGWISDDE AINLLCSLSP RRVWPADQNI EHPKAETLLD TSDPMEFKEE PVIRYDVTPK
VEPVIEKIKS PVESPCSSQA GASSLRASRW RHDDISKEKI LSILQTELKK IEEETTESKV
VVPVKPTATG GNRRFSSNST YESSSLTGKS RPSTVELILK QRISTGLQPL TQAQYDAYTV
LDTDFLVKQI KEFLTMNSIS QRQFGEYILG LSQGSVSDLL ARPKTWAQLT QKGREPFIRM
QLFMDDVEAS EENDEKQPKI TICDEDSDLA KTLATLLNAV HREPSEPKTS VKLEPLSEID
VIMQVPSASK PSSVVKSIDE SSGEEILDTF EIVYQVKGIL EENGISPRVF GDEYLHCTSS
MCADLMIRTK SFENSKASEK LMYTRMKTFL SDPIAIPLLV EKEESKETVK AKIESVPAPR
EAPRPVKRKH SSDTDDYDLN TKKPIQRTVI TDYQKDTLRF VFVNEQHPSN ELCEQISLKL
DMSLRTVQNW FHNHRTRSKA REKEGKVYSD ALPNGTAVKS LTWKDDLQKM LDEAPAITSQ
WAPDYQNRAG SVKSSTSADS PTNNNYSSPI FSFDKASTTS TVKKPSSTGK LDNLVARMIR
LAEGREAAAA KAS