HM19_CAEEL
ID HM19_CAEEL Reviewed; 199 AA.
AC P26797; Q19644;
DT 01-AUG-1992, integrated into UniProtKB/Swiss-Prot.
DT 06-JUN-2002, sequence version 2.
DT 03-AUG-2022, entry version 157.
DE RecName: Full=Homeobox protein ceh-19;
GN Name=ceh-19; Synonyms=ceh16; ORFNames=F20D12.6;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORM A).
RX PubMed=1352400; DOI=10.1093/nar/20.12.2967;
RA Naito M., Kohara Y., Kurosawa Y.;
RT "Identification of a homeobox-containing gene located between lin-45 and
RT unc-24 on chromosome IV in the nematode Caenorhabditis elegans.";
RL Nucleic Acids Res. 20:2967-2969(1992).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=b;
CC IsoId=P26797-1; Sequence=Displayed;
CC Name=a;
CC IsoId=P26797-2; Sequence=VSP_011803, VSP_011804;
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA77838.1; Type=Miscellaneous discrepancy; Note=Intron retention.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z11794; CAA77837.1; -; Genomic_DNA.
DR EMBL; Z11795; CAA77838.1; ALT_SEQ; mRNA.
DR EMBL; FO080909; CCD67721.1; -; Genomic_DNA.
DR EMBL; FO080909; CCD67722.1; -; Genomic_DNA.
DR PIR; S26301; S26301.
DR PIR; T16113; T16113.
DR RefSeq; NP_001023141.1; NM_001027970.2. [P26797-2]
DR RefSeq; NP_001023142.1; NM_001027971.5. [P26797-1]
DR AlphaFoldDB; P26797; -.
DR SMR; P26797; -.
DR BioGRID; 42704; 2.
DR IntAct; P26797; 2.
DR STRING; 6239.F20D12.6b; -.
DR PaxDb; P26797; -.
DR EnsemblMetazoa; F20D12.6a.1; F20D12.6a.1; WBGene00000442. [P26797-2]
DR EnsemblMetazoa; F20D12.6b.1; F20D12.6b.1; WBGene00000442. [P26797-1]
DR GeneID; 177590; -.
DR KEGG; cel:CELE_F20D12.6; -.
DR UCSC; F20D12.6a; c. elegans. [P26797-1]
DR CTD; 177590; -.
DR WormBase; F20D12.6a; CE31687; WBGene00000442; ceh-19. [P26797-2]
DR WormBase; F20D12.6b; CE04436; WBGene00000442; ceh-19. [P26797-1]
DR eggNOG; KOG0488; Eukaryota.
DR GeneTree; ENSGT00940000170984; -.
DR HOGENOM; CLU_118743_0_0_1; -.
DR InParanoid; P26797; -.
DR OMA; IRQMCKE; -.
DR OrthoDB; 1360298at2759; -.
DR PhylomeDB; P26797; -.
DR SignaLink; P26797; -.
DR PRO; PR:P26797; -.
DR Proteomes; UP000001940; Chromosome IV.
DR Bgee; WBGene00000442; Expressed in larva and 3 other tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW Nucleus; Reference proteome.
FT CHAIN 1..199
FT /note="Homeobox protein ceh-19"
FT /id="PRO_0000048990"
FT DNA_BIND 94..153
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..42
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..77
FT /note="Missing (in isoform a)"
FT /evidence="ECO:0000303|PubMed:1352400"
FT /id="VSP_011803"
FT VAR_SEQ 78..80
FT /note="VSA -> MYS (in isoform a)"
FT /evidence="ECO:0000303|PubMed:1352400"
FT /id="VSP_011804"
SQ SEQUENCE 199 AA; 22890 MW; 770769B1BEA358A0 CRC64;
MAFNIESLLE KKSNPVEEGN DFEEENDSEK NGEEDEEEEE KNVIDGWTNM ATSQLAMFAI
ANDLRTPTLV ELQMLLGVSA RKHDYKRSRK SVCERKPRQA YSARQLDRLE TEFQTDKYLS
VNKRIQLSQT LNLTETQIKT WFQNRRTKWK KQLTSSIRQM VKDAPTSTSV GVPFQSLLTP
PTPPTTLACH VNSLFACEQ