HMIN_BOMMO
ID HMIN_BOMMO Reviewed; 476 AA.
AC P27610;
DT 01-AUG-1992, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1992, sequence version 1.
DT 03-AUG-2022, entry version 118.
DE RecName: Full=Homeobox protein invected;
GN Name=INV;
OS Bombyx mori (Silk moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC Bombycidae; Bombycinae; Bombyx.
OX NCBI_TaxID=7091;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=Kinshu X Showa; TISSUE=Middle silk gland;
RX PubMed=1346065; DOI=10.1073/pnas.89.1.167;
RA Hui C.-C., Matsuno K., Ueno K., Suzuki Y.;
RT "Molecular characterization and silk gland expression of Bombyx engrailed
RT and invected genes.";
RL Proc. Natl. Acad. Sci. U.S.A. 89:167-171(1992).
CC -!- FUNCTION: This protein might be involved in the compartmentalization of
CC the silk gland.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- TISSUE SPECIFICITY: Expressed in the middle silk gland but not in the
CC posterior silk gland during the fourth molt/fifth intermolt period.
CC -!- SIMILARITY: Belongs to the engrailed homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M64336; AAA67313.1; -; mRNA.
DR PIR; B41792; B41792.
DR RefSeq; NP_001037454.1; NM_001043989.1.
DR AlphaFoldDB; P27610; -.
DR SMR; P27610; -.
DR STRING; 7091.BGIBMGA009643-TA; -.
DR GeneID; 693024; -.
DR KEGG; bmor:693024; -.
DR CTD; 36239; -.
DR eggNOG; KOG0493; Eukaryota.
DR HOGENOM; CLU_040714_0_0_1; -.
DR InParanoid; P27610; -.
DR OrthoDB; 1575614at2759; -.
DR Proteomes; UP000005204; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR GO; GO:0007399; P:nervous system development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR019549; Homeobox-engrailed_C-terminal.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000747; Homeobox_engrailed.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR019737; Homoebox-engrailed_CS.
DR InterPro; IPR000047; HTH_motif.
DR Pfam; PF10525; Engrail_1_C_sig; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00026; ENGRAILED.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00033; ENGRAILED; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Homeobox; Nucleus; Reference proteome.
FT CHAIN 1..476
FT /note="Homeobox protein invected"
FT /id="PRO_0000196080"
FT DNA_BIND 372..431
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 273..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 347..381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 295..312
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 351..373
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 476 AA; 53585 MW; CCD219F4667EEC02 CRC64;
MAAVSAHMQD IKIQDQSDDD PYSPNTRDTT SPECHDDEKS EDISIRSSSF SIHNVLRRSG
TTAALTMSFR RKSSWRIPNF DDRNTESVSP VVEVNEREIS VDDGNSCCSD DTVLSVGNEA
PVSNYEEKAS QNTHQELTSF KHIQTHLSAI SQLSQNMNVA QPLLLRPSPI NPNPIMFLNQ
PLLFQSPILS QDLKGMPNRQ TANVISPTFG LNFGMRLKAN HETRTRSDEN RYSKPEESRD
YINQNCLKFS IDNILKADFG RRITDPLHKR KVKTRYEAKP APAKDTAAFA PKLDEARVPD
IKTPDKAGAI DLSKDDSGSN SGSTSGATSG DSPMVWPAWV YCTRYSDRPS SGRSPRTRRP
KKPPGDTASN DEKRPRTAFS GPQLARLKHE FAENRYLTER RRQSLAAELG LAEAQIKIWF
QNKRAKIKKA SGQRNPLALQ LMAQGLYNHS TVPLTKEEEE LEMKARERER ELKNRC