HMIN_DROME
ID HMIN_DROME Reviewed; 576 AA.
AC P05527; A4UZD7; Q0E9C1; Q5U0Z5; Q8T3Z5; Q9V600;
DT 01-NOV-1988, integrated into UniProtKB/Swiss-Prot.
DT 25-OCT-2004, sequence version 2.
DT 03-AUG-2022, entry version 182.
DE RecName: Full=Homeobox protein invected;
GN Name=inv; ORFNames=CG17835;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, SUBCELLULAR LOCATION, AND TISSUE
RP SPECIFICITY.
RX PubMed=2892756; DOI=10.1101/gad.1.1.19;
RA Coleman K.G., Poole S.J., Weir M.P., Soeller W.C., Kornberg T.;
RT "The invected gene of Drosophila: sequence analysis and expression studies
RT reveal a close kinship to the engrailed gene.";
RL Genes Dev. 1:19-28(1987).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley; TISSUE=Testis;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley; TISSUE=Embryo;
RA Stapleton M., Carlson J.W., Chavez C., Frise E., George R.A., Pacleb J.M.,
RA Park S., Wan K.H., Yu C., Celniker S.E.;
RL Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP FUNCTION, AND TISSUE SPECIFICITY.
RX PubMed=9165116; DOI=10.1242/dev.124.9.1675;
RA Bhat K.M., Schedl P.;
RT "Requirement for engrailed and invected genes reveals novel regulatory
RT interactions between engrailed/invected, patched, gooseberry and wingless
RT during Drosophila neurogenesis.";
RL Development 124:1675-1688(1997).
CC -!- FUNCTION: Engrailed (en) and invected (inv) are functionally redundant
CC transcription factors in neuronal precursor cell NB5-3 specification.
CC Inv is unable to substitute for en in other regulatory processes such
CC as maintaining gsb expression in the neuroectoderm after stage 10 of
CC embryogenesis. Maintenance of gsb expression in row 5 of the
CC neuroectoderm involves an as yet unidentified short range signaling
CC molecule. {ECO:0000269|PubMed:2892756, ECO:0000269|PubMed:9165116}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC ECO:0000269|PubMed:2892756}.
CC -!- TISSUE SPECIFICITY: Expressed in row 6/7 of the embryonic
CC neuroectoderm. {ECO:0000269|PubMed:2892756,
CC ECO:0000269|PubMed:9165116}.
CC -!- SIMILARITY: Belongs to the engrailed homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X05273; CAA28885.1; -; mRNA.
DR EMBL; AE013599; AAF58640.3; -; Genomic_DNA.
DR EMBL; AE013599; AAM68707.3; -; Genomic_DNA.
DR EMBL; AE013599; AAM68708.3; -; Genomic_DNA.
DR EMBL; AY089423; AAL90161.1; -; mRNA.
DR EMBL; BT016097; AAV36982.1; -; mRNA.
DR PIR; A26628; A26628.
DR RefSeq; NP_523699.3; NM_078975.4.
DR RefSeq; NP_725056.2; NM_165838.3.
DR RefSeq; NP_725057.2; NM_165839.3.
DR AlphaFoldDB; P05527; -.
DR SMR; P05527; -.
DR BioGRID; 62027; 21.
DR IntAct; P05527; 9.
DR STRING; 7227.FBpp0087173; -.
DR PaxDb; P05527; -.
DR DNASU; 36239; -.
DR EnsemblMetazoa; FBtr0088068; FBpp0087174; FBgn0001269.
DR EnsemblMetazoa; FBtr0088069; FBpp0087175; FBgn0001269.
DR EnsemblMetazoa; FBtr0345673; FBpp0311724; FBgn0001269.
DR GeneID; 36239; -.
DR KEGG; dme:Dmel_CG17835; -.
DR CTD; 36239; -.
DR FlyBase; FBgn0001269; inv.
DR VEuPathDB; VectorBase:FBgn0001269; -.
DR eggNOG; KOG0489; Eukaryota.
DR GeneTree; ENSGT00940000167868; -.
DR HOGENOM; CLU_034034_0_0_1; -.
DR InParanoid; P05527; -.
DR OMA; GSFQEEF; -.
DR OrthoDB; 858478at2759; -.
DR PhylomeDB; P05527; -.
DR SignaLink; P05527; -.
DR BioGRID-ORCS; 36239; 1 hit in 3 CRISPR screens.
DR GenomeRNAi; 36239; -.
DR PRO; PR:P05527; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0001269; Expressed in cleaving embryo and 34 other tissues.
DR Genevisible; P05527; DM.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0007386; P:compartment pattern specification; TAS:FlyBase.
DR GO; GO:0050832; P:defense response to fungus; IMP:FlyBase.
DR GO; GO:0007474; P:imaginal disc-derived wing vein specification; IMP:FlyBase.
DR GO; GO:0007400; P:neuroblast fate determination; IGI:FlyBase.
DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0048100; P:wing disc anterior/posterior pattern formation; TAS:FlyBase.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR019549; Homeobox-engrailed_C-terminal.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000747; Homeobox_engrailed.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR019737; Homoebox-engrailed_CS.
DR InterPro; IPR000047; HTH_motif.
DR Pfam; PF10525; Engrail_1_C_sig; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00026; ENGRAILED.
DR PRINTS; PR00024; HOMEOBOX.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00033; ENGRAILED; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Homeobox; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..576
FT /note="Homeobox protein invected"
FT /id="PRO_0000196081"
FT DNA_BIND 471..530
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..68
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 80..102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 305..344
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 364..410
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 426..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 305..329
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 364..385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 39
FT /note="G -> V (in Ref. 4; AAL90161)"
FT /evidence="ECO:0000305"
FT CONFLICT 45
FT /note="C -> V (in Ref. 4; AAL90161)"
FT /evidence="ECO:0000305"
FT CONFLICT 81
FT /note="V -> A (in Ref. 4; AAL90161)"
FT /evidence="ECO:0000305"
FT CONFLICT 159
FT /note="L -> M (in Ref. 1; CAA28885)"
FT /evidence="ECO:0000305"
FT CONFLICT 321
FT /note="N -> T (in Ref. 1; CAA28885 and 4; AAL90161)"
FT /evidence="ECO:0000305"
FT CONFLICT 570
FT /note="A -> R (in Ref. 1; CAA28885)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 576 AA; 60863 MW; 67E93DA41A606149 CRC64;
MSTLASTRPP PLKLTIPSLE EAEDHAQERR AGGGGQEVGK MHPDCLPLPL VQPGNSPQVR
EEEEDEQTEC EEQLNIEDEE VEEEHDLDLE DPASCCSENS VLSVGQEQSE AAQAALSAQA
QARQRLLISQ IYRPSAFSST ATTVLPPSEG PPFSPEDLLQ LPPSTGTFQE EFLRKSQLYA
EELMKQQMHL MAAARVNALT AAAAGKQLQM AMAAAAVATV PSGQDALAQL TATALGLGPG
GAVHPHQQLL LQRDQVHHHH HMQNHLNNNE NLHERALKFS IDNILKADFG SRLPKIGALS
GNIGGGSVSG SSTGSSKNSG NTNGNRSPLK APKKSGKPLN LAQSNAAANS SLSFSSSLAN
ICSNSNDSNS TATSSSTTNT SGAPVDLVKS PPPAAGAGAT GASGKSGEDS GTPIVWPAWV
YCTRYSDRPS SGRSPRARKP KKPATSSSAA GGGGGGVEKG EAADGGGVPE DKRPRTAFSG
TQLARLKHEF NENRYLTEKR RQQLSGELGL NEAQIKIWFQ NKRAKLKKSS GTKNPLALQL
MAQGLYNHST IPLTREEEEL QELQEAASAA AAKEPC