位置:首页 > 蛋白库 > HMIN_DROME
HMIN_DROME
ID   HMIN_DROME              Reviewed;         576 AA.
AC   P05527; A4UZD7; Q0E9C1; Q5U0Z5; Q8T3Z5; Q9V600;
DT   01-NOV-1988, integrated into UniProtKB/Swiss-Prot.
DT   25-OCT-2004, sequence version 2.
DT   03-AUG-2022, entry version 182.
DE   RecName: Full=Homeobox protein invected;
GN   Name=inv; ORFNames=CG17835;
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, SUBCELLULAR LOCATION, AND TISSUE
RP   SPECIFICITY.
RX   PubMed=2892756; DOI=10.1101/gad.1.1.19;
RA   Coleman K.G., Poole S.J., Weir M.P., Soeller W.C., Kornberg T.;
RT   "The invected gene of Drosophila: sequence analysis and expression studies
RT   reveal a close kinship to the engrailed gene.";
RL   Genes Dev. 1:19-28(1987).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley;
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA   An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA   Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA   Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA   Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA   Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA   Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA   Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA   Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA   Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [3]
RP   GENOME REANNOTATION.
RC   STRAIN=Berkeley;
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [4]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   STRAIN=Berkeley; TISSUE=Testis;
RX   PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA   Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA   Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA   Celniker S.E.;
RT   "A Drosophila full-length cDNA resource.";
RL   Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN   [5]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   STRAIN=Berkeley; TISSUE=Embryo;
RA   Stapleton M., Carlson J.W., Chavez C., Frise E., George R.A., Pacleb J.M.,
RA   Park S., Wan K.H., Yu C., Celniker S.E.;
RL   Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases.
RN   [6]
RP   FUNCTION, AND TISSUE SPECIFICITY.
RX   PubMed=9165116; DOI=10.1242/dev.124.9.1675;
RA   Bhat K.M., Schedl P.;
RT   "Requirement for engrailed and invected genes reveals novel regulatory
RT   interactions between engrailed/invected, patched, gooseberry and wingless
RT   during Drosophila neurogenesis.";
RL   Development 124:1675-1688(1997).
CC   -!- FUNCTION: Engrailed (en) and invected (inv) are functionally redundant
CC       transcription factors in neuronal precursor cell NB5-3 specification.
CC       Inv is unable to substitute for en in other regulatory processes such
CC       as maintaining gsb expression in the neuroectoderm after stage 10 of
CC       embryogenesis. Maintenance of gsb expression in row 5 of the
CC       neuroectoderm involves an as yet unidentified short range signaling
CC       molecule. {ECO:0000269|PubMed:2892756, ECO:0000269|PubMed:9165116}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC       ECO:0000269|PubMed:2892756}.
CC   -!- TISSUE SPECIFICITY: Expressed in row 6/7 of the embryonic
CC       neuroectoderm. {ECO:0000269|PubMed:2892756,
CC       ECO:0000269|PubMed:9165116}.
CC   -!- SIMILARITY: Belongs to the engrailed homeobox family. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; X05273; CAA28885.1; -; mRNA.
DR   EMBL; AE013599; AAF58640.3; -; Genomic_DNA.
DR   EMBL; AE013599; AAM68707.3; -; Genomic_DNA.
DR   EMBL; AE013599; AAM68708.3; -; Genomic_DNA.
DR   EMBL; AY089423; AAL90161.1; -; mRNA.
DR   EMBL; BT016097; AAV36982.1; -; mRNA.
DR   PIR; A26628; A26628.
DR   RefSeq; NP_523699.3; NM_078975.4.
DR   RefSeq; NP_725056.2; NM_165838.3.
DR   RefSeq; NP_725057.2; NM_165839.3.
DR   AlphaFoldDB; P05527; -.
DR   SMR; P05527; -.
DR   BioGRID; 62027; 21.
DR   IntAct; P05527; 9.
DR   STRING; 7227.FBpp0087173; -.
DR   PaxDb; P05527; -.
DR   DNASU; 36239; -.
DR   EnsemblMetazoa; FBtr0088068; FBpp0087174; FBgn0001269.
DR   EnsemblMetazoa; FBtr0088069; FBpp0087175; FBgn0001269.
DR   EnsemblMetazoa; FBtr0345673; FBpp0311724; FBgn0001269.
DR   GeneID; 36239; -.
DR   KEGG; dme:Dmel_CG17835; -.
DR   CTD; 36239; -.
DR   FlyBase; FBgn0001269; inv.
DR   VEuPathDB; VectorBase:FBgn0001269; -.
DR   eggNOG; KOG0489; Eukaryota.
DR   GeneTree; ENSGT00940000167868; -.
DR   HOGENOM; CLU_034034_0_0_1; -.
DR   InParanoid; P05527; -.
DR   OMA; GSFQEEF; -.
DR   OrthoDB; 858478at2759; -.
DR   PhylomeDB; P05527; -.
DR   SignaLink; P05527; -.
DR   BioGRID-ORCS; 36239; 1 hit in 3 CRISPR screens.
DR   GenomeRNAi; 36239; -.
DR   PRO; PR:P05527; -.
DR   Proteomes; UP000000803; Chromosome 2R.
DR   Bgee; FBgn0001269; Expressed in cleaving embryo and 34 other tissues.
DR   Genevisible; P05527; DM.
DR   GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0007386; P:compartment pattern specification; TAS:FlyBase.
DR   GO; GO:0050832; P:defense response to fungus; IMP:FlyBase.
DR   GO; GO:0007474; P:imaginal disc-derived wing vein specification; IMP:FlyBase.
DR   GO; GO:0007400; P:neuroblast fate determination; IGI:FlyBase.
DR   GO; GO:0030182; P:neuron differentiation; IBA:GO_Central.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR   GO; GO:0048100; P:wing disc anterior/posterior pattern formation; TAS:FlyBase.
DR   CDD; cd00086; homeodomain; 1.
DR   InterPro; IPR019549; Homeobox-engrailed_C-terminal.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR000747; Homeobox_engrailed.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   InterPro; IPR019737; Homoebox-engrailed_CS.
DR   InterPro; IPR000047; HTH_motif.
DR   Pfam; PF10525; Engrail_1_C_sig; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   PRINTS; PR00026; ENGRAILED.
DR   PRINTS; PR00024; HOMEOBOX.
DR   PRINTS; PR00031; HTHREPRESSR.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; SSF46689; 1.
DR   PROSITE; PS00033; ENGRAILED; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   2: Evidence at transcript level;
KW   Developmental protein; DNA-binding; Homeobox; Nucleus; Reference proteome;
KW   Transcription; Transcription regulation.
FT   CHAIN           1..576
FT                   /note="Homeobox protein invected"
FT                   /id="PRO_0000196081"
FT   DNA_BIND        471..530
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT   REGION          1..68
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          80..102
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          305..344
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          364..410
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          426..476
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        305..329
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        364..385
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CONFLICT        39
FT                   /note="G -> V (in Ref. 4; AAL90161)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        45
FT                   /note="C -> V (in Ref. 4; AAL90161)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        81
FT                   /note="V -> A (in Ref. 4; AAL90161)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        159
FT                   /note="L -> M (in Ref. 1; CAA28885)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        321
FT                   /note="N -> T (in Ref. 1; CAA28885 and 4; AAL90161)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        570
FT                   /note="A -> R (in Ref. 1; CAA28885)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   576 AA;  60863 MW;  67E93DA41A606149 CRC64;
     MSTLASTRPP PLKLTIPSLE EAEDHAQERR AGGGGQEVGK MHPDCLPLPL VQPGNSPQVR
     EEEEDEQTEC EEQLNIEDEE VEEEHDLDLE DPASCCSENS VLSVGQEQSE AAQAALSAQA
     QARQRLLISQ IYRPSAFSST ATTVLPPSEG PPFSPEDLLQ LPPSTGTFQE EFLRKSQLYA
     EELMKQQMHL MAAARVNALT AAAAGKQLQM AMAAAAVATV PSGQDALAQL TATALGLGPG
     GAVHPHQQLL LQRDQVHHHH HMQNHLNNNE NLHERALKFS IDNILKADFG SRLPKIGALS
     GNIGGGSVSG SSTGSSKNSG NTNGNRSPLK APKKSGKPLN LAQSNAAANS SLSFSSSLAN
     ICSNSNDSNS TATSSSTTNT SGAPVDLVKS PPPAAGAGAT GASGKSGEDS GTPIVWPAWV
     YCTRYSDRPS SGRSPRARKP KKPATSSSAA GGGGGGVEKG EAADGGGVPE DKRPRTAFSG
     TQLARLKHEF NENRYLTEKR RQQLSGELGL NEAQIKIWFQ NKRAKLKKSS GTKNPLALQL
     MAQGLYNHST IPLTREEEEL QELQEAASAA AAKEPC
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024