ZFH1_DROME
ID ZFH1_DROME Reviewed; 1054 AA.
AC P28166; Q59DT3; Q6NP51; Q8MSQ8; Q9VA39; Q9VA40;
DT 01-OCT-1994, integrated into UniProtKB/Swiss-Prot.
DT 15-MAR-2004, sequence version 2.
DT 03-AUG-2022, entry version 193.
DE RecName: Full=Zinc finger protein 1;
DE AltName: Full=Zinc finger homeodomain protein 1;
GN Name=zfh1; Synonyms=zfh-1; ORFNames=CG1322;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], FUNCTION, AND TISSUE SPECIFICITY.
RX PubMed=1680376; DOI=10.1016/0925-4773(91)90048-b;
RA Fortini M.E., Lai Z., Rubin G.M.;
RT "The Drosophila zfh-1 and zfh-2 genes encode novel proteins containing both
RT zinc-finger and homeodomain motifs.";
RL Mech. Dev. 34:113-122(1991).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM A), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 30-1054 (ISOFORM B).
RC STRAIN=Berkeley; TISSUE=Embryo;
RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J.W., Champe M.,
RA Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., George R.A.,
RA Gonzalez M., Guarin H., Kronmiller B., Li P.W., Liao G., Miranda A.,
RA Mungall C.J., Nunoo J., Pacleb J.M., Paragas V., Park S., Patel S.,
RA Phouanenavong S., Wan K.H., Yu C., Lewis S.E., Rubin G.M., Celniker S.E.;
RL Submitted (DEC-2003) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 540-1054.
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [6]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-582; SER-586 AND SER-934, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Involved in the development of the embryonic central nervous
CC system, embryonic mesoderm and adult musculature.
CC {ECO:0000269|PubMed:1680376}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=B;
CC IsoId=P28166-1; Sequence=Displayed;
CC Name=A;
CC IsoId=P28166-2; Sequence=VSP_009670, VSP_009671;
CC -!- TISSUE SPECIFICITY: Mesoderm and mesodermally-derived structures in the
CC embryo including the dorsal vessel, support cells of the gonads, and
CC segment-specific arrays of adult muscle precursor. Also identified in
CC motor neurons of developing CNS. {ECO:0000269|PubMed:1680376}.
CC -!- SIMILARITY: Belongs to the delta-EF1/ZFH-1 C2H2-type zinc-finger
CC family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAM50023.1; Type=Miscellaneous discrepancy; Note=Contaminating sequence.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M63449; AAA29050.1; -; Genomic_DNA.
DR EMBL; AE014297; AAF57083.1; -; Genomic_DNA.
DR EMBL; AE014297; AAF57084.1; -; Genomic_DNA.
DR EMBL; BT003277; AAO25034.1; -; mRNA.
DR EMBL; BT011080; AAR82746.1; -; mRNA.
DR EMBL; AY118654; AAM50023.1; ALT_SEQ; mRNA.
DR PIR; S33641; S33641.
DR RefSeq; NP_476850.1; NM_057502.5. [P28166-1]
DR RefSeq; NP_733402.1; NM_170523.3. [P28166-2]
DR AlphaFoldDB; P28166; -.
DR SMR; P28166; -.
DR BioGRID; 68501; 27.
DR ELM; P28166; -.
DR IntAct; P28166; 13.
DR MINT; P28166; -.
DR STRING; 7227.FBpp0085063; -.
DR iPTMnet; P28166; -.
DR PaxDb; P28166; -.
DR DNASU; 43650; -.
DR EnsemblMetazoa; FBtr0085701; FBpp0085063; FBgn0004606. [P28166-1]
DR EnsemblMetazoa; FBtr0085702; FBpp0085064; FBgn0004606. [P28166-2]
DR GeneID; 43650; -.
DR KEGG; dme:Dmel_CG1322; -.
DR CTD; 43650; -.
DR FlyBase; FBgn0004606; zfh1.
DR VEuPathDB; VectorBase:FBgn0004606; -.
DR eggNOG; KOG3623; Eukaryota.
DR GeneTree; ENSGT00870000136508; -.
DR HOGENOM; CLU_010698_0_0_1; -.
DR InParanoid; P28166; -.
DR OMA; IATPFSC; -.
DR PhylomeDB; P28166; -.
DR SignaLink; P28166; -.
DR BioGRID-ORCS; 43650; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 43650; -.
DR PRO; PR:P28166; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0004606; Expressed in embryonic/larval hemocyte (Drosophila) and 71 other tissues.
DR Genevisible; P28166; DM.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IDA:FlyBase.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IDA:FlyBase.
DR GO; GO:0048856; P:anatomical structure development; IBA:GO_Central.
DR GO; GO:0007413; P:axonal fasciculation; IMP:FlyBase.
DR GO; GO:0061321; P:garland nephrocyte differentiation; IMP:FlyBase.
DR GO; GO:0008354; P:germ cell migration; IMP:FlyBase.
DR GO; GO:0007516; P:hemocyte development; IMP:FlyBase.
DR GO; GO:0048542; P:lymph gland development; IMP:FlyBase.
DR GO; GO:0007498; P:mesoderm development; IGI:FlyBase.
DR GO; GO:0008045; P:motor neuron axon guidance; IMP:FlyBase.
DR GO; GO:0046716; P:muscle cell cellular homeostasis; IMP:FlyBase.
DR GO; GO:0051148; P:negative regulation of muscle cell differentiation; IMP:FlyBase.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IDA:FlyBase.
DR GO; GO:0007399; P:nervous system development; IEP:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0048103; P:somatic stem cell division; IMP:FlyBase.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00096; zf-C2H2; 3.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00355; ZnF_C2H2; 9.
DR SUPFAM; SSF46689; SSF46689; 1.
DR SUPFAM; SSF57667; SSF57667; 4.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 6.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 9.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Homeobox; Metal-binding; Nucleus;
KW Phosphoprotein; Reference proteome; Repeat; Zinc; Zinc-finger.
FT CHAIN 1..1054
FT /note="Zinc finger protein 1"
FT /id="PRO_0000047241"
FT ZN_FING 74..97
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 289..311
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 324..346
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 355..377
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 383..407
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 628..651
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT DNA_BIND 699..758
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT ZN_FING 967..989
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 995..1017
FT /note="C2H2-type 8"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1023..1044
FT /note="C2H2-type 9"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 42..65
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 99..197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 424..463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 521..623
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 663..722
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..815
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 848..871
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 910..958
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 42..61
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..174
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..565
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..588
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 589..603
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..623
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..690
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 691..705
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 706..720
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 922..937
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 582
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 586
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 934
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 1..307
FT /note="Missing (in isoform A)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_009670"
FT VAR_SEQ 308..324
FT /note="EQLHSPCGPAAVSNVSQ -> MSAAACLLSSSTSSFEK (in isoform
FT A)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_009671"
FT CONFLICT 78
FT /note="Q -> K (in Ref. 4; AAR82746)"
FT /evidence="ECO:0000305"
FT CONFLICT 147
FT /note="S -> T (in Ref. 4; AAR82746)"
FT /evidence="ECO:0000305"
FT CONFLICT 239
FT /note="Q -> QMQQQQQ (in Ref. 1; AAA29050)"
FT /evidence="ECO:0000305"
FT CONFLICT 625
FT /note="A -> S (in Ref. 1; AAA29050, 4; AAR82746 and 5;
FT AAM50023)"
FT /evidence="ECO:0000305"
FT CONFLICT 954
FT /note="A -> V (in Ref. 1; AAA29050)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1054 AA; 116598 MW; 5189AB2214AB5B8B CRC64;
MLSCLAPSSS RFGQEDTIIQ QSMPSTSPFA MQFPSLASTL LHHNQSPKHS NPGSSGIQDA
HPNQPGAAAD AFLVKCTQCH KRFPEYQSLS EHIASEHPHD KLNCGAAQPE SDAEDEQSNM
SGSSRRYAKS PLASNNNSST ANANNNSTSS QSMNNNSELA KNHNSANKMS PMCSPGSLTP
GDLFAQLQHP PPQLPPHLHA QFMAAAASLA MQSARTASSP SQQQQQQLQQ QQQLQQQQQH
QMAMQQLLPP QLPGSNSSVG SNSAYDLDLS APRSTSSPGS TTGDLSGAYP CMQCTASFAS
REQLEQHEQL HSPCGPAAVS NVSQTCRICH KAFANVYRLQ RHMISHDESA LLRKFKCKEC
DKAFKFKHHL KEHVRIHSGE KPFGCDNCGK RFSHSGSFSS HMTSKKCISM GLKLNNNRAL
LKRLEKSPGS ASSASRRSPS DHGKGKLPEQ PSLPGLPHPM SYFASDAQVQ GGSAAPAPFP
PFHPNYMNAA LLAFPHNFMA AAAGLDPRVH PYSIQRLLQL SAAGQQQREE EREEQQKQQQ
HDEEETPDEP KLVMDIEEPE TKEMAPTPEA TEAATPIKRE ESREASPDPE SYRSSSQAIK
QEQEPLNVAE ERQTPVEEHA PVEHAADLRC SRCSKQFNHP TELVQHEKVL CGLIKEELEQ
HFQQQQATSF ALASASEEDE EDEEMDVEEE PRQESGERKV RVRTAINEEQ QQQLKQHYSL
NARPSRDEFR MIAARLQLDP RVVQVWFQNN RSRERKMQSF QNNQAAGAAP PMPIDSQASL
TREDQPLDLS VKRDPLTPKS ESSPPYIAPP SGEALNPEAI NLSRKFSTSA SMSPASISPS
SAAALYFGAA PPPSPPNSQL DSTPRSGQAF PGLPPYMLPM SLPMEALFKM RPGGDFASNH
ALMNSIKLPD YRGTSLSPGG SEKRSWRDDD SRISHEDEFG AGVLMPPKPR RGKAETHGHA
GDPDLPYVCD QCDKAFAKQS SLARHKYEHS GQRPYQCIEC PKAFKHKHHL TEHKRLHSGE
KPFQCSKCLK RFSHSGSYSQ HMNHRYSYCK PYRE