EWG_DROME
ID EWG_DROME Reviewed; 733 AA.
AC Q24312; A4V3R8; Q59E73; Q59E74; Q59E75; Q59E76; Q59E77; Q8SXJ1; Q9NF76;
AC Q9W5G8; Q9W5G9;
DT 15-NOV-2002, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2001, sequence version 2.
DT 03-AUG-2022, entry version 170.
DE RecName: Full=DNA-binding protein Ewg;
DE AltName: Full=Protein erect wing;
GN Name=ewg; ORFNames=CG3114;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000312|EMBL:AAL90347.1};
RN [1] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A), FUNCTION, SUBCELLULAR LOCATION,
RP TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE.
RC STRAIN=Canton-S; TISSUE=Head;
RX PubMed=8388540; DOI=10.1128/mcb.13.6.3641-3649.1993;
RA DeSimone S.M., White K.;
RT "The Drosophila erect wing gene, which is important for both neuronal and
RT muscle development, encodes a protein which is similar to the sea urchin
RT P3A2 DNA binding protein.";
RL Mol. Cell. Biol. 13:3641-3649(1993).
RN [2] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, AND TISSUE
RP SPECIFICITY.
RC TISSUE=Head;
RX PubMed=10330140; DOI=10.1128/mcb.19.6.3998;
RA Koushika S.P., Soller M., DeSimone S.M., Daub D.M., White K.;
RT "Differential and inefficient splicing of a broadly expressed Drosophila
RT erect wing transcript results in tissue-specific enrichment of the vital
RT EWG protein isoform.";
RL Mol. Cell. Biol. 19:3998-4007(1999).
RN [3] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [4]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [5] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Oregon-R;
RX PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA Glover D.M.;
RT "From sequence to chromosome: the tip of the X chromosome of D.
RT melanogaster.";
RL Science 287:2220-2222(2000).
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
CC -!- FUNCTION: May function as a positive regulator of transcription in
CC developing and differentiated neurons, regulating common aspects of
CC neuronal differentiation and maintenance. Requirement in the CNS may be
CC higher than in the peripheral system. Vital for development of the
CC indirect flight muscles. {ECO:0000269|PubMed:8388540,
CC ECO:0000303|PubMed:8388540}.
CC -!- SUBUNIT: Homodimer. Binds DNA as a dimer (By similarity).
CC {ECO:0000250|UniProtKB:Q16656}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:8388540}. Note=Not in
CC nucleolus.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=7;
CC Comment=Additional isoforms seem to exist.;
CC Name=A; Synonyms=C;
CC IsoId=Q24312-1; Sequence=Displayed;
CC Name=B;
CC IsoId=Q24312-4; Sequence=VSP_003603;
CC Name=D;
CC IsoId=Q24312-5; Sequence=VSP_026518, VSP_026519;
CC Name=E;
CC IsoId=Q24312-6; Sequence=VSP_026520;
CC Name=F;
CC IsoId=Q24312-7; Sequence=VSP_026515;
CC Name=G;
CC IsoId=Q24312-8; Sequence=VSP_026515, VSP_026520;
CC Name=H;
CC IsoId=Q24312-9; Sequence=VSP_026516, VSP_026517;
CC -!- TISSUE SPECIFICITY: Isoform A is highly expressed in possibly all
CC embryonic neurons and is enriched in adult heads. Other isoforms show
CC similar expression at a much lower level. Transient expression in
CC migrating myoblasts. {ECO:0000269|PubMed:10330140,
CC ECO:0000269|PubMed:8388540}.
CC -!- DEVELOPMENTAL STAGE: Expressed throughout development, beginning at
CC embryonic stage 12 when levels steadily increase and then drop
CC dramatically at third-instar larvae. Levels increase in 24 hour pupae
CC and remain until adulthood. {ECO:0000269|PubMed:8388540}.
CC -!- SIMILARITY: Belongs to the NRF1/Ewg family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAL90347.1; Type=Miscellaneous discrepancy; Note=Unusual initiator. The initiator methionine is coded by a non-canonical CTG leucine codon.; Evidence={ECO:0000305};
CC Sequence=CAB43325.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L11345; AAA28478.1; -; mRNA.
DR EMBL; AF135590; AAD34460.1; -; Genomic_DNA.
DR EMBL; AE014298; AAF45491.5; -; Genomic_DNA.
DR EMBL; AE014298; AAF45492.5; -; Genomic_DNA.
DR EMBL; AE014298; AAX52466.1; -; Genomic_DNA.
DR EMBL; AE014298; AAX52467.1; -; Genomic_DNA.
DR EMBL; AE014298; AAX52468.1; -; Genomic_DNA.
DR EMBL; AE014298; AAX52469.1; -; Genomic_DNA.
DR EMBL; AE014298; AAX52470.1; -; Genomic_DNA.
DR EMBL; AE014298; AAX52471.1; -; Genomic_DNA.
DR EMBL; AL050231; CAB43325.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AY089609; AAL90347.1; ALT_SEQ; mRNA.
DR PIR; A48128; A48128.
DR RefSeq; NP_001014707.1; NM_001014707.2. [Q24312-8]
DR RefSeq; NP_001014708.1; NM_001014708.2. [Q24312-7]
DR RefSeq; NP_001014709.1; NM_001014709.2. [Q24312-6]
DR RefSeq; NP_001014710.1; NM_001014710.2. [Q24312-5]
DR RefSeq; NP_001014711.1; NM_001014711.2. [Q24312-1]
DR RefSeq; NP_001014712.1; NM_001014712.2. [Q24312-9]
DR RefSeq; NP_476892.4; NM_057544.4. [Q24312-4]
DR RefSeq; NP_726660.3; NM_166836.2. [Q24312-1]
DR AlphaFoldDB; Q24312; -.
DR BioGRID; 57548; 66.
DR IntAct; Q24312; 5.
DR STRING; 7227.FBpp0300529; -.
DR PaxDb; Q24312; -.
DR EnsemblMetazoa; FBtr0089441; FBpp0088456; FBgn0005427. [Q24312-4]
DR EnsemblMetazoa; FBtr0089442; FBpp0088457; FBgn0005427. [Q24312-1]
DR EnsemblMetazoa; FBtr0100577; FBpp0100032; FBgn0005427. [Q24312-1]
DR EnsemblMetazoa; FBtr0100578; FBpp0100033; FBgn0005427. [Q24312-5]
DR EnsemblMetazoa; FBtr0100579; FBpp0100034; FBgn0005427. [Q24312-6]
DR EnsemblMetazoa; FBtr0100580; FBpp0100035; FBgn0005427. [Q24312-7]
DR EnsemblMetazoa; FBtr0100581; FBpp0100036; FBgn0005427. [Q24312-8]
DR EnsemblMetazoa; FBtr0100582; FBpp0100037; FBgn0005427. [Q24312-9]
DR GeneID; 30975; -.
DR KEGG; dme:Dmel_CG3114; -.
DR CTD; 30975; -.
DR FlyBase; FBgn0005427; ewg.
DR VEuPathDB; VectorBase:FBgn0005427; -.
DR eggNOG; ENOG502QTK1; Eukaryota.
DR GeneTree; ENSGT00390000006835; -.
DR HOGENOM; CLU_018156_2_0_1; -.
DR InParanoid; Q24312; -.
DR SignaLink; Q24312; -.
DR BioGRID-ORCS; 30975; 0 hits in 1 CRISPR screen.
DR ChiTaRS; ewg; fly.
DR GenomeRNAi; 30975; -.
DR PRO; PR:Q24312; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0005427; Expressed in brain and 35 other tissues.
DR ExpressionAtlas; Q24312; baseline and differential.
DR Genevisible; Q24312; DM.
DR GO; GO:0005634; C:nucleus; IDA:UniProtKB.
DR GO; GO:0003677; F:DNA binding; NAS:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0007527; P:adult somatic muscle development; IGI:FlyBase.
DR GO; GO:0007417; P:central nervous system development; IMP:UniProtKB.
DR GO; GO:0007560; P:imaginal disc morphogenesis; IMP:UniProtKB.
DR GO; GO:0007517; P:muscle organ development; IMP:UniProtKB.
DR GO; GO:0045886; P:negative regulation of synaptic assembly at neuromuscular junction; IMP:FlyBase.
DR GO; GO:0090263; P:positive regulation of canonical Wnt signaling pathway; IGI:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IMP:FlyBase.
DR InterPro; IPR039142; NRF1/Ewg.
DR InterPro; IPR019525; Nrf1_NLS/DNA-bd_dimer.
DR PANTHER; PTHR20338; PTHR20338; 1.
DR Pfam; PF10491; Nrf1_DNA-bind; 1.
PE 2: Evidence at transcript level;
KW Activator; Alternative splicing; DNA-binding; Nucleus; Phosphoprotein;
KW Reference proteome; Transcription; Transcription regulation.
FT CHAIN 1..733
FT /note="DNA-binding protein Ewg"
FT /id="PRO_0000100213"
FT DNA_BIND 167..379
FT /evidence="ECO:0000250"
FT REGION 13..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 69..136
FT /note="Dimerization"
FT /evidence="ECO:0000250"
FT REGION 375..663
FT /note="Required for transcriptional activation"
FT /evidence="ECO:0000250"
FT MOTIF 146..174
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000250"
FT MOD_RES 97
FT /note="Phosphoserine; by CK2"
FT /evidence="ECO:0000250"
FT MOD_RES 110
FT /note="Phosphoserine; by CK2"
FT /evidence="ECO:0000250"
FT VAR_SEQ 387..540
FT /note="Missing (in isoform F and isoform G)"
FT /evidence="ECO:0000305"
FT /id="VSP_026515"
FT VAR_SEQ 387..427
FT /note="QPQQVNVVKINSAGTVITTHTAQSNTPAPTIIQSTNNQHVT -> LIVIELL
FT VISIIICIRHKLKKHTHITPRKRSSHRTPMVPYR (in isoform H)"
FT /evidence="ECO:0000305"
FT /id="VSP_026516"
FT VAR_SEQ 428..733
FT /note="Missing (in isoform H)"
FT /evidence="ECO:0000305"
FT /id="VSP_026517"
FT VAR_SEQ 541..581
FT /note="YTTQTVLSQNADGTVSLIQVDPNNPIITLPDGTTAQVQGVA -> LIVIELL
FT VISIIICIRHKLKKHTHITPRKRSSHRTPMVPYR (in isoform D)"
FT /evidence="ECO:0000305"
FT /id="VSP_026518"
FT VAR_SEQ 582..733
FT /note="Missing (in isoform D)"
FT /evidence="ECO:0000305"
FT /id="VSP_026519"
FT VAR_SEQ 669..733
FT /note="VENGDQLETITMSPGMHQMMIQGGPGQEPQLVQVVSLKDATLLSKAMEAINS
FT GNVKSEDTIIMEQ -> IGKLENFRGPLDLWRMATSWRPSPCRLECTR (in
FT isoform E and isoform G)"
FT /evidence="ECO:0000305"
FT /id="VSP_026520"
FT VAR_SEQ 670..733
FT /note="ENGDQLETITMSPGMHQMMIQGGPGQEPQLVQVVSLKDATLLSKAMEAINSG
FT NVKSEDTIIMEQ -> DTSINRSTSTAASSSSSLGNGGVQCYSLISAGSSVLSRRSGGV
FT IGHQVTPGERYYLATTTSSSLGLNNNNNCTLANNNNSVKMPIVLATPSIPQVANKRSTG
FT KRSTGVSGGDDVMVQKRGRPNSSSRSKSQLNANENAHSSGTSSSIRCVDSASLTNVQLQ
FT LPAIKLEHLG (in isoform B)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_003603"
FT CONFLICT 172..173
FT /note="KL -> NV (in Ref. 1; AAA28478 and 2; AAD34460)"
FT /evidence="ECO:0000305"
FT CONFLICT Q24312-4:777
FT /note="V -> A (in Ref. 6; AAL90347)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 733 AA; 77763 MW; A1CA6CD7AB3177A3 CRC64;
MATTSYRLVV APAGSQRSST GNVVVTTTSS GSHSSNGANG GTGGTSAGSS TLGSGLNVTT
ITATSGGQLQ SAGNTSQSNG TTYKIEMLEE DIQSLGSDDD DEDLISSDGS LYEGDLGSMP
VNDDVAHQLA AAGPVGVAAA AAIASSKKRK RPHCFETNPS VRKRQQNRLL RKLRAIIYEF
TGRVGKQAVV LVATPGKPNT SYKVFGAKPL EDVLRNLKNI VMDELDNALA QQAPPPPQDD
PSLFELPGLV IDGIPTPVEK MTQAQLRAFI PLMLKYSTGR GKPGWGREST RPPWWPKELP
WANVRMDARS EDDKQKISWT HALRKIVINC YKYHGREDLL PTFADDEDKV NALISQSGDE
DEDMELSNPP TIHTVTTMTP PTGNSNQPQQ VNVVKINSAG TVITTHTAQS NTPAPTIIQS
TNNQHVTTTA TLPASTKIEI CQAPAQNQQH HQHHQTHLPN AVHIQPVAGG QPQTIQLTTA
SGTATATAVQ TTAAAVSAAQ AHAHSQSQAH SQSSANQTVT AQQIANAQVC IEPITLSDVD
YTTQTVLSQN ADGTVSLIQV DPNNPIITLP DGTTAQVQGV ATLHQGEGGA TIQTVQSLTD
VNGHENMTVD LTETQDGQIY ITTEDGQGYP VSVSNVISVP VSMYQSVMAN VQQIQTNSDG
TVCLAPMQVE NGDQLETITM SPGMHQMMIQ GGPGQEPQLV QVVSLKDATL LSKAMEAINS
GNVKSEDTII MEQ