位置:首页 > 蛋白库 > EWG_DROME
EWG_DROME
ID   EWG_DROME               Reviewed;         733 AA.
AC   Q24312; A4V3R8; Q59E73; Q59E74; Q59E75; Q59E76; Q59E77; Q8SXJ1; Q9NF76;
AC   Q9W5G8; Q9W5G9;
DT   15-NOV-2002, integrated into UniProtKB/Swiss-Prot.
DT   01-OCT-2001, sequence version 2.
DT   03-AUG-2022, entry version 170.
DE   RecName: Full=DNA-binding protein Ewg;
DE   AltName: Full=Protein erect wing;
GN   Name=ewg; ORFNames=CG3114;
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=7227 {ECO:0000312|EMBL:AAL90347.1};
RN   [1] {ECO:0000305}
RP   NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A), FUNCTION, SUBCELLULAR LOCATION,
RP   TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE.
RC   STRAIN=Canton-S; TISSUE=Head;
RX   PubMed=8388540; DOI=10.1128/mcb.13.6.3641-3649.1993;
RA   DeSimone S.M., White K.;
RT   "The Drosophila erect wing gene, which is important for both neuronal and
RT   muscle development, encodes a protein which is similar to the sea urchin
RT   P3A2 DNA binding protein.";
RL   Mol. Cell. Biol. 13:3641-3649(1993).
RN   [2] {ECO:0000305}
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, AND TISSUE
RP   SPECIFICITY.
RC   TISSUE=Head;
RX   PubMed=10330140; DOI=10.1128/mcb.19.6.3998;
RA   Koushika S.P., Soller M., DeSimone S.M., Daub D.M., White K.;
RT   "Differential and inefficient splicing of a broadly expressed Drosophila
RT   erect wing transcript results in tissue-specific enrichment of the vital
RT   EWG protein isoform.";
RL   Mol. Cell. Biol. 19:3998-4007(1999).
RN   [3] {ECO:0000305}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Berkeley;
RX   PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA   Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA   Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA   George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA   Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA   Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA   Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA   An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA   Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA   Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA   Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA   Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA   Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA   Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA   Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA   Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA   Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA   Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA   Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA   Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA   Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA   Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA   McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA   Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA   Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA   Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA   Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA   Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA   Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA   Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA   Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA   Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA   Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA   Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA   Venter J.C.;
RT   "The genome sequence of Drosophila melanogaster.";
RL   Science 287:2185-2195(2000).
RN   [4]
RP   GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC   STRAIN=Berkeley;
RX   PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA   Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA   Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA   Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA   Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA   Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA   Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT   "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT   review.";
RL   Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN   [5] {ECO:0000305}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Oregon-R;
RX   PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA   Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA   Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA   Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA   Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA   Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA   Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA   Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA   McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA   Glover D.M.;
RT   "From sequence to chromosome: the tip of the X chromosome of D.
RT   melanogaster.";
RL   Science 287:2220-2222(2000).
RN   [6]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
RC   STRAIN=Berkeley; TISSUE=Embryo;
RX   PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA   Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA   Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA   Celniker S.E.;
RT   "A Drosophila full-length cDNA resource.";
RL   Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
CC   -!- FUNCTION: May function as a positive regulator of transcription in
CC       developing and differentiated neurons, regulating common aspects of
CC       neuronal differentiation and maintenance. Requirement in the CNS may be
CC       higher than in the peripheral system. Vital for development of the
CC       indirect flight muscles. {ECO:0000269|PubMed:8388540,
CC       ECO:0000303|PubMed:8388540}.
CC   -!- SUBUNIT: Homodimer. Binds DNA as a dimer (By similarity).
CC       {ECO:0000250|UniProtKB:Q16656}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:8388540}. Note=Not in
CC       nucleolus.
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=7;
CC         Comment=Additional isoforms seem to exist.;
CC       Name=A; Synonyms=C;
CC         IsoId=Q24312-1; Sequence=Displayed;
CC       Name=B;
CC         IsoId=Q24312-4; Sequence=VSP_003603;
CC       Name=D;
CC         IsoId=Q24312-5; Sequence=VSP_026518, VSP_026519;
CC       Name=E;
CC         IsoId=Q24312-6; Sequence=VSP_026520;
CC       Name=F;
CC         IsoId=Q24312-7; Sequence=VSP_026515;
CC       Name=G;
CC         IsoId=Q24312-8; Sequence=VSP_026515, VSP_026520;
CC       Name=H;
CC         IsoId=Q24312-9; Sequence=VSP_026516, VSP_026517;
CC   -!- TISSUE SPECIFICITY: Isoform A is highly expressed in possibly all
CC       embryonic neurons and is enriched in adult heads. Other isoforms show
CC       similar expression at a much lower level. Transient expression in
CC       migrating myoblasts. {ECO:0000269|PubMed:10330140,
CC       ECO:0000269|PubMed:8388540}.
CC   -!- DEVELOPMENTAL STAGE: Expressed throughout development, beginning at
CC       embryonic stage 12 when levels steadily increase and then drop
CC       dramatically at third-instar larvae. Levels increase in 24 hour pupae
CC       and remain until adulthood. {ECO:0000269|PubMed:8388540}.
CC   -!- SIMILARITY: Belongs to the NRF1/Ewg family. {ECO:0000305}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAL90347.1; Type=Miscellaneous discrepancy; Note=Unusual initiator. The initiator methionine is coded by a non-canonical CTG leucine codon.; Evidence={ECO:0000305};
CC       Sequence=CAB43325.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; L11345; AAA28478.1; -; mRNA.
DR   EMBL; AF135590; AAD34460.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAF45491.5; -; Genomic_DNA.
DR   EMBL; AE014298; AAF45492.5; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52466.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52467.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52468.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52469.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52470.1; -; Genomic_DNA.
DR   EMBL; AE014298; AAX52471.1; -; Genomic_DNA.
DR   EMBL; AL050231; CAB43325.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; AY089609; AAL90347.1; ALT_SEQ; mRNA.
DR   PIR; A48128; A48128.
DR   RefSeq; NP_001014707.1; NM_001014707.2. [Q24312-8]
DR   RefSeq; NP_001014708.1; NM_001014708.2. [Q24312-7]
DR   RefSeq; NP_001014709.1; NM_001014709.2. [Q24312-6]
DR   RefSeq; NP_001014710.1; NM_001014710.2. [Q24312-5]
DR   RefSeq; NP_001014711.1; NM_001014711.2. [Q24312-1]
DR   RefSeq; NP_001014712.1; NM_001014712.2. [Q24312-9]
DR   RefSeq; NP_476892.4; NM_057544.4. [Q24312-4]
DR   RefSeq; NP_726660.3; NM_166836.2. [Q24312-1]
DR   AlphaFoldDB; Q24312; -.
DR   BioGRID; 57548; 66.
DR   IntAct; Q24312; 5.
DR   STRING; 7227.FBpp0300529; -.
DR   PaxDb; Q24312; -.
DR   EnsemblMetazoa; FBtr0089441; FBpp0088456; FBgn0005427. [Q24312-4]
DR   EnsemblMetazoa; FBtr0089442; FBpp0088457; FBgn0005427. [Q24312-1]
DR   EnsemblMetazoa; FBtr0100577; FBpp0100032; FBgn0005427. [Q24312-1]
DR   EnsemblMetazoa; FBtr0100578; FBpp0100033; FBgn0005427. [Q24312-5]
DR   EnsemblMetazoa; FBtr0100579; FBpp0100034; FBgn0005427. [Q24312-6]
DR   EnsemblMetazoa; FBtr0100580; FBpp0100035; FBgn0005427. [Q24312-7]
DR   EnsemblMetazoa; FBtr0100581; FBpp0100036; FBgn0005427. [Q24312-8]
DR   EnsemblMetazoa; FBtr0100582; FBpp0100037; FBgn0005427. [Q24312-9]
DR   GeneID; 30975; -.
DR   KEGG; dme:Dmel_CG3114; -.
DR   CTD; 30975; -.
DR   FlyBase; FBgn0005427; ewg.
DR   VEuPathDB; VectorBase:FBgn0005427; -.
DR   eggNOG; ENOG502QTK1; Eukaryota.
DR   GeneTree; ENSGT00390000006835; -.
DR   HOGENOM; CLU_018156_2_0_1; -.
DR   InParanoid; Q24312; -.
DR   SignaLink; Q24312; -.
DR   BioGRID-ORCS; 30975; 0 hits in 1 CRISPR screen.
DR   ChiTaRS; ewg; fly.
DR   GenomeRNAi; 30975; -.
DR   PRO; PR:Q24312; -.
DR   Proteomes; UP000000803; Chromosome X.
DR   Bgee; FBgn0005427; Expressed in brain and 35 other tissues.
DR   ExpressionAtlas; Q24312; baseline and differential.
DR   Genevisible; Q24312; DM.
DR   GO; GO:0005634; C:nucleus; IDA:UniProtKB.
DR   GO; GO:0003677; F:DNA binding; NAS:UniProtKB.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0007527; P:adult somatic muscle development; IGI:FlyBase.
DR   GO; GO:0007417; P:central nervous system development; IMP:UniProtKB.
DR   GO; GO:0007560; P:imaginal disc morphogenesis; IMP:UniProtKB.
DR   GO; GO:0007517; P:muscle organ development; IMP:UniProtKB.
DR   GO; GO:0045886; P:negative regulation of synaptic assembly at neuromuscular junction; IMP:FlyBase.
DR   GO; GO:0090263; P:positive regulation of canonical Wnt signaling pathway; IGI:FlyBase.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IMP:FlyBase.
DR   InterPro; IPR039142; NRF1/Ewg.
DR   InterPro; IPR019525; Nrf1_NLS/DNA-bd_dimer.
DR   PANTHER; PTHR20338; PTHR20338; 1.
DR   Pfam; PF10491; Nrf1_DNA-bind; 1.
PE   2: Evidence at transcript level;
KW   Activator; Alternative splicing; DNA-binding; Nucleus; Phosphoprotein;
KW   Reference proteome; Transcription; Transcription regulation.
FT   CHAIN           1..733
FT                   /note="DNA-binding protein Ewg"
FT                   /id="PRO_0000100213"
FT   DNA_BIND        167..379
FT                   /evidence="ECO:0000250"
FT   REGION          13..51
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          69..136
FT                   /note="Dimerization"
FT                   /evidence="ECO:0000250"
FT   REGION          375..663
FT                   /note="Required for transcriptional activation"
FT                   /evidence="ECO:0000250"
FT   MOTIF           146..174
FT                   /note="Nuclear localization signal"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         97
FT                   /note="Phosphoserine; by CK2"
FT                   /evidence="ECO:0000250"
FT   MOD_RES         110
FT                   /note="Phosphoserine; by CK2"
FT                   /evidence="ECO:0000250"
FT   VAR_SEQ         387..540
FT                   /note="Missing (in isoform F and isoform G)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026515"
FT   VAR_SEQ         387..427
FT                   /note="QPQQVNVVKINSAGTVITTHTAQSNTPAPTIIQSTNNQHVT -> LIVIELL
FT                   VISIIICIRHKLKKHTHITPRKRSSHRTPMVPYR (in isoform H)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026516"
FT   VAR_SEQ         428..733
FT                   /note="Missing (in isoform H)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026517"
FT   VAR_SEQ         541..581
FT                   /note="YTTQTVLSQNADGTVSLIQVDPNNPIITLPDGTTAQVQGVA -> LIVIELL
FT                   VISIIICIRHKLKKHTHITPRKRSSHRTPMVPYR (in isoform D)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026518"
FT   VAR_SEQ         582..733
FT                   /note="Missing (in isoform D)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026519"
FT   VAR_SEQ         669..733
FT                   /note="VENGDQLETITMSPGMHQMMIQGGPGQEPQLVQVVSLKDATLLSKAMEAINS
FT                   GNVKSEDTIIMEQ -> IGKLENFRGPLDLWRMATSWRPSPCRLECTR (in
FT                   isoform E and isoform G)"
FT                   /evidence="ECO:0000305"
FT                   /id="VSP_026520"
FT   VAR_SEQ         670..733
FT                   /note="ENGDQLETITMSPGMHQMMIQGGPGQEPQLVQVVSLKDATLLSKAMEAINSG
FT                   NVKSEDTIIMEQ -> DTSINRSTSTAASSSSSLGNGGVQCYSLISAGSSVLSRRSGGV
FT                   IGHQVTPGERYYLATTTSSSLGLNNNNNCTLANNNNSVKMPIVLATPSIPQVANKRSTG
FT                   KRSTGVSGGDDVMVQKRGRPNSSSRSKSQLNANENAHSSGTSSSIRCVDSASLTNVQLQ
FT                   LPAIKLEHLG (in isoform B)"
FT                   /evidence="ECO:0000303|PubMed:12537569"
FT                   /id="VSP_003603"
FT   CONFLICT        172..173
FT                   /note="KL -> NV (in Ref. 1; AAA28478 and 2; AAD34460)"
FT                   /evidence="ECO:0000305"
FT   CONFLICT        Q24312-4:777
FT                   /note="V -> A (in Ref. 6; AAL90347)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   733 AA;  77763 MW;  A1CA6CD7AB3177A3 CRC64;
     MATTSYRLVV APAGSQRSST GNVVVTTTSS GSHSSNGANG GTGGTSAGSS TLGSGLNVTT
     ITATSGGQLQ SAGNTSQSNG TTYKIEMLEE DIQSLGSDDD DEDLISSDGS LYEGDLGSMP
     VNDDVAHQLA AAGPVGVAAA AAIASSKKRK RPHCFETNPS VRKRQQNRLL RKLRAIIYEF
     TGRVGKQAVV LVATPGKPNT SYKVFGAKPL EDVLRNLKNI VMDELDNALA QQAPPPPQDD
     PSLFELPGLV IDGIPTPVEK MTQAQLRAFI PLMLKYSTGR GKPGWGREST RPPWWPKELP
     WANVRMDARS EDDKQKISWT HALRKIVINC YKYHGREDLL PTFADDEDKV NALISQSGDE
     DEDMELSNPP TIHTVTTMTP PTGNSNQPQQ VNVVKINSAG TVITTHTAQS NTPAPTIIQS
     TNNQHVTTTA TLPASTKIEI CQAPAQNQQH HQHHQTHLPN AVHIQPVAGG QPQTIQLTTA
     SGTATATAVQ TTAAAVSAAQ AHAHSQSQAH SQSSANQTVT AQQIANAQVC IEPITLSDVD
     YTTQTVLSQN ADGTVSLIQV DPNNPIITLP DGTTAQVQGV ATLHQGEGGA TIQTVQSLTD
     VNGHENMTVD LTETQDGQIY ITTEDGQGYP VSVSNVISVP VSMYQSVMAN VQQIQTNSDG
     TVCLAPMQVE NGDQLETITM SPGMHQMMIQ GGPGQEPQLV QVVSLKDATL LSKAMEAINS
     GNVKSEDTII MEQ
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024