AWH_DROME
ID AWH_DROME Reviewed; 275 AA.
AC Q8IRC7; O18547; Q1LYZ6; Q8SZ10; Q9VZM0;
DT 12-APR-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 146.
DE RecName: Full=LIM/homeobox protein Awh;
DE AltName: Full=Protein arrowhead;
GN Name=Awh {ECO:0000312|EMBL:AAN11572.1}; ORFNames=CG1072;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1] {ECO:0000305, ECO:0000312|EMBL:AAB71337.1}
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A), FUNCTION, TISSUE SPECIFICITY,
RP DEVELOPMENTAL STAGE, AND MUTAGENESIS OF CYS-57; LEU-88 AND VAL-117.
RC STRAIN=Oregon-R {ECO:0000312|EMBL:AAB71337.1};
RC TISSUE=Embryo {ECO:0000269|PubMed:9331336};
RX PubMed=9331336; DOI=10.1006/dbio.1997.8659;
RA Curtiss J., Heilig J.S.;
RT "Arrowhead encodes a LIM homeodomain protein that distinguishes subsets of
RT Drosophila imaginal cells.";
RL Dev. Biol. 190:129-141(1997).
RN [2] {ECO:0000312|EMBL:AAN11572.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000269|PubMed:10731132};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3] {ECO:0000312|EMBL:AAN11572.1}
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000305, ECO:0000312|EMBL:AAL48819.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
RC STRAIN=Berkeley {ECO:0000312|EMBL:AAL48819.1};
RC TISSUE=Embryo {ECO:0000269|PubMed:12537569};
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
RC STRAIN=Berkeley;
RA Stapleton M., Carlson J.W., Frise E., Kapadia B., Park S., Wan K.H., Yu C.,
RA Celniker S.E.;
RL Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-126, AND IDENTIFICATION BY
RP MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Probable transcription factor. Required for the establishment
CC of a subset of imaginal tissues: the abdominal histoblasts and the
CC salivary gland imaginal rings. {ECO:0000269|PubMed:9331336,
CC ECO:0000303|PubMed:9331336}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=B {ECO:0000303|PubMed:10731132};
CC IsoId=Q8IRC7-1; Sequence=Displayed;
CC Name=A {ECO:0000269|PubMed:9331336};
CC IsoId=Q8IRC7-2; Sequence=VSP_051714, VSP_051715;
CC -!- TISSUE SPECIFICITY: First detected in neuroblasts in stage 9 embryos.
CC Expressed in all 10 abdominal segments and in the labial segment during
CC early embryogenesis. Expressed in the stage 14 developing epithelium.
CC By embryonic stage 16, expression is refined to the abdominal
CC histoblasts and salivary gland imaginal ring cells. Expressed in both
CC larval and imaginal cells between the salivary gland and the salivary
CC gland imaginal ring, in late third instar larvae. Also expressed in
CC specific areas of the larval wing, leg and eye-antennal disks.
CC {ECO:0000269|PubMed:9331336}.
CC -!- DEVELOPMENTAL STAGE: Expressed in all stages of zygotic development,
CC with highest levels of expression during embryonic and early pupal
CC stages, and lower levels in larval and adult stages.
CC {ECO:0000269|PubMed:9331336}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U82539; AAB71337.1; -; mRNA.
DR EMBL; AE014296; AAF47800.2; -; Genomic_DNA.
DR EMBL; AE014296; AAN11572.1; -; Genomic_DNA.
DR EMBL; AY071197; AAL48819.1; -; mRNA.
DR EMBL; BT025230; ABF17921.1; -; mRNA.
DR RefSeq; NP_523907.2; NM_079183.3. [Q8IRC7-2]
DR RefSeq; NP_728906.1; NM_168042.2. [Q8IRC7-1]
DR AlphaFoldDB; Q8IRC7; -.
DR SMR; Q8IRC7; -.
DR BioGRID; 63939; 9.
DR IntAct; Q8IRC7; 17.
DR STRING; 7227.FBpp0073014; -.
DR iPTMnet; Q8IRC7; -.
DR PaxDb; Q8IRC7; -.
DR DNASU; 38451; -.
DR EnsemblMetazoa; FBtr0073155; FBpp0073013; FBgn0013751. [Q8IRC7-2]
DR EnsemblMetazoa; FBtr0073156; FBpp0073014; FBgn0013751. [Q8IRC7-1]
DR GeneID; 38451; -.
DR KEGG; dme:Dmel_CG1072; -.
DR UCSC; CG1072-RA; d. melanogaster. [Q8IRC7-1]
DR CTD; 38451; -.
DR FlyBase; FBgn0013751; Awh.
DR VEuPathDB; VectorBase:FBgn0013751; -.
DR eggNOG; KOG0490; Eukaryota.
DR GeneTree; ENSGT00940000172226; -.
DR HOGENOM; CLU_027802_7_0_1; -.
DR InParanoid; Q8IRC7; -.
DR OMA; DFGRHIN; -.
DR PhylomeDB; Q8IRC7; -.
DR BioGRID-ORCS; 38451; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 38451; -.
DR PRO; PR:Q8IRC7; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0013751; Expressed in atrium (Drosophila) and 47 other tissues.
DR ExpressionAtlas; Q8IRC7; baseline and differential.
DR Genevisible; Q8IRC7; DM.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; ISS:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048749; P:compound eye development; IMP:FlyBase.
DR GO; GO:0007444; P:imaginal disc development; IMP:UniProtKB.
DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central.
DR GO; GO:0010468; P:regulation of gene expression; IMP:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:UniProtKB.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001781; Znf_LIM.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00412; LIM; 2.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00132; LIM; 2.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00478; LIM_DOMAIN_1; 2.
DR PROSITE; PS50023; LIM_DOMAIN_2; 2.
PE 1: Evidence at protein level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW LIM domain; Metal-binding; Nucleus; Phosphoprotein; Reference proteome;
KW Repeat; Transcription; Transcription regulation; Zinc.
FT CHAIN 1..275
FT /note="LIM/homeobox protein Awh"
FT /id="PRO_0000075705"
FT DOMAIN 6..67
FT /note="LIM zinc-binding 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00125"
FT DOMAIN 68..129
FT /note="LIM zinc-binding 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00125"
FT DNA_BIND 148..207
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 253..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 126
FT /note="Phosphothreonine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 214
FT /note="I -> K (in isoform A)"
FT /evidence="ECO:0000303|PubMed:9331336"
FT /id="VSP_051714"
FT VAR_SEQ 215..275
FT /note="Missing (in isoform A)"
FT /evidence="ECO:0000303|PubMed:9331336"
FT /id="VSP_051715"
FT MUTAGEN 57
FT /note="C->Y: In allele 11; may cause loss of zinc binding."
FT /evidence="ECO:0000269|PubMed:9331336"
FT MUTAGEN 88
FT /note="L->T: In allele 13; may alter structure of LIM
FT domain."
FT /evidence="ECO:0000269|PubMed:9331336"
FT MUTAGEN 117
FT /note="V->E: In allele 17; may alter structure of LIM
FT domain."
FT /evidence="ECO:0000269|PubMed:9331336"
FT CONFLICT 72
FT /note="C -> S (in Ref. 1; AAB71337)"
FT /evidence="ECO:0000305"
FT CONFLICT 175
FT /note="G -> D (in Ref. 4; AAL48819)"
FT /evidence="ECO:0000305"
FT CONFLICT 193
FT /note="Q -> E (in Ref. 1; AAB71337)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 275 AA; 31032 MW; 643EBC0E9B519E56 CRC64;
MKTELRSCAA CGEPISDRFF LEVGGCSWHA HCLRCCMCMC PLDRQQSCFI RERQVYCKAD
YSKNFGAKCS KCCRGISASD WVRRARELVF HLACFACDQC GRQLSTGEQF ALMDDRVLCK
AHYLETVEGG TTSSDEGCDG DGYHKSKTKR VRTTFTEEQL QVLQANFQID SNPDGQDLER
IASVTGLSKR VTQVWFQNSR ARQKKHIHAG KNKIREPEGS SFARHINLQL TYSFQNNAQN
PMHLNGSKAG LYPTHESSMD ELSQDSSVHC MPSEV