ARA_DROME
ID ARA_DROME Reviewed; 717 AA.
AC Q24248; Q9VTZ9;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 14-AUG-2001, sequence version 2.
DT 03-AUG-2022, entry version 175.
DE RecName: Full=Homeobox protein araucan;
GN Name=ara; ORFNames=CG10571;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC TISSUE=Larva;
RX PubMed=8620542; DOI=10.1016/s0092-8674(00)81085-5;
RA Gomez-Skarmeta J.-L., del Corral R.D., de la Calle-Mustienes E.,
RA Ferres-Marco D., Modolell J.;
RT "Araucan and caupolican, two members of the novel iroquois complex, encode
RT homeoproteins that control proneural and vein-forming genes.";
RL Cell 85:95-105(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-336, AND IDENTIFICATION BY
RP MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Controls proneural and vein forming genes. Positive
CC transcriptional controler of AC-SC (achaete-scute). May act as an
CC activator that interacts with the transcriptional complex assembled on
CC the AC and SC promoters and participates in transcription initiation.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- MISCELLANEOUS: 'Araucan' is named after the Araucanian American-Indian
CC tribe, also called Mohawks, who shaved all but a medial stripe of hairs
CC on the head.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X95179; CAA64486.1; -; mRNA.
DR EMBL; AE014296; AAF49896.1; -; Genomic_DNA.
DR RefSeq; NP_524045.2; NM_079321.3.
DR AlphaFoldDB; Q24248; -.
DR SMR; Q24248; -.
DR BioGRID; 64788; 19.
DR DIP; DIP-18459N; -.
DR IntAct; Q24248; 7.
DR STRING; 7227.FBpp0075640; -.
DR iPTMnet; Q24248; -.
DR PaxDb; Q24248; -.
DR EnsemblMetazoa; FBtr0075908; FBpp0075640; FBgn0015904.
DR GeneID; 39439; -.
DR KEGG; dme:Dmel_CG10571; -.
DR CTD; 39439; -.
DR FlyBase; FBgn0015904; ara.
DR VEuPathDB; VectorBase:FBgn0015904; -.
DR eggNOG; KOG0773; Eukaryota.
DR GeneTree; ENSGT00940000165426; -.
DR InParanoid; Q24248; -.
DR OrthoDB; 814237at2759; -.
DR PhylomeDB; Q24248; -.
DR SignaLink; Q24248; -.
DR BioGRID-ORCS; 39439; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 39439; -.
DR PRO; PR:Q24248; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0015904; Expressed in embryonic epidermis (Drosophila) and 35 other tissues.
DR ExpressionAtlas; Q24248; baseline and differential.
DR Genevisible; Q24248; DM.
DR GO; GO:0005737; C:cytoplasm; HDA:FlyBase.
DR GO; GO:0005634; C:nucleus; ISS:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IDA:FlyBase.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048468; P:cell development; IBA:GO_Central.
DR GO; GO:0001745; P:compound eye morphogenesis; IMP:FlyBase.
DR GO; GO:0048813; P:dendrite morphogenesis; IMP:FlyBase.
DR GO; GO:0045317; P:equator specification; TAS:FlyBase.
DR GO; GO:0007476; P:imaginal disc-derived wing morphogenesis; IMP:FlyBase.
DR GO; GO:0007474; P:imaginal disc-derived wing vein specification; IGI:FlyBase.
DR GO; GO:0042693; P:muscle cell fate commitment; IGI:FlyBase.
DR GO; GO:0045926; P:negative regulation of growth; IMP:FlyBase.
DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central.
DR GO; GO:0035310; P:notum cell fate specification; IMP:FlyBase.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IMP:FlyBase.
DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; NAS:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 1: Evidence at protein level;
KW Activator; Developmental protein; DNA-binding; Homeobox; Nucleus;
KW Phosphoprotein; Reference proteome; Transcription;
KW Transcription regulation.
FT CHAIN 1..717
FT /note="Homeobox protein araucan"
FT /id="PRO_0000048817"
FT DNA_BIND 255..317
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 46..80
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 94..130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 317..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 395..418
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 478..516
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 549..615
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 675..717
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..67
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 317..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 487..514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 684..704
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 336
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT CONFLICT 130
FT /note="Missing (in Ref. 1; CAA64486)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 717 AA; 75422 MW; BFF09BCB7EF7C711 CRC64;
MAAYTQFGYG GFPSASQLLP PSVQTTEDAS ANVNVNVNEA LVMTNAPAMS PTGGQDCQGS
QPSGGAGGDA SSGALSPNAL SQNSNAATVV GAGGGSSAGG GGPADLATGG SLDGNGVGTT
PTAGGAGGGG SCCENGRPIM TDPVSGQTVC SCQYDSARLA LSSYSRLPAA SVGVYGTPYP
STDQNPYQSI GVDSSAFYSP LSNPYGLKDT GAGPEMGAWT SAGLQPTTGY YSYDPMSAYG
GLLVSNSSYG ASYDLAARRK NATRESTATL KAWLNEHKKN PYPTKGEKIM LAIITKMTLT
QVSTWFANAR RRLKKENKMT WEPKNRTDDD DDALVSDDEK DKEDLEPSKG SQGSVSLAKD
ETKEEEDAID EDQKCLGQAN ILRAGFGYPS AGSGSGGYPG GGGSSSGHPG GYHPYHHQHP
AYYQAGQQGG MLPFHGENSK LQTDLGDPKN QLGRDCGVPI PATKPKIWSL ADTVGCKTPP
PAYMGHQSMP LQQQQQQQQQ QQQAQHQYPP SEAGRDQQLF NGAAAPYLRP HTTAYGGFLG
ATTQQLHTTN NSIPYSNMPP QQQQPQQQQQ QLQQGGTIHT TGSSSGPIIP LQFHNRHPQQ
QQQLQQQSQS TASQRAMGFL EAQPDTPPQT PPNMKVLSGA LSLLPTATQV PMTATCRSSN
AFGFPASGYP MNFSARLGEY SPRDDYSSGN SSSSSSSSPQ LQRNEAMFKP LFKKFTN