CAUP_DROME
ID CAUP_DROME Reviewed; 693 AA.
AC P54269; Q5BIH8; Q5U1A6; Q8MR03; Q9VU00;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 14-AUG-2001, sequence version 2.
DT 03-AUG-2022, entry version 170.
DE RecName: Full=Homeobox protein caupolican;
GN Name=caup; ORFNames=CG10605;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], AND FUNCTION.
RC TISSUE=Larva;
RX PubMed=8620542; DOI=10.1016/s0092-8674(00)81085-5;
RA Gomez-Skarmeta J.-L., del Corral R.D., de la Calle-Mustienes E.,
RA Ferres-Marco D., Modolell J.;
RT "Araucan and caupolican, two members of the novel iroquois complex, encode
RT homeoproteins that control proneural and vein-forming genes.";
RL Cell 85:95-105(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley; TISSUE=Embryo;
RA Stapleton M., Carlson J.W., Chavez C., Frise E., George R.A., Pacleb J.M.,
RA Park S., Wan K.H., Yu C., Rubin G.M., Celniker S.E.;
RL Submitted (JUN-2006) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 369-693.
RC STRAIN=Berkeley; TISSUE=Larva, and Pupae;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
CC -!- FUNCTION: Controls proneural and vein forming genes. Positive
CC transcriptional controler of ac-sc (achaete-scute). May act as an
CC activator that interacts with the transcriptional complex assembled on
CC the ac and sc promoters and participates in transcription initiation.
CC {ECO:0000269|PubMed:8620542}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- MISCELLANEOUS: 'Caupolican' is named after the Araucanian American-
CC Indian tribe, also called mohawks, who shaved all but a medial stripe
CC of hairs on the head.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X95178; CAA64485.1; -; mRNA.
DR EMBL; AE014296; AAF49895.1; -; Genomic_DNA.
DR EMBL; BT015986; AAV36871.1; -; mRNA.
DR EMBL; BT021246; AAX33394.1; -; mRNA.
DR EMBL; AY122206; AAM52718.1; -; mRNA.
DR RefSeq; NP_524046.2; NM_079322.3.
DR AlphaFoldDB; P54269; -.
DR SMR; P54269; -.
DR BioGRID; 64789; 15.
DR DIP; DIP-18539N; -.
DR IntAct; P54269; 14.
DR STRING; 7227.FBpp0075641; -.
DR PaxDb; P54269; -.
DR DNASU; 39440; -.
DR EnsemblMetazoa; FBtr0075909; FBpp0075641; FBgn0015919.
DR GeneID; 39440; -.
DR KEGG; dme:Dmel_CG10605; -.
DR CTD; 39440; -.
DR FlyBase; FBgn0015919; caup.
DR VEuPathDB; VectorBase:FBgn0015919; -.
DR eggNOG; KOG0773; Eukaryota.
DR GeneTree; ENSGT00940000165426; -.
DR HOGENOM; CLU_019586_0_0_1; -.
DR InParanoid; P54269; -.
DR OMA; HPMHAAY; -.
DR OrthoDB; 814237at2759; -.
DR PhylomeDB; P54269; -.
DR SignaLink; P54269; -.
DR BioGRID-ORCS; 39440; 0 hits in 3 CRISPR screens.
DR ChiTaRS; caup; fly.
DR GenomeRNAi; 39440; -.
DR PRO; PR:P54269; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0015919; Expressed in wing disc and 32 other tissues.
DR Genevisible; P54269; DM.
DR GO; GO:0005634; C:nucleus; ISS:FlyBase.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; ISS:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISS:FlyBase.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048468; P:cell development; IBA:GO_Central.
DR GO; GO:0001745; P:compound eye morphogenesis; IMP:FlyBase.
DR GO; GO:0045317; P:equator specification; TAS:FlyBase.
DR GO; GO:0007474; P:imaginal disc-derived wing vein specification; IGI:FlyBase.
DR GO; GO:0042693; P:muscle cell fate commitment; IGI:FlyBase.
DR GO; GO:0045926; P:negative regulation of growth; IMP:FlyBase.
DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; ISS:FlyBase.
DR GO; GO:0007346; P:regulation of mitotic cell cycle; IGI:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR003893; Iroquois_homeo.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00548; IRO; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Activator; Developmental protein; DNA-binding; Homeobox; Nucleus;
KW Reference proteome; Transcription; Transcription regulation.
FT CHAIN 1..693
FT /note="Homeobox protein caupolican"
FT /id="PRO_0000048845"
FT DNA_BIND 226..288
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 20..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 288..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 387..453
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 480..538
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 561..627
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 648..693
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 20..74
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..302
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 387..422
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 483..515
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 603..627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..669
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 106
FT /note="C -> R (in Ref. 1; CAA64485)"
FT /evidence="ECO:0000305"
FT CONFLICT 316
FT /note="G -> A (in Ref. 1; CAA64485)"
FT /evidence="ECO:0000305"
FT CONFLICT 518
FT /note="H -> N (in Ref. 4; AAV36871)"
FT /evidence="ECO:0000305"
FT CONFLICT 678
FT /note="G -> A (in Ref. 1; CAA64485)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 693 AA; 73668 MW; FBEB1616493F7EC9 CRC64;
MAAYAQFGYA GYPTANQLTT ANTDSQSGHG GGSPLSGTNE ASLSPSGGST ATGLTAGPLS
PGAVSQSSHH AGHKGLSTSP AEDVVGGDVP VGLSSAAQDL PSRGSCCENG RPIITDPVSG
QTVCSCQYDP ARLAIGGYSR MALPSGGVGV GVYGGPYPSN EQNPYPSIGV DNSAFYAPLS
NPYGIKDTSP STEMSAWTSA SLQSTTGYYS YDPTLAAYGY GPNYDLAARR KNATRESTAT
LKAWLSEHKK NPYPTKGEKI MLAIITKMTL TQVSTWFANA RRRLKKENKM TWEPKNKTED
DDDGMMSDDE KEKDAGDGGK LSTEAFDPGN QLIKSELGKA EKEVDSSGDQ KLDLDREPHN
LVAMRGLAPY ATPPGAHPMH AAYSSYAQSH NTHTHPHPQQ MQHHQQQQQQ QQNQQQLQHH
QMDQPYYHPG GYGQEESGEF AAQKNPLSRD CGIPVPASKP KIWSVADTAA CKTPPPTAAY
LGQNFYPPSS ADQQLPHQPL QQHQQQQLQQ LQQQQQHHHH PHHHHPHHSM ELGSPLSMMS
SYAGGSPYSR IPTAYTEAMG MHLPSSSSSS SSTGKLPPTH IHPAPQRVGF PEIQPDTPPQ
TPPTMKLNSS GGSSSSSGSS HSSSMHSVTP VTVASMVNIL YSNTDSGYGH GHSHGHGHGH
GHGLGHGHGL GHGHGHMGVT SNAYLTEGGR SGS