GSC_DROME
ID GSC_DROME Reviewed; 415 AA.
AC P54366; Q9VPR9;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 16-MAR-2016, sequence version 2.
DT 03-AUG-2022, entry version 173.
DE RecName: Full=Homeobox protein goosecoid;
GN Name=Gsc; ORFNames=CG2851;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC TISSUE=Embryo;
RX PubMed=8625850; DOI=10.1242/dev.122.5.1641;
RA Goriely A., Stella M., Coffinier C., Kessler D., Mailhos C., Dessain S.,
RA Desplan C.;
RT "A functional homologue of goosecoid in Drosophila.";
RL Development 122:1641-1650(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA].
RX PubMed=8670808; DOI=10.1002/j.1460-2075.1996.tb00670.x;
RA Hahn M., Jackle H.;
RT "Drosophila goosecoid participates in neural development but not in body
RT axis formation.";
RL EMBO J. 15:3077-3084(1996).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
CC -!- FUNCTION: Appears to regulate regional development of specific tissues.
CC Can rescue axis polarity in UV-radiated Xenopus embryos.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- TISSUE SPECIFICITY: In early embryo development, expression confined to
CC two regions; a horseshoe-like pattern across the dorsal side which is
CC destined to form the brain hemispheres and a second domain which
CC invaginates inside the stomodeum and which, is fated to form the
CC foregut, ring gland and stomatogastric nervous system (SNS).
CC -!- SIMILARITY: Belongs to the paired homeobox family. Bicoid subfamily.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAB17948.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC Sequence=CAA64699.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X95420; CAA64699.1; ALT_INIT; mRNA.
DR EMBL; U52968; AAB17948.1; ALT_INIT; mRNA.
DR EMBL; AE014134; AAF51473.2; -; Genomic_DNA.
DR PIR; S70617; S70617.
DR RefSeq; NP_476949.2; NM_057601.3.
DR AlphaFoldDB; P54366; -.
DR SMR; P54366; -.
DR BioGRID; 59496; 20.
DR IntAct; P54366; 1.
DR STRING; 7227.FBpp0113060; -.
DR PaxDb; P54366; -.
DR DNASU; 33240; -.
DR EnsemblMetazoa; FBtr0343220; FBpp0309909; FBgn0010323.
DR GeneID; 33240; -.
DR KEGG; dme:Dmel_CG2851; -.
DR CTD; 145258; -.
DR FlyBase; FBgn0010323; Gsc.
DR VEuPathDB; VectorBase:FBgn0010323; -.
DR eggNOG; KOG0490; Eukaryota.
DR InParanoid; P54366; -.
DR OrthoDB; 1388739at2759; -.
DR SignaLink; P54366; -.
DR BioGRID-ORCS; 33240; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 33240; -.
DR PRO; PR:P54366; -.
DR Proteomes; UP000000803; Chromosome 2L.
DR Bgee; FBgn0010323; Expressed in stomodeum and 13 other tissues.
DR ExpressionAtlas; P54366; baseline and differential.
DR Genevisible; P54366; DM.
DR GO; GO:0005634; C:nucleus; IC:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IDA:FlyBase.
DR GO; GO:0046982; F:protein heterodimerization activity; IPI:FlyBase.
DR GO; GO:0042803; F:protein homodimerization activity; IPI:FlyBase.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0045892; P:negative regulation of transcription, DNA-templated; NAS:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Homeobox; Nucleus; Reference proteome.
FT CHAIN 1..415
FT /note="Homeobox protein goosecoid"
FT /id="PRO_0000048893"
FT DNA_BIND 282..341
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..37
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 66..87
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 114..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 238..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 336..383
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 114..170
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..284
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 336..353
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 354..383
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 415 AA; 44506 MW; 9646621754668657 CRC64;
MVETNSPPAG YTLKRSPSDL GEQQQPPRQI SRSPGNTAAY HLTTAMLLNS QQCGYLGQRL
QSVLQQQHAQ HQQSQSQTPS SDDGSQSGVT ILEEERRGGA AAASLFTIDS ILGSRQQGGG
TAPSQGSHIS SNGNQNGLTS NGISLGLKRS GAESPASPNS NSSSSAAASP IRPQRVPAML
QHPGLHLGHL AAAAASGFAA SPSDFLVAYP NFYPNYMHAA AVAHVAAAQM QAHVSGAAAG
LSGHGHHPHH PHGHPHHPHL GAHHHGQHHL SHLGHGPPPK RKRRHRTIFT EEQLEQLEAT
FDKTHYPDVV LREQLALKVD LKEERVEVWF KNRRAKWRKQ KREEQERLRK LQEEQCGSTT
NGTTNSSSGT TSSTGNGSLT VKCPGSDHYS AQLVHIKSDA NGYSDADESS DLEVA