GSBN_DROME
ID GSBN_DROME Reviewed; 449 AA.
AC P09083; Q9W0W5;
DT 01-NOV-1988, integrated into UniProtKB/Swiss-Prot.
DT 27-JAN-2003, sequence version 2.
DT 03-AUG-2022, entry version 192.
DE RecName: Full=Protein gooseberry-neuro;
DE AltName: Full=BSH4;
DE AltName: Full=Protein gooseberry proximal;
GN Name=gsb-n; Synonyms=Gsb-p, GSBA; ORFNames=CG2692;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA], FUNCTION, AND TISSUE SPECIFICITY.
RX PubMed=3123319; DOI=10.1101/gad.1.10.1247;
RA Baumgartner S., Bopp D., Burri M., Noll M.;
RT "Structure of two genes at the gooseberry locus related to the paired gene
RT and their spatial expression during Drosophila embryogenesis.";
RL Genes Dev. 1:1247-1267(1987).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 10-146 AND 162-241.
RX PubMed=2877747; DOI=10.1016/0092-8674(86)90818-4;
RA Bopp D., Burri M., Baumgartner S., Frigerio G., Noll M.;
RT "Conservation of a large protein domain in the segmentation gene paired and
RT in functionally related genes of Drosophila.";
RL Cell 47:1033-1040(1986).
RN [6]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-165, AND IDENTIFICATION BY
RP MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Expressed in a segmentally repeating pattern to define the
CC polarity of embryonic segments. {ECO:0000269|PubMed:3123319}.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- TISSUE SPECIFICITY: Expressed in a single-segment repeat in the
CC neurectoderm during germ-band extension and later in single neurons
CC during neuronal differentiation. {ECO:0000269|PubMed:3123319}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE013599; AAF47314.1; -; Genomic_DNA.
DR EMBL; AY071593; AAL49215.1; -; mRNA.
DR EMBL; M14941; AAA28834.1; -; mRNA.
DR EMBL; M14943; AAA28835.1; -; mRNA.
DR PIR; A26332; A26332.
DR PIR; B43698; B43698.
DR RefSeq; NP_523862.1; NM_079138.2.
DR AlphaFoldDB; P09083; -.
DR SMR; P09083; -.
DR BioGRID; 63572; 5.
DR DIP; DIP-23099N; -.
DR IntAct; P09083; 2.
DR STRING; 7227.FBpp0072363; -.
DR iPTMnet; P09083; -.
DR PaxDb; P09083; -.
DR DNASU; 38004; -.
DR EnsemblMetazoa; FBtr0072461; FBpp0072363; FBgn0001147.
DR GeneID; 38004; -.
DR KEGG; dme:Dmel_CG2692; -.
DR CTD; 38004; -.
DR FlyBase; FBgn0001147; gsb-n.
DR VEuPathDB; VectorBase:FBgn0001147; -.
DR eggNOG; KOG0849; Eukaryota.
DR GeneTree; ENSGT00940000168362; -.
DR HOGENOM; CLU_019281_2_0_1; -.
DR InParanoid; P09083; -.
DR OMA; GMTDSSH; -.
DR OrthoDB; 1126858at2759; -.
DR PhylomeDB; P09083; -.
DR Reactome; R-DME-3214847; HATs acetylate histones.
DR SignaLink; P09083; -.
DR BioGRID-ORCS; 38004; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 38004; -.
DR PRO; PR:P09083; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0001147; Expressed in atrium (Drosophila) and 13 other tissues.
DR Genevisible; P09083; DM.
DR GO; GO:0005634; C:nucleus; IDA:UniProtKB.
DR GO; GO:0003677; F:DNA binding; NAS:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IDA:FlyBase.
DR GO; GO:0048856; P:anatomical structure development; IBA:GO_Central.
DR GO; GO:0022008; P:neurogenesis; IGI:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0007367; P:segment polarity determination; IGI:FlyBase.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd00131; PAX; 1.
DR Gene3D; 1.10.10.10; -; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR043182; PAIRED_DNA-bd_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636; PTHR45636; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00292; PAX; 1.
DR PRINTS; PR00027; PAIREDBOX.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; SSF46689; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00034; PAIRED_1; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 1: Evidence at protein level;
KW Developmental protein; DNA-binding; Homeobox; Nucleus; Paired box;
KW Phosphoprotein; Reference proteome; Segmentation polarity protein;
KW Transcription; Transcription regulation.
FT CHAIN 1..449
FT /note="Protein gooseberry-neuro"
FT /id="PRO_0000050169"
FT DNA_BIND 20..143
FT /note="Paired"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00381"
FT DNA_BIND 182..241
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 23..79
FT /note="PAI subdomain"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00381"
FT REGION 98..143
FT /note="RED subdomain"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00381"
FT REGION 127..187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 294..321
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..187
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 165
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT CONFLICT 10..11
FT /note="RP -> AA (in Ref. 5; AAA28835)"
FT /evidence="ECO:0000305"
FT CONFLICT 199
FT /note="R -> G (in Ref. 1 and 5; AAA28834)"
FT /evidence="ECO:0000305"
FT CONFLICT 314
FT /note="A -> R (in Ref. 1)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 449 AA; 48187 MW; 885C618306E900F8 CRC64;
MDMSSANSLR PLFAGYPFQG QGRVNQLGGV FINGRPLPNH IRLKIVEMAA SGVRPCVISR
QLRVSHGCVS KILNRYQETG SIRPGVIGGS KPKVTSPEIE TRIDELRKEN PSIFSWEIRE
KLIKEGFADP PSTSSISRLL RGSDRGSEDG RKDYTINGIL GGRDSDISDT ESEPGIPLKR
KQRRSRTTFT AEQLEALERA FSRTQYPDVY TREELAQTTA LTEARIQVWF SNRRARLRKH
SGGSNSGLSP MNSGSSNVGV GVGLSGATAP LGYGPLGVGS MAGYSPAPGT TATGAGMNDG
VHHAAHAPSS HHSAATAAAA AHHHTQMGGY DLVQSAAQHG FPGGFAQPGH FGSQNYYHQD
YSKLTIDDFS KLTADSVSKI SPSLHLSDNY SKLEAPSNWS QAAYHAAANY NAHVAQHQLN
DYAAAAAHGN PASAYSHPLP TQGQAKYWS