SGS4_DROME
ID SGS4_DROME Reviewed; 297 AA.
AC Q00725; O76917; Q24438; Q24504; Q24513; Q24514; Q24515; Q24516; Q24517;
AC Q7JMJ3; Q7JQ97; Q7JQ98; Q7JQ99; Q7JQA0; Q7JQA1; Q7JQA2; Q9W4T2;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-1996, sequence version 1.
DT 03-AUG-2022, entry version 107.
DE RecName: Full=Salivary glue protein Sgs-4;
DE Flags: Precursor;
GN Name=Sgs4; Synonyms=Sgs-4; ORFNames=CG12181;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=Karsnas, Oregon-R, and Samarkand-pk1;
RX PubMed=1562607; DOI=10.1016/0167-4781(92)90444-5;
RA Furia M., Digilio F.A., Artiaco D., Favia G., Polito L.C.;
RT "Molecular characterization of a Drosophila melanogaster variant strain
RT defective in the Sgs-4 gene dosage compensation.";
RL Biochim. Biophys. Acta 1130:314-316(1992).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=Oregon-R(2/10), and Samarkand-pSW9;
RA Siegmund T., Korge G.;
RT "Identification of cis-acting elements for dosage compensation in
RT Drosophila melanogaster by statistical analysis.";
RL Submitted (MAR-1995) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Oregon-R;
RX PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA Glover D.M.;
RT "From sequence to chromosome: the tip of the X chromosome of D.
RT melanogaster.";
RL Science 287:2220-2222(2000).
RN [6]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-86 AND 121-190.
RX PubMed=6817924; DOI=10.1016/0092-8674(82)90467-6;
RA Muskavitch M.A.T., Hogness D.S.;
RT "An expandable gene that encodes a Drosophila glue protein is not expressed
RT in variants lacking remote upstream sequences.";
RL Cell 29:1041-1051(1982).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Salivary gland.
CC -!- POLYMORPHISM: The sequence shown is that from strain Oregon-R. The
CC number of the 7 residues repeat vary between strains: strain Oregon-
CC R(2/10) has 11 more copies, strain Karsnas has 8 more copies of the
CC repeat, strain Samarkand-pk1 has 1 less copy and strain Samarkand-pSW9
CC and strain Berkeley have 2 less copies.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAF45860.2; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X61943; CAA43949.1; -; Genomic_DNA.
DR EMBL; X61942; CAA43948.1; -; Genomic_DNA.
DR EMBL; X61944; CAA43950.1; -; Genomic_DNA.
DR EMBL; Z48721; CAA88613.1; -; Genomic_DNA.
DR EMBL; Z48722; CAA88614.1; -; Genomic_DNA.
DR EMBL; AE014298; AAF45860.2; ALT_INIT; Genomic_DNA.
DR EMBL; AL024484; CAA19673.1; -; Genomic_DNA.
DR EMBL; J01129; AAA28886.1; -; Genomic_DNA.
DR EMBL; J01130; AAA28887.1; -; Genomic_DNA.
DR EMBL; J01133; AAA28888.1; -; Genomic_DNA.
DR EMBL; J01134; AAA28889.1; -; Genomic_DNA.
DR EMBL; J01135; AAA28890.1; -; Genomic_DNA.
DR EMBL; J01136; AAA28891.1; -; Genomic_DNA.
DR PIR; S21085; S21085.
DR PIR; S29893; S29893.
DR RefSeq; NP_476717.4; NM_057369.4.
DR AlphaFoldDB; Q00725; -.
DR STRING; 7227.FBpp0070500; -.
DR EnsemblMetazoa; FBtr0346736; FBpp0312344; FBgn0003374.
DR GeneID; 31304; -.
DR KEGG; dme:Dmel_CG12181; -.
DR CTD; 31304; -.
DR FlyBase; FBgn0003374; Sgs4.
DR VEuPathDB; VectorBase:FBgn0003374; -.
DR GeneTree; ENSGT01050000245874; -.
DR HOGENOM; CLU_970667_0_0_1; -.
DR BioGRID-ORCS; 31304; 0 hits in 1 CRISPR screen.
DR GenomeRNAi; 31304; -.
DR PRO; PR:Q00725; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0003374; Expressed in saliva-secreting gland and 14 other tissues.
DR Genevisible; Q00725; DM.
DR GO; GO:0062130; C:adhesive extracellular matrix; IDA:FlyBase.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0007594; P:puparial adhesion; IEP:FlyBase.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat; Secreted; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT CHAIN 22..297
FT /note="Salivary glue protein Sgs-4"
FT /id="PRO_0000022332"
FT REPEAT 26..32
FT /note="1"
FT REPEAT 33..39
FT /note="2"
FT REPEAT 40..46
FT /note="3"
FT REPEAT 47..53
FT /note="4"
FT REPEAT 54..60
FT /note="5"
FT REPEAT 61..67
FT /note="6"
FT REPEAT 68..74
FT /note="7"
FT REPEAT 75..81
FT /note="8"
FT REPEAT 82..88
FT /note="9"
FT REPEAT 89..95
FT /note="10"
FT REPEAT 96..102
FT /note="11"
FT REPEAT 103..109
FT /note="12"
FT REPEAT 110..116
FT /note="13"
FT REPEAT 117..123
FT /note="14"
FT REPEAT 124..130
FT /note="15"
FT REPEAT 131..137
FT /note="16"
FT REPEAT 138..144
FT /note="17"
FT REPEAT 145..151
FT /note="18"
FT REPEAT 152..158
FT /note="19"
FT REPEAT 159..165
FT /note="20"
FT REPEAT 166..172
FT /note="21"
FT REPEAT 173..179
FT /note="22; approximate"
FT REGION 26..179
FT /note="22 X 7 AA approximate tandem repeats of T-[ETK]-
FT [PT]-P-[RKT]-C-[ERK]"
FT REGION 26..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 141..218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 243..297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..60
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..76
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 182..206
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 251..265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VARIANT 62
FT /note="T -> A (in strain: Samarkand-pk1)"
FT VARIANT 106
FT /note="P -> A (in strain: Berkeley)"
FT VARIANT 119..125
FT /note="Missing (in strain: Samarkand-pk1)"
FT VARIANT 159
FT /note="T -> A (in strain: Berkeley)"
FT VARIANT 177
FT /note="C -> S (in strain: Berkeley)"
FT VARIANT 204..208
FT /note="HHHNR -> LILPT (in strain: Karsnas)"
FT VARIANT 209..297
FT /note="Missing (in strain: Karsnas)"
FT VARIANT 247
FT /note="P -> S (in strain: Berkeley and Oregon-R(2/10))"
FT VARIANT 249
FT /note="P -> S (in strain: Berkeley, Oregon-R and Oregon-
FT R(2/10))"
FT VARIANT 252
FT /note="S -> SCKPA (in strain: Berkeley, Oregon-R and
FT Oregon-R(2/10))"
FT VARIANT 252
FT /note="S -> SRKPS (in strain: Samarkand-pSW9)"
FT VARIANT 267
FT /note="T -> A (in strain: Berkeley, Oregon-R and Oregon-
FT R(2/10))"
FT VARIANT 275
FT /note="A -> V (in strain: Oregon-R)"
FT CONFLICT 186..190
FT /note="KRHRT -> SDTAQ (in Ref. 6; AAA28887/AAA28889/
FT AAA28891)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 297 AA; 32309 MW; F121DDF2B177BE8C CRC64;
MRLELLVVLL VGLAALAPSG STCCKTEPPR CETEPPRCET EPPRCETEPP RCETEPPRCE
TTTPKCETTP PTCRTEPPTC KTEPPTCRTE PPTCKTKPPT CRTEPPTCRT EPPTCKTKPP
TCKTEPPTCK TEPPTCRTEP PTCKTEPPTC RTEPPTCKTE PPTCKTEPPT CKTEPPCEKH
CTKRIKRHRT KRTKRSKSTK KIVHHHNRPG TTPESGCGCG SKNESGGGGS GCILKDLLTP
KCPDSKPKPQ ASPKCKSDPK PKAASKTTSK PKPKACDSGK KNTTKKPRKT QPQKGGC