SUWA_DROME
ID SUWA_DROME Reviewed; 963 AA.
AC P12297; Q24543; Q7KW21; Q8MSS2; Q9I7D2; Q9TVZ2;
DT 01-OCT-1989, integrated into UniProtKB/Swiss-Prot.
DT 19-JUL-2005, sequence version 3.
DT 03-AUG-2022, entry version 176.
DE RecName: Full=Protein suppressor of white apricot;
GN Name=su(w[a]); Synonyms=su(wa); ORFNames=CG3019;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOFORM A).
RX PubMed=2832151; DOI=10.1002/j.1460-2075.1987.tb02755.x;
RA Chou T.-B., Zachar Z., Bingham P.M.;
RT "Developmental expression of a regulatory gene is programmed at the level
RT of splicing.";
RL EMBO J. 6:4095-4104(1987).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Oregon-R;
RX PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA Glover D.M.;
RT "From sequence to chromosome: the tip of the X chromosome of D.
RT melanogaster.";
RL Science 287:2220-2222(2000).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM A).
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [6]
RP FUNCTION, AND ALTERNATIVE SPLICING.
RX PubMed=3443103; DOI=10.1002/j.1460-2075.1987.tb02756.x;
RA Zachar Z., Chou T.-B., Bingham P.M.;
RT "Evidence that a regulatory gene autoregulates splicing of its
RT transcript.";
RL EMBO J. 6:4105-4111(1987).
RN [7]
RP CHARACTERIZATION OF RS DOMAIN.
RX PubMed=1655279; DOI=10.1016/0092-8674(91)90185-2;
RA Li H., Bingham P.M.;
RT "Arginine/serine-rich domains of the su(wa) and tra RNA processing
RT regulators target proteins to a subnuclear compartment implicated in
RT splicing.";
RL Cell 67:335-342(1991).
RN [8]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-438; SER-447; SER-448;
RP SER-450; SER-649; SER-912; SER-914 AND SER-916, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Regulator of pre-mRNA splicing (and, possibly, of other RNA
CC processing events). Regulates its own expression at the level of RNA
CC processing. {ECO:0000269|PubMed:3443103}.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle. Note=Speckled subnuclear
CC compartment.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=A;
CC IsoId=P12297-2; Sequence=Displayed;
CC Name=B; Synonyms=C;
CC IsoId=P12297-3; Sequence=VSP_004436;
CC -!- DEVELOPMENTAL STAGE: Three mRNAs are produced during development. The
CC smallest of these (3.5 kb RNA) is the majority species during
CC precellular blastoderm development after which its levels drop rapidly,
CC but persists as a minority species throughout the rest of the life of
CC the organism. The larger two transcripts (4.4 and 5.2 kb RNAs) first
CC appear around cellular blastoderm and levels increase substantially
CC during next few hours and are the preponderant RNA species throughout
CC the remainder of the life of the organism.
CC -!- DOMAIN: RS domain directs localization of proteins to the speckled
CC subnuclear compartment and the purpose of this localization is to allow
CC colocalization and co-concentration of components of the splicing and
CC splicing regulatory machinery to permit relatively high rates and/or
CC efficiencies of reaction and interaction.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAS65241.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAA29812.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAA29813.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB65879.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB65880.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X06589; CAA29812.1; ALT_SEQ; Genomic_DNA.
DR EMBL; X06589; CAA29813.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AE014298; AAG22382.2; -; Genomic_DNA.
DR EMBL; AE014298; AAS65241.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL109630; CAB65879.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL132651; CAB65879.1; JOINED; Genomic_DNA.
DR EMBL; AL132651; CAB65880.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL109630; CAB65880.1; JOINED; Genomic_DNA.
DR EMBL; AY118635; AAM50004.1; -; mRNA.
DR PIR; S06028; S06028.
DR RefSeq; NP_476756.2; NM_057408.6. [P12297-2]
DR RefSeq; NP_996323.1; NM_206600.3.
DR RefSeq; NP_996324.1; NM_206601.3.
DR AlphaFoldDB; P12297; -.
DR SMR; P12297; -.
DR BioGRID; 57618; 12.
DR IntAct; P12297; 8.
DR STRING; 7227.FBpp0070171; -.
DR iPTMnet; P12297; -.
DR PaxDb; P12297; -.
DR DNASU; 31054; -.
DR EnsemblMetazoa; FBtr0070176; FBpp0070171; FBgn0003638. [P12297-2]
DR EnsemblMetazoa; FBtr0302884; FBpp0292012; FBgn0003638. [P12297-3]
DR GeneID; 31054; -.
DR KEGG; dme:Dmel_CG3019; -.
DR CTD; 31054; -.
DR FlyBase; FBgn0003638; su(w[a]).
DR VEuPathDB; VectorBase:FBgn0003638; -.
DR eggNOG; KOG1847; Eukaryota.
DR GeneTree; ENSGT00940000153892; -.
DR HOGENOM; CLU_012240_0_0_1; -.
DR InParanoid; P12297; -.
DR OMA; RKAAMFI; -.
DR PhylomeDB; P12297; -.
DR SignaLink; P12297; -.
DR BioGRID-ORCS; 31054; 1 hit in 1 CRISPR screen.
DR ChiTaRS; su(w[a]); fly.
DR GenomeRNAi; 31054; -.
DR PRO; PR:P12297; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0003638; Expressed in brain and 17 other tissues.
DR ExpressionAtlas; P12297; baseline and differential.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000395; P:mRNA 5'-splice site recognition; IBA:GO_Central.
DR Gene3D; 1.10.10.790; -; 2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR040397; SWAP.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR InterPro; IPR019147; SWAP_N_domain.
DR PANTHER; PTHR13161; PTHR13161; 1.
DR Pfam; PF09750; DRY_EERY; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM01141; DRY_EERY; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; SSF109905; 2.
DR PROSITE; PS50128; SURP; 2.
PE 1: Evidence at protein level;
KW Alternative splicing; mRNA processing; mRNA splicing; Nucleus;
KW Phosphoprotein; Reference proteome; Repeat; RNA-binding; Transcription;
KW Transcription regulation.
FT CHAIN 1..963
FT /note="Protein suppressor of white apricot"
FT /id="PRO_0000072324"
FT REPEAT 234..276
FT /note="SURP motif 1"
FT REPEAT 483..523
FT /note="SURP motif 2"
FT REGION 290..322
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 360..430
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 445..470
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 593..613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 634..662
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 716..963
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..317
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 360..408
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 634..652
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 749..767
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 785..799
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 885..920
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 928..942
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 943..963
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 438
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 447
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 448
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 450
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 649
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 912
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 914
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 916
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 1..215
FT /note="Missing (in isoform B)"
FT /evidence="ECO:0000305"
FT /id="VSP_004436"
FT CONFLICT 49
FT /note="S -> SS (in Ref. 5; CAA29812/CAA29813)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 963 AA; 106141 MW; 9682B6516D9F085F CRC64;
MLPYNVRNAG GGSVGGILRR TGQGSGTGST ILGNGNSPGA LGAGKVSSSL ENHRQPPLEL
LVFGYACKIF RDDEKAREMD HGKQLIPWMG DVNLKIDRYD VRGALCELAP HEAPPGGYGN
RLEYLSAEEQ RAEQLCEEER YLFLYNNEEE LRLRQEEDLK RLQQETSGGC FSQVGFQYDG
QSAASTSIGG SSTATSQLSP NSEESELPFV LPYTLMMAPP LDMQLPETMK QHAIIEKTAR
FIATQGAQME ILIKAKQANN TQFDFLTQGG HLQPYYRHLL AAIKAAKFPP APQTPLDQQN
TDKEAPSADD HSEEVAGGRR NPNQVVITVP TIKYKPSANC AYTQLISKIK GVPLQAVLQE
DESSNPGNSQ HSGGTASPAL SCRSEGHNSQ GGEFTPVLLQ YNGSTFTHEE ESSNREQQDD
NDVNGGEPPQ VELLKNTSAL ALAQNYSSES EEEEDQVQPE KEEEKKPEPV LTFPVPKDSL
RHIIDKTATY VIKNGRQFEE TLRTKSVDRF SFLLPANEYY PYYLYKVTGD VDAASKEEKT
RKAAAVAAAL MSKKGLSFGG AAAAVSGSNL DKAPVSFSIR ARDDQCPLQH TLPQEASDEE
TSSNAAGVEH VRPGMPDSVQ RAIKQVETQL LARTAGQKGN ITASPSCSSP QKEQRQAEER
VKDKLAQIAR EKLNGMISRE KQLQLERKRK ALAFLNQIKG EGAIVGSAVP VVGPNPPESA
AGAATADSGD ESGDSVRSIP ITYFGPDDDD EVGEQRPEMR LIGSTQKDEE DDDEEDGGDL
EKYNLLNDDS TNTFTSKPVL PPTAAPPPAA VLLSDDDDVQ LVATTSTRSS SSRHLKTHRR
SRSRSKNVRS SDSSPSSRES SRRRRQKSSR LSREPSSNPP RKSQHSSTQR KKTPKKRRRS
KSRSRSKSIR RSRSISILRN NRRSRSRSPS CRNAEQRRQQ DRRRTPTKKS HKRHKRRRRS
SSP