SUGP1_HUMAN
ID SUGP1_HUMAN Reviewed; 645 AA.
AC Q8IWZ8; O60378; Q6P3X9; Q8TCQ4; Q8WWT4; Q8WWT5; Q9NTG3;
DT 15-MAR-2005, integrated into UniProtKB/Swiss-Prot.
DT 15-MAR-2005, sequence version 2.
DT 03-AUG-2022, entry version 152.
DE RecName: Full=SURP and G-patch domain-containing protein 1;
DE AltName: Full=RNA-binding protein RBP;
DE AltName: Full=Splicing factor 4;
GN Name=SUGP1; Synonyms=SF4;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND TISSUE SPECIFICITY.
RX PubMed=12594045; DOI=10.1016/s0378-1119(02)01230-1;
RA Sampson N.D., Hewitt J.E.;
RT "SF4 and SFRS14, two related putative splicing factors on human chromosome
RT 19p13.11.";
RL Gene 305:91-100(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 3-645 (ISOFORMS 1 AND 2).
RA Gu Y., Nguyen C.-T.;
RT "Novel isoforms of a human RNA-binding protein.";
RL Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15057824; DOI=10.1038/nature02399;
RA Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E.,
RA Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A.,
RA Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S.,
RA Carrano A.V., Caoile C., Chan Y.M., Christensen M., Cleland C.A.,
RA Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J.,
RA Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M.,
RA Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W.,
RA Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V.,
RA Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D.,
RA McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I.,
RA Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L.,
RA Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A.,
RA She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M.,
RA Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J.,
RA Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E.,
RA Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M.,
RA Rubin E.M., Lucas S.M.;
RT "The DNA sequence and biology of human chromosome 19.";
RL Nature 428:529-535(2004).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 3-645 (ISOFORM 1).
RC TISSUE=PNS;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 254-645.
RC TISSUE=Testis;
RX PubMed=17974005; DOI=10.1186/1471-2164-8-399;
RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U.,
RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D.,
RA Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A.,
RA Wiemann S., Schupp I.;
RT "The full-ORF clone resource of the German cDNA consortium.";
RL BMC Genomics 8:399-399(2007).
RN [6]
RP IDENTIFICATION IN A COMPLEX WITH THE SPLICEOSOME, AND IDENTIFICATION BY
RP MASS SPECTROMETRY.
RX PubMed=12176931; DOI=10.1101/gr.473902;
RA Rappsilber J., Ryder U., Lamond A.I., Mann M.;
RT "Large-scale proteomic analysis of the human spliceosome.";
RL Genome Res. 12:1231-1245(2002).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18691976; DOI=10.1016/j.molcel.2008.07.007;
RA Daub H., Olsen J.V., Bairlein M., Gnad F., Oppermann F.S., Korner R.,
RA Greff Z., Keri G., Stemmann O., Mann M.;
RT "Kinase-selective enrichment enables quantitative phosphoproteomics of the
RT kinome across the cell cycle.";
RL Mol. Cell 31:438-448(2008).
RN [8]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-409 AND SER-485, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18669648; DOI=10.1073/pnas.0805139105;
RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E.,
RA Elledge S.J., Gygi S.P.;
RT "A quantitative atlas of mitotic phosphorylation.";
RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008).
RN [9]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19413330; DOI=10.1021/ac9004309;
RA Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.;
RT "Lys-N and trypsin cover complementary parts of the phosphoproteome in a
RT refined SCX-based approach.";
RL Anal. Chem. 81:4493-4501(2009).
RN [10]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-409; SER-411 AND SER-414, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Leukemic T-cell;
RX PubMed=19690332; DOI=10.1126/scisignal.2000007;
RA Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K.,
RA Rodionov V., Han D.K.;
RT "Quantitative phosphoproteomic analysis of T cell receptor signaling
RT reveals system-wide modulation of protein-protein interactions.";
RL Sci. Signal. 2:RA46-RA46(2009).
RN [11]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-409; SER-411 AND SER-485, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=20068231; DOI=10.1126/scisignal.2000475;
RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L.,
RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.;
RT "Quantitative phosphoproteomics reveals widespread full phosphorylation
RT site occupancy during mitosis.";
RL Sci. Signal. 3:RA3-RA3(2010).
RN [12]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=21269460; DOI=10.1186/1752-0509-5-17;
RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T.,
RA Bennett K.L., Superti-Furga G., Colinge J.;
RT "Initial characterization of the human central proteome.";
RL BMC Syst. Biol. 5:17-17(2011).
RN [13]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=21406692; DOI=10.1126/scisignal.2001570;
RA Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., Johansen P.T.,
RA Kratchmarova I., Kassem M., Mann M., Olsen J.V., Blagoev B.;
RT "System-wide temporal characterization of the proteome and phosphoproteome
RT of human embryonic stem cell differentiation.";
RL Sci. Signal. 4:RS3-RS3(2011).
RN [14]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-132; SER-256; SER-326;
RP SER-409; SER-411 AND SER-485, AND IDENTIFICATION BY MASS SPECTROMETRY
RP [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma, and Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
RN [15]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Liver;
RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014;
RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., Wang L.,
RA Ye M., Zou H.;
RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human liver
RT phosphoproteome.";
RL J. Proteomics 96:253-262(2014).
CC -!- FUNCTION: Plays a role in pre-mRNA splicing.
CC -!- SUBUNIT: Component of the spliceosome. {ECO:0000269|PubMed:12176931}.
CC -!- INTERACTION:
CC Q8IWZ8; O43143: DHX15; NbExp=2; IntAct=EBI-2691671, EBI-1237044;
CC Q8IWZ8; P98175: RBM10; NbExp=2; IntAct=EBI-2691671, EBI-721525;
CC Q8IWZ8; Q96I25: RBM17; NbExp=2; IntAct=EBI-2691671, EBI-740272;
CC Q8IWZ8; P26368: U2AF2; NbExp=3; IntAct=EBI-2691671, EBI-742339;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1; Synonyms=RNA-binding protein splice variant A;
CC IsoId=Q8IWZ8-1; Sequence=Displayed;
CC Name=2; Synonyms=RNA-binding protein splice variant B;
CC IsoId=Q8IWZ8-2; Sequence=VSP_013109, VSP_013110;
CC -!- TISSUE SPECIFICITY: Detected in adult testis and heart, and in adult
CC and fetal brain, kidney and skeletal muscle.
CC {ECO:0000269|PubMed:12594045}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC08052.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=AAL68960.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=AAL68961.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF521128; AAN77123.1; -; mRNA.
DR EMBL; AY072916; AAL68960.1; ALT_INIT; mRNA.
DR EMBL; AY072917; AAL68961.1; ALT_INIT; mRNA.
DR EMBL; AC004475; AAC08052.1; ALT_SEQ; Genomic_DNA.
DR EMBL; BC063784; AAH63784.1; -; mRNA.
DR EMBL; AL137286; CAB70678.1; -; mRNA.
DR EMBL; AL713757; CAD28528.1; -; mRNA.
DR CCDS; CCDS12399.1; -. [Q8IWZ8-1]
DR PIR; T02299; T02299.
DR RefSeq; NP_757386.2; NM_172231.3. [Q8IWZ8-1]
DR AlphaFoldDB; Q8IWZ8; -.
DR SMR; Q8IWZ8; -.
DR BioGRID; 121767; 144.
DR IntAct; Q8IWZ8; 30.
DR MINT; Q8IWZ8; -.
DR STRING; 9606.ENSP00000247001; -.
DR GlyGen; Q8IWZ8; 5 sites, 2 O-linked glycans (5 sites).
DR iPTMnet; Q8IWZ8; -.
DR PhosphoSitePlus; Q8IWZ8; -.
DR BioMuta; SUGP1; -.
DR DMDM; 61216666; -.
DR EPD; Q8IWZ8; -.
DR jPOST; Q8IWZ8; -.
DR MassIVE; Q8IWZ8; -.
DR MaxQB; Q8IWZ8; -.
DR PaxDb; Q8IWZ8; -.
DR PeptideAtlas; Q8IWZ8; -.
DR PRIDE; Q8IWZ8; -.
DR ProteomicsDB; 70945; -. [Q8IWZ8-1]
DR ProteomicsDB; 70946; -. [Q8IWZ8-2]
DR Antibodypedia; 1687; 164 antibodies from 27 providers.
DR DNASU; 57794; -.
DR Ensembl; ENST00000247001.10; ENSP00000247001.3; ENSG00000105705.16. [Q8IWZ8-1]
DR Ensembl; ENST00000588731.6; ENSP00000465413.2; ENSG00000105705.16. [Q8IWZ8-2]
DR GeneID; 57794; -.
DR KEGG; hsa:57794; -.
DR MANE-Select; ENST00000247001.10; ENSP00000247001.3; NM_172231.4; NP_757386.2.
DR UCSC; uc002nmh.4; human. [Q8IWZ8-1]
DR CTD; 57794; -.
DR DisGeNET; 57794; -.
DR GeneCards; SUGP1; -.
DR HGNC; HGNC:18643; SUGP1.
DR HPA; ENSG00000105705; Low tissue specificity.
DR MIM; 607992; gene.
DR neXtProt; NX_Q8IWZ8; -.
DR OpenTargets; ENSG00000105705; -.
DR PharmGKB; PA165394338; -.
DR VEuPathDB; HostDB:ENSG00000105705; -.
DR eggNOG; KOG0965; Eukaryota.
DR GeneTree; ENSGT00410000025695; -.
DR HOGENOM; CLU_028403_0_0_1; -.
DR InParanoid; Q8IWZ8; -.
DR OMA; QWLEIKI; -.
DR OrthoDB; 1232201at2759; -.
DR PhylomeDB; Q8IWZ8; -.
DR TreeFam; TF326321; -.
DR PathwayCommons; Q8IWZ8; -.
DR Reactome; R-HSA-72163; mRNA Splicing - Major Pathway.
DR SignaLink; Q8IWZ8; -.
DR BioGRID-ORCS; 57794; 566 hits in 1084 CRISPR screens.
DR ChiTaRS; SUGP1; human.
DR GenomeRNAi; 57794; -.
DR Pharos; Q8IWZ8; Tbio.
DR PRO; PR:Q8IWZ8; -.
DR Proteomes; UP000005640; Chromosome 19.
DR RNAct; Q8IWZ8; protein.
DR Bgee; ENSG00000105705; Expressed in lower esophagus mucosa and 174 other tissues.
DR ExpressionAtlas; Q8IWZ8; baseline and differential.
DR Genevisible; Q8IWZ8; HS.
DR GO; GO:0005654; C:nucleoplasm; IDA:HPA.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; HDA:UniProtKB.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 1.10.10.790; -; 2.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; PTHR23340; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; SSF109905; 2.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 2.
PE 1: Evidence at protein level;
KW Alternative splicing; mRNA processing; mRNA splicing; Nucleus;
KW Phosphoprotein; Reference proteome; Repeat; Spliceosome.
FT CHAIN 1..645
FT /note="SURP and G-patch domain-containing protein 1"
FT /id="PRO_0000097701"
FT REPEAT 191..233
FT /note="SURP motif 1"
FT REPEAT 266..309
FT /note="SURP motif 2"
FT DOMAIN 562..609
FT /note="G-patch"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00092"
FT REGION 49..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 97..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 150..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 582..607
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 380..386
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255"
FT COMPBIAS 54..78
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 132
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 256
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 326
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 409
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648,
FT ECO:0007744|PubMed:19690332, ECO:0007744|PubMed:20068231,
FT ECO:0007744|PubMed:23186163"
FT MOD_RES 411
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:19690332,
FT ECO:0007744|PubMed:20068231, ECO:0007744|PubMed:23186163"
FT MOD_RES 414
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:19690332"
FT MOD_RES 485
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648,
FT ECO:0007744|PubMed:20068231, ECO:0007744|PubMed:23186163"
FT VAR_SEQ 181..222
FT /note="SPPEGAETRKVIEKLARFVAEGGPELEKVAMEDYKDNPAFAF -> CLTKTV
FT RPTPPSSWCFVPGLGTPDSLLHLTLFSPHPLCRCGP (in isoform 2)"
FT /evidence="ECO:0000303|Ref.2"
FT /id="VSP_013109"
FT VAR_SEQ 223..645
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|Ref.2"
FT /id="VSP_013110"
FT VARIANT 290
FT /note="R -> H (in dbSNP:rs17751061)"
FT /id="VAR_051339"
FT VARIANT 568
FT /note="Q -> H (in dbSNP:rs1044980)"
FT /id="VAR_051340"
FT CONFLICT 497
FT /note="E -> K (in Ref. 1; AAN77123)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 645 AA; 72471 MW; 27AF5ED41DFDDCB7 CRC64;
MSLKMDNRDV AGKANRWFGV APPKSGKMNM NILHQEELIA QKKREIEAKM EQKAKQNQVA
SPQPPHPGEI TNAHNSSCIS NKFANDGSFL QQFLKLQKAQ TSTDAPTSAP SAPPSTPTPS
AGKRSLLISR RTGLGLASLP GPVKSYSHAK QLPVAHRPSV FQSPDEDEEE DYEQWLEIKV
SPPEGAETRK VIEKLARFVA EGGPELEKVA MEDYKDNPAF AFLHDKNSRE FLYYRKKVAE
IRKEAQKSQA ASQKVSPPED EEVKNLAEKL ARFIADGGPE VETIALQNNR ENQAFSFLYE
PNSQGYKYYR QKLEEFRKAK ASSTGSFTAP DPGLKRKSPP EALSGSLPPA TTCPASSTPA
PTIIPAPAAP GKPASAATVK RKRKSRWGPE EDKVELPPAE LVQRDVDASP SPLSVQDLKG
LGYEKGKPVG LVGVTELSDA QKKQLKEQQE MQQMYDMIMQ HKRAMQDMQL LWEKAVQQHQ
HGYDSDEEVD SELGTWEHQL RRMEMDKTRE WAEQLTKMGR GKHFIGDFLP PDELEKFMET
FKALKEGREP DYSEYKEFKL TVENIGYQML MKMGWKEGEG LGSEGQGIKN PVNKGTTTVD
GAGFGIDRPA ELSKEDDEYE AFRKRMMLAY RFRPNPLNNP RRPYY