SAGE1_HUMAN
ID SAGE1_HUMAN Reviewed; 904 AA.
AC Q9NXZ1; Q5JNW0;
DT 15-MAY-2007, integrated into UniProtKB/Swiss-Prot.
DT 15-MAY-2007, sequence version 2.
DT 03-AUG-2022, entry version 115.
DE RecName: Full=Sarcoma antigen 1;
DE AltName: Full=Cancer/testis antigen 14;
DE Short=CT14;
GN Name=SAGE1; Synonyms=SAGE;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], TISSUE SPECIFICITY, AND VARIANT SER-805.
RX PubMed=10919659;
RA Martelange V.M.F., De Smet C., De Plaen E., Lurquin C., Boon T.;
RT "Identification on a human sarcoma of two new genes with tumor-specific
RT expression.";
RL Cancer Res. 60:3848-3855(2000).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15772651; DOI=10.1038/nature03440;
RA Ross M.T., Grafham D.V., Coffey A.J., Scherer S., McLay K., Muzny D.,
RA Platzer M., Howell G.R., Burrows C., Bird C.P., Frankish A., Lovell F.L.,
RA Howe K.L., Ashurst J.L., Fulton R.S., Sudbrak R., Wen G., Jones M.C.,
RA Hurles M.E., Andrews T.D., Scott C.E., Searle S., Ramser J., Whittaker A.,
RA Deadman R., Carter N.P., Hunt S.E., Chen R., Cree A., Gunaratne P.,
RA Havlak P., Hodgson A., Metzker M.L., Richards S., Scott G., Steffen D.,
RA Sodergren E., Wheeler D.A., Worley K.C., Ainscough R., Ambrose K.D.,
RA Ansari-Lari M.A., Aradhya S., Ashwell R.I., Babbage A.K., Bagguley C.L.,
RA Ballabio A., Banerjee R., Barker G.E., Barlow K.F., Barrett I.P.,
RA Bates K.N., Beare D.M., Beasley H., Beasley O., Beck A., Bethel G.,
RA Blechschmidt K., Brady N., Bray-Allen S., Bridgeman A.M., Brown A.J.,
RA Brown M.J., Bonnin D., Bruford E.A., Buhay C., Burch P., Burford D.,
RA Burgess J., Burrill W., Burton J., Bye J.M., Carder C., Carrel L.,
RA Chako J., Chapman J.C., Chavez D., Chen E., Chen G., Chen Y., Chen Z.,
RA Chinault C., Ciccodicola A., Clark S.Y., Clarke G., Clee C.M., Clegg S.,
RA Clerc-Blankenburg K., Clifford K., Cobley V., Cole C.G., Conquer J.S.,
RA Corby N., Connor R.E., David R., Davies J., Davis C., Davis J., Delgado O.,
RA Deshazo D., Dhami P., Ding Y., Dinh H., Dodsworth S., Draper H.,
RA Dugan-Rocha S., Dunham A., Dunn M., Durbin K.J., Dutta I., Eades T.,
RA Ellwood M., Emery-Cohen A., Errington H., Evans K.L., Faulkner L.,
RA Francis F., Frankland J., Fraser A.E., Galgoczy P., Gilbert J., Gill R.,
RA Gloeckner G., Gregory S.G., Gribble S., Griffiths C., Grocock R., Gu Y.,
RA Gwilliam R., Hamilton C., Hart E.A., Hawes A., Heath P.D., Heitmann K.,
RA Hennig S., Hernandez J., Hinzmann B., Ho S., Hoffs M., Howden P.J.,
RA Huckle E.J., Hume J., Hunt P.J., Hunt A.R., Isherwood J., Jacob L.,
RA Johnson D., Jones S., de Jong P.J., Joseph S.S., Keenan S., Kelly S.,
RA Kershaw J.K., Khan Z., Kioschis P., Klages S., Knights A.J., Kosiura A.,
RA Kovar-Smith C., Laird G.K., Langford C., Lawlor S., Leversha M., Lewis L.,
RA Liu W., Lloyd C., Lloyd D.M., Loulseged H., Loveland J.E., Lovell J.D.,
RA Lozado R., Lu J., Lyne R., Ma J., Maheshwari M., Matthews L.H.,
RA McDowall J., McLaren S., McMurray A., Meidl P., Meitinger T., Milne S.,
RA Miner G., Mistry S.L., Morgan M., Morris S., Mueller I., Mullikin J.C.,
RA Nguyen N., Nordsiek G., Nyakatura G., O'dell C.N., Okwuonu G., Palmer S.,
RA Pandian R., Parker D., Parrish J., Pasternak S., Patel D., Pearce A.V.,
RA Pearson D.M., Pelan S.E., Perez L., Porter K.M., Ramsey Y., Reichwald K.,
RA Rhodes S., Ridler K.A., Schlessinger D., Schueler M.G., Sehra H.K.,
RA Shaw-Smith C., Shen H., Sheridan E.M., Shownkeen R., Skuce C.D.,
RA Smith M.L., Sotheran E.C., Steingruber H.E., Steward C.A., Storey R.,
RA Swann R.M., Swarbreck D., Tabor P.E., Taudien S., Taylor T., Teague B.,
RA Thomas K., Thorpe A., Timms K., Tracey A., Trevanion S., Tromans A.C.,
RA d'Urso M., Verduzco D., Villasana D., Waldron L., Wall M., Wang Q.,
RA Warren J., Warry G.L., Wei X., West A., Whitehead S.L., Whiteley M.N.,
RA Wilkinson J.E., Willey D.L., Williams G., Williams L., Williamson A.,
RA Williamson H., Wilming L., Woodmansey R.L., Wray P.W., Yen J., Zhang J.,
RA Zhou J., Zoghbi H., Zorilla S., Buck D., Reinhardt R., Poustka A.,
RA Rosenthal A., Lehrach H., Meindl A., Minx P.J., Hillier L.W., Willard H.F.,
RA Wilson R.K., Waterston R.H., Rice C.M., Vaudin M., Coulson A., Nelson D.L.,
RA Weinstock G., Sulston J.E., Durbin R.M., Hubbard T., Gibbs R.A., Beck S.,
RA Rogers J., Bentley D.R.;
RT "The DNA sequence of the human X chromosome.";
RL Nature 434:325-337(2005).
RN [3]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18669648; DOI=10.1073/pnas.0805139105;
RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E.,
RA Elledge S.J., Gygi S.P.;
RT "A quantitative atlas of mitotic phosphorylation.";
RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008).
RN [4]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-45; SER-64 AND SER-238, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
RN [5]
RP SUMOYLATION [LARGE SCALE ANALYSIS] AT LYS-778, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=28112733; DOI=10.1038/nsmb.3366;
RA Hendriks I.A., Lyon D., Young C., Jensen L.J., Vertegaal A.C.,
RA Nielsen M.L.;
RT "Site-specific mapping of the human SUMO proteome reveals co-modification
RT with phosphorylation.";
RL Nat. Struct. Mol. Biol. 24:325-336(2017).
CC -!- TISSUE SPECIFICITY: Expressed mainly in bladder, lung, head and neck
CC carcinomas. Not expressed in normal tissues except for testis.
CC {ECO:0000269|PubMed:10919659}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ278111; CAB92443.1; -; mRNA.
DR EMBL; AL953870; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR CCDS; CCDS14652.1; -.
DR RefSeq; NP_061136.2; NM_018666.2.
DR RefSeq; XP_016885110.1; XM_017029621.1.
DR RefSeq; XP_016885111.1; XM_017029622.1.
DR AlphaFoldDB; Q9NXZ1; -.
DR SMR; Q9NXZ1; -.
DR BioGRID; 120691; 3.
DR IntAct; Q9NXZ1; 3.
DR STRING; 9606.ENSP00000323191; -.
DR iPTMnet; Q9NXZ1; -.
DR PhosphoSitePlus; Q9NXZ1; -.
DR BioMuta; SAGE1; -.
DR DMDM; 147732638; -.
DR jPOST; Q9NXZ1; -.
DR MassIVE; Q9NXZ1; -.
DR MaxQB; Q9NXZ1; -.
DR PaxDb; Q9NXZ1; -.
DR PeptideAtlas; Q9NXZ1; -.
DR PRIDE; Q9NXZ1; -.
DR ProteomicsDB; 83145; -.
DR Antibodypedia; 552; 52 antibodies from 11 providers.
DR DNASU; 55511; -.
DR Ensembl; ENST00000324447.7; ENSP00000323191.3; ENSG00000181433.11.
DR Ensembl; ENST00000370709.4; ENSP00000359743.3; ENSG00000181433.11.
DR GeneID; 55511; -.
DR KEGG; hsa:55511; -.
DR MANE-Select; ENST00000370709.4; ENSP00000359743.3; NM_001381902.1; NP_001368831.1.
DR UCSC; uc065bgk.1; human.
DR CTD; 55511; -.
DR DisGeNET; 55511; -.
DR GeneCards; SAGE1; -.
DR HGNC; HGNC:30369; SAGE1.
DR HPA; ENSG00000181433; Tissue enriched (testis).
DR MIM; 300359; gene.
DR neXtProt; NX_Q9NXZ1; -.
DR OpenTargets; ENSG00000181433; -.
DR PharmGKB; PA134909712; -.
DR VEuPathDB; HostDB:ENSG00000181433; -.
DR eggNOG; KOG3768; Eukaryota.
DR GeneTree; ENSGT00390000016655; -.
DR HOGENOM; CLU_015595_0_0_1; -.
DR InParanoid; Q9NXZ1; -.
DR OMA; STRDLCM; -.
DR OrthoDB; 124883at2759; -.
DR PhylomeDB; Q9NXZ1; -.
DR TreeFam; TF323386; -.
DR PathwayCommons; Q9NXZ1; -.
DR SignaLink; Q9NXZ1; -.
DR BioGRID-ORCS; 55511; 16 hits in 713 CRISPR screens.
DR GenomeRNAi; 55511; -.
DR Pharos; Q9NXZ1; Tdark.
DR PRO; PR:Q9NXZ1; -.
DR Proteomes; UP000005640; Chromosome X.
DR RNAct; Q9NXZ1; protein.
DR Bgee; ENSG00000181433; Expressed in right testis and 26 other tissues.
DR ExpressionAtlas; Q9NXZ1; baseline and differential.
DR Genevisible; Q9NXZ1; HS.
DR GO; GO:0032039; C:integrator complex; IBA:GO_Central.
DR GO; GO:0016604; C:nuclear body; IDA:HPA.
DR GO; GO:0005654; C:nucleoplasm; IDA:HPA.
DR GO; GO:0034472; P:snRNA 3'-end processing; IBA:GO_Central.
DR InterPro; IPR029307; INT_SG_DDX_CT_C.
DR Pfam; PF15300; INT_SG_DDX_CT_C; 1.
PE 1: Evidence at protein level;
KW Isopeptide bond; Phosphoprotein; Reference proteome; Ubl conjugation.
FT CHAIN 1..904
FT /note="Sarcoma antigen 1"
FT /id="PRO_0000286971"
FT REGION 24..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..40
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 44..62
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 45
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 64
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 238
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT CROSSLNK 778
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0007744|PubMed:28112733"
FT VARIANT 741
FT /note="N -> K (in dbSNP:rs35470903)"
FT /id="VAR_032243"
FT VARIANT 805
FT /note="L -> S (in dbSNP:rs4829799)"
FT /evidence="ECO:0000269|PubMed:10919659"
FT /id="VAR_032244"
SQ SEQUENCE 904 AA; 99225 MW; 0537E3F0E1858CE5 CRC64;
MQASPLQTSQ PTPPEELHAA AYVFTNDGQQ MRSDEVNLVA TGHQSKKKHS RKSKRHSSSK
RRKSMSSWLD KQEDAAVTHS ICEERINNGQ PVADNVLSTA PPWPDATIAH NIREERMENG
QSRTDKVLST APPQLVHMAA AGIPSMSTRD LHSTVTHNIR EERMENGQPQ PDNVLSTGPT
GLINMAATPI PAMSARDLYA TVTHNVCEQK MENVQPAPDN VLLTLRPRRI NMTDTGISPM
STRDPYATIT YNVPEEKMEK GQPQPDNILS TASTGLINVA GAGTPAISTN GLYSTVPHNV
CEEKMENDQP QPNNVLSTVQ PVIIYLTATG IPGMNTRDQY ATITHNVCEE RVVNNQPLPS
NALSTVLPGL AYLATADMPA MSTRDQHATI IHNLREEKKD NSQPTPDNVL SAVTPELINL
AGAGIPPMST RDQYATVNHH VHEARMENGQ RKQDNVLSNV LSGLINMAGA SIPAMSSRDL
YATITHSVRE EKMESGKPQT DKVISNDAPQ LGHMAAGGIP SMSTKDLYAT VTQNVHEERM
ENNQPQPSYD LSTVLPGLTY LTVAGIPAMS TRDQYATVTH NVHEEKIKNG QAASDNVFST
VPPAFINMAA TGVSSMSTRD QYAAVTHNIR EEKINNSQPA PGNILSTAPP WLRHMAAAGI
SSTITRDLYV TATHSVHEEK MTNGQQAPDN SLSTVPPGCI NLSGAGISCR STRDLYATVI
HDIQEEEMEN DQTPPDGFLS NSDSPELINM TGHCMPPNAL DSFSHDFTSL SKDELLYKPD
SNEFAVGTKN YSVSAGDPPV TVMSLVETVP NTPQISPAMA KKINDDIKYQ LMKEVRRFGQ
NYERIFILLE EVQGSMKVKR QFVEFTIKEA ARFKKVVLIQ QLEKALKEID SHCHLRKVKH
MRKR