SARG_HUMAN
ID SARG_HUMAN Reviewed; 601 AA.
AC Q9BW04; C9JV41; Q658X3;
DT 05-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 18-MAY-2010, sequence version 2.
DT 03-AUG-2022, entry version 145.
DE RecName: Full=Specifically androgen-regulated gene protein;
GN Name=SARG; Synonyms=C1orf116;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2), FUNCTION, SUBCELLULAR
RP LOCATION, ALTERNATIVE SPLICING, TISSUE SPECIFICITY, AND VARIANT PRO-444.
RC TISSUE=Prostatic adenocarcinoma;
RX PubMed=15525603; DOI=10.1677/jme.1.01478;
RA Steketee K., Ziel-van der Made A.C., van der Korput H.A., Houtsmuller A.B.,
RA Trapman J.;
RT "A bioinformatics-based functional analysis shows that the specifically
RT androgen-regulated gene SARG contains an active direct repeat androgen
RT response element in the first intron.";
RL J. Mol. Endocrinol. 33:477-491(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND VARIANT PRO-444.
RC TISSUE=Trachea;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT PRO-444.
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND VARIANT PRO-444.
RC TISSUE=Lung;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 224-601 (ISOFORM 1), AND VARIANT
RP PRO-444.
RC TISSUE=Stomach;
RX PubMed=17974005; DOI=10.1186/1471-2164-8-399;
RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U.,
RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D.,
RA Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A.,
RA Wiemann S., Schupp I.;
RT "The full-ORF clone resource of the German cDNA consortium.";
RL BMC Genomics 8:399-399(2007).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=21269460; DOI=10.1186/1752-0509-5-17;
RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T.,
RA Bennett K.L., Superti-Furga G., Colinge J.;
RT "Initial characterization of the human central proteome.";
RL BMC Syst. Biol. 5:17-17(2011).
RN [8]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-131; SER-133 AND SER-519, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
CC -!- FUNCTION: Putative androgen-specific receptor.
CC {ECO:0000269|PubMed:15525603}.
CC -!- INTERACTION:
CC Q9BW04; Q86YM7: HOMER1; NbExp=7; IntAct=EBI-2320464, EBI-746815;
CC Q9BW04; Q9NSC5: HOMER3; NbExp=3; IntAct=EBI-2320464, EBI-748420;
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000269|PubMed:15525603}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Comment=Additional isoforms seem to exist.;
CC Name=1;
CC IsoId=Q9BW04-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9BW04-2; Sequence=VSP_031228;
CC -!- TISSUE SPECIFICITY: Highly expressed in prostate.
CC {ECO:0000269|PubMed:15525603}.
CC -!- INDUCTION: Expression is up-regulated by androgen, but not by
CC glucocorticoids.
CC -!- SIMILARITY: Belongs to the SARG family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY352640; AAR11484.1; -; mRNA.
DR EMBL; AK093826; BAC04233.1; -; mRNA.
DR EMBL; AC098935; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471100; EAW93511.1; -; Genomic_DNA.
DR EMBL; BC000765; AAH00765.1; -; mRNA.
DR EMBL; AL832940; CAH56284.1; -; mRNA.
DR CCDS; CCDS1475.1; -. [Q9BW04-1]
DR CCDS; CCDS44306.1; -. [Q9BW04-2]
DR RefSeq; NP_001077393.1; NM_001083924.1. [Q9BW04-2]
DR RefSeq; NP_076427.2; NM_023938.5. [Q9BW04-1]
DR RefSeq; XP_005273316.1; XM_005273259.1. [Q9BW04-2]
DR RefSeq; XP_006711593.1; XM_006711530.1. [Q9BW04-1]
DR RefSeq; XP_011508275.1; XM_011509973.2. [Q9BW04-2]
DR AlphaFoldDB; Q9BW04; -.
DR BioGRID; 122545; 17.
DR IntAct; Q9BW04; 7.
DR MINT; Q9BW04; -.
DR STRING; 9606.ENSP00000352447; -.
DR GlyGen; Q9BW04; 1 site, 1 O-linked glycan (1 site).
DR iPTMnet; Q9BW04; -.
DR PhosphoSitePlus; Q9BW04; -.
DR BioMuta; C1orf116; -.
DR DMDM; 296452897; -.
DR EPD; Q9BW04; -.
DR jPOST; Q9BW04; -.
DR MassIVE; Q9BW04; -.
DR MaxQB; Q9BW04; -.
DR PaxDb; Q9BW04; -.
DR PeptideAtlas; Q9BW04; -.
DR PRIDE; Q9BW04; -.
DR ProteomicsDB; 79245; -. [Q9BW04-1]
DR ProteomicsDB; 79246; -. [Q9BW04-2]
DR Antibodypedia; 1991; 71 antibodies from 20 providers.
DR DNASU; 79098; -.
DR Ensembl; ENST00000359470.6; ENSP00000352447.5; ENSG00000182795.13. [Q9BW04-1]
DR Ensembl; ENST00000461135.2; ENSP00000436862.1; ENSG00000182795.13. [Q9BW04-2]
DR GeneID; 79098; -.
DR KEGG; hsa:79098; -.
DR MANE-Select; ENST00000359470.6; ENSP00000352447.5; NM_023938.6; NP_076427.2.
DR UCSC; uc001hfd.3; human. [Q9BW04-1]
DR CTD; 79098; -.
DR DisGeNET; 79098; -.
DR GeneCards; C1orf116; -.
DR HGNC; HGNC:28667; C1orf116.
DR HPA; ENSG00000182795; Tissue enhanced (esophagus, lung, stomach).
DR MIM; 611680; gene.
DR neXtProt; NX_Q9BW04; -.
DR OpenTargets; ENSG00000182795; -.
DR PharmGKB; PA142672500; -.
DR VEuPathDB; HostDB:ENSG00000182795; -.
DR eggNOG; ENOG502RGW5; Eukaryota.
DR GeneTree; ENSGT00390000017874; -.
DR HOGENOM; CLU_035136_0_0_1; -.
DR InParanoid; Q9BW04; -.
DR OMA; NSHTPGE; -.
DR OrthoDB; 484727at2759; -.
DR PhylomeDB; Q9BW04; -.
DR TreeFam; TF336615; -.
DR PathwayCommons; Q9BW04; -.
DR SignaLink; Q9BW04; -.
DR BioGRID-ORCS; 79098; 7 hits in 1049 CRISPR screens.
DR ChiTaRS; C1orf116; human.
DR GenomeRNAi; 79098; -.
DR Pharos; Q9BW04; Tbio.
DR PRO; PR:Q9BW04; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; Q9BW04; protein.
DR Bgee; ENSG00000182795; Expressed in pancreatic ductal cell and 126 other tissues.
DR Genevisible; Q9BW04; HS.
DR GO; GO:0005737; C:cytoplasm; IDA:UniProtKB.
DR GO; GO:0005829; C:cytosol; IDA:HPA.
DR GO; GO:0070062; C:extracellular exosome; HDA:UniProtKB.
DR GO; GO:0005886; C:plasma membrane; IDA:HPA.
DR InterPro; IPR026152; SARG.
DR PANTHER; PTHR21555; PTHR21555; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Cytoplasm; Phosphoprotein; Receptor;
KW Reference proteome.
FT CHAIN 1..601
FT /note="Specifically androgen-regulated gene protein"
FT /id="PRO_0000318575"
FT REGION 1..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 56..538
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 551..601
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..41
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..77
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..122
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 136..158
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 222..262
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 342..356
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 583..601
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 131
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 133
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 519
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT VAR_SEQ 1..246
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15525603"
FT /id="VSP_031228"
FT VARIANT 87
FT /note="P -> S (in dbSNP:rs706846)"
FT /id="VAR_038775"
FT VARIANT 107
FT /note="T -> A (in dbSNP:rs35299018)"
FT /id="VAR_038776"
FT VARIANT 157
FT /note="N -> T (in dbSNP:rs34660159)"
FT /id="VAR_038777"
FT VARIANT 258
FT /note="R -> G (in dbSNP:rs12062114)"
FT /id="VAR_038778"
FT VARIANT 434
FT /note="N -> S (in dbSNP:rs35267170)"
FT /id="VAR_038779"
FT VARIANT 444
FT /note="S -> P (in dbSNP:rs2842726)"
FT /evidence="ECO:0000269|PubMed:14702039,
FT ECO:0000269|PubMed:15489334, ECO:0000269|PubMed:15525603,
FT ECO:0000269|PubMed:17974005, ECO:0000269|Ref.4"
FT /id="VAR_038780"
FT VARIANT 514
FT /note="F -> S (in dbSNP:rs11799966)"
FT /id="VAR_038781"
SQ SEQUENCE 601 AA; 63964 MW; 01CEBFF2712332AC CRC64;
MPERELWPAG TGSEPVTRVG SCDSMMSSTS TRSGSSDSSY DFLSTEEKEC LLFLEETIGS
LDTEADSGLS TDESEPATTP RGFRALPITQ PTPRGGPEET ITQQGRTPRT VTESSSSHPP
EPQGLGLRSG SYSLPRNIHI ARSQNFRKST TQASSHNPGE PGRLAPEPEK EQVSQSSQPR
QAPASPQEAA LDLDVVLIPP PEAFRDTQPE QCREASLPEG PGQQGHTPQL HTPSSSQERE
QTPSEAMSQK AKETVSTRYT QPQPPPAGLP QNARAEDAPL SSGEDPNSRL APLTTPKPRK
LPPNIVLKSS RSSFHSDPQH WLSRHTEAAP GDSGLISCSL QEQRKARKEA LEKLGLPQDQ
DEPGLHLSKP TSSIRPKETR AQHLSPAPGL AQPAAPAQAS AAIPAAGKAL AQAPAPAPGP
AQGPLPMKSP APGNVAASKS MPISIPKAPR ANSALTPPKP ESGLTLQESN TPGLRQMNFK
SNTLERSGVG LSSYLSTEKD ASPKTSTSLG KGSFLDKISP SVLRNSRPRP ASLGTGKDFA
GIQVGKLADL EQEQSSKRLS YQGQSRDKLP RPPCVSVKIS PKGVPNEHRR EALKKLGLLK
E