CA226_HUMAN
ID CA226_HUMAN Reviewed; 272 AA.
AC A1L170; B4DF31;
DT 20-MAY-2008, integrated into UniProtKB/Swiss-Prot.
DT 06-FEB-2007, sequence version 1.
DT 03-AUG-2022, entry version 109.
DE RecName: Full=Uncharacterized protein C1orf226;
GN Name=C1orf226;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC TISSUE=Cerebellum;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [4]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-249, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18220336; DOI=10.1021/pr0705441;
RA Cantin G.T., Yi W., Lu B., Park S.K., Xu T., Lee J.-D., Yates J.R. III;
RT "Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient
RT phosphoproteomic analysis.";
RL J. Proteome Res. 7:1346-1351(2008).
RN [5]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18691976; DOI=10.1016/j.molcel.2008.07.007;
RA Daub H., Olsen J.V., Bairlein M., Gnad F., Oppermann F.S., Korner R.,
RA Greff Z., Keri G., Stemmann O., Mann M.;
RT "Kinase-selective enrichment enables quantitative phosphoproteomics of the
RT kinome across the cell cycle.";
RL Mol. Cell 31:438-448(2008).
RN [6]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-222; SER-223; SER-249 AND
RP SER-258, AND IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18669648; DOI=10.1073/pnas.0805139105;
RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E.,
RA Elledge S.J., Gygi S.P.;
RT "A quantitative atlas of mitotic phosphorylation.";
RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19413330; DOI=10.1021/ac9004309;
RA Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.;
RT "Lys-N and trypsin cover complementary parts of the phosphoproteome in a
RT refined SCX-based approach.";
RL Anal. Chem. 81:4493-4501(2009).
RN [8]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-249, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=20068231; DOI=10.1126/scisignal.2000475;
RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L.,
RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.;
RT "Quantitative phosphoproteomics reveals widespread full phosphorylation
RT site occupancy during mitosis.";
RL Sci. Signal. 3:RA3-RA3(2010).
RN [9]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-223, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma, and Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
RN [10]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-249, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Liver;
RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014;
RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., Wang L.,
RA Ye M., Zou H.;
RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human liver
RT phosphoproteome.";
RL J. Proteomics 96:253-262(2014).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=A1L170-1; Sequence=Displayed;
CC Name=2;
CC IsoId=A1L170-2; Sequence=VSP_040791;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK293907; BAG57292.1; -; mRNA.
DR EMBL; AL512785; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC127743; AAI27744.1; -; mRNA.
DR EMBL; BC127744; AAI27745.1; -; mRNA.
DR CCDS; CCDS44268.1; -. [A1L170-2]
DR CCDS; CCDS53422.1; -. [A1L170-1]
DR RefSeq; NP_001078844.1; NM_001085375.1. [A1L170-1]
DR RefSeq; NP_001128712.1; NM_001135240.1. [A1L170-2]
DR AlphaFoldDB; A1L170; -.
DR BioGRID; 134762; 50.
DR IntAct; A1L170; 11.
DR MINT; A1L170; -.
DR STRING; 9606.ENSP00000413150; -.
DR iPTMnet; A1L170; -.
DR PhosphoSitePlus; A1L170; -.
DR BioMuta; C1orf226; -.
DR EPD; A1L170; -.
DR jPOST; A1L170; -.
DR MassIVE; A1L170; -.
DR MaxQB; A1L170; -.
DR PaxDb; A1L170; -.
DR PeptideAtlas; A1L170; -.
DR PRIDE; A1L170; -.
DR ProteomicsDB; 134; -. [A1L170-1]
DR ProteomicsDB; 135; -. [A1L170-2]
DR Antibodypedia; 64530; 39 antibodies from 8 providers.
DR DNASU; 400793; -.
DR Ensembl; ENST00000426197.2; ENSP00000413150.2; ENSG00000239887.6. [A1L170-2]
DR Ensembl; ENST00000458626.4; ENSP00000437071.1; ENSG00000239887.6. [A1L170-1]
DR GeneID; 400793; -.
DR KEGG; hsa:400793; -.
DR MANE-Select; ENST00000458626.4; ENSP00000437071.1; NM_001085375.2; NP_001078844.1.
DR UCSC; uc001gby.2; human. [A1L170-1]
DR CTD; 400793; -.
DR DisGeNET; 400793; -.
DR GeneCards; C1orf226; -.
DR HGNC; HGNC:34351; C1orf226.
DR HPA; ENSG00000239887; Tissue enhanced (liver).
DR neXtProt; NX_A1L170; -.
DR OpenTargets; ENSG00000239887; -.
DR PharmGKB; PA164717033; -.
DR VEuPathDB; HostDB:ENSG00000239887; -.
DR eggNOG; KOG4815; Eukaryota.
DR GeneTree; ENSGT00510000046975; -.
DR HOGENOM; CLU_882664_0_0_1; -.
DR InParanoid; A1L170; -.
DR OMA; LEAPMEV; -.
DR OrthoDB; 1123753at2759; -.
DR PhylomeDB; A1L170; -.
DR TreeFam; TF343380; -.
DR PathwayCommons; A1L170; -.
DR SignaLink; A1L170; -.
DR BioGRID-ORCS; 400793; 10 hits in 1064 CRISPR screens.
DR ChiTaRS; C1orf226; human.
DR GenomeRNAi; 400793; -.
DR Pharos; A1L170; Tdark.
DR PRO; PR:A1L170; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; A1L170; protein.
DR Bgee; ENSG00000239887; Expressed in oocyte and 108 other tissues.
DR Genevisible; A1L170; HS.
DR InterPro; IPR027851; DUF4628.
DR Pfam; PF15429; DUF4628; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Phosphoprotein; Reference proteome.
FT CHAIN 1..272
FT /note="Uncharacterized protein C1orf226"
FT /id="PRO_0000334679"
FT REGION 1..47
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 111..135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 147..272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..17
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..165
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 203..217
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 222
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648"
FT MOD_RES 223
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648,
FT ECO:0007744|PubMed:23186163"
FT MOD_RES 249
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18220336,
FT ECO:0007744|PubMed:18669648, ECO:0007744|PubMed:20068231,
FT ECO:0007744|PubMed:24275569"
FT MOD_RES 258
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648"
FT VAR_SEQ 1
FT /note="M -> MLVLVAQAGGDTLKVTGHYVKCVFLLFVSHLLSEPLSRTVDHGM
FT (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_040791"
SQ SEQUENCE 272 AA; 29057 MW; 01D9EA3C6AD46ED1 CRC64;
MFENLNTALT PKLQASRSFP HLSKPVAPGS APLGSGEPGG PGLWVGSSQH LKNLGKAMGA
KVNDFLRRKE PSSLGSVGVT EINKTAGAQL ASGTDAAPEA WLEDERSVLQ ETFPRLDPPP
PITRKRTPRA LKTTQDMLIS SQPVLSSLEY GTEPSPGQAQ DSAPTAQPDV PADASQPEAT
MEREERGKVL PNGEVSLSVP DLIHKDSQDE SKLKMTECRR ASSPSLIERN GFKLSLSPIS
LAESWEDGSP PPQARTSSLD NEGPHPDLLS FE