CH033_HUMAN
ID CH033_HUMAN Reviewed; 229 AA.
AC Q9H7E9; A6NGC0; Q96BT8;
DT 02-OCT-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 146.
DE RecName: Full=UPF0488 protein C8orf33;
GN Name=C8orf33;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Endothelial cell;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RA Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.;
RT "Cloning of human full open reading frames in Gateway(TM) system entry
RT vector (pDONR201).";
RL Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16421571; DOI=10.1038/nature04406;
RA Nusbaum C., Mikkelsen T.S., Zody M.C., Asakawa S., Taudien S., Garber M.,
RA Kodira C.D., Schueler M.G., Shimizu A., Whittaker C.A., Chang J.L.,
RA Cuomo C.A., Dewar K., FitzGerald M.G., Yang X., Allen N.R., Anderson S.,
RA Asakawa T., Blechschmidt K., Bloom T., Borowsky M.L., Butler J., Cook A.,
RA Corum B., DeArellano K., DeCaprio D., Dooley K.T., Dorris L. III,
RA Engels R., Gloeckner G., Hafez N., Hagopian D.S., Hall J.L., Ishikawa S.K.,
RA Jaffe D.B., Kamat A., Kudoh J., Lehmann R., Lokitsang T., Macdonald P.,
RA Major J.E., Matthews C.D., Mauceli E., Menzel U., Mihalev A.H.,
RA Minoshima S., Murayama Y., Naylor J.W., Nicol R., Nguyen C., O'Leary S.B.,
RA O'Neill K., Parker S.C.J., Polley A., Raymond C.K., Reichwald K.,
RA Rodriguez J., Sasaki T., Schilhabel M., Siddiqui R., Smith C.L.,
RA Sneddon T.P., Talamas J.A., Tenzin P., Topham K., Venkataraman V., Wen G.,
RA Yamazaki S., Young S.K., Zeng Q., Zimmer A.R., Rosenthal A., Birren B.W.,
RA Platzer M., Shimizu N., Lander E.S.;
RT "DNA sequence and analysis of human chromosome 8.";
RL Nature 439:331-335(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 76-229 (ISOFORM 2).
RC TISSUE=Brain, and Uterus;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [6]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, CLEAVAGE OF INITIATOR
RP METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS SPECTROMETRY
RP [LARGE SCALE ANALYSIS].
RX PubMed=19413330; DOI=10.1021/ac9004309;
RA Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.;
RT "Lys-N and trypsin cover complementary parts of the phosphoproteome in a
RT refined SCX-based approach.";
RL Anal. Chem. 81:4493-4501(2009).
RN [7]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-82, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=20068231; DOI=10.1126/scisignal.2000475;
RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L.,
RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.;
RT "Quantitative phosphoproteomics reveals widespread full phosphorylation
RT site occupancy during mitosis.";
RL Sci. Signal. 3:RA3-RA3(2010).
RN [8]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=21269460; DOI=10.1186/1752-0509-5-17;
RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T.,
RA Bennett K.L., Superti-Furga G., Colinge J.;
RT "Initial characterization of the human central proteome.";
RL BMC Syst. Biol. 5:17-17(2011).
RN [9]
RP METHYLATION [LARGE SCALE ANALYSIS] AT ARG-27, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Colon carcinoma;
RX PubMed=24129315; DOI=10.1074/mcp.o113.027870;
RA Guo A., Gu H., Zhou J., Mulhern D., Wang Y., Lee K.A., Yang V., Aguiar M.,
RA Kornhauser J., Jia X., Ren J., Beausoleil S.A., Silva J.C., Vemulapalli V.,
RA Bedford M.T., Comb M.J.;
RT "Immunoaffinity enrichment and mass spectrometry analysis of protein
RT methylation.";
RL Mol. Cell. Proteomics 13:372-387(2014).
RN [10]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=25944712; DOI=10.1002/pmic.201400617;
RA Vaca Jacome A.S., Rabilloud T., Schaeffer-Reiss C., Rompais M., Ayoub D.,
RA Lane L., Bairoch A., Van Dorsselaer A., Carapito C.;
RT "N-terminome analysis of the human mitochondrial proteome.";
RL Proteomics 15:2519-2524(2015).
CC -!- INTERACTION:
CC Q9H7E9; Q8NHQ1: CEP70; NbExp=5; IntAct=EBI-715389, EBI-739624;
CC Q9H7E9; Q9NRI5-2: DISC1; NbExp=3; IntAct=EBI-715389, EBI-11988027;
CC Q9H7E9; Q8IZU0: FAM9B; NbExp=3; IntAct=EBI-715389, EBI-10175124;
CC Q9H7E9; Q6NT76: HMBOX1; NbExp=6; IntAct=EBI-715389, EBI-2549423;
CC Q9H7E9; Q13422: IKZF1; NbExp=3; IntAct=EBI-715389, EBI-745305;
CC Q9H7E9; Q13422-7: IKZF1; NbExp=3; IntAct=EBI-715389, EBI-11522367;
CC Q9H7E9; Q8NC69: KCTD6; NbExp=3; IntAct=EBI-715389, EBI-2511344;
CC Q9H7E9; P60409: KRTAP10-7; NbExp=3; IntAct=EBI-715389, EBI-10172290;
CC Q9H7E9; Q9BRK4: LZTS2; NbExp=3; IntAct=EBI-715389, EBI-741037;
CC Q9H7E9; P23508: MCC; NbExp=3; IntAct=EBI-715389, EBI-307531;
CC Q9H7E9; Q99750: MDFI; NbExp=5; IntAct=EBI-715389, EBI-724076;
CC Q9H7E9; Q96RE7: NACC1; NbExp=3; IntAct=EBI-715389, EBI-7950997;
CC Q9H7E9; Q9NRD5: PICK1; NbExp=3; IntAct=EBI-715389, EBI-79165;
CC Q9H7E9; Q8ND90: PNMA1; NbExp=3; IntAct=EBI-715389, EBI-302345;
CC Q9H7E9; Q15025: TNIP1; NbExp=3; IntAct=EBI-715389, EBI-357849;
CC Q9H7E9; P36406: TRIM23; NbExp=3; IntAct=EBI-715389, EBI-740098;
CC Q9H7E9; Q8WV44: TRIM41; NbExp=3; IntAct=EBI-715389, EBI-725997;
CC Q9H7E9; Q8TF50: ZNF526; NbExp=3; IntAct=EBI-715389, EBI-11035148;
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q9H7E9-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9H7E9-2; Sequence=VSP_028169;
CC -!- SIMILARITY: Belongs to the UPF0488 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK024642; BAB14943.1; -; mRNA.
DR EMBL; CR457330; CAG33611.1; -; mRNA.
DR EMBL; AC139103; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471162; EAW82023.1; -; Genomic_DNA.
DR EMBL; BC010001; AAH10001.1; -; mRNA.
DR EMBL; BC015181; AAH15181.1; -; mRNA.
DR CCDS; CCDS34974.1; -. [Q9H7E9-1]
DR RefSeq; NP_075568.1; NM_023080.2. [Q9H7E9-1]
DR AlphaFoldDB; Q9H7E9; -.
DR SMR; Q9H7E9; -.
DR BioGRID; 122420; 173.
DR IntAct; Q9H7E9; 77.
DR MINT; Q9H7E9; -.
DR STRING; 9606.ENSP00000330361; -.
DR GlyGen; Q9H7E9; 1 site, 1 O-linked glycan (1 site).
DR iPTMnet; Q9H7E9; -.
DR PhosphoSitePlus; Q9H7E9; -.
DR BioMuta; C8orf33; -.
DR DMDM; 74733630; -.
DR EPD; Q9H7E9; -.
DR jPOST; Q9H7E9; -.
DR MassIVE; Q9H7E9; -.
DR MaxQB; Q9H7E9; -.
DR PaxDb; Q9H7E9; -.
DR PeptideAtlas; Q9H7E9; -.
DR PRIDE; Q9H7E9; -.
DR ProteomicsDB; 81116; -. [Q9H7E9-1]
DR ProteomicsDB; 81117; -. [Q9H7E9-2]
DR Antibodypedia; 28690; 77 antibodies from 12 providers.
DR DNASU; 65265; -.
DR Ensembl; ENST00000331434.7; ENSP00000330361.6; ENSG00000182307.14. [Q9H7E9-1]
DR GeneID; 65265; -.
DR KEGG; hsa:65265; -.
DR MANE-Select; ENST00000331434.7; ENSP00000330361.6; NM_023080.3; NP_075568.1.
DR UCSC; uc003zfc.5; human. [Q9H7E9-1]
DR CTD; 65265; -.
DR DisGeNET; 65265; -.
DR GeneCards; C8orf33; -.
DR HGNC; HGNC:26104; C8orf33.
DR HPA; ENSG00000182307; Low tissue specificity.
DR neXtProt; NX_Q9H7E9; -.
DR OpenTargets; ENSG00000182307; -.
DR PharmGKB; PA142672352; -.
DR VEuPathDB; HostDB:ENSG00000182307; -.
DR eggNOG; ENOG502S1RU; Eukaryota.
DR GeneTree; ENSGT00390000000306; -.
DR HOGENOM; CLU_082144_1_1_1; -.
DR InParanoid; Q9H7E9; -.
DR OMA; LKMQRPT; -.
DR OrthoDB; 583605at2759; -.
DR PhylomeDB; Q9H7E9; -.
DR TreeFam; TF326272; -.
DR PathwayCommons; Q9H7E9; -.
DR SignaLink; Q9H7E9; -.
DR BioGRID-ORCS; 65265; 102 hits in 1068 CRISPR screens.
DR ChiTaRS; C8orf33; human.
DR GenomeRNAi; 65265; -.
DR Pharos; Q9H7E9; Tdark.
DR PRO; PR:Q9H7E9; -.
DR Proteomes; UP000005640; Chromosome 8.
DR RNAct; Q9H7E9; protein.
DR Bgee; ENSG00000182307; Expressed in right adrenal gland cortex and 194 other tissues.
DR ExpressionAtlas; Q9H7E9; baseline and differential.
DR Genevisible; Q9H7E9; HS.
DR InterPro; IPR029274; DUF4615.
DR PANTHER; PTHR13602; PTHR13602; 1.
DR Pfam; PF15393; DUF4615; 1.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Methylation; Phosphoprotein;
KW Reference proteome.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0007744|PubMed:19413330"
FT CHAIN 2..229
FT /note="UPF0488 protein C8orf33"
FT /id="PRO_0000304982"
FT REGION 1..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..54
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 2
FT /note="N-acetylalanine"
FT /evidence="ECO:0007744|PubMed:19413330"
FT MOD_RES 27
FT /note="Omega-N-methylarginine"
FT /evidence="ECO:0007744|PubMed:24129315"
FT MOD_RES 82
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:20068231"
FT VAR_SEQ 106
FT /note="Q -> QARAGFGREGWNPR (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_028169"
SQ SEQUENCE 229 AA; 24993 MW; EFAFCF3ED552AC45 CRC64;
MAALGHLAGE AAAAPGPGTP CASRGARLPG PVSSARNPST VCLCPEQPTC SNADSRAHPL
GDEGGTASKK QKNKKKTRNR ASVANGGEKA SEKLAPEEVP LSAEAQAQQL AQELAWCVEQ
LELGLKRQKP TPKQKEQAIG AIRTLRSKRT PLPRKRQLMH SLFGDYRAQM EAEWREALRA
LRAAAYSAQV QPVDGATRKK SQRVCRPRSI WRAKATLDMP DEEFRFNFF