GPA33_HUMAN
ID GPA33_HUMAN Reviewed; 319 AA.
AC Q99795; Q5VZP6;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-1997, sequence version 1.
DT 03-AUG-2022, entry version 185.
DE RecName: Full=Cell surface A33 antigen;
DE AltName: Full=Glycoprotein A33;
DE Flags: Precursor;
GN Name=GPA33;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], AND PARTIAL PROTEIN SEQUENCE.
RC TISSUE=Colon carcinoma;
RX PubMed=9012807; DOI=10.1073/pnas.94.2.469;
RA Heath J.K., White S.J., Johnstone C.N., Catimel B., Simpson R.J.,
RA Moritz R.L., Tu G.-F., Ji H., Whitehead R.H., Groenen L.C., Scott A.M.,
RA Ritter G., Cohen L., Welt S., Old L.J., Nice E.C., Burgess A.W.;
RT "The human A33 antigen is a transmembrane glycoprotein and a novel member
RT of the immunoglobulin superfamily.";
RL Proc. Natl. Acad. Sci. U.S.A. 94:469-474(1997).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Thymus;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Lung;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [6]
RP GLYCOSYLATION AT ASN-112, AND PALMITOYLATION.
RX PubMed=9245713; DOI=10.1006/bbrc.1997.6966;
RA Ritter G., Cohen L.S., Nice E.C., Catimel B., Burgess A.W., Moritz R.L.,
RA Ji H., Heath J.K., White S.J., Welt S., Old L.J., Simpson R.J.;
RT "Characterization of posttranslational modifications of human A33 antigen,
RT a novel palmitoylated surface glycoprotein of human gastrointestinal
RT epithelium.";
RL Biochem. Biophys. Res. Commun. 236:682-686(1997).
CC -!- FUNCTION: May play a role in cell-cell recognition and signaling.
CC -!- INTERACTION:
CC Q99795; P54852: EMP3; NbExp=3; IntAct=EBI-4289554, EBI-3907816;
CC Q99795; O75355-2: ENTPD3; NbExp=3; IntAct=EBI-4289554, EBI-12279764;
CC Q99795; Q13021: MALL; NbExp=3; IntAct=EBI-4289554, EBI-750078;
CC Q99795; Q8IZ57: NRSN1; NbExp=3; IntAct=EBI-4289554, EBI-10264528;
CC Q99795; P42857: NSG1; NbExp=3; IntAct=EBI-4289554, EBI-6380741;
CC Q99795; Q9NUX5: POT1; NbExp=2; IntAct=EBI-4289554, EBI-752420;
CC Q99795; Q9NRQ5: SMCO4; NbExp=3; IntAct=EBI-4289554, EBI-8640191;
CC Q99795; O00526: UPK2; NbExp=3; IntAct=EBI-4289554, EBI-10179682;
CC -!- SUBCELLULAR LOCATION: Membrane; Single-pass type I membrane protein.
CC -!- TISSUE SPECIFICITY: Expressed in normal gastrointestinal epithelium and
CC in 95% of colon cancers.
CC -!- PTM: N-glycosylated, contains approximately 8 kDa of N-linked
CC carbohydrate. {ECO:0000269|PubMed:9245713}.
CC -!- PTM: Palmitoylated. {ECO:0000269|PubMed:9245713}.
CC -!- WEB RESOURCE: Name=Atlas of Genetics and Cytogenetics in Oncology and
CC Haematology;
CC URL="http://atlasgeneticsoncology.org/Genes/GPA33ID40735ch1q23.html";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U79725; AAC50957.1; -; mRNA.
DR EMBL; AK312833; BAG35687.1; -; mRNA.
DR EMBL; AL158837; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471067; EAW90783.1; -; Genomic_DNA.
DR EMBL; BC069705; AAH69705.1; -; mRNA.
DR EMBL; BC069723; AAH69723.1; -; mRNA.
DR EMBL; BC069745; AAH69745.1; -; mRNA.
DR EMBL; BC069761; AAH69761.1; -; mRNA.
DR EMBL; BC069789; AAH69789.1; -; mRNA.
DR EMBL; BC074830; AAH74830.1; -; mRNA.
DR EMBL; BC074876; AAH74876.1; -; mRNA.
DR EMBL; BC107164; AAI07165.1; -; mRNA.
DR EMBL; BC107165; AAI07166.1; -; mRNA.
DR CCDS; CCDS1258.1; -.
DR RefSeq; NP_005805.1; NM_005814.2.
DR AlphaFoldDB; Q99795; -.
DR SMR; Q99795; -.
DR BioGRID; 115517; 17.
DR IntAct; Q99795; 8.
DR STRING; 9606.ENSP00000356842; -.
DR ChEMBL; CHEMBL3712927; -.
DR GlyGen; Q99795; 3 sites.
DR iPTMnet; Q99795; -.
DR PhosphoSitePlus; Q99795; -.
DR SwissPalm; Q99795; -.
DR BioMuta; GPA33; -.
DR DMDM; 2842765; -.
DR jPOST; Q99795; -.
DR MassIVE; Q99795; -.
DR MaxQB; Q99795; -.
DR PaxDb; Q99795; -.
DR PeptideAtlas; Q99795; -.
DR PRIDE; Q99795; -.
DR ProteomicsDB; 78476; -.
DR ABCD; Q99795; 5 sequenced antibodies.
DR Antibodypedia; 1116; 309 antibodies from 30 providers.
DR DNASU; 10223; -.
DR Ensembl; ENST00000367868.4; ENSP00000356842.3; ENSG00000143167.12.
DR GeneID; 10223; -.
DR KEGG; hsa:10223; -.
DR MANE-Select; ENST00000367868.4; ENSP00000356842.3; NM_005814.3; NP_005805.1.
DR UCSC; uc001gea.2; human.
DR CTD; 10223; -.
DR DisGeNET; 10223; -.
DR GeneCards; GPA33; -.
DR HGNC; HGNC:4445; GPA33.
DR HPA; ENSG00000143167; Tissue enriched (intestine).
DR MIM; 602171; gene.
DR neXtProt; NX_Q99795; -.
DR OpenTargets; ENSG00000143167; -.
DR PharmGKB; PA28826; -.
DR VEuPathDB; HostDB:ENSG00000143167; -.
DR eggNOG; ENOG502QR0Y; Eukaryota.
DR GeneTree; ENSGT00940000160248; -.
DR HOGENOM; CLU_040549_2_0_1; -.
DR InParanoid; Q99795; -.
DR OMA; TEMSGYY; -.
DR OrthoDB; 841952at2759; -.
DR PhylomeDB; Q99795; -.
DR TreeFam; TF330875; -.
DR PathwayCommons; Q99795; -.
DR SignaLink; Q99795; -.
DR BioGRID-ORCS; 10223; 7 hits in 1059 CRISPR screens.
DR ChiTaRS; GPA33; human.
DR GeneWiki; GPA33; -.
DR GenomeRNAi; 10223; -.
DR Pharos; Q99795; Tbio.
DR PRO; PR:Q99795; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; Q99795; protein.
DR Bgee; ENSG00000143167; Expressed in ileal mucosa and 81 other tissues.
DR ExpressionAtlas; Q99795; baseline and differential.
DR Genevisible; Q99795; HS.
DR GO; GO:0070062; C:extracellular exosome; HDA:UniProtKB.
DR GO; GO:0005887; C:integral component of plasma membrane; TAS:ProtInc.
DR GO; GO:0005886; C:plasma membrane; IBA:GO_Central.
DR GO; GO:0038023; F:signaling receptor activity; TAS:ProtInc.
DR Gene3D; 2.60.40.10; -; 2.
DR InterPro; IPR042474; A33.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013106; Ig_V-set.
DR PANTHER; PTHR44969; PTHR44969; 1.
DR Pfam; PF07686; V-set; 1.
DR SMART; SM00409; IG; 2.
DR SMART; SM00408; IGc2; 2.
DR SMART; SM00406; IGv; 1.
DR SUPFAM; SSF48726; SSF48726; 2.
DR PROSITE; PS50835; IG_LIKE; 2.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Disulfide bond; Glycoprotein;
KW Immunoglobulin domain; Lipoprotein; Membrane; Palmitate;
KW Reference proteome; Signal; Transmembrane; Transmembrane helix.
FT SIGNAL 1..21
FT CHAIN 22..319
FT /note="Cell surface A33 antigen"
FT /id="PRO_0000014770"
FT TOPO_DOM 22..235
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 236..256
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 257..319
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT DOMAIN 22..134
FT /note="Ig-like V-type"
FT DOMAIN 140..227
FT /note="Ig-like C2-type"
FT REGION 267..319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 112
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:9245713"
FT CARBOHYD 200
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 223
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT DISULFID 43..117
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114"
FT DISULFID 146..222
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114"
FT DISULFID 162..211
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114"
FT VARIANT 20
FT /note="D -> N (in dbSNP:rs2274531)"
FT /id="VAR_020079"
FT VARIANT 165
FT /note="K -> N (in dbSNP:rs2228399)"
FT /id="VAR_049874"
SQ SEQUENCE 319 AA; 35632 MW; 9BFC7AAF45C2408E CRC64;
MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK
LLLTHTERVV IWPFSNKNYI HGELYKNRVS ISNNAEQSDA SITIDQLTMA DNGTYECSVS
LMSDLEGNTK SRVRLLVLVP PSKPECGIEG ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI
LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI CTSSNEEGTQ FCNITVAVRS PSMNVALYVG
IAVGVVAALI IIGIIIYCCC CRGKDDNTED KEDARPNREA YEEPPEQLRE LSREREEEDD
YRQEEQRSTG RESPDHLDQ