CEA21_HUMAN
ID CEA21_HUMAN Reviewed; 293 AA.
AC Q3KPI0; B7WNQ6; O75296; Q6UY47; Q96ER7;
DT 21-AUG-2007, integrated into UniProtKB/Swiss-Prot.
DT 23-FEB-2022, sequence version 3.
DT 03-AUG-2022, entry version 139.
DE RecName: Full=Carcinoembryonic antigen-related cell adhesion molecule 21;
DE Flags: Precursor;
GN Name=CEACAM21; ORFNames=UNQ3098/PRO10075;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RX PubMed=12975309; DOI=10.1101/gr.1293003;
RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J.,
RA Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P.,
RA Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E., Heldens S., Huang A.,
RA Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D.,
RA Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L.,
RA Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C.,
RA Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J.,
RA Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.;
RT "The secreted protein discovery initiative (SPDI), a large-scale effort to
RT identify novel human secreted and transmembrane proteins: a bioinformatics
RT assessment.";
RL Genome Res. 13:2265-2270(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANTS THR-121 AND
RP MET-198.
RX PubMed=15057824; DOI=10.1038/nature02399;
RA Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E.,
RA Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A.,
RA Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S.,
RA Carrano A.V., Caoile C., Chan Y.M., Christensen M., Cleland C.A.,
RA Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J.,
RA Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M.,
RA Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W.,
RA Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V.,
RA Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D.,
RA McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I.,
RA Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L.,
RA Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A.,
RA She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M.,
RA Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J.,
RA Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E.,
RA Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M.,
RA Rubin E.M., Lucas S.M.;
RT "The DNA sequence and biology of human chromosome 19.";
RL Nature 428:529-535(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 3).
RC TISSUE=Testis;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Single-pass type I
CC membrane protein {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q3KPI0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q3KPI0-2; Sequence=VSP_027304;
CC Name=3;
CC IsoId=Q3KPI0-3; Sequence=VSP_027303;
CC -!- SIMILARITY: Belongs to the immunoglobulin superfamily. CEA family.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC34569.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY358084; AAQ88451.1; -; mRNA.
DR EMBL; AC005626; AAC34569.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AC243960; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC012001; AAH12001.1; -; mRNA.
DR EMBL; BC106727; AAI06728.1; -; mRNA.
DR CCDS; CCDS46086.1; -. [Q3KPI0-1]
DR CCDS; CCDS46087.1; -. [Q3KPI0-2]
DR RefSeq; NP_001091976.3; NM_001098506.3.
DR RefSeq; NP_001275702.2; NM_001288773.2.
DR RefSeq; NP_001277042.1; NM_001290113.1.
DR RefSeq; NP_291021.4; NM_033543.5.
DR RefSeq; XP_005278453.1; XM_005278396.4.
DR RefSeq; XP_005278454.1; XM_005278397.3.
DR AlphaFoldDB; Q3KPI0; -.
DR SMR; Q3KPI0; -.
DR BioGRID; 124686; 85.
DR IntAct; Q3KPI0; 33.
DR STRING; 9606.ENSP00000385739; -.
DR GlyGen; Q3KPI0; 1 site.
DR iPTMnet; Q3KPI0; -.
DR PhosphoSitePlus; Q3KPI0; -.
DR BioMuta; CEACAM21; -.
DR DMDM; 156630480; -.
DR EPD; Q3KPI0; -.
DR MassIVE; Q3KPI0; -.
DR PaxDb; Q3KPI0; -.
DR PeptideAtlas; Q3KPI0; -.
DR PRIDE; Q3KPI0; -.
DR ProteomicsDB; 61718; -. [Q3KPI0-1]
DR ProteomicsDB; 61719; -. [Q3KPI0-2]
DR ProteomicsDB; 61720; -. [Q3KPI0-3]
DR TopDownProteomics; Q3KPI0-2; -. [Q3KPI0-2]
DR Antibodypedia; 30791; 153 antibodies from 24 providers.
DR DNASU; 90273; -.
DR Ensembl; ENST00000187608.13; ENSP00000187608.9; ENSG00000007129.18. [Q3KPI0-2]
DR Ensembl; ENST00000401445.4; ENSP00000385739.2; ENSG00000007129.18. [Q3KPI0-1]
DR Ensembl; ENST00000457737.5; ENSP00000390697.1; ENSG00000007129.18. [Q3KPI0-3]
DR Ensembl; ENST00000611554.3; ENSP00000483598.2; ENSG00000278565.4. [Q3KPI0-2]
DR Ensembl; ENST00000627877.2; ENSP00000487327.1; ENSG00000278565.4. [Q3KPI0-3]
DR Ensembl; ENST00000629689.1; ENSP00000487532.1; ENSG00000278565.4. [Q3KPI0-1]
DR GeneID; 90273; -.
DR KEGG; hsa:90273; -.
DR MANE-Select; ENST00000401445.4; ENSP00000385739.2; NM_001098506.4; NP_001091976.3.
DR UCSC; uc002ore.5; human. [Q3KPI0-1]
DR CTD; 90273; -.
DR DisGeNET; 90273; -.
DR GeneCards; CEACAM21; -.
DR HGNC; HGNC:28834; CEACAM21.
DR HPA; ENSG00000007129; Tissue enhanced (bone marrow, lymphoid tissue).
DR MIM; 618191; gene.
DR neXtProt; NX_Q3KPI0; -.
DR OpenTargets; ENSG00000007129; -.
DR PharmGKB; PA142672135; -.
DR VEuPathDB; HostDB:ENSG00000007129; -.
DR eggNOG; ENOG502T1YP; Eukaryota.
DR GeneTree; ENSGT00960000186634; -.
DR HOGENOM; CLU_024555_6_0_1; -.
DR InParanoid; Q3KPI0; -.
DR OrthoDB; 998214at2759; -.
DR PhylomeDB; Q3KPI0; -.
DR TreeFam; TF336859; -.
DR PathwayCommons; Q3KPI0; -.
DR SignaLink; Q3KPI0; -.
DR BioGRID-ORCS; 90273; 6 hits in 1064 CRISPR screens.
DR ChiTaRS; CEACAM21; human.
DR GenomeRNAi; 90273; -.
DR Pharos; Q3KPI0; Tdark.
DR PRO; PR:Q3KPI0; -.
DR Proteomes; UP000005640; Chromosome 19.
DR RNAct; Q3KPI0; protein.
DR Bgee; ENSG00000007129; Expressed in lymph node and 97 other tissues.
DR ExpressionAtlas; Q3KPI0; baseline and differential.
DR Genevisible; Q3KPI0; HS.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.10; -; 2.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR013106; Ig_V-set.
DR Pfam; PF13895; Ig_2; 1.
DR Pfam; PF07686; V-set; 1.
DR SMART; SM00409; IG; 2.
DR SMART; SM00408; IGc2; 1.
DR SUPFAM; SSF48726; SSF48726; 2.
DR PROSITE; PS50835; IG_LIKE; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Disulfide bond; Glycoprotein; Immunoglobulin domain;
KW Membrane; Reference proteome; Signal; Transmembrane; Transmembrane helix.
FT SIGNAL 1..34
FT /evidence="ECO:0000255"
FT CHAIN 35..293
FT /note="Carcinoembryonic antigen-related cell adhesion
FT molecule 21"
FT /id="PRO_0000297615"
FT TOPO_DOM 35..240
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 241..261
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 262..293
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT DOMAIN 147..231
FT /note="Ig-like C2-type"
FT REGION 267..293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 271..293
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 111
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT DISULFID 166..214
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114"
FT VAR_SEQ 142..293
FT /note="ESVAQPSIQASSTTVTEKGSVVLTCHTNNTGTSFQWIFNNQRLQVTKRMKLS
FT WFNHVLTIDPIRQEDAGEYQCEVSNPVSSNRSDPLKLTVKSDDNTLGILIGVLVGSLLV
FT AALVCFLLLRKTGRASDQSDFREQQPPASTPGHGPSDSSIS -> GECSKFDSEISEDA
FT AWPQDTFCWSLYPQSQWLSPPSKPAAPQSQRRAPWS (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_027303"
FT VAR_SEQ 234..235
FT /note="SD -> Y (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:12975309"
FT /id="VSP_027304"
FT VARIANT 121
FT /note="N -> T (in dbSNP:rs714106)"
FT /evidence="ECO:0000269|PubMed:15057824"
FT /id="VAR_034651"
FT VARIANT 198
FT /note="V -> M (in dbSNP:rs2302188)"
FT /evidence="ECO:0000269|PubMed:15057824"
FT /id="VAR_034652"
SQ SEQUENCE 293 AA; 32354 MW; C1883C58203451C0 CRC64;
MGPPSACPHR ECIPWQGLLL TASLLTFWNA PTTAWLFIAS APFEVAEGEN VHLSVVYLPE
NLYSYGWYKG KTVEPNQLIA AYVIDTHVRT PGPAYSGRET ISPSGDLHFQ NVTLEDTGYY
NLQVTYRNSQ IEQASHHLRV YESVAQPSIQ ASSTTVTEKG SVVLTCHTNN TGTSFQWIFN
NQRLQVTKRM KLSWFNHVLT IDPIRQEDAG EYQCEVSNPV SSNRSDPLKL TVKSDDNTLG
ILIGVLVGSL LVAALVCFLL LRKTGRASDQ SDFREQQPPA STPGHGPSDS SIS