CEI_HUMAN
ID CEI_HUMAN Reviewed; 138 AA.
AC Q86SI9;
DT 10-JUL-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2003, sequence version 1.
DT 03-AUG-2022, entry version 114.
DE RecName: Full=Protein CEI;
DE AltName: Full=Coordinated expression to IRXA2 protein;
DE Flags: Precursor;
GN Name=C5orf38; Synonyms=CEI;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORMS 1; 2 AND 3), ALTERNATIVE
RP SPLICING, AND TISSUE SPECIFICITY.
RC TISSUE=Kidney;
RX PubMed=16515847; DOI=10.1016/j.gene.2005.11.033;
RA Wu Q., Tommerup N., Ming Wang S., Hansen L.;
RT "A novel primate specific gene, CEI, is located in the homeobox gene IRXA2
RT promoter in Homo sapiens.";
RL Gene 371:167-173(2006).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Brain;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- INTERACTION:
CC Q86SI9; Q9UHD4: CIDEB; NbExp=3; IntAct=EBI-17872065, EBI-7062247;
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1; Synonyms=CEIa;
CC IsoId=Q86SI9-1; Sequence=Displayed;
CC Name=2; Synonyms=CEIb;
CC IsoId=Q86SI9-2; Sequence=VSP_026648;
CC Name=3; Synonyms=CEIc;
CC IsoId=Q86SI9-3; Sequence=VSP_026649;
CC -!- TISSUE SPECIFICITY: Isoform 1 is highly expressed in small intestine,
CC testis and kidney, medium expressed in brain and heart and low
CC expressed in colon; it could not be detected in liver, adrenal gland
CC and pancreas. {ECO:0000269|PubMed:16515847}.
CC -!- MISCELLANEOUS: According to PubMed:16515847, this gene is only
CC represented in primates genome and it is highly conserved. Also found
CC in bats (according to Ensembl).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY249324; AAP15241.1; -; Genomic_DNA.
DR EMBL; AY249325; AAP15242.1; -; mRNA.
DR EMBL; BC101608; AAI01609.1; -; mRNA.
DR EMBL; BC101634; AAI01635.1; -; mRNA.
DR RefSeq; NP_001281266.1; NM_001294337.1. [Q86SI9-3]
DR RefSeq; NP_001293078.1; NM_001306149.1.
DR RefSeq; NP_848664.1; NM_178569.3. [Q86SI9-1]
DR RefSeq; XP_005248313.1; XM_005248256.3.
DR AlphaFoldDB; Q86SI9; -.
DR BioGRID; 127503; 1.
DR IntAct; Q86SI9; 1.
DR STRING; 9606.ENSP00000334267; -.
DR iPTMnet; Q86SI9; -.
DR BioMuta; C5orf38; -.
DR PaxDb; Q86SI9; -.
DR PRIDE; Q86SI9; -.
DR ProteomicsDB; 69594; -. [Q86SI9-2]
DR DNASU; 153571; -.
DR GeneID; 153571; -.
DR UCSC; uc003jdc.3; human. [Q86SI9-1]
DR CTD; 153571; -.
DR DisGeNET; 153571; -.
DR GeneCards; C5orf38; -.
DR HGNC; HGNC:24226; C5orf38.
DR MIM; 610522; gene.
DR neXtProt; NX_Q86SI9; -.
DR PharmGKB; PA162380158; -.
DR VEuPathDB; HostDB:ENSG00000186493; -.
DR eggNOG; ENOG502TDUE; Eukaryota.
DR HOGENOM; CLU_2793298_0_0_1; -.
DR InParanoid; Q86SI9; -.
DR OMA; QWHYRVH; -.
DR OrthoDB; 1471538at2759; -.
DR PhylomeDB; Q86SI9; -.
DR TreeFam; TF338586; -.
DR PathwayCommons; Q86SI9; -.
DR SignaLink; Q86SI9; -.
DR BioGRID-ORCS; 153571; 11 hits in 1052 CRISPR screens.
DR ChiTaRS; C5orf38; human.
DR GenomeRNAi; 153571; -.
DR Pharos; Q86SI9; Tdark.
DR PRO; PR:Q86SI9; -.
DR Proteomes; UP000005640; Chromosome 5.
DR RNAct; Q86SI9; protein.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
PE 1: Evidence at protein level;
KW Alternative splicing; Reference proteome; Secreted; Signal.
FT SIGNAL 1..35
FT /evidence="ECO:0000255"
FT CHAIN 36..138
FT /note="Protein CEI"
FT /id="PRO_5000090517"
FT VAR_SEQ 67..135
FT /note="RFVLSKHWGDDCYLTNRLWQDLKPPSHVENGQELRLAPPVQWALQVQGNQLQ
FT TAVLCLRMAPPEPAGSR -> S (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16515847"
FT /id="VSP_026649"
FT VAR_SEQ 112..138
FT /note="VQGNQLQTAVLCLRMAPPEPAGSRQRI -> PKNLERVYVDTQVSASGDFLR
FT GRARGTAGPGGSGSGSPRGRGRLRRPGRSPGAAPSSVSRGRKEATQARSRARGRRGGAV
FT ARVCRPESRQRWARPTSSPGGLIRGRRKNGIEAFQ (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16515847"
FT /id="VSP_026648"
SQ SEQUENCE 138 AA; 15091 MW; 2F84F3ED685B3CE3 CRC64;
MVAPAARVFL RAVRAALTST VPDLLCLLAR GSPRGLASGR LPLAVHSAQH GPGSGAPWLR
IARRALRFVL SKHWGDDCYL TNRLWQDLKP PSHVENGQEL RLAPPVQWAL QVQGNQLQTA
VLCLRMAPPE PAGSRQRI