YE2H4_CAEEL
ID YE2H4_CAEEL Reviewed; 518 AA.
AC Q19040;
DT 03-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 05-OCT-2010, sequence version 3.
DT 03-AUG-2022, entry version 107.
DE RecName: Full=Uncharacterized protein E02H4.4;
DE Flags: Precursor;
GN ORFNames=E02H4.4;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-295, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=12754521; DOI=10.1038/nbt829;
RA Kaji H., Saito H., Yamauchi Y., Shinkawa T., Taoka M., Hirabayashi J.,
RA Kasai K., Takahashi N., Isobe T.;
RT "Lectin affinity capture, isotope-coded tagging and mass spectrometry to
RT identify N-linked glycoproteins.";
RL Nat. Biotechnol. 21:667-672(2003).
RN [3]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-295 AND ASN-362, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=17761667; DOI=10.1074/mcp.m600392-mcp200;
RA Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T.,
RA Taoka M., Takahashi N., Isobe T.;
RT "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis
RT elegans and suggests an atypical translocation mechanism for integral
RT membrane proteins.";
RL Mol. Cell. Proteomics 6:2100-2109(2007).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z68003; CAA91977.3; -; Genomic_DNA.
DR PIR; T20422; T20422.
DR RefSeq; NP_510379.3; NM_077978.3.
DR AlphaFoldDB; Q19040; -.
DR iPTMnet; Q19040; -.
DR EPD; Q19040; -.
DR PaxDb; Q19040; -.
DR EnsemblMetazoa; E02H4.4.1; E02H4.4.1; WBGene00008462.
DR GeneID; 183995; -.
DR KEGG; cel:CELE_E02H4.4; -.
DR UCSC; E02H4.4; c. elegans.
DR CTD; 183995; -.
DR WormBase; E02H4.4; CE44096; WBGene00008462; -.
DR eggNOG; ENOG502TG1S; Eukaryota.
DR GeneTree; ENSGT00550000075913; -.
DR HOGENOM; CLU_520963_0_0_1; -.
DR InParanoid; Q19040; -.
DR OMA; LNFYDAS; -.
DR OrthoDB; 1768695at2759; -.
DR PRO; PR:Q19040; -.
DR Proteomes; UP000001940; Chromosome X.
DR Bgee; WBGene00008462; Expressed in embryo and 3 other tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR Gene3D; 2.60.120.290; -; 1.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR SUPFAM; SSF49854; SSF49854; 1.
PE 1: Evidence at protein level;
KW Glycoprotein; Reference proteome; Secreted; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT CHAIN 22..518
FT /note="Uncharacterized protein E02H4.4"
FT /id="PRO_0000250572"
FT DOMAIN 389..517
FT /note="CUB"
FT CARBOHYD 30
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 142
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 295
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:12754521,
FT ECO:0000269|PubMed:17761667"
FT CARBOHYD 342
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 362
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:17761667"
FT CARBOHYD 410
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 503
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 518 AA; 56155 MW; 33CD47208583D6FC CRC64;
MWKWKVILLF LAEMFVSGVN GDCACYEANN VTGKSAPISN SYMTEDFSPC YLPCSYVAYS
GDETYGWTGL VLNWKTTTNS GGIIQIFDGP SATGTPIIQV AEGETIAAGI GKSLIKSSLP
VITIKYSQTG SGASMFILNI NNGTFVTNVY YGSISTAAGL APIAVSTTTT AMPTTRSYSN
PNFAYDPYFI THDIFVVINQ RTSGGMAALT SLNALAINFV TLLSTTTDIN SVSSSRLSLA
TLTPYDPYYS VQGGIWELSA TDVLNNVPRS GVTIEGDIDS ALIGLADLAF TVNKNASTDT
RNNVQRSVVL LTAEWPSNAL LGDNVASKFT ELGLNLLVVG YNLTDAETSQ LLRTDRWYNA
INSSDSKITN VAAFVNPFYF NNGANNFWCP PFGITNSVDG TYSWFQEPYN YTGPHGINGV
WTDPFDGQTG RYCNFANNQY VYQNNAGGSV KVQVYFELEA GKDFLNFYDA SNNLIASFTG
YEIAGSSFFT STTTLTARFT SDNKSIFRGF WVSITPQP