U376A_CAEEL
ID U376A_CAEEL Reviewed; 203 AA.
AC O01454;
DT 05-SEP-2006, integrated into UniProtKB/Swiss-Prot.
DT 27-JUL-2011, sequence version 4.
DT 03-AUG-2022, entry version 110.
DE RecName: Full=UPF0376 protein C03G6.5;
DE Flags: Precursor;
GN ORFNames=C03G6.5;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-84, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=12754521; DOI=10.1038/nbt829;
RA Kaji H., Saito H., Yamauchi Y., Shinkawa T., Taoka M., Hirabayashi J.,
RA Kasai K., Takahashi N., Isobe T.;
RT "Lectin affinity capture, isotope-coded tagging and mass spectrometry to
RT identify N-linked glycoproteins.";
RL Nat. Biotechnol. 21:667-672(2003).
RN [3]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-84, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RX PubMed=15888633; DOI=10.1093/glycob/cwi075;
RA Fan X., She Y.-M., Bagshaw R.D., Callahan J.W., Schachter H., Mahuran D.J.;
RT "Identification of the hydrophobic glycoproteins of Caenorhabditis
RT elegans.";
RL Glycobiology 15:952-964(2005).
RN [4]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-32 AND ASN-84, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=17761667; DOI=10.1074/mcp.m600392-mcp200;
RA Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T.,
RA Taoka M., Takahashi N., Isobe T.;
RT "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis
RT elegans and suggests an atypical translocation mechanism for integral
RT membrane proteins.";
RL Mol. Cell. Proteomics 6:2100-2109(2007).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the UPF0376 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO080297; CCD62706.1; -; Genomic_DNA.
DR RefSeq; NP_504881.3; NM_072480.3.
DR AlphaFoldDB; O01454; -.
DR STRING; 6239.C03G6.5; -.
DR iPTMnet; O01454; -.
DR EPD; O01454; -.
DR PaxDb; O01454; -.
DR PeptideAtlas; O01454; -.
DR PRIDE; O01454; -.
DR EnsemblMetazoa; C03G6.5.1; C03G6.5.1; WBGene00015393.
DR GeneID; 179123; -.
DR KEGG; cel:CELE_C03G6.5; -.
DR UCSC; C03G6.5; c. elegans.
DR CTD; 179123; -.
DR WormBase; C03G6.5; CE45476; WBGene00015393; -.
DR eggNOG; ENOG502THJ5; Eukaryota.
DR GeneTree; ENSGT00970000195826; -.
DR HOGENOM; CLU_078890_1_0_1; -.
DR InParanoid; O01454; -.
DR OrthoDB; 1767442at2759; -.
DR PhylomeDB; O01454; -.
DR PRO; PR:O01454; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00015393; Expressed in larva and 2 other tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR InterPro; IPR002542; DUF19.
DR InterPro; IPR016638; UPF0376.
DR Pfam; PF01579; DUF19; 1.
DR PIRSF; PIRSF015697; UCP015697; 1.
PE 1: Evidence at protein level;
KW Glycoprotein; Reference proteome; Secreted; Signal.
FT SIGNAL 1..20
FT /evidence="ECO:0000255"
FT CHAIN 21..203
FT /note="UPF0376 protein C03G6.5"
FT /id="PRO_0000248555"
FT CARBOHYD 32
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:17761667"
FT CARBOHYD 84
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:12754521,
FT ECO:0000269|PubMed:15888633, ECO:0000269|PubMed:17761667"
FT CARBOHYD 188
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 203 AA; 22819 MW; 385573D71D850AB2 CRC64;
MIGFLKFALI GTVLLGVANG ASVATASAKG SNCTVQDGYA ALMCLVRLSD FAEKVDNLDM
NDKTKLKEFK RSCDSLHSCY SNLNCTTKSD DEKDKYVESI KQYCDAVVYV SDGFSKCSDK
LNEKKSKCFD DWDPIPNKIH LEEDEAKIEK IKNEACKTYF GKDDCMKKEI TETCGKEEWD
SFRKQFVNLS GGLVSKCDFS RFE