CPG3_CAEEL
ID CPG3_CAEEL Reviewed; 292 AA.
AC Q21771;
DT 26-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1996, sequence version 1.
DT 03-AUG-2022, entry version 100.
DE RecName: Full=Chondroitin proteoglycan 3;
DE Flags: Precursor;
GN Name=cpg-3 {ECO:0000312|EMBL:CAA95840.1, ECO:0000312|WormBase:R06C7.4};
GN ORFNames=R06C7.4;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1] {ECO:0000305, ECO:0000312|EMBL:ABC65813.1}
RP NUCLEOTIDE SEQUENCE [MRNA], AND IDENTIFICATION BY MASS SPECTROMETRY.
RX PubMed=16785326; DOI=10.1083/jcb.200603003;
RA Olson S.K., Bishop J.R., Yates J.R., Oegema K., Esko J.D.;
RT "Identification of novel chondroitin proteoglycans in Caenorhabditis
RT elegans: embryonic cell division depends on CPG-1 and CPG-2.";
RL J. Cell Biol. 173:985-994(2006).
RN [2] {ECO:0000312|EMBL:CAA95840.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [3]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-174, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=17761667; DOI=10.1074/mcp.m600392-mcp200;
RA Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T.,
RA Taoka M., Takahashi N., Isobe T.;
RT "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis
RT elegans and suggests an atypical translocation mechanism for integral
RT membrane proteins.";
RL Mol. Cell. Proteomics 6:2100-2109(2007).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ340625; ABC65813.1; -; mRNA.
DR EMBL; Z71266; CAA95840.1; -; Genomic_DNA.
DR PIR; T23966; T23966.
DR RefSeq; NP_492047.1; NM_059646.5.
DR AlphaFoldDB; Q21771; -.
DR SMR; Q21771; -.
DR BioGRID; 37908; 4.
DR IntAct; Q21771; 2.
DR STRING; 6239.R06C7.4.1; -.
DR iPTMnet; Q21771; -.
DR PaxDb; Q21771; -.
DR PeptideAtlas; Q21771; -.
DR EnsemblMetazoa; R06C7.4.1; R06C7.4.1; WBGene00011063.
DR EnsemblMetazoa; R06C7.4.2; R06C7.4.2; WBGene00011063.
DR GeneID; 172465; -.
DR KEGG; cel:CELE_R06C7.4; -.
DR UCSC; R06C7.4.3; c. elegans.
DR CTD; 172465; -.
DR WormBase; R06C7.4; CE06247; WBGene00011063; cpg-3.
DR eggNOG; ENOG502RT8N; Eukaryota.
DR GeneTree; ENSGT00970000196222; -.
DR HOGENOM; CLU_1009114_0_0_1; -.
DR InParanoid; Q21771; -.
DR OMA; FVITEMT; -.
DR OrthoDB; 1392893at2759; -.
DR PRO; PR:Q21771; -.
DR Proteomes; UP000001940; Chromosome I.
DR Bgee; WBGene00011063; Expressed in germ line (C elegans) and 3 other tissues.
DR InterPro; IPR039260; Cpg-3.
DR PANTHER; PTHR37973; PTHR37973; 1.
PE 1: Evidence at protein level;
KW Glycoprotein; Proteoglycan; Reference proteome; Signal.
FT SIGNAL 1..17
FT /evidence="ECO:0000255"
FT CHAIN 18..292
FT /note="Chondroitin proteoglycan 3"
FT /id="PRO_0000320222"
FT REGION 28..103
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..55
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 66..85
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 174
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:17761667"
FT CARBOHYD 254
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 292 AA; 29731 MW; FC5C58A2FD070098 CRC64;
MRFVFIIALL LIGASLAHPA DPIRAKRDVS ASEDEFSGDS SGEISGESSG EASGEASGEA
SGEASGEASG ESSGETSGES SGDEETSGEG SGEEGSGDTS PVVPVDELTL QQLETLNTYA
QQVQAESQKL IHQANFVITE MTALSANAQN LGILSNIVLA NSQMVLDSAR LSLNETETET
GTSAPATCVS SAVCYGDSGC GSGKCIGALA GTCNCNSCVF GWPCQEDSAC GGFNGACNSI
TATCDCFAAY TKNNLTLAEA LTSFCNVETC NGAEDNVEKC HGLPCNYGFC VC