CPG4_CAEEL
ID CPG4_CAEEL Reviewed; 782 AA.
AC O16883;
DT 26-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2003, sequence version 3.
DT 03-AUG-2022, entry version 94.
DE RecName: Full=Chondroitin proteoglycan 4;
DE Flags: Precursor;
GN Name=cpg-4 {ECO:0000312|WormBase:C10F3.1}; ORFNames=C10F3.1;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1] {ECO:0000305, ECO:0000312|EMBL:ABC65814.1}
RP NUCLEOTIDE SEQUENCE [MRNA], IDENTIFICATION BY MASS SPECTROMETRY, AND
RP GLYCOSYLATION AT SER-691; SER-701; SER-704; SER-708; SER-714 AND SER-721.
RX PubMed=16785326; DOI=10.1083/jcb.200603003;
RA Olson S.K., Bishop J.R., Yates J.R., Oegema K., Esko J.D.;
RT "Identification of novel chondroitin proteoglycans in Caenorhabditis
RT elegans: embryonic cell division depends on CPG-1 and CPG-2.";
RL J. Cell Biol. 173:985-994(2006).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ340626; ABC65814.1; -; mRNA.
DR EMBL; FO080493; CCD64120.1; -; Genomic_DNA.
DR PIR; T32155; T32155.
DR RefSeq; NP_504556.3; NM_072155.4.
DR AlphaFoldDB; O16883; -.
DR STRING; 6239.C10F3.1; -.
DR iPTMnet; O16883; -.
DR EPD; O16883; -.
DR PaxDb; O16883; -.
DR PeptideAtlas; O16883; -.
DR EnsemblMetazoa; C10F3.1.1; C10F3.1.1; WBGene00015677.
DR GeneID; 178986; -.
DR KEGG; cel:CELE_C10F3.1; -.
DR UCSC; C10F3.1; c. elegans.
DR CTD; 178986; -.
DR WormBase; C10F3.1; CE08066; WBGene00015677; cpg-4.
DR eggNOG; ENOG502S6Z7; Eukaryota.
DR HOGENOM; CLU_404523_0_0_1; -.
DR InParanoid; O16883; -.
DR OrthoDB; 855831at2759; -.
DR PRO; PR:O16883; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00015677; Expressed in germ line (C elegans) and 3 other tissues.
DR InterPro; IPR029153; CPG4.
DR Pfam; PF15481; CPG4; 1.
PE 1: Evidence at protein level;
KW Glycoprotein; Proteoglycan; Reference proteome; Signal.
FT SIGNAL 1..18
FT /evidence="ECO:0000255"
FT CHAIN 19..782
FT /note="Chondroitin proteoglycan 4"
FT /id="PRO_0000320224"
FT REGION 513..726
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..536
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 544..616
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..655
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 684..721
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 76
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 208
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 462
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 468
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 474
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 503
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 559
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 691
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 699
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 701
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 704
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 708
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 714
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 721
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000269|PubMed:16785326"
FT CARBOHYD 743
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 782 AA; 83512 MW; 842E020AD9478794 CRC64;
MRLVYSLIFL LFIPFSHPNP IPIPTISPET TNAYLRAFLP WWPEKTDFTL RTAPTPEESE
AEIVGNLLES SGEKENVTEF ATEKEEIDPS TLRVHDLPPS PLDEFAPEGS PKSLVASGAR
SSDGNFIISF DEMGECPRDC SNDLRDALGI ILQDMSHVER YRQICGKYTN AITCVNEDTR
CNKEDRDMFE TMTSGLNYMC VEQKLAFNAT IKCIDDEAGV VQSECDTQCQ TKNLFMNWMM
KTAFQDTIQQ GVNGIVGAAT GTNANPLAFL QPVAGAAGGA PGGGWADMLA NIGQRPPSPQ
DAQQGFENFR QFTNDLCRIG DCMLDCIRSK FNTRCEGSAG TLLSEVFVRP IAATQNKLSI
LRPILGTFMP EQCGYLTNNA ELKKHRIDAT MDEELKRMYA EKIAKEARDR TAQDEILANL
VPLDENGVPL PRALPELKSI ESPLDVSVKT LDQLILDMYS NNKTEELNIS EKNNVTSTFS
EPSEKEDEAS TTVISVISPL HTNATDSEIL EHISEKSTEE SSGSSGEMSG DGSDNEASGE
GSGEYDASGS SGDNSGEFNS SGSSGEASEE GESSGSEDQG SGNYKMIESI ESSGEFSGSS
GEGSGDTASS DTSIDDKSII RSGEGSAESV SEILQEASGE DAPTLTPTSE ESTGYKIDHS
GFGESSGSSG ESIELRDSGE GSAEYDASGS SGDNSGDFNS SGSSGEASGV GESSGSEDQG
SGNYKKIEVI ESSGDYEFSG SSNESIEQSK EGSAASIYEI LQAASGEDTP TLTLLSEDST
GY