CPG4_CAEBR
ID CPG4_CAEBR Reviewed; 680 AA.
AC A8X9H4;
DT 26-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 05-OCT-2010, sequence version 2.
DT 25-MAY-2022, entry version 50.
DE RecName: Full=Chondroitin proteoglycan 4;
DE Flags: Precursor;
GN Name=cpg-4; ORFNames=CBG09272;
OS Caenorhabditis briggsae.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6238;
RN [1] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AF16;
RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045;
RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N.,
RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., D'Eustachio P.,
RA Fitch D.H.A., Fulton L.A., Fulton R.E., Griffiths-Jones S., Harris T.W.,
RA Hillier L.W., Kamath R., Kuwabara P.E., Mardis E.R., Marra M.A.,
RA Miner T.L., Minx P., Mullikin J.C., Plumb R.W., Rogers J., Schein J.E.,
RA Sohrmann M., Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K.,
RA Durbin R.M., Waterston R.H.;
RT "The genome sequence of Caenorhabditis briggsae: a platform for comparative
RT genomics.";
RL PLoS Biol. 1:166-192(2003).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HE600954; CAP29286.3; -; Genomic_DNA.
DR AlphaFoldDB; A8X9H4; -.
DR STRING; 6238.CBG09272; -.
DR EnsemblMetazoa; CBG09272.1; CBG09272.1; WBGene00030884.
DR WormBase; CBG09272; CBP37594; WBGene00030884; Cbr-cpg-4.
DR eggNOG; ENOG502S6Z7; Eukaryota.
DR HOGENOM; CLU_404523_0_0_1; -.
DR InParanoid; A8X9H4; -.
DR OMA; MCVEQEL; -.
DR OrthoDB; 855831at2759; -.
DR Proteomes; UP000008549; Chromosome V.
DR InterPro; IPR029153; CPG4.
DR Pfam; PF15481; CPG4; 1.
PE 3: Inferred from homology;
KW Glycoprotein; Proteoglycan; Reference proteome; Signal.
FT SIGNAL 1..18
FT /evidence="ECO:0000255"
FT CHAIN 19..680
FT /note="Chondroitin proteoglycan 4"
FT /id="PRO_0000320223"
FT REGION 460..680
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 467..534
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 659..680
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 42
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 59
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 72
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 167
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 205
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 458
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 472
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 486
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 498
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 526
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 527
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 556
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 604
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 640
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000250"
FT CARBOHYD 644
FT /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT /evidence="ECO:0000250"
FT CARBOHYD 664
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 680 AA; 73119 MW; 31AD526A8974C907 CRC64;
MLRVNLLILL CFVPFSLNNP LPTLPPDTAN AYLRSFLPWW PNNTDFSLRA APTPSESENS
TEAVLLGAEY ENGTDSTTGN QEDLDPATLR VQALPDSPLD ALSPENAPKS FVSESMNHHD
GNFIINFDEM GECPRDCSND LREALGIVLQ DMSHVERYHR ICEKYSNAST CVNEDGRCDK
DDRGMFEMMT SGLHYMCVEQ ELAFNATIKC IDDEAGLVQS ECDAQCQTKN LFMNWMMRTA
FQDTIQQGVN GIVGAATGTN ANPLAFLQGA EGAAGGTPTG WADMLATVEQ RPPSPQDAQQ
GFENFRQFTN DLCRIGDCML DCIRSKFNTR CEGSAGTLLS EVFVRPIAAS QNKLSILRPV
LGSFMPEQCN YLTNNADLKK HRIDSTMDEE LKRMYAEKMA KEIRDRNAQD ELLSNLVPLD
ENGVPLPRAL PELKSVDSPL DVSVKTLDQL ILDMYSKNET KKAETTKKSY PNTTTVAPKN
DDQAANTTAE TTKTTSANIT HVETTTLGNP KTEEVLSDVS LDTSGNNSTV ADSGEGSAEG
AGEGTEEDAE YSGSGNESTT EEEYSGSEEV SATDEGHPLA ADESNEGLLT GSGEPAEEAS
GTGNSSVEGS GSGLDSLRSY IETSGEASGG APGEASGEAS GEASGEVSGE EEFSGYSGES
PGENESSGEV PLTTTLHELY