位置:首页 > 蛋白库 > CPG4_CAEEL
CPG4_CAEEL
ID   CPG4_CAEEL              Reviewed;         782 AA.
AC   O16883;
DT   26-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT   01-OCT-2003, sequence version 3.
DT   03-AUG-2022, entry version 94.
DE   RecName: Full=Chondroitin proteoglycan 4;
DE   Flags: Precursor;
GN   Name=cpg-4 {ECO:0000312|WormBase:C10F3.1}; ORFNames=C10F3.1;
OS   Caenorhabditis elegans.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
OX   NCBI_TaxID=6239;
RN   [1] {ECO:0000305, ECO:0000312|EMBL:ABC65814.1}
RP   NUCLEOTIDE SEQUENCE [MRNA], IDENTIFICATION BY MASS SPECTROMETRY, AND
RP   GLYCOSYLATION AT SER-691; SER-701; SER-704; SER-708; SER-714 AND SER-721.
RX   PubMed=16785326; DOI=10.1083/jcb.200603003;
RA   Olson S.K., Bishop J.R., Yates J.R., Oegema K., Esko J.D.;
RT   "Identification of novel chondroitin proteoglycans in Caenorhabditis
RT   elegans: embryonic cell division depends on CPG-1 and CPG-2.";
RL   J. Cell Biol. 173:985-994(2006).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bristol N2;
RX   PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG   The C. elegans sequencing consortium;
RT   "Genome sequence of the nematode C. elegans: a platform for investigating
RT   biology.";
RL   Science 282:2012-2018(1998).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DQ340626; ABC65814.1; -; mRNA.
DR   EMBL; FO080493; CCD64120.1; -; Genomic_DNA.
DR   PIR; T32155; T32155.
DR   RefSeq; NP_504556.3; NM_072155.4.
DR   AlphaFoldDB; O16883; -.
DR   STRING; 6239.C10F3.1; -.
DR   iPTMnet; O16883; -.
DR   EPD; O16883; -.
DR   PaxDb; O16883; -.
DR   PeptideAtlas; O16883; -.
DR   EnsemblMetazoa; C10F3.1.1; C10F3.1.1; WBGene00015677.
DR   GeneID; 178986; -.
DR   KEGG; cel:CELE_C10F3.1; -.
DR   UCSC; C10F3.1; c. elegans.
DR   CTD; 178986; -.
DR   WormBase; C10F3.1; CE08066; WBGene00015677; cpg-4.
DR   eggNOG; ENOG502S6Z7; Eukaryota.
DR   HOGENOM; CLU_404523_0_0_1; -.
DR   InParanoid; O16883; -.
DR   OrthoDB; 855831at2759; -.
DR   PRO; PR:O16883; -.
DR   Proteomes; UP000001940; Chromosome V.
DR   Bgee; WBGene00015677; Expressed in germ line (C elegans) and 3 other tissues.
DR   InterPro; IPR029153; CPG4.
DR   Pfam; PF15481; CPG4; 1.
PE   1: Evidence at protein level;
KW   Glycoprotein; Proteoglycan; Reference proteome; Signal.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000255"
FT   CHAIN           19..782
FT                   /note="Chondroitin proteoglycan 4"
FT                   /id="PRO_0000320224"
FT   REGION          513..726
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        519..536
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        544..616
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        641..655
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        684..721
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CARBOHYD        76
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        208
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        462
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        468
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        474
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        503
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        559
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        691
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        699
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   CARBOHYD        701
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        704
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        708
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        714
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        721
FT                   /note="O-linked (Xyl...) (chondroitin sulfate) serine"
FT                   /evidence="ECO:0000269|PubMed:16785326"
FT   CARBOHYD        743
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
SQ   SEQUENCE   782 AA;  83512 MW;  842E020AD9478794 CRC64;
     MRLVYSLIFL LFIPFSHPNP IPIPTISPET TNAYLRAFLP WWPEKTDFTL RTAPTPEESE
     AEIVGNLLES SGEKENVTEF ATEKEEIDPS TLRVHDLPPS PLDEFAPEGS PKSLVASGAR
     SSDGNFIISF DEMGECPRDC SNDLRDALGI ILQDMSHVER YRQICGKYTN AITCVNEDTR
     CNKEDRDMFE TMTSGLNYMC VEQKLAFNAT IKCIDDEAGV VQSECDTQCQ TKNLFMNWMM
     KTAFQDTIQQ GVNGIVGAAT GTNANPLAFL QPVAGAAGGA PGGGWADMLA NIGQRPPSPQ
     DAQQGFENFR QFTNDLCRIG DCMLDCIRSK FNTRCEGSAG TLLSEVFVRP IAATQNKLSI
     LRPILGTFMP EQCGYLTNNA ELKKHRIDAT MDEELKRMYA EKIAKEARDR TAQDEILANL
     VPLDENGVPL PRALPELKSI ESPLDVSVKT LDQLILDMYS NNKTEELNIS EKNNVTSTFS
     EPSEKEDEAS TTVISVISPL HTNATDSEIL EHISEKSTEE SSGSSGEMSG DGSDNEASGE
     GSGEYDASGS SGDNSGEFNS SGSSGEASEE GESSGSEDQG SGNYKMIESI ESSGEFSGSS
     GEGSGDTASS DTSIDDKSII RSGEGSAESV SEILQEASGE DAPTLTPTSE ESTGYKIDHS
     GFGESSGSSG ESIELRDSGE GSAEYDASGS SGDNSGDFNS SGSSGEASGV GESSGSEDQG
     SGNYKKIEVI ESSGDYEFSG SSNESIEQSK EGSAASIYEI LQAASGEDTP TLTLLSEDST
     GY
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024