PSGP_ONCMY
ID PSGP_ONCMY Reviewed; 542 AA.
AC P12027;
DT 01-OCT-1989, integrated into UniProtKB/Swiss-Prot.
DT 01-FEB-1991, sequence version 2.
DT 03-AUG-2022, entry version 70.
DE RecName: Full=Polysialoglycoprotein;
DE Short=PSGP;
DE AltName: Full=Apopolysialoglycoprotein;
DE Short=apoPSGP;
DE Flags: Precursor;
OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Protacanthopterygii; Salmoniformes;
OC Salmonidae; Salmoninae; Oncorhynchus.
OX NCBI_TaxID=8022;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC TISSUE=Egg;
RX PubMed=3182867; DOI=10.1016/s0021-9258(19)77890-9;
RA Sorimachi H., Emori Y., Kawasaki H., Kitajima K., Inoue S., Suzuki K.,
RA Inoue Y.;
RT "Molecular cloning and characterization of cDNAs coding for apo-
RT polysialoglycoprotein of rainbow trout eggs. Multiple mRNA species
RT transcribed from multiple genes contain diverged numbers of exact 39-base
RT (13-amino acid) repeats.";
RL J. Biol. Chem. 263:17678-17684(1988).
RN [2]
RP PROTEIN SEQUENCE OF 174-510.
RC TISSUE=Egg;
RX PubMed=3514613; DOI=10.1016/s0021-9258(19)57208-8;
RA Kitajima K., Inoue Y., Inoue S.;
RT "Polysialoglycoproteins of Salmonidae fish eggs. Complete structure of 200-
RT kDa polysialoglycoprotein from the unfertilized eggs of rainbow trout
RT (Salmo gairdneri).";
RL J. Biol. Chem. 261:5262-5269(1986).
RN [3]
RP GENE FAMILY ORGANIZATION.
RX PubMed=2299671; DOI=10.1016/0022-2836(90)90009-b;
RA Sorimachi H., Emori Y., Kawasaki H., Suzuki K., Inoue Y.;
RT "Organization and primary sequence of multiple genes coding for the
RT apopolysialoglycoproteins of rainbow trout.";
RL J. Mol. Biol. 211:35-48(1990).
CC -!- FUNCTION: In response to egg activation, PSGP is discharged by
CC exocytosis into the perivitelline space, where it undergo rapid
CC proteolysis into glycotridecapeptides. During fertilization and/or
CC early development the glycotridecapeptides prevent polyspermy or are
CC involved in the formation of a fertilization membrane.
CC -!- TISSUE SPECIFICITY: Cortical alveoli of immature ovaries.
CC -!- PTM: Most sialic acid residues exist in the form of polysialyl groups
CC partly capped with deaminoneuraminic acid.
CC -!- MISCELLANEOUS: The core of the PSGP protein contains an average of 25
CC exact tandem repeats of the same glycotridecapeptide, where the Ser and
CC the Thr residues are attachment sites of a polysialylglycan chain.
CC -!- MISCELLANEOUS: Multiple genes for PSGP are transcribed into multiple
CC mRNAs containing diverged numbers of repeating units.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; J04051; AAA49548.1; -; mRNA.
DR PIR; S08207; S08207.
DR RefSeq; NP_001118159.1; NM_001124687.1.
DR AlphaFoldDB; P12027; -.
DR Ensembl; ENSOMYT00000118057; ENSOMYP00000120621; ENSOMYG00000071125.
DR Ensembl; ENSOMYT00000122036; ENSOMYP00000111858; ENSOMYG00000062830.
DR Ensembl; ENSOMYT00000129702; ENSOMYP00000113243; ENSOMYG00000064337.
DR Ensembl; ENSOMYT00000144330; ENSOMYP00000133643; ENSOMYG00000069809.
DR Ensembl; ENSOMYT00000146922; ENSOMYP00000137352; ENSOMYG00000070369.
DR Ensembl; ENSOMYT00000157122; ENSOMYP00000121029; ENSOMYG00000068122.
DR GeneID; 100136727; -.
DR KEGG; omy:100136727; -.
DR OrthoDB; 1699360at2759; -.
DR InterPro; IPR009900; PSGP.
DR Pfam; PF07276; PSGP; 33.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Glycoprotein; Repeat; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT PROPEP 22..120
FT /id="PRO_0000022167"
FT CHAIN 121..536
FT /note="Polysialoglycoprotein"
FT /id="PRO_0000022168"
FT PROPEP 537..542
FT /id="PRO_0000022169"
FT REPEAT 121..133
FT /note="1"
FT REPEAT 134..146
FT /note="2"
FT REPEAT 147..159
FT /note="3"
FT REPEAT 160..172
FT /note="4"
FT REPEAT 173..185
FT /note="5"
FT REPEAT 186..198
FT /note="6"
FT REPEAT 199..211
FT /note="7"
FT REPEAT 212..224
FT /note="8"
FT REPEAT 225..237
FT /note="9"
FT REPEAT 238..250
FT /note="10"
FT REPEAT 251..263
FT /note="11"
FT REPEAT 264..276
FT /note="12"
FT REPEAT 277..289
FT /note="13"
FT REPEAT 290..302
FT /note="14"
FT REPEAT 303..315
FT /note="15"
FT REPEAT 316..328
FT /note="16"
FT REPEAT 329..341
FT /note="17"
FT REPEAT 342..354
FT /note="18"
FT REPEAT 355..367
FT /note="19"
FT REPEAT 368..380
FT /note="20"
FT REPEAT 381..393
FT /note="21"
FT REPEAT 394..406
FT /note="22"
FT REPEAT 407..419
FT /note="23"
FT REPEAT 420..432
FT /note="24"
FT REPEAT 433..445
FT /note="25"
FT REPEAT 446..458
FT /note="26"
FT REPEAT 459..471
FT /note="27"
FT REPEAT 472..484
FT /note="28"
FT REPEAT 485..497
FT /note="29"
FT REPEAT 498..510
FT /note="30"
FT REPEAT 511..523
FT /note="31"
FT REPEAT 524..536
FT /note="32"
FT REGION 70..542
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 121..536
FT /note="32 X 13 AA tandem repeats of D-D-A-T-S-E-A-A-T-G-P-
FT S-G"
FT COMPBIAS 77..95
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..529
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 124
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 125
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 129
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 137
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 138
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 142
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 150
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 151
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 155
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 163
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 164
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 168
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 176
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 177
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 181
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 189
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 190
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 194
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 202
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 203
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 207
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 215
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 216
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 220
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 228
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 229
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 233
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 241
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 242
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 246
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 254
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 255
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 259
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 267
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 268
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 272
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 280
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 281
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 285
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 293
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 294
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 298
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 306
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 307
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 311
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 319
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 320
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 324
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 332
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 333
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 337
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 345
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 346
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 350
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 358
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 359
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 363
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 371
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 372
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 376
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 384
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 385
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 389
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 397
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 398
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 402
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 410
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 411
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 415
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 423
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 424
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 428
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 436
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 437
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 441
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 449
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 450
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 454
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 462
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 463
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 467
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 475
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 476
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 480
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 488
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 489
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 493
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 501
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 502
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 506
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 514
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 515
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 519
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 527
FT /note="O-linked (GalNAc...) threonine"
FT CARBOHYD 528
FT /note="O-linked (GalNAc...) serine"
FT CARBOHYD 532
FT /note="O-linked (GalNAc...) threonine"
SQ SEQUENCE 542 AA; 50483 MW; 083510D3F21310E0 CRC64;
MIMGGVRELL LVVMTVGVVK VSCYPVGKSQ KQDQVSLQRR LGELSSNDVS IVHALALLRS
IGSDAKQARE EYLETNEVES QASPNHGSSP ANDALSSEEK LRRVSSDDAA TSEAATGPSG
DDATSEAATG PSGDDATSEA ATGPSGDDAT SEAATGPSGD DATSEAATGP SGDDATSEAA
TGPSGDDATS EAATGPSGDD ATSEAATGPS GDDATSEAAT GPSGDDATSE AATGPSGDDA
TSEAATGPSG DDATSEAATG PSGDDATSEA ATGPSGDDAT SEAATGPSGD DATSEAATGP
SGDDATSEAA TGPSGDDATS EAATGPSGDD ATSEAATGPS GDDATSEAAT GPSGDDATSE
AATGPSGDDA TSEAATGPSG DDATSEAATG PSGDDATSEA ATGPSGDDAT SEAATGPSGD
DATSEAATGP SGDDATSEAA TGPSGDDATS EAATGPSGDD ATSEAATGPS GDDATSEAAT
GPSGDDATSE AATGPSGDDA TSEAATGPSG DDATSEAATG PSGDDATSEA ATGPSGDDAM
DI