EGG2_SCHMA
ID EGG2_SCHMA Reviewed; 177 AA.
AC P12796; Q26577;
DT 01-OCT-1989, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-1989, sequence version 1.
DT 03-AUG-2022, entry version 57.
DE RecName: Full=Eggshell protein;
DE AltName: Full=Chorion protein;
DE Flags: Precursor;
OS Schistosoma mansoni (Blood fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6183;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=2850476; DOI=10.1128/mcb.8.8.3008-3016.1988;
RA Bobek L.A., Rekosh D.M., Loverde P.T.;
RT "Small gene family encoding an eggshell (chorion) protein of the human
RT parasite Schistosoma mansoni.";
RL Mol. Cell. Biol. 8:3008-3016(1988).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA].
RX PubMed=3461449; DOI=10.1073/pnas.83.15.5544;
RA Bobek L., Rekosh D.M., van Keulen H., LoVerde P.T.;
RT "Characterization of a female-specific cDNA derived from a developmentally
RT regulated mRNA in the human blood fluke Schistosoma mansoni.";
RL Proc. Natl. Acad. Sci. U.S.A. 83:5544-5548(1986).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M21607; AAA29862.1; -; Genomic_DNA.
DR EMBL; M14309; AAA74695.1; -; mRNA.
DR PIR; A31204; A31204.
DR AlphaFoldDB; P12796; -.
DR EnsemblMetazoa; Smp_316140.1; Smp_316140.1; Smp_316140.
DR EnsemblMetazoa; Smp_316150.1; Smp_316150.1; Smp_316150.
DR WBParaSite; Smp_316140.1; Smp_316140.1; Smp_316140.
DR WBParaSite; Smp_316150.1; Smp_316150.1; Smp_316150.
DR Proteomes; UP000008854; Unassembled WGS sequence.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat; Signal.
FT SIGNAL 1..18
FT CHAIN 19..177
FT /note="Eggshell protein"
FT /id="PRO_0000021153"
FT REPEAT 25..41
FT /note="1"
FT REPEAT 42..59
FT /note="2"
FT REPEAT 60..75
FT /note="3"
FT REPEAT 76..91
FT /note="4"
FT REPEAT 92..112
FT /note="5"
FT REGION 25..112
FT /note="5 X approximate tandem repeats"
FT REGION 149..177
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 19
FT /note="H -> Y (in Ref. 2; AAA74695)"
FT /evidence="ECO:0000305"
FT CONFLICT 54
FT /note="G -> S (in Ref. 2; AAA74695)"
FT /evidence="ECO:0000305"
FT CONFLICT 117
FT /note="F -> L (in Ref. 2; AAA74695)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 177 AA; 16310 MW; 11B2577636097308 CRC64;
MKQSLTLVFL VAIGYATAHT TSHDYSGGYG GGCYGSDCDS GYGDSGYGGG CTGGDCGGGY
GGGYGGGCSG GDCGNYGGGY GGDCNGGDCG NYGGGYGGGN GGGCSGGNCG GGFDEAFPAP
YGGDYGNGGN GFGKGGSKGN NYGKGYGGGS GKGKGGGKGG KGGKGGTYKP SHYGGGY