SWP2_ENCIN
ID SWP2_ENCIN Reviewed; 1002 AA.
AC Q95WA4;
DT 16-JUN-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-DEC-2001, sequence version 1.
DT 25-MAY-2022, entry version 33.
DE RecName: Full=Spore wall protein 2;
DE Flags: Precursor;
GN Name=SWP2;
OS Encephalitozoon intestinalis (Microsporidian parasite).
OC Eukaryota; Fungi; Fungi incertae sedis; Microsporidia; Unikaryonidae;
OC Encephalitozoon.
OX NCBI_TaxID=58839;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], IDENTIFICATION AS AN ANTIGEN,
RP SUBCELLULAR LOCATION, SUBUNIT, AND DEVELOPMENTAL STAGE.
RX PubMed=11598081; DOI=10.1128/iai.69.11.7057-7066.2001;
RA Hayman J.R., Hayes S.F., Amon J., Nash T.E.;
RT "Developmental expression of two spore wall proteins during maturation of
RT the microsporidian Encephalitozoon intestinalis.";
RL Infect. Immun. 69:7057-7066(2001).
CC -!- FUNCTION: Spore wall component.
CC -!- SUBUNIT: Component of a complex composed of at least SWP1 and SWP2.
CC {ECO:0000269|PubMed:11598081}.
CC -!- SUBCELLULAR LOCATION: Spore, perispore {ECO:0000269|PubMed:11598081}.
CC -!- DEVELOPMENTAL STAGE: Synthesized in the fully form of sporont.
CC {ECO:0000269|PubMed:11598081}.
CC -!- MISCELLANEOUS: SWP2 is a major antigen recognized during host
CC infection.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF355749; AAL27282.1; -; Genomic_DNA.
DR AlphaFoldDB; Q95WA4; -.
DR VEuPathDB; MicrosporidiaDB:Eint_101630; -.
DR GO; GO:0030435; P:sporulation resulting in formation of a cellular spore; IEA:UniProtKB-KW.
PE 1: Evidence at protein level;
KW Glycoprotein; Repeat; Signal; Sporulation.
FT SIGNAL 1..18
FT /evidence="ECO:0000255"
FT CHAIN 19..1002
FT /note="Spore wall protein 2"
FT /id="PRO_5000060239"
FT REPEAT 357..368
FT /note="1"
FT REPEAT 369..380
FT /note="2"
FT REPEAT 381..392
FT /note="3"
FT REPEAT 393..404
FT /note="4"
FT REPEAT 405..416
FT /note="5"
FT REPEAT 417..428
FT /note="6"
FT REPEAT 429..440
FT /note="7"
FT REPEAT 441..452
FT /note="8"
FT REPEAT 453..464
FT /note="9"
FT REPEAT 465..476
FT /note="10"
FT REPEAT 477..488
FT /note="11"
FT REPEAT 489..500
FT /note="12"
FT REPEAT 501..512
FT /note="13"
FT REPEAT 513..524
FT /note="14"
FT REPEAT 525..536
FT /note="15"
FT REPEAT 537..548
FT /note="16"
FT REPEAT 549..560
FT /note="17"
FT REPEAT 561..572
FT /note="18"
FT REPEAT 576..587
FT /note="19"
FT REPEAT 588..599
FT /note="20"
FT REPEAT 600..611
FT /note="21"
FT REPEAT 615..626
FT /note="22"
FT REPEAT 627..638
FT /note="23"
FT REPEAT 639..650
FT /note="24"
FT REPEAT 651..662
FT /note="25; degenerate"
FT REPEAT 663..674
FT /note="26; degenerate"
FT REPEAT 678..689
FT /note="27"
FT REPEAT 693..704
FT /note="28"
FT REPEAT 708..719
FT /note="29"
FT REPEAT 723..734
FT /note="30"
FT REPEAT 735..746
FT /note="31"
FT REPEAT 747..758
FT /note="32"
FT REPEAT 759..770
FT /note="33"
FT REPEAT 774..785
FT /note="34"
FT REPEAT 786..797
FT /note="35"
FT REPEAT 801..812
FT /note="36"
FT REPEAT 816..827
FT /note="37"
FT REPEAT 831..842
FT /note="38"
FT REPEAT 843..854
FT /note="39"
FT REPEAT 855..866
FT /note="40"
FT REPEAT 867..878
FT /note="41"
FT REPEAT 882..893
FT /note="42"
FT REPEAT 894..905
FT /note="43"
FT REPEAT 906..915
FT /note="44"
FT REPEAT 918..929
FT /note="45"
FT REPEAT 930..941
FT /note="46"
FT REPEAT 942..953
FT /note="47"
FT REPEAT 957..969
FT /note="48"
FT REPEAT 972..983
FT /note="49; degenerate"
FT REPEAT 987..998
FT /note="50; degenerate"
FT REGION 349..1002
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 357..998
FT /note="50 X 12 AA tandem repeats of E-E-G-E-D-K-D-N-T-G-E-
FT G"
FT COMPBIAS 354..1002
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 308
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 1002 AA; 107177 MW; 3AE8E3BD8531D73D CRC64;
MIKLSLLLSL ASFTAVLANQ RPRCQRCPVS SSKYFQQNNL LESRFQNEVQ RLCARRVREE
SSSESSSSSS SEDCSRRRRR PHREWEDSCS SSYSSCSSTD SCSSSAPCPP PVAQRCDIEL
KTPIILMGER IYEFLKNYED QYKKAVLLFL TNILSQISGF NPVFPGGDYD ALIEQLKTLG
VTVPANTAAE LAAIDAAESS ALTRAIQANA QKVISDLLTR VSAMCYLDIM SLVNSGLLAS
QVSSVFNNIQ PIITIAGNDL FAKQMAVFQK LSKTLISTAV TNALQGNRAK FTRFYTTQTS
NLQTSVQNSS KTLTSELKKL ATDTETAFTA FANAEISTPV RRIFRRSITS SGFEDAEEGE
DKDNTGEGEE GEDKDNTGEG EEGEDKDNTG EGEEGEDKDN TGEGEEGEDK DNTGEGEEGE
DKDNTGEGEE GEDKDNTGEG EEGEDKDNTG EGEEGEDKDN TGEGEEGEDK DNTGEGEEGE
DKDNTGEGEE GEDKDNTGEG EEGEDKDNTG EGEEGEDKDN TGEGEEGEDK DNTGEGEEGE
DKDNTGEGEE GEDKDNTGEG EEGEDKDNTG EGEEGEEGED KDNTGEGEEG EDKDNTGEGE
EGEDKDNTGE GEEGEEGEDK DNTGEGEEGE DKDNTGEGEE GEDKDNTGEG EEGEDKDNTG
DAEEGEEGED KDSTGEGEEG EDKDNTGEGE EGEEGEDKDN TGEGEEGEEG EDKDNTGEGE
EGEEGEDKDN TGEGEEGEDK DNTGEGEEGE DKDNTGEGDE GEDKDNTGEG EEGEEGEDKD
NTGEGEEGED KDNTGEGEEG EEGEDKDNTG EGEEGEEGED KDNTGEGEEG EEGEDKDNTG
EGEEGEDKDN TGEGEEGEDK DNTGEGEEGE DKDNTGEGEE GEEGEDKDNT GEGEEGEDKD
NTGEGEEGED KDNTGEGEEG EDKDNTGEGE EGEDKDNTGE GEEGEDKDNT GEGEEGEEGE
DKDNTGDAEE GEEGEDKDNT GDAEEGEEGE DKDNTEEGEE TT