POLN_MIDDV
ID POLN_MIDDV Reviewed; 995 AA.
AC P03318;
DT 21-JUL-1986, integrated into UniProtKB/Swiss-Prot.
DT 10-APR-2019, sequence version 2.
DT 03-AUG-2022, entry version 107.
DE RecName: Full=Polyprotein nsP1234;
DE Short=P1234;
DE AltName: Full=Non-structural polyprotein;
DE Contains:
DE RecName: Full=Polyprotein P123';
DE Short=P123';
DE Contains:
DE RecName: Full=Polyprotein P123;
DE Short=P123;
DE Contains:
DE RecName: Full=Non-structural protein 3';
DE Short=nsP3';
DE EC=3.1.3.84 {ECO:0000305};
DE Contains:
DE RecName: Full=Non-structural protein 3;
DE Short=nsP3;
DE EC=3.1.3.84 {ECO:0000250|UniProtKB:Q8JUX6};
DE Contains:
DE RecName: Full=RNA-directed RNA polymerase nsP4;
DE EC=2.7.7.19 {ECO:0000250|UniProtKB:P03317};
DE EC=2.7.7.48 {ECO:0000255|PROSITE-ProRule:PRU00539};
DE AltName: Full=Non-structural protein 4;
DE Short=nsP4;
DE Flags: Fragment;
OS Middelburg virus.
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes;
OC Martellivirales; Togaviridae; Alphavirus.
OX NCBI_TaxID=11023;
OH NCBI_TaxID=7158; Aedes.
OH NCBI_TaxID=149459; Mansonia.
OH NCBI_TaxID=9940; Ovis aries (Sheep).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=6577423; DOI=10.1073/pnas.80.17.5271;
RA Strauss E.G., Rice C.M., Strauss J.H.;
RT "Sequence coding for the alphavirus nonstructural proteins is interrupted
RT by an opal termination codon.";
RL Proc. Natl. Acad. Sci. U.S.A. 80:5271-5275(1983).
CC -!- FUNCTION: Polyprotein P1234: Inactive precursor of the viral replicase,
CC which is activated by cleavages carried out by the viral protease nsP2.
CC {ECO:0000250|UniProtKB:Q8JUX6}.
CC -!- FUNCTION: [Polyprotein P123]: The early replication complex formed by
CC the polyprotein P123 and nsP4 synthesizes minus-strand RNAs (By
CC similarity). As soon P123 is cleaved into mature proteins, the plus-
CC strand RNAs synthesis begins (By similarity).
CC {ECO:0000250|UniProtKB:P03317}.
CC -!- FUNCTION: [Polyprotein P123']: The early replication complex formed by
CC the polyprotein P123' and nsP4 synthesizes minus-strand RNAs
CC (Probable). Polyprotein P123' is a short-lived polyprotein that
CC accumulates during early stage of infection (Probable). As soon P123'
CC is cleaved into mature proteins, the plus-strand RNAs synthesis begins
CC (Probable). {ECO:0000305}.
CC -!- FUNCTION: [Non-structural protein 3']: Seems to be essential for minus-
CC strand RNAs and subgenomic 26S mRNAs synthesis (By similarity).
CC Displays mono-ADP-ribosylhydrolase activity (Probable). ADP-
CC ribosylation is a post-translational modification that controls various
CC processes of the host cell and the virus probably needs to revert it
CC for optimal viral replication (Probable). Binds proteins of FXR family
CC and sequesters them into the viral RNA replication complexes thereby
CC inhibiting the formation of host stress granules on viral mRNAs
CC (Probable). The nsp3'-FXR complexes bind viral RNAs and probably
CC orchestrate the assembly of viral replication complexes, thanks to the
CC ability of FXR family members to self-assemble and bind DNA (Probable).
CC {ECO:0000250|UniProtKB:P03317, ECO:0000305}.
CC -!- FUNCTION: [Non-structural protein 3]: Seems to be essential for minus-
CC strand RNAs and subgenomic 26S mRNAs synthesis (By similarity).
CC Displays mono-ADP-ribosylhydrolase activity (By similarity). ADP-
CC ribosylation is a post-translantional modification that controls
CC various processes of the host cell and the virus probably needs to
CC revert it for optimal viral replication (By similarity). Binds proteins
CC of G3BP family and sequesters them into the viral RNA replication
CC complexes thereby inhibiting the formation of host stress granules on
CC viral mRNAs (By similarity). The nsp3-G3BP complexes bind viral RNAs
CC and probably orchestrate the assembly of viral replication complexes,
CC thanks to the ability of G3BP family members to self-assemble and bind
CC DNA (By similarity). {ECO:0000250|UniProtKB:P03317,
CC ECO:0000250|UniProtKB:Q8JUX6}.
CC -!- FUNCTION: [RNA-directed RNA polymerase nsP4]: RNA dependent RNA
CC polymerase (By similarity). Replicates genomic and antigenomic RNA by
CC recognizing replications specific signals. The early replication
CC complex formed by the polyprotein P123 and nsP4 synthesizes minus-
CC strand RNAs (By similarity). The late replication complex composed of
CC fully processed nsP1-nsP4 is responsible for the production of genomic
CC and subgenomic plus-strand RNAs (By similarity). The core catalytic
CC domain of nsP4 also possesses terminal adenylyltransferase (TATase)
CC activity that is probably involved in maintenance and repair of the
CC poly(A) tail, an element required for replication of the viral genome
CC (By similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate +
CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA-
CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395;
CC EC=2.7.7.48; Evidence={ECO:0000255|PROSITE-ProRule:PRU00539};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + RNA(n) = diphosphate + RNA(n)-3'-adenine ribonucleotide;
CC Xref=Rhea:RHEA:11332, Rhea:RHEA-COMP:14527, Rhea:RHEA-COMP:17347,
CC ChEBI:CHEBI:30616, ChEBI:CHEBI:33019, ChEBI:CHEBI:140395,
CC ChEBI:CHEBI:173115; EC=2.7.7.19;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=4-O-(ADP-D-ribosyl)-L-aspartyl-[protein] + H2O = ADP-D-ribose
CC + H(+) + L-aspartyl-[protein]; Xref=Rhea:RHEA:54428, Rhea:RHEA-
CC COMP:9867, Rhea:RHEA-COMP:13832, ChEBI:CHEBI:15377,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:29961, ChEBI:CHEBI:57967,
CC ChEBI:CHEBI:138102; Evidence={ECO:0000250|UniProtKB:P03317};
CC PhysiologicalDirection=left-to-right; Xref=Rhea:RHEA:54429;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=5-O-(ADP-D-ribosyl)-L-glutamyl-[protein] + H2O = ADP-D-ribose
CC + H(+) + L-glutamyl-[protein]; Xref=Rhea:RHEA:58248, Rhea:RHEA-
CC COMP:10208, Rhea:RHEA-COMP:15089, ChEBI:CHEBI:15377,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:29973, ChEBI:CHEBI:57967,
CC ChEBI:CHEBI:142540; Evidence={ECO:0000250|UniProtKB:P03317};
CC PhysiologicalDirection=left-to-right; Xref=Rhea:RHEA:58249;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ADP-beta-D-ribose 1''-phosphate + H2O = ADP-D-ribose +
CC phosphate; Xref=Rhea:RHEA:25029, ChEBI:CHEBI:15377,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:57967, ChEBI:CHEBI:58753; EC=3.1.3.84;
CC Evidence={ECO:0000250|UniProtKB:Q8JUX6};
CC PhysiologicalDirection=left-to-right; Xref=Rhea:RHEA:25030;
CC Evidence={ECO:0000250|UniProtKB:Q8JUX6};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC Name=Mn(2+); Xref=ChEBI:CHEBI:29035;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC Note=For nsP4 adenylyltransferase activity; Mn(2+) supports catalysis
CC at 60% of the levels observed with Mg(2+).
CC {ECO:0000250|UniProtKB:P03317};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000250|UniProtKB:P03317};
CC Note=For nsP4 RNA-directed RNA polymerase activity.
CC {ECO:0000250|UniProtKB:P03317};
CC -!- SUBUNIT: [Non-structural protein 3]: Interacts with mRNA-capping enzyme
CC nsP1 (By similarity). Interacts with host DDX1. Interacts with host
CC DDX3. Interacts (via C-terminus) with host G3BP1; this interaction
CC inhibits the formation of host stress granules on viral mRNAs and the
CC nsp3-G3BP1 complexes bind viral RNAs and probably orchestrate the
CC assembly of viral replication complexes (By similarity). Interacts (via
CC C-terminus) with host G3BP2; this interaction inhibits the formation of
CC host stress granules on viral mRNAs and the nsp3-G3BP2 complexes bind
CC viral RNAs and probably orchestrate the assembly of viral replication
CC complexes (By similarity). {ECO:0000250|UniProtKB:P03317,
CC ECO:0000250|UniProtKB:P27282}.
CC -!- SUBUNIT: [RNA-directed RNA polymerase nsP4]: Interacts with mRNA-
CC capping enzyme nsP1 (By similarity). Interacts with protease nsP2 (By
CC similarity). interacts with itself (By similarity).
CC {ECO:0000250|UniProtKB:Q8JUX6}.
CC -!- SUBCELLULAR LOCATION: [Polyprotein nsP1234]: Host cytoplasmic vesicle
CC membrane {ECO:0000305}; Peripheral membrane protein {ECO:0000305}.
CC Note=Part of cytoplasmic vesicles, which are probably formed at the
CC plasma membrane and internalized leading to late endosomal/lysosomal
CC spherules containing the replication complex. {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 3]: Host cytoplasmic
CC vesicle membrane {ECO:0000250|UniProtKB:P03317}; Peripheral membrane
CC protein {ECO:0000305}. Note=In the late phase of infection, the
CC polyprotein is quickly cleaved before localization to cellular
CC membranes. Then nsP3 and nsP3' form aggregates in cytoplasm (By
CC similarity). NsP3 is also part of cytoplasmic vesicles, which are
CC probably formed at the plasma membrane and internalized leading to late
CC endosomal/lysosomal spherules containing the replication complex (By
CC similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 3']: Host cytoplasmic
CC vesicle membrane {ECO:0000250|UniProtKB:P03317}; Peripheral membrane
CC protein {ECO:0000305}. Note=In the late phase of infection, the
CC polyprotein is quickly cleaved before localization to cellular
CC membranes. Then nsP3 and nsP3' form aggregates in cytoplasm (By
CC similarity). NsP3' is also part of cytoplasmic vesicles, which are
CC probably formed at the plasma membrane and internalized leading to late
CC endosomal/lysosomal spherules containing the replication complex (By
CC similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- SUBCELLULAR LOCATION: [RNA-directed RNA polymerase nsP4]: Host
CC cytoplasmic vesicle membrane; Peripheral membrane protein
CC {ECO:0000250|UniProtKB:P03317}. Note=NsP4 is part of cytoplasmic
CC vesicles, which are probably formed at the plasma membrane and
CC internalized leading to late endosomal/lysosomal spherules containing
CC the replication complex. {ECO:0000250|UniProtKB:P08411}.
CC -!- DOMAIN: [Non-structural protein 3]: In the N-terminus, the macro domain
CC displays a mono-ADP-ribosylhydrolase activity (By similarity). The
CC central part has a zinc-binding function (By similarity). The C-
CC terminus contains two FGDF motifs necessary and sufficient for
CC formation of the nsP3/G3BP1 complex (By similarity).
CC {ECO:0000250|UniProtKB:P03317, ECO:0000250|UniProtKB:P08411}.
CC -!- DOMAIN: [Non-structural protein 3']: In the N-terminus, the macro
CC domain displays a mono-ADP-ribosylhydrolase activity (By similarity).
CC The central part has a zinc-binding function (By similarity). The C-
CC terminus contains two FGDF motifs necessary and sufficient for
CC formation of the nsP3'/G3BP1 complex (By similarity).
CC {ECO:0000250|UniProtKB:P03317, ECO:0000250|UniProtKB:P08411}.
CC -!- PTM: Polyprotein P1234: Specific enzymatic cleavages in vivo yield
CC mature proteins (By similarity). The processing of the polyprotein is
CC temporally regulated (By similarity). In early stages (1.7 hpi), P1234
CC is first cleaved in trans through its nsP2 protease activity, releasing
CC P123' and nsP4, which associate to form the early replication complex
CC (By similarity). At the same time, P1234 is also cut at the nsP1/nsP2
CC site early in infection but with lower efficiency (By similarity).
CC After replication of the viral minus-strand RNAs (4 hpi), the
CC polyproteins are cut at the nsP1/nsP2 and nsP2/nsP3 sites very
CC efficiently, preventing accumulation of P123' and P1234 and allowing
CC the formation of the late replication complex (By similarity).
CC NsP3'/nsP4 site is not cleaved anymore and P34 is produced rather than
CC nsP4 (By similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- PTM: [Polyprotein P123]: Specific enzymatic cleavages in vivo yield
CC mature proteins (By similarity). The processing of the polyprotein is
CC temporally regulated (By similarity). In early stages (1.7 hpi), P123
CC is cleaved at the nsP1/nsP2 site with low efficiency (By similarity).
CC After replication of the viral minus-strand RNAs (4 hpi), the
CC polyproteins are cut at the nsP1/nsP2 and nsP2/nsP3 sites very
CC efficiently, preventing accumulation of P123 and allowing the formation
CC of the late replication complex (By similarity).
CC {ECO:0000250|UniProtKB:P03317}.
CC -!- PTM: [Polyprotein P123']: Specific enzymatic cleavages in vivo yield
CC mature proteins (By similarity). The processing of the polyprotein is
CC temporally regulated (By similarity). In early stages (1.7 hpi), P123'
CC is cleaved at the nsP1/nsP2 site with low efficiency (By similarity).
CC After replication of the viral minus-strand RNAs (4 hpi), the
CC polyproteins are cut at the nsP1/nsP2 and nsP2/nsP3 sites very
CC efficiently, preventing accumulation of P123' and allowing the
CC formation of the late replication complex (By similarity).
CC {ECO:0000250|UniProtKB:P03317}.
CC -!- PTM: [Non-structural protein 3]: Phosphorylated by host on serines and
CC threonines. {ECO:0000250|UniProtKB:P08411}.
CC -!- PTM: [Non-structural protein 3']: Phosphorylated by host on serines and
CC threonines. {ECO:0000250|UniProtKB:P08411}.
CC -!- PTM: [RNA-directed RNA polymerase nsP4]: Ubiquitinated; targets the
CC protein for rapid degradation via the ubiquitin system (By similarity).
CC Nsp4 is present in extremely low quantities due to low frequency of
CC translation through the amber stop-codon and the degradation by the
CC ubiquitin pathway (By similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- MISCELLANEOUS: Viral replication produces dsRNA in the late phase of
CC infection, resulting in a strong activation of host EIF2AK2/PKR,
CC leading to almost complete phosphorylation of EIF2A (By similarity).
CC This inactivates completely cellular translation initiation, resulting
CC shutoff of host proteins synthesis (By similarity). However,
CC phosphorylation of EIF2A is probably not the only mechanism responsible
CC for the host translation shutoff (By similarity). The viral translation
CC can still occur normally because it relies on a hairpin structure in
CC the coding region of sgRNA and is EIF2A-, EIF2D-, EIF4G- EIF4A-
CC independent (By similarity). {ECO:0000250|UniProtKB:P03317}.
CC -!- MISCELLANEOUS: The genome codes for P123, but readthrough of a
CC terminator codon UGA occurs between the codons for Ala-378 and Leu-380
CC giving rise to P1234 (Probable). P1234 is cleaved quickly by nsP2 into
CC P123' and nsP4 (By similarity). Further processing of p123' gives nsP1,
CC nsP2 and nsP3' which is 6 amino acids longer than nsP3 since the
CC cleavage site is after the readthrough (By similarity). This unusual
CC molecular mechanism ensures that few nsP4 are produced compared to
CC other non-structural proteins (By similarity). Mutant viruses with no
CC alternative termination site grow significantly slower than wild-type
CC virus (By similarity). The opal termination codon is frequently mutated
CC to a sense codon on passage in cell culture (By similarity). The
CC presence of the opal codon may be a requirement for viral maintenance
CC in both vertebrate and invertebrate hosts and a selective advantage may
CC be conferred in cell culture for the sense codon (By similarity).
CC {ECO:0000250|UniProtKB:O90368, ECO:0000250|UniProtKB:P03317,
CC ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; J02246; AAA96654.1; ALT_TERM; Genomic_RNA.
DR EMBL; J02246; AAA96653.1; -; Genomic_RNA.
DR PIR; A03918; MNWVM.
DR PRIDE; P03318; -.
DR GO; GO:0044162; C:host cell cytoplasmic vesicle membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000166; F:nucleotide binding; IEA:UniProtKB-KW.
DR GO; GO:0004652; F:polynucleotide adenylyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003968; F:RNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0006351; P:transcription, DNA-templated; IEA:InterPro.
DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR Gene3D; 3.40.220.10; -; 1.
DR Gene3D; 3.40.50.150; -; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR InterPro; IPR001788; Tymovirus_RNA-dep_RNA_pol.
DR Pfam; PF00978; RdRP_2; 1.
DR SUPFAM; SSF52949; SSF52949; 1.
DR SUPFAM; SSF56672; SSF56672; 1.
DR PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE 3: Inferred from homology;
KW Host cytoplasmic vesicle; Host membrane; Hydrolase; Membrane;
KW Metal-binding; Nucleotide-binding; Nucleotidyltransferase; RNA-binding;
KW RNA-directed RNA polymerase; Transferase; Ubl conjugation;
KW Viral RNA replication; Zinc.
FT CHAIN <1..995
FT /note="Polyprotein nsP1234"
FT /id="PRO_0000308394"
FT CHAIN <1..385
FT /note="Non-structural protein 3'"
FT /id="PRO_0000041212"
FT CHAIN <1..385
FT /note="Polyprotein P123'"
FT /id="PRO_0000446646"
FT CHAIN <1..378
FT /note="Non-structural protein 3"
FT /id="PRO_0000228780"
FT CHAIN <1..378
FT /note="Polyprotein P123"
FT /id="PRO_0000446647"
FT CHAIN 386..995
FT /note="RNA-directed RNA polymerase nsP4"
FT /id="PRO_0000041213"
FT DOMAIN 749..864
FT /note="RdRp catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00539"
FT MOTIF 347..350
FT /note="FGDF; binding to host G3BP1"
FT /evidence="ECO:0000250|UniProtKB:P08411"
FT MOTIF 364..367
FT /note="FGDF; binding to host G3BP1"
FT /evidence="ECO:0000250|UniProtKB:P08411"
FT BINDING 30
FT /ligand="ADP-D-ribose"
FT /ligand_id="ChEBI:CHEBI:57967"
FT /evidence="ECO:0000250|UniProtKB:P36328"
FT BINDING 32
FT /ligand="ADP-D-ribose"
FT /ligand_id="ChEBI:CHEBI:57967"
FT /evidence="ECO:0000250|UniProtKB:P36328"
FT BINDING 180
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000250|UniProtKB:P03317"
FT BINDING 182
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000250|UniProtKB:P03317"
FT BINDING 205
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000250|UniProtKB:P03317"
FT BINDING 223
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000250|UniProtKB:P03317"
FT SITE 385..386
FT /note="Cleavage; by protease nsP2"
FT /evidence="ECO:0000250|UniProtKB:P03317"
FT NON_TER 1
SQ SEQUENCE 995 AA; 111393 MW; FE6FF003A4D834E7 CRC64;
DADLAAVYRA VASLADETVR TMAIPLLSTG TFAGGKDRVL QSLNHLFTAL DTTDVDVTIY
CRDKSWEKKI QEAIDMRTAT ELLDDDTTVM KELTRVHPDS CLVGRSGFST VDGRLHSYLE
GTRFHQTAVD VAERPTLWPR REEANEQITH YVLGESMEAI RTKCPVDDTD SSAPPCTVPC
LCRYAMTPER VHRLRAAQVK QFTVCSSFPL PKYKIPGVQR VACSAVMLFN HDVPALVSPR
KYREPSISSE SSSSGLSVFD LDIGSDSEYE PMEPVQPEPL IDLAVVEETA PVRLERVAPV
AAPRRARATP FTLEQRVVAP VPAPRTMPVR PPRRKKAATR TPERISFGDL DAECMAIIND
DLTFGDFGAG EFERLTSAXL DRAGAYIFSS DTGPGHLQQR SVRQTRLADC VAEDVHEERV
FAPKCDKEKE RLLLLQMQMA PTEANKSRYQ SRKVENMKAE VIDRLLGGAK LFVTPTTDCR
YVTHKHPKPM YSTSVAFYLS SAKTAVAACN EFLSRNYPTV TSYQITDEYD AYLDMVDGSE
SCLDRAAFCP SKLRSFPKKH SYHRAEIRSA VPSPFQNTLQ NVLAAATKRN CNVTQMRELP
TLDSAVFNVE CFKKYACNND YWDEFAQKPI RLTTENITSY VTRLKGPKAA ALFAKTYDLK
PLQEVPMDRF VVDMKRDVKV TPGTKHTEER PKVQVIQAAE PLATAYLCGI HRELVRRLNA
VLLPNVHTLF DMSAEDFDAI ISEHFRPGDA VLETDIASFD KSQDDSLAYT GLMLLEDLGV
DQPLLELIEA SFGEITSTHL PTGTRFKFGA MMKSGMFLTL FVNTMLNMTI ASRVLEERLT
NSKCAAFIGD DNIVHGVKSD KLLAERCAAW MNMEVKIIDA VMCERPPYFC GGFIVFDQVT
GTCCRVADPL KRLFKLGKPL PAEDKQDEDR RRALADEAQR WNRVGIQADL EAAMNSRYEV
EGIRNVITAL TTLSRNYHNF RHLRGPVIDL YGGPK