YB11B_YEAST
ID YB11B_YEAST Reviewed; 1755 AA.
AC Q12490; D6VPZ6;
DT 06-MAR-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1996, sequence version 1.
DT 03-AUG-2022, entry version 155.
DE RecName: Full=Transposon Ty1-BL Gag-Pol polyprotein;
DE AltName: Full=Gag-Pol-p199;
DE AltName: Full=TY1A-TY1B;
DE AltName: Full=Transposon Ty1 TYA-TYB polyprotein;
DE AltName: Full=p190;
DE Contains:
DE RecName: Full=Capsid protein;
DE Short=CA;
DE AltName: Full=Gag-p45;
DE AltName: Full=p54;
DE Contains:
DE RecName: Full=Ty1 protease;
DE Short=PR;
DE EC=3.4.23.-;
DE AltName: Full=Pol-p20;
DE AltName: Full=p23;
DE Contains:
DE RecName: Full=Integrase;
DE Short=IN;
DE AltName: Full=Pol-p71;
DE AltName: Full=p84;
DE AltName: Full=p90;
DE Contains:
DE RecName: Full=Reverse transcriptase/ribonuclease H;
DE Short=RT;
DE Short=RT-RH;
DE EC=2.7.7.49;
DE EC=2.7.7.7;
DE EC=3.1.26.4;
DE AltName: Full=Pol-p63;
DE AltName: Full=p60;
GN Name=TY1B-BL; Synonyms=YBLWTy1-1 POL; OrderedLocusNames=YBL005W-B;
GN ORFNames=YBL004W-A, YBL0325;
OS Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Saccharomyces.
OX NCBI_TaxID=559292;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 204508 / S288c;
RX PubMed=7813418; DOI=10.1002/j.1460-2075.1994.tb06923.x;
RA Feldmann H., Aigle M., Aljinovic G., Andre B., Baclet M.C., Barthe C.,
RA Baur A., Becam A.-M., Biteau N., Boles E., Brandt T., Brendel M.,
RA Brueckner M., Bussereau F., Christiansen C., Contreras R., Crouzet M.,
RA Cziepluch C., Demolis N., Delaveau T., Doignon F., Domdey H.,
RA Duesterhus S., Dubois E., Dujon B., El Bakkoury M., Entian K.-D.,
RA Feuermann M., Fiers W., Fobo G.M., Fritz C., Gassenhuber J., Glansdorff N.,
RA Goffeau A., Grivell L.A., de Haan M., Hein C., Herbert C.J.,
RA Hollenberg C.P., Holmstroem K., Jacq C., Jacquet M., Jauniaux J.-C.,
RA Jonniaux J.-L., Kallesoee T., Kiesau P., Kirchrath L., Koetter P.,
RA Korol S., Liebl S., Logghe M., Lohan A.J.E., Louis E.J., Li Z.Y.,
RA Maat M.J., Mallet L., Mannhaupt G., Messenguy F., Miosga T., Molemans F.,
RA Mueller S., Nasr F., Obermaier B., Perea J., Pierard A., Piravandi E.,
RA Pohl F.M., Pohl T.M., Potier S., Proft M., Purnelle B., Ramezani Rad M.,
RA Rieger M., Rose M., Schaaff-Gerstenschlaeger I., Scherens B.,
RA Schwarzlose C., Skala J., Slonimski P.P., Smits P.H.M., Souciet J.-L.,
RA Steensma H.Y., Stucka R., Urrestarazu L.A., van der Aart Q.J.M.,
RA Van Dyck L., Vassarotti A., Vetter I., Vierendeels F., Vissers S.,
RA Wagner G., de Wergifosse P., Wolfe K.H., Zagulski M., Zimmermann F.K.,
RA Mewes H.-W., Kleine K.;
RT "Complete DNA sequence of yeast chromosome II.";
RL EMBO J. 13:5795-5809(1994).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=ATCC 204508 / S288c;
RX PubMed=24374639; DOI=10.1534/g3.113.008995;
RA Engel S.R., Dietrich F.S., Fisk D.G., Binkley G., Balakrishnan R.,
RA Costanzo M.C., Dwight S.S., Hitz B.C., Karra K., Nash R.S., Weng S.,
RA Wong E.D., Lloyd P., Skrzypek M.S., Miyasato S.R., Simison M., Cherry J.M.;
RT "The reference genome sequence of Saccharomyces cerevisiae: Then and now.";
RL G3 (Bethesda) 4:389-398(2014).
RN [3]
RP NOMENCLATURE.
RX PubMed=9582191; DOI=10.1101/gr.8.5.464;
RA Kim J.M., Vanguri S., Boeke J.D., Gabriel A., Voytas D.F.;
RT "Transposable elements and genome organization: a comprehensive survey of
RT retrotransposons revealed by the complete Saccharomyces cerevisiae genome
RT sequence.";
RL Genome Res. 8:464-478(1998).
RN [4]
RP REVIEW.
RX PubMed=16093660; DOI=10.1159/000084940;
RA Lesage P., Todeschini A.L.;
RT "Happy together: the life and times of Ty retrotransposons and their
RT hosts.";
RL Cytogenet. Genome Res. 110:70-90(2005).
RN [5]
RP REVIEW, AND DOMAINS.
RX PubMed=16093680; DOI=10.1159/000084960;
RA Wilhelm F.-X., Wilhelm M., Gabriel A.;
RT "Reverse transcriptase and integrase of the Saccharomyces cerevisiae Ty1
RT element.";
RL Cytogenet. Genome Res. 110:269-287(2005).
RN [6]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC STRAIN=ADR376;
RX PubMed=17330950; DOI=10.1021/pr060559j;
RA Li X., Gerber S.A., Rudner A.D., Beausoleil S.A., Haas W., Villen J.,
RA Elias J.E., Gygi S.P.;
RT "Large-scale phosphorylation analysis of alpha-factor-arrested
RT Saccharomyces cerevisiae.";
RL J. Proteome Res. 6:1190-1197(2007).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=18407956; DOI=10.1074/mcp.m700468-mcp200;
RA Albuquerque C.P., Smolka M.B., Payne S.H., Bafna V., Eng J., Zhou H.;
RT "A multidimensional chromatography technology for in-depth phosphoproteome
RT analysis.";
RL Mol. Cell. Proteomics 7:1389-1396(2008).
RN [8]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19779198; DOI=10.1126/science.1172867;
RA Holt L.J., Tuch B.B., Villen J., Johnson A.D., Gygi S.P., Morgan D.O.;
RT "Global analysis of Cdk1 substrate phosphorylation sites provides insights
RT into evolution.";
RL Science 325:1682-1686(2009).
CC -!- FUNCTION: Capsid protein (CA) is the structural component of the virus-
CC like particle (VLP), forming the shell that encapsulates the
CC retrotransposons dimeric RNA genome. The particles are assembled from
CC trimer-clustered units and there are holes in the capsid shells that
CC allow for the diffusion of macromolecules. CA has also nucleocapsid-
CC like chaperone activity, promoting primer tRNA(i)-Met annealing to the
CC multipartite primer-binding site (PBS), dimerization of Ty1 RNA and
CC initiation of reverse transcription (By similarity). {ECO:0000250}.
CC -!- FUNCTION: The aspartyl protease (PR) mediates the proteolytic cleavages
CC of the Gag and Gag-Pol polyproteins after assembly of the VLP.
CC {ECO:0000250}.
CC -!- FUNCTION: Reverse transcriptase/ribonuclease H (RT) is a
CC multifunctional enzyme that catalyzes the conversion of the retro-
CC elements RNA genome into dsDNA within the VLP. The enzyme displays a
CC DNA polymerase activity that can copy either DNA or RNA templates, and
CC a ribonuclease H (RNase H) activity that cleaves the RNA strand of RNA-
CC DNA heteroduplexes during plus-strand synthesis and hydrolyzes RNA
CC primers. The conversion leads to a linear dsDNA copy of the
CC retrotransposon that includes long terminal repeats (LTRs) at both ends
CC (By similarity). {ECO:0000250}.
CC -!- FUNCTION: Integrase (IN) targets the VLP to the nucleus, where a
CC subparticle preintegration complex (PIC) containing at least integrase
CC and the newly synthesized dsDNA copy of the retrotransposon must
CC transit the nuclear membrane. Once in the nucleus, integrase performs
CC the integration of the dsDNA into the host genome (By similarity).
CC {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.49;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.7;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endonucleolytic cleavage to 5'-phosphomonoester.; EC=3.1.26.4;
CC -!- SUBUNIT: The capsid protein forms a homotrimer, from which the VLPs are
CC assembled. The protease is a homodimer, whose active site consists of
CC two apposed aspartic acid residues (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm. Nucleus {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Comment=The Gag-Pol polyprotein is generated by a +1 ribosomal
CC frameshift. The ratio of Gag:Gag-Pol varies between 20:1 and 5:1 (By
CC similarity). {ECO:0000250};
CC Name=Transposon Ty1-BL Gag-Pol polyprotein;
CC IsoId=Q12490-1; Sequence=Displayed;
CC Name=Transposon Ty1-BL Gag polyprotein;
CC IsoId=Q12266-1; Sequence=External;
CC -!- DOMAIN: The C-terminal RNA-binding region of CA is sufficient for all
CC its nucleocapsid-like chaperone activities. {ECO:0000250}.
CC -!- DOMAIN: Integrase core domain contains the D-x(n)-D-x(35)-E motif,
CC named for the phylogenetically conserved glutamic acid and aspartic
CC acid residues and the invariant 35 amino acid spacing between the
CC second and third acidic residues. Each acidic residue of the D,D(35)E
CC motif is independently essential for the 3'-processing and strand
CC transfer activities of purified integrase protein (By similarity).
CC {ECO:0000250}.
CC -!- PTM: Initially, virus-like particles (VLPs) are composed of the
CC structural unprocessed proteins Gag and Gag-Pol, and also contain the
CC host initiator methionine tRNA (tRNA(i)-Met) which serves as a primer
CC for minus-strand DNA synthesis, and a dimer of genomic Ty RNA.
CC Processing of the polyproteins occurs within the particle and proceeds
CC by an ordered pathway, called maturation. First, the protease (PR) is
CC released by autocatalytic cleavage of the Gag-Pol polyprotein yielding
CC capsid protein p45 and a Pol-p154 precursor protein. This cleavage is a
CC prerequisite for subsequent processing of Pol-p154 at the remaining
CC sites to release the mature structural and catalytic proteins.
CC Maturation takes place prior to the RT reaction and is required to
CC produce transposition-competent VLPs (By similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: Retrotransposons are mobile genetic entities that are
CC able to replicate via an RNA intermediate and a reverse transcription
CC step. In contrast to retroviruses, retrotransposons are non-infectious,
CC lack an envelope and remain intracellular. Ty1 retrotransposons belong
CC to the copia elements (pseudoviridae).
CC -!- MISCELLANEOUS: [Isoform Transposon Ty1-BL Gag-Pol polyprotein]:
CC Produced by +1 ribosomal frameshifting between codon Leu-435 and Gly-
CC 436 of the YBL005W-A ORF.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z35765; CAA84820.1; -; Genomic_DNA.
DR EMBL; Z35766; CAA84824.1; -; Genomic_DNA.
DR EMBL; BK006936; DAA07116.1; -; Genomic_DNA.
DR PIR; S40969; S40969.
DR PIR; S45736; S45736.
DR RefSeq; NP_009549.1; NM_001180048.2. [Q12490-1]
DR AlphaFoldDB; Q12490; -.
DR BioGRID; 32696; 10.
DR IntAct; Q12490; 3.
DR MINT; Q12490; -.
DR STRING; 4932.YBL005W-B; -.
DR iPTMnet; Q12490; -.
DR MaxQB; Q12490; -.
DR PaxDb; Q12490; -.
DR PRIDE; Q12490; -.
DR GeneID; 852280; -.
DR KEGG; sce:YBL005W-B; -.
DR SGD; S000002147; YBL005W-B.
DR VEuPathDB; FungiDB:YBL005W-B; -.
DR eggNOG; KOG0017; Eukaryota.
DR HOGENOM; CLU_244151_0_0_1; -.
DR InParanoid; Q12490; -.
DR OMA; FHGSACA; -.
DR ChiTaRS; YBL005W-B; yeast.
DR Proteomes; UP000002311; Chromosome II.
DR RNAct; Q12490; protein.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IDA:SGD.
DR GO; GO:0000943; C:retrotransposon nucleocapsid; ISS:SGD.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; ISS:SGD.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0008233; F:peptidase activity; ISS:SGD.
DR GO; GO:0004540; F:ribonuclease activity; ISS:SGD.
DR GO; GO:0003723; F:RNA binding; ISS:SGD.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; ISS:SGD.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:UniProtKB-EC.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0032197; P:transposition, RNA-mediated; ISS:SGD.
DR Gene3D; 3.30.420.10; -; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR015820; TYA.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR Pfam; PF01021; TYA; 1.
DR SUPFAM; SSF53098; SSF53098; 1.
DR SUPFAM; SSF56672; SSF56672; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 1: Evidence at protein level;
KW Aspartyl protease; ATP-binding; Cytoplasm; DNA integration;
KW DNA recombination; DNA-binding; DNA-directed DNA polymerase; Endonuclease;
KW Hydrolase; Magnesium; Metal-binding; Multifunctional enzyme; Nuclease;
KW Nucleotide-binding; Nucleotidyltransferase; Nucleus; Protease;
KW Reference proteome; Ribosomal frameshifting; RNA-binding;
KW RNA-directed DNA polymerase; Transferase; Transposable element;
KW Transposition; Viral release from host cell; Virion maturation; Zinc;
KW Zinc-finger.
FT CHAIN 1..1755
FT /note="Transposon Ty1-BL Gag-Pol polyprotein"
FT /id="PRO_0000278985"
FT CHAIN 1..401
FT /note="Capsid protein"
FT /evidence="ECO:0000250"
FT /id="PRO_0000278986"
FT CHAIN 402..582
FT /note="Ty1 protease"
FT /evidence="ECO:0000250"
FT /id="PRO_0000278987"
FT CHAIN 583..1217
FT /note="Integrase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000278988"
FT CHAIN 1218..1755
FT /note="Reverse transcriptase/ribonuclease H"
FT /evidence="ECO:0000250"
FT /id="PRO_0000278989"
FT DOMAIN 660..835
FT /note="Integrase catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT DOMAIN 1338..1476
FT /note="Reverse transcriptase Ty1/copia-type"
FT DOMAIN 1610..1752
FT /note="RNase H Ty1/copia-type"
FT REGION 20..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 137..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 299..401
FT /note="RNA-binding"
FT /evidence="ECO:0000250"
FT REGION 350..420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 583..640
FT /note="Integrase-type zinc finger-like"
FT REGION 956..1172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 1178..1212
FT /note="Bipartite nuclear localization signal"
FT /evidence="ECO:0000250"
FT COMPBIAS 20..67
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..375
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 376..420
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 969..983
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 993..1015
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1042..1056
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1057..1080
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1094..1111
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1149..1172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 461
FT /note="For protease activity; shared with dimeric partner"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10094"
FT BINDING 671
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 736
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1346
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1427
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1428
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1610
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1652
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1685
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT SITE 401..402
FT /note="Cleavage; by Ty1 protease"
FT /evidence="ECO:0000250"
FT SITE 582..583
FT /note="Cleavage; by Ty1 protease"
FT /evidence="ECO:0000250"
FT SITE 1217..1218
FT /note="Cleavage; by Ty1 protease"
FT /evidence="ECO:0000250"
SQ SEQUENCE 1755 AA; 198969 MW; FECE570875A82B6F CRC64;
MESQQLSQHS PIFHGSACAS VTSKEVQTTQ DPLDISASKT EECEKVSTQA NSQQPTTPPS
SAVPENHHHA SPQAAQVPLP QNGPYPQQRM MNTQQANISG WPVYGHPSLM PYPPYQMSPM
YAPPGAQSQF TQYPQYVGTH LNTPSPESGN SFPDSSSAKS NMTSTNQHVR PPPILTSPND
FLNWVKIYIK FLQNSNLGDI IPTATRKAVR QMTDDELTFL CHTFQLFAPS QFLPPWVKDI
LSVDYTDIMK ILSKSINKMQ SDTQEVNDIT TLATLHYNGS TPADAFEAEV TNILDRLNNN
GIPINNKVAC QFIMRGLSGE YKFLPYARHR CIHMTVADLF SDIHSMYEEQ QESKRNKSTY
RRSPSDEKKD SRTYTNTTKP KSITRNSQKP NNSQSRTARA HNVSTFNNSP GPDNDLIRGS
TTEPIQLKNT HDLHLGQELT ESTVNHTNHS DDELPGHLLL DSGASRTLIR SAHHIHSASS
NPDINVVDAQ KRNIPINAIG DLQFHFQDNT KTSIKVLHTP NIAYDLLSLN ELAAVDITAC
FTKNVLERSD GTVLAPIVKY GDFYWVSKKY LLPSNISVPT INNVHTSEST RKYPYPFIHR
MLAHANAQTI RYSLKNNTIT YFNESDVDWS SAIDYQCPDC LIGKSTKHRH IKGSRLKYQN
SYEPFQYLHT DIFGPVHNLP KSAPSYFISF TDETTKFRWV YPLHDRREDS ILDVFTTILA
FIKNQFQASV LVIQMDRGSE YTNRTLHKFL EKNGITPCYT TTADSRAHGV AERLNRTLLD
DCRTQLQCSG LPNHLWFSAI EFSTIVRNSL ASPKSKKSAR QHAGLAGLDI STLLPFGQPV
IVNDHNPNSK IHPRGIPGYA LHPSRNSYGY IIYLPSLKKT VDTTNYVILQ GKESRLDQFN
YDALTFDEDL NRLTASYQSF IASNEIQQSN DLNIESDHDF QSDIELYPEQ PRNVLSKAVS
PTDSTPPSTH TEDSKRVSKT NIRAPREVDP NISESNILPS KKRSSTPQIS DIESTDSGGM
HRLDVPLLAP MSQSNTHESS YASKSKDFRH SDSYSDNETN HTNVPISSTG GTNNKTVPQT
SEQETEKRII HRSPSIDTSS SESNSLHHVV PIKTSDTCPK ENTEESIIAD LPLPDLPPEP
PTELSDSFKE LPPINSRQTN SSLGGIGDSN AYTTINSKKR SLEDNETEIK VSRDTWNTKN
MRSLEPPRSK KRIHLIAAVK AVKSIKPIRT TLRYDEAITY NKDIKEKEKY IEAYHKEVNQ
LLKMKTWDTD KYYDRKEIDP KRVINSMFIF NRKRDGTHKA RFVARGDIQH PDTYDSGMQS
NTVHHYALMT SLSLALDNNY HITQLDISSA YLYADIKEEL YIRPPPHLGM NDKLIRLKKS
LYGLKQSGAN WYETIKSYLI KQCGMEEVRG WSCVFKNSQV TICLFVDDMV LFSKNLNSNK
RIIDKLKMQY DTKIINLGES DEEIQYDILG LEIKYQRGKY MKLGMENSLT EKIPKLNVPL
NPKGRKLSAP GQPGLYIDQQ ELELEEDDYK MKVHEMQKLI GLASYVGYKF RFDLLYYINT
LAQHILFPSK QVLDMTYELI QFIWNTRDKQ LIWHKSKPVK PTNKLVVISD ASYGNQPYYK
SQIGNIYLLN GKVIGGKSTK ASLTCTSTTE AEIHAISESV PLLNNLSYLI QELDKKPITK
GLLTDSKSTI SIIISNNEEK FRNRFFGTKA MRLRDEVSGN HLHVCYIETK KNIADVMTKP
LPIKTFKLLT NKWIH