SET1_PLAF7
ID SET1_PLAF7 Reviewed; 6753 AA.
AC C6KTD2;
DT 03-NOV-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-SEP-2009, sequence version 1.
DT 25-MAY-2022, entry version 74.
DE RecName: Full=Putative histone-lysine N-methyltransferase 1;
DE Short=PfSET1;
DE EC=2.1.1.-;
GN Name=SET1; ORFNames=PFF1440w;
OS Plasmodium falciparum (isolate 3D7).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=36329;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3D7;
RX PubMed=12368864; DOI=10.1038/nature01097;
RA Gardner M.J., Hall N., Fung E., White O., Berriman M., Hyman R.W.,
RA Carlton J.M., Pain A., Nelson K.E., Bowman S., Paulsen I.T., James K.D.,
RA Eisen J.A., Rutherford K.M., Salzberg S.L., Craig A., Kyes S., Chan M.-S.,
RA Nene V., Shallom S.J., Suh B., Peterson J., Angiuoli S., Pertea M.,
RA Allen J., Selengut J., Haft D., Mather M.W., Vaidya A.B., Martin D.M.A.,
RA Fairlamb A.H., Fraunholz M.J., Roos D.S., Ralph S.A., McFadden G.I.,
RA Cummings L.M., Subramanian G.M., Mungall C., Venter J.C., Carucci D.J.,
RA Hoffman S.L., Newbold C., Davis R.W., Fraser C.M., Barrell B.G.;
RT "Genome sequence of the human malaria parasite Plasmodium falciparum.";
RL Nature 419:498-511(2002).
RN [2] {ECO:0000312|EMBL:CAG25109.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3D7;
RX PubMed=12368867; DOI=10.1038/nature01095;
RA Hall N., Pain A., Berriman M., Churcher C.M., Harris B., Harris D.,
RA Mungall K.L., Bowman S., Atkin R., Baker S., Barron A., Brooks K.,
RA Buckee C.O., Burrows C., Cherevach I., Chillingworth C., Chillingworth T.,
RA Christodoulou Z., Clark L., Clark R., Corton C., Cronin A., Davies R.M.,
RA Davis P., Dear P., Dearden F., Doggett J., Feltwell T., Goble A.,
RA Goodhead I., Gwilliam R., Hamlin N., Hance Z., Harper D., Hauser H.,
RA Hornsby T., Holroyd S., Horrocks P., Humphray S., Jagels K., James K.D.,
RA Johnson D., Kerhornou A., Knights A., Konfortov B., Kyes S., Larke N.,
RA Lawson D., Lennard N., Line A., Maddison M., Mclean J., Mooney P.,
RA Moule S., Murphy L., Oliver K., Ormond D., Price C., Quail M.A.,
RA Rabbinowitsch E., Rajandream M.A., Rutter S., Rutherford K.M., Sanders M.,
RA Simmonds M., Seeger K., Sharp S., Smith R., Squares R., Squares S.,
RA Stevens K., Taylor K., Tivey A., Unwin L., Whitehead S., Woodward J.R.,
RA Sulston J.E., Craig A., Newbold C., Barrell B.G.;
RT "Sequence of Plasmodium falciparum chromosomes 1, 3-9 and 13.";
RL Nature 419:527-531(2002).
RN [3] {ECO:0000305}
RP SYNTHESIS OF 3412-3441, AND POSSIBLE CANDIDATE MALARIA EPITOPE.
RX PubMed=17653272; DOI=10.1371/journal.pone.0000645;
RA Villard V., Agak G.W., Frank G., Jafarshad A., Servis C., Nebie I.,
RA Sirima S.B., Felger I., Arevalo-Herrera M., Herrera S., Heitz F.,
RA Baecker V., Druilhe P., Kajava A.V., Corradin G.;
RT "Rapid identification of malaria vaccine candidates based on alpha-helical
RT coiled coil protein motif.";
RL PLoS ONE 2:E645-E645(2007).
RN [4]
RP DEVELOPMENTAL STAGE.
RX PubMed=18299133; DOI=10.1016/j.ijpara.2008.01.002;
RA Cui L., Fan Q., Cui L., Miao J.;
RT "Histone lysine methyltransferases and demethylases in Plasmodium
RT falciparum.";
RL Int. J. Parasitol. 38:1083-1097(2008).
CC -!- FUNCTION: Probable histone methyltransferase. {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) + N(6)-
CC methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61929;
CC -!- DEVELOPMENTAL STAGE: constitutive pattern of expression.
CC {ECO:0000269|PubMed:18299133}.
CC -!- BIOTECHNOLOGY: Possible candidate for an effective malaria vaccine as
CC determined by epitope response in sera. {ECO:0000269|PubMed:17653272}.
CC -!- SIMILARITY: Belongs to the class V-like SAM-binding methyltransferase
CC superfamily. {ECO:0000255|PROSITE-ProRule:PRU00190}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL844505; CAG25109.2; -; Genomic_DNA.
DR RefSeq; XP_966279.2; XM_961186.2.
DR SMR; C6KTD2; -.
DR BioGRID; 1210423; 12.
DR STRING; 5833.PFF1440w; -.
DR PRIDE; C6KTD2; -.
DR EnsemblProtists; CAG25109; CAG25109; PF3D7_0629700.
DR GeneID; 3885750; -.
DR KEGG; pfa:PF3D7_0629700; -.
DR VEuPathDB; PlasmoDB:PF3D7_0629700; -.
DR HOGENOM; CLU_222997_0_0_1; -.
DR InParanoid; C6KTD2; -.
DR OMA; CCESIYN; -.
DR Proteomes; UP000001450; Chromosome 6.
DR GO; GO:0035097; C:histone methyltransferase complex; IBA:GO_Central.
DR GO; GO:0042800; F:histone methyltransferase activity (H3-K4 specific); IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0019904; F:protein domain specific binding; ISS:GeneDB.
DR GO; GO:0006325; P:chromatin organization; IEA:UniProtKB-KW.
DR GO; GO:0051568; P:histone H3-K4 methylation; IBA:GO_Central.
DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; IBA:GO_Central.
DR Gene3D; 3.30.40.10; -; 2.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR SMART; SM00297; BROMO; 1.
DR SMART; SM00249; PHD; 4.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF47370; SSF47370; 1.
DR SUPFAM; SSF57903; SSF57903; 1.
DR SUPFAM; SSF82199; SSF82199; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 1: Evidence at protein level;
KW Bromodomain; Chromatin regulator; Merozoite; Metal-binding;
KW Methyltransferase; Reference proteome; Repeat; S-adenosyl-L-methionine;
KW Transcription; Transcription regulation; Transferase; Zinc; Zinc-finger.
FT CHAIN 1..6753
FT /note="Putative histone-lysine N-methyltransferase 1"
FT /id="PRO_0000388753"
FT DOMAIN 1832..1902
FT /note="Bromo"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00035"
FT DOMAIN 6612..6729
FT /note="SET"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00190"
FT DOMAIN 6737..6753
FT /note="Post-SET"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00155"
FT ZN_FING 1671..1728
FT /note="PHD-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00146"
FT ZN_FING 1761..1819
FT /note="PHD-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00146"
FT ZN_FING 1764..1817
FT /note="RING-type; degenerate"
FT /evidence="ECO:0000255"
FT ZN_FING 2510..2579
FT /note="PHD-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00146"
FT ZN_FING 5496..5532
FT /note="C2HC pre-PHD-type; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01146"
FT ZN_FING 5558..5610
FT /note="PHD-type 4; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01146"
FT REGION 1..28
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 418..458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 470..522
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 736..1024
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1487..1545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1557..1642
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2694..2768
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3053..3072
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4182..4254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4295..4349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5331..5460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5905..5958
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 6103..6133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 6212..6235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..449
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 471..506
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 736..798
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 812..842
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 843..865
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 883..900
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 901..946
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 963..1008
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1515..1539
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1562..1614
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1615..1632
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2694..2710
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2719..2738
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4182..4227
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4236..4254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4295..4325
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4334..4349
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5331..5402
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5429..5460
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 6114..6132
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 6212..6230
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 6753 AA; 796045 MW; EF689CABCFA20F00 CRC64;
MSENSDEGKM QKGKGNEMNE DKIICKKEES NSSSYKFISF IKTIEDVKEK WKSSNLSRMD
VLKLKNDDIN NDDNMNNMDN MNNMNNMNNI NNMNNMNNIN NMNNMNNINN MNNMNNMNNM
DNMNNNEKHP CSNNNYSFNN YIKESINYNT CTGYDKNNIM LNFLDKHKKS IFLKSRNNVN
TFQSDINSSK TSSFFSCMSL KNMNYHNDIN KYSNYRMMKN INGLHKLMIL KNKNKDKKLF
TNTSDKKNYY CLNNGDHIIN EKDSFSDTYI SYGCRKRRWK KKKKNIYNML FYDESINDYP
YKNLEKTTFS NSSKIYELFF NKYKKKKILN IECSGNKYKG LLRTNHFPPY ANFNFTIRRK
LQTSNVLGHF MISKCNFNRT ICYRKYIKCV NKYKNNKRNR VINMYVKKEN VDSIKGTFGN
MNNGVHHNNS RRRLNNTSKN NISNNNNNML KKKKGKKNYK GSFDQMIQED TTLDQAKKES
IKTVSKNERK NNMNKHSHDN KVSKLNKRMS NKRRNNKNCN PSNDMCNEDD VIEKICTTEE
VNDDEKKKKL SRHKKFVCER KKGVILQNNN KCKKKNDDNI INNNNDDVNN CGDNINDHHD
NRKDYDTTED QNKCPKILSI NFIKCDEKLF EKIYNDMFLR IGKIVTNKRK KYFLIYKIVK
DSFFRILILN TEKLEKNILQ VMAVQDVILC DKNVKIYSDP IYIRNNNKLY AIKFLKIFSL
KYMKKVKNMS EVNFSDDDEC KKENKDNISE SSKRSNNIGE KKMLHVEKSE EHDDMTSDSN
KEDTKIEEGR KKSNEVNIDV DDGEEEENVN NNDNNNDNNN DNDNSSDNNN NDDGSNDTES
CSKINKSKYK GKEKKDVKEN TDDKNLSDSN SNNSKKKFKV LNKAIKKDND KKKKYEKKNI
EGNSNNNMIL VRSNSSSTST SNSSSKSKSS NCRNKKNNQI SICSKMDEKN SEQKKKNIKK
KNKTCNEGKS KKDSTKLNCV KKVKNKSTDK KNGKSKINIK NEKKKKINNS KINKGRKGIN
KKDKGKGDDN NYVCLIYDDE KKFYFNFKKF KDIINVIKGS DESTRFYVDT NNNNHNNNMK
KKMKLFKKVE KSLYLENLDI DTDEILFRRP KFFNCIIEES FENYDINYEG EDGNFYVFKN
KLNKIRKKVT IKNETDSSDM YIELKDEEKG FKYIGYRVSF ELKKDNKKKA QVKVGIIKYY
SPKYKQFFIH HLENYKLYQT SDLSKGNNNN NNNNMKENGF VKCGRNSYSR ACSKSLNSSI
YINKYKNVEL NEKITNIIDD DNNNNINSSC IYKNNLSNEN NLCADKVLCD GNYNMLENDV
DEMNNVDQNQ KKRNDLVFSD VKGWYSPYFY NIKILNNYKE FERFDILEKC DKKKEMTDHV
NNTLNKNEIC SICSKRILFM KGHDHYSCSL SSAIYDMCSE AEKKEIDENN CIDIYWGIKC
FTCEKKYHAN CLEDDVLITK WFDKNILMKE YKKFIYKNSL KKEKRYFNNN GKNKRTKNGK
KKKNTIHKLE DKNNSHVVST ASNSHSIEVS SSESAKKGNE KNTATCKKRK TSCSALYKKV
KKGKNKNGEN KNGENKNGDI KNDDIKNDDI KNDDIRNDDD KNDENEENTK ECKNESNNID
NNNSSNDSLS DVDNNKDNGK SKNKKYRRCI NYIPSVEHSD ITYKKFICKD CYRCIYCCES
IYDYKQTPNV ANYVICKNCN MVAHGSCCFP NVPDIYLFNW KCDDCLKCNK CNYSNLCYIN
YNEWELHLDC CINCYKEYEK KNFCIMCNEK YDEDDSKKWV QCDVCKFWIH LSCDKNESRN
IETLSNKNID YKCPTCSIGT FHDKIERILY LLFLLDKYKN FTFHVPINYS IYWRIVKIPM
NLYIMKKKIW EKKYDTILDF LYDFMLIIHN AKMVHMPNTP IYKNACIFEK KGRVIIKNMF
NMTNEYLNKC IEDCVENYKN EINNLDSFQI GHDNNNNNDN NINNNNKMEG VNNESVIFMN
DGCNNKLYNK EGTNMTCVNM DSINKNLNDM NNNNNNNNKM EVFCGQNNIK LNEYYINKEG
YNIISNDNNM NYDNYNNVQN IGMKKMYTNI NDYNSSNVPN ESVYNKENFI NNSSIYNINE
NNTYDLNCDK KLIFDNKYNL SAYQNEGDIM NYNGMYQKNI NISNPPYNNN NNNNNNMVKE
GEKYILNNDD MNNISISKDN DINNNCNNIA NIYRKRKLEQ YKKDITQYEL YELFDFKNDS
FFINRNKEMF SCSSNDHNLI DYNILYVKLN GKIYFNTFYD KHDEFDVCKI LKVHFLKNNR
RGGGNHIMCP KIRNNQNLKE DVTQCDEEKI EQNNDNGCDD EYDNNNNNNN NIGSSSISSI
NKIHMNDTYN NSINDNSLRH NNNCSVFINS NIFMIDVLNE KVKINNIVKE TKRIISPINN
VLKFLKCIQI VFFYDNNTNE CTEKEKNVIS SNLCNNNFEK YVNINSIDHN NMSGEKKRKS
VEEIMSVVDN KSYYGFNKCS EYILYSSNEY DRAKKKENKR IKLLKNDILK ECCYICGSIE
YKNNFIYCCI CGISVHYSCA NIVHPFLFNL NDYKDHKKEI NNILNIITRN FKCDNCIKCD
NCLLHFDSSL KHNLYFKLKN LNVSCNNNRM ERRTYNYLYD VKIKVFTTNK EGTKQTGKCV
MKTKENDIIK LDEDNKQKDE LNEANKECFY KQDDVHVEKN CDELLYKNSF NSEECNKNEK
KKNDNNVDEN DDNVDKNDDN NNNNNNGDDN NNIDNTLVDG DMNKLENDLN NSNDFSINEE
KKNNKDTKKY MISSKEEINK EIENVSNQMD NKNNDVDKKK NISNEEIILD NTKNSCHDND
SNVLYNESVK KSFNACKIEK KEGIEKDDNL GHVNKLRNKR LIKILSIREE GNKLVKCFCC
GKPSHDECFY IIDNNTYKNK ICTLKKTVKN YVRKKNVENK CNTKVEMNSD VILLDDNIKD
HCSVNKTGDE HMNNNEFIDI SKDKISEHNI LDESFNSGVV CSDKNRKDQV VFIDGDVKNN
DISIIEENVT YYTSKEVNNN SVLKNERDVE STSELYIGGD KHVYNKLETD NAEMESNNNN
NNNNNNSDCN NNVSTLSTAA VNTINRSHMS PQKNDNDMNK INQDDIIHTK MNDKKNVKDD
NGNMTNLNNN IDKNDRNLDT ILKEHSVIMQ KLDELKENRN KEHDGEFYNN LILNNQFLIH
SFKVEEGVEI NKDSIIINKK MFNKYILNKI YELLDNKKRV RIKDLFNLFN LDEFRCSFIL
YEVLNALNTY LNEKLKEYNL TKIRNGTYKK YENEIRNENF CLFNKYLIVN NRDKINITKV
VLKIQSLITE ILSDAMNGNI VDEKDKSKKK ESISSDFNVV NNVKEYNIQN GCINIDINNY
FKGEYINFKP ILCSSQFNLD TNAQIFLQNK ANLSMFQKSA SYNNNFENNL ENNFVNNKMN
NMNNMKNNMN NMNNIMNNIM NNNMNNIMNN IMNNNMNNII NNNNIFNNDV SNNVDMQHKS
DQICIFNSNN IHSVPIFNNK PYMDNNFNNM VLVNKSNDIN GDDILCNMKN LYNKSVCNKQ
EKNGYSVVHK NICDVNFPYN DTKIWNGDIT NKSKTYTYTN INNNGSVIDY RKWSSVNRSV
SMNNMNCIRS NTNPRILSGT DHILKNNHMN KRNNVNNTYG VNNVNNVNNV NNTNNVSCFM
MRRKIRSNSL HDMNDKMNKM NDNINININN LNIINNNINI CNMNYPIDRV DSMNNTFNRN
NIIISQNTKE NTNILNFNGN DFCNNNNNNN NIINKENNFG TSNFNSPFHV GNLAKSYSYN
NTMSEKNMNE VICPNVRNNM SNMNNMNNMN NMNNMNNMNN MNNMNNMDSI NNVISYSCTN
PNMKDINFNS MRRSSSTPKK STGLLKNYFN IDIDQYNKTN SHIMNYNVSI NNDMNNVYIN
NNNNHNNNNN NSCNNFINND VINMNNVGTF YNFNQNAESY NNISNNIKCS NINNIIINNN
MNVNNLNYFC NNKEVGFKEN DLNINQKVHT INKDPYEMNH SKMYVTFPYC NTKDNNSNKS
SLKSNVLRLD KIRNRNIKKP LLYNRSSSMH SSDNLIYNVH NNNRNSPAEF TSDIIINKER
NMENNYNTSI NYVNNNITNN IIVANNNCYH TANNFIQINY SPSSSNIVNI NKDSYKYDIS
LENCIDLGST NTCMYNLNNI HNKDINNMPL EDCFSHIGSC PNEQNEHEDE INEDNKKDVT
KQQKKRKLNS VSKKDMLIKK EMNADDNINC KENTLQNESP KKDDELREND LKTTTENIKS
NEVEDKEFVD KKKKRKLSVK VKVNVNVKVE LQDTENDENK EKGIKKEKND EEKKNDAEKK
KKENKKGREK SVKVRKTKNQ TQVERENEKE NLMENVTNDK TSDIINNKTS DIINNKTSDI
INNKTSDIIN NKTSDIINNK TSDIINNKTS DIINDKTNDI INNKTSDCLS LVQNNKPVIH
IDCNTSLDIT DGYNNLISCE GRKKGKYNIE ENNINDDNMF QYSDAFDSEG SIKYKEEYDK
IYIPNNKNKI NNINNFLIKN NLLLKSKFMR ITPNTYLCRN CVLLYQNDFS YNTYEEEINK
MERSILNEYE KGVIKKNEYV HDAIQNDKVV NTDENIMTNV LADKVDDMKI VEKLNCPLNV
EEKGIEENIL YVKREGDELI NYGKHNDQMK EEEESEVVLG DVDFLKNEKK DNLILPYDEK
YTGVNNNNSS IIMSLKKCDK NITKKKGKDK NFNNIKYFKI DNNKSLWYEN YMYWMNKISL
YNNLFFTNMY YKENVIKCID MNNLFLNKMN IYKCSICCML YHCSNMDRNN FFIKNEDKMR
KKTKKNDCLK LLYICNLCTI KYDYVLKIIT YKENNKCWNE FYTDNYYKNN FYELVYFIIK
IIYKNIYINN FFNIFCSIMD NIFKEHKTLN MKLLSLFLFS NKYFKYEEFY HFFNNKMKKD
DRVFKKTFHM KNVCFMPRYS KKSIMYYIFS LFCNKIYNIN KKKKCIHNKR SYIHNKQSYI
HNKRTDVMYD NNVYFHLYEL ARKKLYDYSE KSQKPFDEII NMCLYLLYLY YLCNVLYKCV
RINNVLKGDK DKGDILCDNK KMKYYKKRKN IKLFLNIINS NEYINVNKIF HGKCIYELPF
YVNKERIKKK RNSVREFINN NDNNQENNAD KKKDHHIYNQ NYNHNSYLCD IGKVDESLHS
KEDNKKDVII TNNASIDSTS PSINMNKSVV SSIYSYNSNK EKKKNMRKCI KGLHHTIKNK
LNLYVKLMLD KYIQESSNYI KNENKDIKKT IESKNKDDKI CLLCNFSNYI YKGRLIPFYD
IYIHSECLKW SLNCTQCCYE ENKNKTIVNN DNGTKVDVNL DNADDIINNN NMNMLDNNMN
GPIKNNEENN NNNDNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN
KSKKNTQKKK DHVNDVKINQ NNSNNKNNKK KKTSKDNEEL KSDNTKNNKT KDSDGNNNDK
TKLEKINLIH NKQSNEISCK IDNNNIINDI STNNPYMKEK KCKNKEKNRG SKNNNIKNIK
LIDMCEWKED RNFYNIYENM IEVDEDNVKE IIFDSIKSTC FLCGYNNASV YCSNEDCNVK
FHLNCAFYST VIKDPSNNPF FRYLKCFNLV EFNKDTIFYK NMYHDPNVNN SVSSHVCTDN
RYYKEMIEKN YVDIFPVHII YKIKKIWCNK CWNKKKIYNL FYIQKCFMSP YYYDNIKTEE
SVPNKITPMK KDKSDITNEV KEEKDDDILY DSKVHKEYFT LRNILMDDNL LNNIIRNKKS
CSMEIQENND KMKGDNNIDN EDVRNVLCDG EERVSYNRNK LNDILLKMDM KDILKYFIDF
YFENGSYYIL DNILYSVNHC VKIKYKKKSL YNIQDVLNKE TKIGTMMREL EGLISKYVNR
SNDNNYMDRK YDMLLLKNIK SDINNDNNND INNNDNNNNE NNNENINDNN NNNNNNNNNN
NNNNSNNNNN NNYYYYHNNE GKYNEQGSYQ HIDIQNIIND KSVENLIKGY FILKNFLHIG
NNNITLQNED LLILKKSENS FVHNLYDNDK VYNVHVPYVY TKKMNTSYMN KKENDKKYNK
SVNKNSCKSK NSILDNINFD KNKNKITKKY TAFIIDNNEY TTDCSNEENN TSDDEENENR
KNENDDDNIP EHIKMNNIMN SQQKKENDFK NINLYFQLTN VIKKVSINKL EGNFFNYEEK
GNLLGSNVSK IKMNELLECN VGEENFCDDD QKFSDNKNYA SDDEEKKKKK RKNQTRFYNY
PKRISTTNNN KNVNVLVNSL NNNLINKKEY FLNIIMNENN DLYMKKINEK YFPNYHPKKR
KKKKLDNTSY INHNYNYNYP YNYNLLSNNS KSRILKVGCH NILNIGDILK YDGDKIIYPC
GYLNMRIFYN LPSYYLFQIY KNANIDDINR KTKLLEKIFL QLRATYIFSI TLREQNFFFS
ILLFPLINID YFSESDATNF ILAEGYNINE VYMKFLSLFN SQNYICDDMN NDYSHYNAKY
GNIYKCLETY ILKSVEHNKF IDSHTFFGLT LPCVVYQIKY KLFKYMYKHL SEKIKTYIKK
SKDSVMKKRI KGCTREVVYN DNVLCKYSNL DTTIFKENEK ENEKNIRKTV KYKYNINSAM
SYRYLMNISS NLRLYVKKSS IHGYGLYTCE FINEGEPVIE YIGEYIRNII SDKREKYYDK
IESSCYMFRL NENIIIDATK WGNVSRFINH SCEPNCFCKI VSCDQNLKHI VIFAKRDIAA
HEEITYDYQF GVESEGKKLI CLCGSSTCLG RMN