SRFAB_BACSU
ID SRFAB_BACSU Reviewed; 3583 AA.
AC Q04747;
DT 01-FEB-1995, integrated into UniProtKB/Swiss-Prot.
DT 16-JUN-2009, sequence version 3.
DT 03-AUG-2022, entry version 157.
DE RecName: Full=Surfactin synthase subunit 2;
GN Name=srfAB; Synonyms=comL, srfA2; OrderedLocusNames=BSU03490;
OS Bacillus subtilis (strain 168).
OC Bacteria; Firmicutes; Bacilli; Bacillales; Bacillaceae; Bacillus.
OX NCBI_TaxID=224308;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-3073.
RC STRAIN=168;
RX PubMed=8441623; DOI=10.1093/nar/21.1.93;
RA Fuma S., Fujishima Y., Corbell N., D'Souza C., Nakano M.M., Zuber P.,
RA Yamane K.;
RT "Nucleotide sequence of 5' portion of srfA that contains the region
RT required for competence establishment in Bacillus subtilis.";
RL Nucleic Acids Res. 21:93-97(1993).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=168 / JH642;
RX PubMed=8355609; DOI=10.1111/j.1365-2958.1993.tb01629.x;
RA Cosmina P., Rodriguez F., de Ferra F., Grandi G., Perego M., Venema G.,
RA van Sinderen D.;
RT "Sequence and analysis of the genetic locus responsible for surfactin
RT synthesis in Bacillus subtilis.";
RL Mol. Microbiol. 8:821-831(1993).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=168;
RX PubMed=8969502; DOI=10.1099/13500872-142-11-3047;
RA Yamane K., Kumano M., Kurita K.;
RT "The 25 degrees-36 degrees region of the Bacillus subtilis chromosome:
RT determination of the sequence of a 146 kb segment and identification of 113
RT genes.";
RL Microbiology 142:3047-3056(1996).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=168;
RX PubMed=9384377; DOI=10.1038/36786;
RA Kunst F., Ogasawara N., Moszer I., Albertini A.M., Alloni G., Azevedo V.,
RA Bertero M.G., Bessieres P., Bolotin A., Borchert S., Borriss R.,
RA Boursier L., Brans A., Braun M., Brignell S.C., Bron S., Brouillet S.,
RA Bruschi C.V., Caldwell B., Capuano V., Carter N.M., Choi S.-K.,
RA Codani J.-J., Connerton I.F., Cummings N.J., Daniel R.A., Denizot F.,
RA Devine K.M., Duesterhoeft A., Ehrlich S.D., Emmerson P.T., Entian K.-D.,
RA Errington J., Fabret C., Ferrari E., Foulger D., Fritz C., Fujita M.,
RA Fujita Y., Fuma S., Galizzi A., Galleron N., Ghim S.-Y., Glaser P.,
RA Goffeau A., Golightly E.J., Grandi G., Guiseppi G., Guy B.J., Haga K.,
RA Haiech J., Harwood C.R., Henaut A., Hilbert H., Holsappel S., Hosono S.,
RA Hullo M.-F., Itaya M., Jones L.-M., Joris B., Karamata D., Kasahara Y.,
RA Klaerr-Blanchard M., Klein C., Kobayashi Y., Koetter P., Koningstein G.,
RA Krogh S., Kumano M., Kurita K., Lapidus A., Lardinois S., Lauber J.,
RA Lazarevic V., Lee S.-M., Levine A., Liu H., Masuda S., Mauel C.,
RA Medigue C., Medina N., Mellado R.P., Mizuno M., Moestl D., Nakai S.,
RA Noback M., Noone D., O'Reilly M., Ogawa K., Ogiwara A., Oudega B.,
RA Park S.-H., Parro V., Pohl T.M., Portetelle D., Porwollik S.,
RA Prescott A.M., Presecan E., Pujic P., Purnelle B., Rapoport G., Rey M.,
RA Reynolds S., Rieger M., Rivolta C., Rocha E., Roche B., Rose M., Sadaie Y.,
RA Sato T., Scanlan E., Schleich S., Schroeter R., Scoffone F., Sekiguchi J.,
RA Sekowska A., Seror S.J., Serror P., Shin B.-S., Soldo B., Sorokin A.,
RA Tacconi E., Takagi T., Takahashi H., Takemaru K., Takeuchi M.,
RA Tamakoshi A., Tanaka T., Terpstra P., Tognoni A., Tosato V., Uchiyama S.,
RA Vandenbol M., Vannier F., Vassarotti A., Viari A., Wambutt R., Wedler E.,
RA Wedler H., Weitzenegger T., Winters P., Wipat A., Yamamoto H., Yamane K.,
RA Yasumoto K., Yata K., Yoshida K., Yoshikawa H.-F., Zumstein E.,
RA Yoshikawa H., Danchin A.;
RT "The complete genome sequence of the Gram-positive bacterium Bacillus
RT subtilis.";
RL Nature 390:249-256(1997).
RN [5]
RP SEQUENCE REVISION.
RX PubMed=19383706; DOI=10.1099/mic.0.027839-0;
RA Barbe V., Cruveiller S., Kunst F., Lenoble P., Meurice G., Sekowska A.,
RA Vallenet D., Wang T., Moszer I., Medigue C., Danchin A.;
RT "From a consortium sequence to a unified sequence: the Bacillus subtilis
RT 168 reference genome a decade later.";
RL Microbiology 155:1758-1775(2009).
RN [6]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 507-795.
RC STRAIN=ATCC 21332 / IAM 1213;
RX PubMed=1601288; DOI=10.1016/0378-1097(92)90508-l;
RA Borchert S., Patil S.S., Marahiel M.A.;
RT "Identification of putative multifunctional peptide synthetase genes using
RT highly conserved oligonucleotide sequences derived from known
RT synthetases.";
RL FEMS Microbiol. Lett. 71:175-180(1992).
RN [7]
RP PHOSPHOPANTETHEINYLATION [LARGE SCALE ANALYSIS] AT SER-999 AND SER-2040,
RP AND IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=168;
RX PubMed=17218307; DOI=10.1074/mcp.m600464-mcp200;
RA Macek B., Mijakovic I., Olsen J.V., Gnad F., Kumar C., Jensen P.R.,
RA Mann M.;
RT "The serine/threonine/tyrosine phosphoproteome of the model bacterium
RT Bacillus subtilis.";
RL Mol. Cell. Proteomics 6:697-707(2007).
CC -!- FUNCTION: This protein is a multifunctional enzyme able to activate and
CC polymerize the amino acids Leu, Glu, Asp and Val. Activation sites for
CC these AA consist of individual domains.
CC -!- COFACTOR:
CC Name=pantetheine 4'-phosphate; Xref=ChEBI:CHEBI:47942;
CC Note=Binds 3 phosphopantetheines covalently.;
CC -!- PATHWAY: Antibiotic biosynthesis; surfactin biosynthesis.
CC -!- SIMILARITY: Belongs to the ATP-dependent AMP-binding enzyme family.
CC {ECO:0000305}.
CC -!- CAUTION: The phosphoserine observed at Ser-999 and Ser-2040 in
CC PubMed:17218307 may result from the secondary neutral loss of
CC pantetheine from the phosphodiester linked cofactor. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D13262; BAA02523.1; -; Genomic_DNA.
DR EMBL; X70356; CAA49817.1; -; Genomic_DNA.
DR EMBL; D50453; BAA08983.1; -; Genomic_DNA.
DR EMBL; AL009126; CAB12143.2; -; Genomic_DNA.
DR EMBL; X65835; CAA46678.1; -; Genomic_DNA.
DR PIR; I40486; I40486.
DR RefSeq; NP_388231.2; NC_000964.3.
DR RefSeq; WP_010886403.1; NZ_CP053102.1.
DR SMR; Q04747; -.
DR IntAct; Q04747; 3.
DR MINT; Q04747; -.
DR STRING; 224308.BSU03490; -.
DR jPOST; Q04747; -.
DR PaxDb; Q04747; -.
DR PRIDE; Q04747; -.
DR EnsemblBacteria; CAB12143; CAB12143; BSU_03490.
DR GeneID; 938303; -.
DR KEGG; bsu:BSU03490; -.
DR PATRIC; fig|224308.179.peg.367; -.
DR eggNOG; COG1020; Bacteria.
DR InParanoid; Q04747; -.
DR OMA; FNMQNME; -.
DR PhylomeDB; Q04747; -.
DR BioCyc; BSUB:BSU03490-MON; -.
DR UniPathway; UPA00181; -.
DR Proteomes; UP000001570; Chromosome.
DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW.
DR GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR GO; GO:0017000; P:antibiotic biosynthetic process; IEA:UniProtKB-KW.
DR GO; GO:1901576; P:organic substance biosynthetic process; IEA:UniProt.
DR GO; GO:0030435; P:sporulation resulting in formation of a cellular spore; IEA:UniProtKB-KW.
DR Gene3D; 1.10.1200.10; -; 3.
DR Gene3D; 3.30.300.30; -; 3.
DR Gene3D; 3.30.559.10; -; 4.
DR InterPro; IPR010071; AA_adenyl_domain.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR025110; AMP-bd_C.
DR InterPro; IPR045851; AMP-bd_C_sf.
DR InterPro; IPR020845; AMP-binding_CS.
DR InterPro; IPR000873; AMP-dep_Synth/Lig.
DR InterPro; IPR023213; CAT-like_dom_sf.
DR InterPro; IPR001242; Condensatn.
DR InterPro; IPR010060; NRPS_synth.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR Pfam; PF00501; AMP-binding; 3.
DR Pfam; PF13193; AMP-binding_C; 3.
DR Pfam; PF00668; Condensation; 4.
DR Pfam; PF00550; PP-binding; 3.
DR SMART; SM00823; PKS_PP; 3.
DR SUPFAM; SSF47336; SSF47336; 3.
DR TIGRFAMs; TIGR01733; AA-adenyl-dom; 3.
DR TIGRFAMs; TIGR01720; NRPS-para261; 1.
DR PROSITE; PS00455; AMP_BINDING; 3.
DR PROSITE; PS50075; CARRIER; 3.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 3.
PE 1: Evidence at protein level;
KW Antibiotic biosynthesis; Ligase; Multifunctional enzyme;
KW Phosphopantetheine; Phosphoprotein; Reference proteome; Repeat;
KW Sporulation.
FT CHAIN 1..3583
FT /note="Surfactin synthase subunit 2"
FT /id="PRO_0000193100"
FT DOMAIN 965..1039
FT /note="Carrier 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 2005..2080
FT /note="Carrier 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 3034..3108
FT /note="Carrier 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT REGION ?..1040
FT /note="Domain 1 (valine-activating)"
FT REGION ?..2091
FT /note="Domain 2 (aspartate-activating)"
FT REGION ?..3110
FT /note="Domain 3 (D-leucine-activating)"
FT MOD_RES 999
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258,
FT ECO:0000269|PubMed:17218307"
FT MOD_RES 2040
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258,
FT ECO:0000269|PubMed:17218307"
FT MOD_RES 3069
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT CONFLICT 33
FT /note="F -> S (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 42
FT /note="G -> A (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 110
FT /note="D -> Q (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 113
FT /note="R -> A (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 115
FT /note="G -> A (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 139
FT /note="V -> A (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 259
FT /note="W -> L (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 309
FT /note="A -> R (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 478..480
FT /note="TPA -> SRP (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 507..513
FT /note="LVEHGLQ -> ACRTRPS (in Ref. 6; CAA46678)"
FT /evidence="ECO:0000305"
FT CONFLICT 595
FT /note="Missing (in Ref. 6; CAA46678)"
FT /evidence="ECO:0000305"
FT CONFLICT 648
FT /note="R -> A (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 680..682
FT /note="ETL -> RHV (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 694..698
FT /note="EQSIT -> DKRIS (in Ref. 6; CAA46678)"
FT /evidence="ECO:0000305"
FT CONFLICT 788
FT /note="M -> L (in Ref. 6; CAA46678)"
FT /evidence="ECO:0000305"
FT CONFLICT 939..940
FT /note="PL -> LV (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1038
FT /note="N -> I (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1133
FT /note="Q -> H (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1310
FT /note="V -> C (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1333
FT /note="V -> G (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1384
FT /note="P -> R (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1582
FT /note="E -> G (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1677
FT /note="E -> KRRADG (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1695
FT /note="S -> C (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1750
FT /note="K -> F (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 1782
FT /note="T -> S (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1796..1817
FT /note="GAIAGRVDLYEPDAFAKRPTIG -> APSPGGLICMSRCICETPDNR (in
FT Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 1910..1911
FT /note="PK -> LG (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2070
FT /note="R -> C (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2074
FT /note="A -> V (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2134..2141
FT /note="SRLRDSLN -> WPARLTP (in Ref. 2; CAA49817 and 3;
FT BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2134..2135
FT /note="SR -> WP (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2441
FT /note="Q -> E (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2481..2485
FT /note="ATDLF -> RQICS (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2542..2562
FT /note="TVHQLFEETVQRHKDRPAVTY -> DGCISYSKRLSSATKTARLSHT (in
FT Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2604..2611
FT /note="MSAAVLGV -> KCPPRCSAS (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2640..2641
FT /note="KL -> NV (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2709
FT /note="H -> D (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2719
FT /note="H -> D (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2872..2877
FT /note="GELCVA -> RALRG (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2895..2896
FT /note="RF -> L (in Ref. 1; BAA02523)"
FT /evidence="ECO:0000305"
FT CONFLICT 2954..2956
FT /note="QDA -> EDR (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 2960
FT /note="A -> R (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 3236..3237
FT /note="EQ -> AE (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
FT CONFLICT 3533
FT /note="Q -> E (in Ref. 2; CAA49817 and 3; BAA08983)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 3583 AA; 400944 MW; A257AC7643C4C64C CRC64;
MSKKSIQKVY ALTPMQEGML YHAMLDPHSS SYFTQLELGI HGAFDLEIFE KSVNELIRSY
DILRTVFVHQ QLQKPRQVVL AERKTKVHYE DISHADENRQ KEHIERYKQD VQRQGFNLAK
DILFKVAVFR LAADQLYLVW SNHHIMMDGW SMGVLMKSLF QNYEALRAGR TPANGQGKPY
SDYIKWLGKQ DNEEAESYWS ERLAGFEQPS VLPGRLPVKK DEYVNKEYSF TWDETLVARI
QQTANLHQVT GPNLFQAVWG IVLSKYNFTD DVIFGTVVSG RPSEINGIET MAGLFINTIP
VRVKVERDAA FADIFTAVQQ HAVEAERYDY VPLYEIQKRS ALDGNLLNHL VAFENYPLDQ
ELENGSMEDR LGFSIKVESA FEQTSFDFNL IVYPGKTWTV KIKYNGAAFD SAFIERTAEH
LTRMMEAAVD QPAAFVREYG LVGDEEQRQI VEVFNSTKAE LPEGMAVHQV FEEQAKRTPA
STAVVYEGTK LTYRELNAAA NRLARKLVEH GLQKGETAAI MNDRSVETVV GMLAVLKAGA
AYVPLDPALP GDRLRFMAED SSVRMVLIGN SYTGQAHQLQ VPVLTLDIGF EESEAADNLN
LPSAPSDLAY IMYTSGSTGK PKGVMIEHKS ILRLVKNAGY VPVTEEDRMA QTGAVSFDAG
TFEVFGALLN GAALYPVKKE TLLDAKQFAA FLREQSITTM WLTSPLFNQL AAKDAGMFGT
LRHLIIGGDA LVPHIVSKVK QASPSLSLWN GYGPTENTTF STSFLIDREY GGSIPIGKPI
GNSTAYIMDE QQCLQPIGAP GELCVGGIGV ARGYVNLPEL TEKQFLEDPF RPGERIYRTG
DLARWLPDGN IEFLGRIDNQ VKVRGFRIEL GEIETKLNMA EHVTEAAVII RKNKADENEI
CAYFTADREV AVSELRKTLS QSLPDYMVPA HLIQMDSLPL TPNGKINKKE LPAPQSEAVQ
PEYAAPKTES EKKLAEIWEG ILGVKAGVTD NFFMIGGHSL KAMMMTAKIQ EHFHKEVPIK
VLFEKPTIQE LALYLEENES KEEQTFEPIR QASYQQHYPV SPAQRRMYIL NQLGQANTSY
NVPAVLLLEG EVDKDRLENA IQQLINRHEI LRTSFDMIDG EVVQTVHKNI SFQLEAAKGR
EEDAEEIIKA FVQPFELNRA PLVRSKLVQL EEKRHLLLID MHHIITDGSS TGILIGDLAK
IYQGADLELP QIHYKDYAVW HKEQTNYQKD EEYWLDVFKG ELPILDLPAD FERPAERSFA
GERVMFGLDK QITAQIKSLM AETDTTMYMF LLAAFNVLLS KYASQDDIIV GSPTAGRTHP
DLQGVPGMFV NTVALRTAPA GDKTFAQFLE EVKTASLQAF EHQSYPLEEL IEKLPLTRDT
SRSPLFSVMF NMQNMEIPSL RLGDLKISSY SMLHHVAKFD LSLEAVEREE DIGLSFDYAT
ALFKDETIRR WSRHFVNIIK AAAANPNVRL SDVDLLSSAE TAALLEERHM TQITEATFAA
LFEKQAQQTP DHSAVKAGGN LLTYRELDEQ ANQLAHHLRA QGAGNEDIVA IVMDRSAEVM
VSILGVMKAG AAFLPIDPDT PEERIRYSLE DSGAKFAVVN ERNMTAIGQY EGIIVSLDDG
KWRNESKERP SSISGSRNLA YVIYTSGTTG KPKGVQIEHR NLTNYVSWFS EEAGLTENDK
TVLLSSYAFD LGYTSMFPVL LGGGELHIVQ KETYTAPDEI AHYIKEHGIT YIKLTPSLFH
TIVNTASFAK DANFESLRLI VLGGEKIIPT DVIAFRKMYG HTEFINHYGP TEATIGAIAG
RVDLYEPDAF AKRPTIGRPI ANAGALVLNE ALKLVPPGAS GQLYITGQGL ARGYLNRPQL
TAERFVENPY SPGSLMYKTG DVVRRLSDGT LAFIGRADDQ VKIRGYRIEP KEIETVMLSL
SGIQEAVVLA VSEGGLQELC AYYTSDQDIE KAELRYQLSL TLPSHMIPAF FVQVDAIPLT
ANGKTDRNAL PKPNAAQSGG KALAAPETAL EESLCRIWQK TLGIEAIGID DNFFDLGGHS
LKGMMLIANI QAELEKSVPL KALFEQPTVR QLAAYMEASA VSGGHQVLKP ADKQDMYPLS
SAQKRMYVLN QLDRQTISYN MPSVLLMEGE LDISRLRDSL NQLVNRHESL RTSFMEANGE
PVQRIIEKAE VDLHVFEAKE DEADQKIKEF IRPFDLNDAP LIRAALLRIE AKKHLLLLDM
HHIIADGVSR GIFVKELALL YKGEQLPEPT LHYKDFAVWQ NEAEQKERMK EHEAYWMSVL
SGELPELDLP LDYARPPVQS FKGDTIRFRT GSETAKAVEK LLAETGTTLH MVLHAVFHVF
LSKISGQRDI VIGSVTAGRT NADVQDMPGM FVNTLALRME AKEQQTFAEL LELAKQTNLS
ALEHQEYPFE DLVNQLDLPR DMSRNPLFNV MVTTENPDKE QLTLQNLSIS PYEAHQGTSK
FDLTLGGFTD ENGIGLQLEY ATDLFAKETA EKWSEYVLRL LKAVADNPNQ PLSSLLLVTE
TEKQALLEAW KGKALPVPTD KTVHQLFEET VQRHKDRPAV TYNGQSWTYG ELNAKANRLA
RILMDCGISP DDRVGVLTKP SLEMSAAVLG VLKAGAAFVP IDPDYPDQRI EYILQDSGAK
LLLKQEGISV PDSYTGDVIL LDGSRTILSL PLDENDEGNP ETAVTAENLA YMIYTSGTTG
QPKGVMVEHH ALVNLCFWHH DAFSMTAEDR SAKYAGFGFD ASIWEMFPTW TIGAELHVID
EAIRLDIVRL NDYFETNGVT ITFLPTQLAE QFMELENTSL RVLLTGGDKL KRAVKKPYTL
VNNYGPTENT VVATSAEIHP EEGSLSIGRA IANTRVYILG EGNQVQPEGV AGELCVAGRG
LARGYLNRED ETAKRFVADP FVPGERMYRT GDLVKWVNGG IEYIGRIDQQ VKVRGYRIEL
SEIEVQLAQL SEVQDAAVTA VKDKGGNTAI AAYVTPETAD IEALKSTLKE TLPDYMIPAF
WVTLNELPVT ANGKVDRKAL PEPDIEAGSG EYKAPTTDME ELLAGIWQDV LGMSEVGVTD
NFFSLGGDSI KGIQMASRLN QHGWKLEMKD LFQHPTIEEL TQYVERAEGK QADQGPVEGE
VILTPIQRWF FEKNFTNKHH WNQSVMLHAK KGFDPERVEK TLQALIEHHD ALRMVYREEN
GDIVQVYKPI GESKVSFEIV DLYGSDEEML RSQIKLLANK LQSSLDLRNG PLLKAEQYRT
EAGDHLLIAV HHLVVDGVSW RILLEDFASG YMQAEKEESL VFPQKTNSFK DWAEELAAFS
QSAHLLQQAE YWSQIAAEQV SPLPKDCETE QRIVKDTSSV LCELTAEDTK HLLTDVHQPY
GTEINDILLS ALGLTMKEWT KGAKIGINLE GHGREDIIPN VNISRTVGWF TAQYPVVLDI
SDADASAVIK TVKENLRRIP DKGVGYGILR YFTETAETKG FTPEISFNYL GQFDSEVKTD
FFEPSAFDMG RQVSGESEAL YALSFSGMIR NGRFVLSCSY NEKEFERATV EEQMERFKEN
LLMLIRHCTE KEDKEFTPSD FSAEDLEMDE MGDIFDMLEE NLK