NRPS1_ASPFU
ID NRPS1_ASPFU Reviewed; 6269 AA.
AC Q4WT66;
DT 18-APR-2012, integrated into UniProtKB/Swiss-Prot.
DT 05-JUL-2005, sequence version 1.
DT 23-FEB-2022, entry version 108.
DE RecName: Full=Nonribosomal peptide synthetase 1;
DE EC=6.3.2.-;
GN Name=NRPS1; Synonyms=pes1, pesB; ORFNames=AFUA_1G10380;
OS Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC
OS A1100) (Aspergillus fumigatus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus;
OC Aspergillus subgen. Fumigati.
OX NCBI_TaxID=330879;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100;
RX PubMed=16372009; DOI=10.1038/nature04332;
RA Nierman W.C., Pain A., Anderson M.J., Wortman J.R., Kim H.S., Arroyo J.,
RA Berriman M., Abe K., Archer D.B., Bermejo C., Bennett J.W., Bowyer P.,
RA Chen D., Collins M., Coulsen R., Davies R., Dyer P.S., Farman M.L.,
RA Fedorova N., Fedorova N.D., Feldblyum T.V., Fischer R., Fosker N.,
RA Fraser A., Garcia J.L., Garcia M.J., Goble A., Goldman G.H., Gomi K.,
RA Griffith-Jones S., Gwilliam R., Haas B.J., Haas H., Harris D.E.,
RA Horiuchi H., Huang J., Humphray S., Jimenez J., Keller N., Khouri H.,
RA Kitamoto K., Kobayashi T., Konzack S., Kulkarni R., Kumagai T., Lafton A.,
RA Latge J.-P., Li W., Lord A., Lu C., Majoros W.H., May G.S., Miller B.L.,
RA Mohamoud Y., Molina M., Monod M., Mouyna I., Mulligan S., Murphy L.D.,
RA O'Neil S., Paulsen I., Penalva M.A., Pertea M., Price C., Pritchard B.L.,
RA Quail M.A., Rabbinowitsch E., Rawlins N., Rajandream M.A., Reichard U.,
RA Renauld H., Robson G.D., Rodriguez de Cordoba S., Rodriguez-Pena J.M.,
RA Ronning C.M., Rutter S., Salzberg S.L., Sanchez M., Sanchez-Ferrero J.C.,
RA Saunders D., Seeger K., Squares R., Squares S., Takeuchi M., Tekaia F.,
RA Turner G., Vazquez de Aldana C.R., Weidman J., White O., Woodward J.R.,
RA Yu J.-H., Fraser C.M., Galagan J.E., Asai K., Machida M., Hall N.,
RA Barrell B.G., Denning D.W.;
RT "Genomic sequence of the pathogenic and allergenic filamentous fungus
RT Aspergillus fumigatus.";
RL Nature 438:1151-1156(2005).
RN [2]
RP PROTEIN SEQUENCE OF 867-876; 1193-1201; 3391-3402 AND 3696-3707, DISRUPTION
RP PHENOTYPE, AND FUNCTION.
RX PubMed=16759234; DOI=10.1111/j.1742-4658.2006.05315.x;
RA Reeves E.P., Reiber K., Neville C., Scheibner O., Kavanagh K., Doyle S.;
RT "A nonribosomal peptide synthetase (Pes1) confers protection against
RT oxidative stress in Aspergillus fumigatus.";
RL FEBS J. 273:3038-3053(2006).
RN [3]
RP DOMAIN, AND PHOSPHOPANTETHEINYLATION.
RX PubMed=15719355; DOI=10.1002/cbic.200400147;
RA Neville C., Murphy A., Kavanagh K., Doyle S.;
RT "A 4'-phosphopantetheinyl transferase mediates non-ribosomal peptide
RT synthetase activation in Aspergillus fumigatus.";
RL ChemBioChem 6:679-685(2005).
RN [4]
RP NOMENCLATURE.
RX PubMed=16962256; DOI=10.1016/j.gene.2006.07.008;
RA Cramer R.A. Jr., Stajich J.E., Yamanaka Y., Dietrich F.S., Steinbach W.J.,
RA Perfect J.R.;
RT "Phylogenomic analysis of non-ribosomal peptide synthetases in the genus
RT Aspergillus.";
RL Gene 383:24-32(2006).
RN [5]
RP REVIEW ON FUNCTION, AND DOMAIN.
RX PubMed=17464044; DOI=10.1099/mic.0.2006/006908-0;
RA Stack D., Neville C., Doyle S.;
RT "Nonribosomal peptide synthesis in Aspergillus fumigatus and other fungi.";
RL Microbiology 153:1297-1306(2007).
CC -!- FUNCTION: Nonribosomal peptide synthesis (NRPS) is a key mechanism
CC responsible for the biosynthesis of bioactive metabolites which are
CC potentially contributing to organismal virulence. Contributes to
CC improved fungal tolerance against oxidative stress, during the
CC infection process. {ECO:0000269|PubMed:16759234}.
CC -!- DOMAIN: NRP synthetases are composed of discrete domains (adenylation
CC (A), thiolation (T) or peptidyl carrier protein (PCP) and condensation
CC (C) domains) which when grouped together are referred to as a single
CC module. Each module is responsible for the recognition (via the A
CC domain) and incorporation of a single amino acid into the growing
CC peptide product. Thus, an NRP synthetase is generally composed of one
CC or more modules and can terminate in a thioesterase domain (TE) that
CC releases the newly synthesized peptide from the enzyme. Occasionally,
CC epimerase (E) domains (responsible for l- to d- amino acid conversion)
CC are present within the NRP synthetase. NRPS1 has the following
CC architecture: A-T-E-C-A-C-A-T-C-A-T-E-C-T-C-T.
CC {ECO:0000269|PubMed:15719355, ECO:0000269|PubMed:17464044}.
CC -!- PTM: The thiolation domains are 4'-phosphopantetheinylated.
CC -!- DISRUPTION PHENOTYPE: Decreases the virulence, increases the
CC susceptibility to oxidative stress and exhibites altered conidial
CC morphology and hydrophobicity. {ECO:0000269|PubMed:16759234}.
CC -!- SIMILARITY: Belongs to the NRP synthetase family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAHF01000004; EAL90366.1; -; Genomic_DNA.
DR RefSeq; XP_752404.1; XM_747311.1.
DR SMR; Q4WT66; -.
DR STRING; 746128.CADAFUBP00000962; -.
DR PRIDE; Q4WT66; -.
DR EnsemblFungi; EAL90366; EAL90366; AFUA_1G10380.
DR GeneID; 3510659; -.
DR KEGG; afm:AFUA_1G10380; -.
DR eggNOG; KOG1178; Eukaryota.
DR HOGENOM; CLU_000022_60_6_1; -.
DR InParanoid; Q4WT66; -.
DR OMA; DIVSWRI; -.
DR OrthoDB; 4243at2759; -.
DR PHI-base; PHI:2511; -.
DR Proteomes; UP000002530; Chromosome 1.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0000036; F:acyl carrier activity; IBA:GO_Central.
DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW.
DR GO; GO:0031177; F:phosphopantetheine binding; IBA:GO_Central.
DR Gene3D; 1.10.1200.10; -; 5.
DR Gene3D; 3.30.300.30; -; 4.
DR Gene3D; 3.30.559.10; -; 6.
DR Gene3D; 3.40.50.12780; -; 4.
DR InterPro; IPR010071; AA_adenyl_domain.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR045851; AMP-bd_C_sf.
DR InterPro; IPR020845; AMP-binding_CS.
DR InterPro; IPR000873; AMP-dep_Synth/Lig.
DR InterPro; IPR042099; ANL_N_sf.
DR InterPro; IPR023213; CAT-like_dom_sf.
DR InterPro; IPR001242; Condensatn.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR Pfam; PF00501; AMP-binding; 4.
DR Pfam; PF00668; Condensation; 7.
DR Pfam; PF00550; PP-binding; 5.
DR SMART; SM00823; PKS_PP; 4.
DR SUPFAM; SSF47336; SSF47336; 5.
DR TIGRFAMs; TIGR01733; AA-adenyl-dom; 3.
DR PROSITE; PS00455; AMP_BINDING; 1.
DR PROSITE; PS50075; CARRIER; 5.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 2.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Ligase; Phosphopantetheine; Phosphoprotein;
KW Reference proteome; Repeat; Virulence.
FT CHAIN 1..6269
FT /note="Nonribosomal peptide synthetase 1"
FT /id="PRO_0000416542"
FT DOMAIN 803..879
FT /note="Carrier 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 3392..3468
FT /note="Carrier 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 4487..4563
FT /note="Carrier 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 5552..5628
FT /note="Carrier 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT DOMAIN 6139..6220
FT /note="Carrier 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT REGION 249..781
FT /note="Adenylation 1"
FT REGION 894..1342
FT /note="Epimerase 1"
FT REGION 1373..1775
FT /note="Condensation 1"
FT REGION 1725..2333
FT /note="Adenylation 2"
FT REGION 2364..2386
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2597..2670
FT /note="Condensation 2"
FT REGION 2845..3368
FT /note="Adenylation 3"
FT REGION 3512..3898
FT /note="Condensation 3"
FT REGION 3919..4454
FT /note="Adenylation 4"
FT REGION 4578..5024
FT /note="Epimerase 2"
FT REGION 5052..5466
FT /note="Condensation 4"
FT REGION 5628..5658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5720..6067
FT /note="Condensation 5"
FT COMPBIAS 2367..2386
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 840
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT MOD_RES 3429
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT MOD_RES 4524
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
FT MOD_RES 5589
FT /note="O-(pantetheine 4'-phosphoryl)serine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00258"
SQ SEQUENCE 6269 AA; 697525 MW; 55390232D5C21535 CRC64;
MAQSTVSCYL PNFGATCDGP KRPLSLKLRT DALQSSGLLS AWREGQLTQL LQASWALILH
RYTGSEDICF GYHQIGNVSQ DLLQSLEAVD PALCKVSVKE GISLKMFFDQ FKTHNSPDSP
LEIDRSSRKA SENARLYNTI LMIQICHDSN GTSPAIPLPS SLPIALPQEV STERLVLYLP
VQPSETYLQS RVRLHVKVLQ DEVNIFFEWW NNDMPTAQMK SIAGAFGHIL TDLLAANDTA
VDDLDIFPEN DWSRVCSFNS VLPKKHERCI HEMIYDRTLL QPENEAVCSW DGSLTYKELD
LLSSKVAYDL QQRGVGPEVC VALCFEKSKW YTVAMLAVLK AGGAFVPMDP SHPTARLQSL
VEGVQAHIML CSRSQTGKLQ TVAETLIPLD EETVDGLPDL PTSTFSSTTV KSSNAAYVIF
TSGSTGQPKG TLLEHRAYVS GALAHGPVFG LNSSTRVLQF ASHAFDASLV DILSSLIFGS
CICIPSEEAR LNDIAGVINE MKVNHASLTP SFVGFLDPAA VPGLESLVLA GEAMSPQHLA
TWSHIKLVNG YGPTESSVAA ALNPNMSSSS DCRDIGLPVG VRFWVVNPAN HDQLVPVGCP
GELVLEGPTL ARCYINNPQK TSDSFIFNPC WTKRDPNGGS DRRFYKTGDL VRYNSESGSL
TYIGRKDAQV KFHGQRVELG EIESQLSADT DIKHCTVLLP KSGFAQGKLV TVVSLSAGPG
QALEADAVPL KLIEHREKLR YVKSIQERLS IRLPTYMVPG VWLCVEALPM LVSGKLDRKS
IATWVASMSE DPEVHATEAA NARAANSTED QLVSIWSRVL NVPKDRISVD ESFLSLGGDS
IAAITCVGHC KKQGIGLTVQ EILRSKSIRE LATRVKGVTQ PVAYHEMIEE PFGLSPIQKL
HFMVRKEGQG YFNQSVVTRI DRQINDQDMR RAVEAVVMRH SMLRSRLVDP STGNSLQLRI
TEDVAGSYRW RTHYMTAQNE IENAIAESQL CINAFVGPVF AVDFCYVDED SHNLLSLVAH
HLVVDIVSWR IILEDLEDFL LNPQGFVLQN SSLPFQTWCR LQDEQCESVA FENDVQLEDL
PAPDLAYWGM EHRQMTYGDV ICETFELDPG STQSILLECH QSLRTEPVDL FLAALVHSFG
QTFPERTLPV IYNEGHGREV WDSSLDISRT VGWFTTLYPI FVQEIVSEDP ARTVARVKDL
RRQVSDNGRQ KFASRMFTGK GQQTCRHHYP LEMTFNYVGQ HRDLQKQDGL FQLMGHMAGE
AGQGGGAADF GEETPRFALL EISALVVQGQ LRFTFSFNRF MRHQSGIHAW ISRCHQLLAS
LGQKLQSLAP QPTLSDFPML SLTYEELDKM VSAKLPIAGI TSLDLVEDVY PCSRMQQGIL
LSQSRNTSVY AVHDTFEVKG VGIKPSVDRL IIAWQKVVSR HAMLRTVFLE NLTSQDLFCQ
VVLKEYNPAP ALLSCSEERD VLPTFDNQQP VNYRDPRPAH RFSICETANG KLFCRLEISH
AAMDGTSISL IVRDLQSAYS GQLQEDRKPM FKDYMRYLQS CSHSAGLNYW LPYLSDVKPC
HFPVLNDGRP SNKRLQVIRL DFSSLKELQA LCESCGLTLS TAFSTAWGLT LRSFCGSNEV
CFSYMASLRD VSVDEIGSVV GPVINLLACR MKVTEDVCLE DVLHQVQNDY MESLPYRHTS
LIDIQHALKL SDTILLNSGI SYRKLPPKTL SNRDEMRLVE VGKIHDPAEF PVYVNIEATD
DVAYIDLNYW TTSLSEGQAQ NVASTFLQSL ENIIHHHDEK ICRLDQLSAQ NKQQIAVWNN
TLPKAVEKCI HEILEDKVKQ CPEAVAIAAW DGNLTYAKLN ELSSLLAFYL TKLGVGPGLL
VPIDLDKSSW QIVAILAVLR AGGICLPVDA AQPYEFIEKL LIDKDIQVAL ASPNKAQLLE
RTIPYVVPVG RSLFDYLPRF DDMPHVSHKA MDHAYVVFTG GSVKEPKGVM LQHLTVLTRA
ENFASALELN KATKVYQSAT YTSDMFLNVL FGTMMRGGCV CIPANDGFNN LPRSINASRA
NTVVMTPSLA SLLQPSEVPE VQLLALYGEI LTNQVRTIWS EKVRTHSLYG AAECSSSCIH
ASDCQTLGET RNLGLAAGCI TWLVNLSDHD LLVPIGSVGE VVIEGPVIAS GYLLHNGHVK
GGFIENPVWR MNFERDEPSN DLEKRDESST LTRRMFKTGD LARYNSDGIL VYMGRKERQT
QRLQADIWDV QQCIDTFSLP GHPCVVEPIR CLDDDESVEH LAVFVQFAST HLEKADGQRS
VIGQPSSQFF DSVTKMHTYL LSVLPVTQVP RLYIPVPSMP LTSTGLLDRW FLRNEAQNLP
AQTRIEFDLK SFHDFWRVEL AHPKPSPSQL LPSSTSATHR SSGTSTYEWN GHIEWRDISK
LESASLEAAL LAAWALTISG YTRSDDVIFG ELLLEQDSSG LDSTSDKAAP VVVPRRLQIA
EDMSIAELMR KTQERLVAAS PFQRAGLQRI RNVSADTSRA CSFNNLFCFT RFDCKVQTLA
LAYPLGIFCI VANSELQLSA CYDEQILSAP QVERILVQFA RYVEYLKADL RSQETIGDMA
LRKNQTSYLS SPETVYWRKY LADVESCVFP SLNPDGERSG FSSAKLAIEN LADLRRLCQK
IEVTEDIVLQ LVWGLVLRCY TGSEEVCYGY YQASPQPEGS KYLKVLPSRF LLKDDSDLES
IAQQRKSELD EAMEHPISQI ELQLELGFDW YSLLNTVFKF DRFAELPNDN NSTLDLLNDT
EKGIWTIVVN PRFSFVSADI LFEYRTDALS EANITSVVDC FQHILEEIIN NDPVGHKIGD
INFFGERACQ QVREWNAALP ERPDRCAHEI IEQQVLSHPT SPAICSWDGE FTYEQLDRLS
TKLAKHLVCL GVKPEIFVGL CFEKSAWAVI AQVAVLKAGG AFASLDPSHP DARLRGLVDD
IGAHIFLCSA KYLDKARQIS RAAYIVSEET LAELPDVSST ASMTRPSIHN AAYAIFTSGT
TGKPKVTVLE HIALSVSSPA FARSMGMDTT TRALQFSSYT FDVSIKEIII VLMTGGCVCV
PSDEERMNDL SGAIRRLNAN FISCPPSVSN TIQPESVPSV KTVVMGGEKM TASHIDRWGD
RFVINAYGPS ESTVMATMSV KVDEAGVRVN NDCNSIGAAI CGRTWVVDPN NYQRLLPIGA
VGELVLEGCN VARGYLNNDQ KTKESFISDP AWTKAPGLKE LFKRKERMYR TGDLVRYNPD
GTICFISRKD TQIKFNGQRI ELEEIEQQCI SFLSGGTQVA VEVVEPESKA VARSIAAFFT
VDNQSGQDRP DLESQLLVPM SEATREKVQK LREALIKALP PIMIPRLFFP VSHLPFSNSG
KLDRKKLRAT VETLPKDQLK SYATLTAGSR QASDEGVEGT LRSLWEEALG LASGSVSAED
SFFSLGGDSF SAMKLVGAAN SQGISLTFAD VYEDPVFMNM AKRCGMLQGR SGRQTVTPFS
LLPASVDREQ LLEEVAEQCG VPRASIVDLY PCSPVQEGLL TLSVKQNGAY IAQPIFRLSE
GIDLDMFKAA WQQVVDELDI LRTRIVHTES LNFLQAVIDK EEISWASATT LDELTAESPE
LPRHNGGRLT GYAIAASQTG RYFCWTIHHA LYDGWSIPLV LRRVEEVYTN STASARTVPY
SLFINYLLER SMADSDEYWK SQLANLSCSP FPQSRNPLPD SVRVGNRHHS SMKISRAASG
VDLTIPELIR AAWAIVVSAH TGSSDVCFGE TLMGRNIDLA GVTDIAGPVL TTVPTRIQVD
NELPITQYLE NMHHLTTTML PHQHSGLQQI RKLNSDTASA CEFQNLLVIQ TGEGQLNKAL
WVAEPIQTSG DFFTHPLVVE CKVDTSEVSI TMHHDEIVLN SWQTEKLIGQ FSFVLEQLLS
INKGETRKLS ELEIFSPLDS KEVALWNKRH PEAVEKCAHD IISERCSTHP DAPAVCAWDG
EVSYKEMYTL ASSFASYLAC RGVGPETLVP ICLDKSLWAI ITILGILIAG GAYVPLDPAH
PTSRHEEILT EVDARILICS PQYQSRYSSI VKTIIPVSKE TIRAYFALNY QAKGLRRVTP
FNMAYAIFTS GSTGRAKGII IDHRALASSA MAFGPIVHLN ETSRAFQFAS LTFDAAVMEI
LATLMHGGCI CIPSEDERLN DVAGAIRRMN VTWTFLTPSI ASIIEPSTVP SLEVLACGGE
KLSREVVTKW AHRVKLINGY GPTETTIFAV LNNVSPTTDP ACIGYGIPCT LTWVVDPENH
DRLTPLGAIG ELALEGPALA REYLKNPKKT AEAFVDEPAW MKHFQSTLPS PRRIYKTGDL
VRYNPDGSVE YISRKDYQVK LHGQRMELGE IEHRLHEDDR VRHAIVILPK EGLLKGRLVT
ILSLNSLKSG SSIISDNACE LISREDLARV AYSELITIQK NLEAQLPIYM VPQTWAVIKK
LPMLVSGKLD RKKITHWIEN IDEPTYDRIM QDYDNIKRGH VEDSVNEDKS TAAKILQDIY
AQVLNLPSNK VDPKRSFVSL GGDSITGMAV ISRARKQGLN LTLHKILQSK SIVELIQAAE
VETSSIQVEE KANKYFSLSP IQNLYFKSAR TFKETGRFNQ GMTVRVTRKV EPNVVKDALK
AVASQHSMLR ARFSRSANGK WQQRITNDIE SSVRVGIHSV MSSHEMLGKI ANTQSSLDIE
NGPIIAADLF TVNGEQVLFL VANHLCVDMV SWRIILQDIQ EVIEAGSLSS EKPFSFQSWC
ELQLENSRSE ADKAKLPFAI EPPNLSYWGM ESVPNHYGQI RMESFVVGED TTSFILGDCH
EMLRTETIDI LLAAVAQSFR RVFTDRRMPT IYNEGHGRES WDSNLDLSRT VGWFTTLCPL
QVDECSGSDF VDTTKRVKDL RRKIKDNGRS YFARSLLQAN NTEPSDFPVP LEIVFNYLGR
LQQLERDDSL FKHYGEAFDE EKFRLAGDMG SDTPRFALLE ISALVVNDKL QVSFTYNRQM
QRESQIFQWI SECRRVLEID VLRFKDTVPE PTLSDFPLLP ITYDGLKKLT STTLPRAGVK
TFSQVEDIYP CSSVQEGILL SQLRDPSAYM FHVIFEVRSP GGSGKVDPNM LRSAWSAVIN
RHPILRTLFI DSNYANGTFD QLVLRKVDEG AIILHCNDSD ALVKLDTIKL SEINAGRCPK
LLHQLTVCST DSGRVLIKLE MNHAIIDGGS VDLLLRDLAM AYKHQLPEGS GPHFSDYIKF
VRGKSQSQAL SHWRQYLSDA HPCHLTFSEG TGGSRQLGSV MVPFSRYSEL QQFCEKNSVT
LANLTLAAWA IVLQSFTGSN DVCFGYPSAG RDAPVPGIQD AVGIFLNMLC CRVRFSPGKT
LLEVSKTVQD DYIKNLPYQD CSLASIQHEL GQKELFNTTI SIQNHHAVSE ESGNDLLSFD
VQTAHDPTEY PVTVNVETAK GHEGILLRYW TDAVPEDKAN NLANAIAHIF SCFIDKPSQL
VSELDLRGGQ LMKGGQFIDS KSLQELIDRR VTEIISQMLK EGTLAIPAAD NGKGQFKGTT
RAQPAKVTIK ARGMHRERDL SDSTATLTED HSKLMEPEKR LWKIWCSALG LASDTIQRQA
SFFKLGGDSI TAMKMVSAAR EDGLTLTMAD VFNNPVFEDM LAAISASNSS SALEPDSPAD
SNNEKPAEPP RLVELERNPP PPEISLLKTV PLNDSALQDG ICPKIGVFKG GIADVLPVTD
FQAMSITATL FKSRWMLNYF FFDGRGSLDL RRLRESLLRV VDAFDILRTV FVCFNGQFFQ
VVLRKIRPNI FVHETEKNLD EYTEYLQQQD REQEARQGEQ YVKFYIVKKK GSNHHRILIR
MSHAQFDGLC LPTIMSAIKL GYEGSTLPPA PSFANYMRML SGAITPDHYQ HWTNLLKGSK
MTQVIRRTEE NTYRYIGAFR EQRKTLEIQP SVLENVTIAT VMQAAWAVTL AKLSAQSDVV
FGLTISGRNT SIPGIENTVG PCLNVIPIRV TFKEGWTGVD LFRYLQDQQI ANMPFEALGF
REIIRRCTDW PSSTYFTTSV FHQNVEYEGQ MQLDNTTYRM GGAGVVDNFT DMTLFSKSSS
DGKLGVSLGY SDKGPIGPKF AAKVLSMVCD TVQSLLANSR VGLPSPSMLR SLPCQSVHDV
PGMTDEMFLS THLKDRSISD ILVHSDVVTQ AWQQVLPRNN ADEPQSSFQL DSSFFELGGD
VFDMAQVVWL LEQEGLQVHI EDLLEHPTFL GQMAVLTLHN SRTSDSMEEI VPVEEIPLPA
ARTGTSSSLG KALTLAKKVT RWSALSARG