ARO1_YARLI
ID ARO1_YARLI Reviewed; 1556 AA.
AC Q6C1X5;
DT 05-APR-2011, integrated into UniProtKB/Swiss-Prot.
DT 16-AUG-2004, sequence version 1.
DT 03-AUG-2022, entry version 129.
DE RecName: Full=Pentafunctional AROM polypeptide {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=DHQS {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.3.4 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-phosphoshikimate 1-carboxyvinyltransferase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.5.1.19 {ECO:0000255|HAMAP-Rule:MF_03143};
DE AltName: Full=5-enolpyruvylshikimate-3-phosphate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSP synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSPS {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate kinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=SK {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.7.1.71 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate dehydratase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=3-dehydroquinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.1.10 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate dehydrogenase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=1.1.1.25 {ECO:0000255|HAMAP-Rule:MF_03143};
GN Name=ARO1 {ECO:0000255|HAMAP-Rule:MF_03143};
GN OrderedLocusNames=YALI0F12639g;
OS Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida lipolytica).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Dipodascaceae; Yarrowia.
OX NCBI_TaxID=284591;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CLIB 122 / E 150;
RX PubMed=15229592; DOI=10.1038/nature02579;
RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., Lafontaine I.,
RA de Montigny J., Marck C., Neuveglise C., Talla E., Goffard N., Frangeul L.,
RA Aigle M., Anthouard V., Babour A., Barbe V., Barnay S., Blanchin S.,
RA Beckerich J.-M., Beyne E., Bleykasten C., Boisrame A., Boyer J.,
RA Cattolico L., Confanioleri F., de Daruvar A., Despons L., Fabre E.,
RA Fairhead C., Ferry-Dumazet H., Groppi A., Hantraye F., Hennequin C.,
RA Jauniaux N., Joyet P., Kachouri R., Kerrest A., Koszul R., Lemaire M.,
RA Lesur I., Ma L., Muller H., Nicaud J.-M., Nikolski M., Oztas S.,
RA Ozier-Kalogeropoulos O., Pellenz S., Potier S., Richard G.-F.,
RA Straub M.-L., Suleau A., Swennen D., Tekaia F., Wesolowski-Louvel M.,
RA Westhof E., Wirth B., Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M.,
RA Thierry A., Bouchier C., Caudron B., Scarpelli C., Gaillardin C.,
RA Weissenbach J., Wincker P., Souciet J.-L.;
RT "Genome evolution in yeasts.";
RL Nature 430:35-44(2004).
CC -!- FUNCTION: The AROM polypeptide catalyzes 5 consecutive enzymatic
CC reactions in prechorismate polyaromatic amino acid biosynthesis.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate = 3-
CC dehydroquinate + phosphate; Xref=Rhea:RHEA:21968, ChEBI:CHEBI:32364,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:58394; EC=4.2.3.4;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-dehydroquinate = 3-dehydroshikimate + H2O;
CC Xref=Rhea:RHEA:21096, ChEBI:CHEBI:15377, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:32364; EC=4.2.1.10; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=NADP(+) + shikimate = 3-dehydroshikimate + H(+) + NADPH;
CC Xref=Rhea:RHEA:17737, ChEBI:CHEBI:15378, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:57783, ChEBI:CHEBI:58349; EC=1.1.1.25;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + shikimate = 3-phosphoshikimate + ADP + H(+);
CC Xref=Rhea:RHEA:13121, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:145989, ChEBI:CHEBI:456216;
CC EC=2.7.1.71; Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-phosphoshikimate + phosphoenolpyruvate = 5-O-(1-
CC carboxyvinyl)-3-phosphoshikimate + phosphate; Xref=Rhea:RHEA:21256,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:57701, ChEBI:CHEBI:58702,
CC ChEBI:CHEBI:145989; EC=2.5.1.19; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Note=Binds 2 Zn(2+) ions per subunit.;
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 2/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 3/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 4/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 5/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 6/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBUNIT: Homodimer. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the N-terminal section; belongs to the sugar phosphate
CC cyclases superfamily. Dehydroquinate synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 2nd section; belongs to the EPSP synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 3rd section; belongs to the shikimate kinase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 4th section; belongs to the type-I 3-dehydroquinase
CC family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the C-terminal section; belongs to the shikimate
CC dehydrogenase family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CR382132; CAG78144.1; -; Genomic_DNA.
DR RefSeq; XP_505337.1; XM_505337.1.
DR AlphaFoldDB; Q6C1X5; -.
DR SMR; Q6C1X5; -.
DR STRING; 4952.CAG78144; -.
DR PRIDE; Q6C1X5; -.
DR EnsemblFungi; CAG78144; CAG78144; YALI0_F12639g.
DR GeneID; 2908044; -.
DR KEGG; yli:YALI0F12639g; -.
DR VEuPathDB; FungiDB:YALI0_F12639g; -.
DR HOGENOM; CLU_001201_1_2_1; -.
DR InParanoid; Q6C1X5; -.
DR OMA; YCYDDHR; -.
DR UniPathway; UPA00053; UER00085.
DR UniPathway; UPA00053; UER00086.
DR UniPathway; UPA00053; UER00087.
DR UniPathway; UPA00053; UER00088.
DR UniPathway; UPA00053; UER00089.
DR Proteomes; UP000001300; Chromosome F.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003855; F:3-dehydroquinate dehydratase activity; IBA:GO_Central.
DR GO; GO:0003856; F:3-dehydroquinate synthase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003866; F:3-phosphoshikimate 1-carboxyvinyltransferase activity; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004764; F:shikimate 3-dehydrogenase (NADP+) activity; IBA:GO_Central.
DR GO; GO:0004765; F:shikimate kinase activity; IBA:GO_Central.
DR GO; GO:0009073; P:aromatic amino acid family biosynthetic process; IEA:UniProtKB-UniRule.
DR GO; GO:0008652; P:cellular amino acid biosynthetic process; IEA:UniProtKB-KW.
DR GO; GO:0009423; P:chorismate biosynthetic process; IBA:GO_Central.
DR GO; GO:0016310; P:phosphorylation; IEA:UniProtKB-KW.
DR CDD; cd00502; DHQase_I; 1.
DR CDD; cd01556; EPSP_synthase; 1.
DR CDD; cd00464; SK; 1.
DR Gene3D; 3.20.20.70; -; 1.
DR Gene3D; 3.40.50.300; -; 1.
DR Gene3D; 3.65.10.10; -; 2.
DR HAMAP; MF_00210; EPSP_synth; 1.
DR HAMAP; MF_03143; Pentafunct_AroM; 1.
DR HAMAP; MF_00222; Shikimate_DH_AroE; 1.
DR HAMAP; MF_00109; Shikimate_kinase; 1.
DR InterPro; IPR018508; 3-dehydroquinate_DH_AS.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR046346; Aminiacid_DH-like_N_sf.
DR InterPro; IPR016037; DHQ_synth_AroB.
DR InterPro; IPR030960; DHQS/DOIS.
DR InterPro; IPR001381; DHquinase_I.
DR InterPro; IPR001986; Enolpyruvate_Tfrase_dom.
DR InterPro; IPR036968; Enolpyruvate_Tfrase_sf.
DR InterPro; IPR006264; EPSP_synthase.
DR InterPro; IPR023193; EPSP_synthase_CS.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR008289; Pentafunct_AroM.
DR InterPro; IPR013792; RNA3'P_cycl/enolpyr_Trfase_a/b.
DR InterPro; IPR041121; SDH_C.
DR InterPro; IPR031322; Shikimate/glucono_kinase.
DR InterPro; IPR013708; Shikimate_DH-bd_N.
DR InterPro; IPR010110; Shikimate_DH_AroM-type.
DR InterPro; IPR022893; Shikimate_DH_fam.
DR InterPro; IPR000623; Shikimate_kinase/TSH1.
DR InterPro; IPR023000; Shikimate_kinase_CS.
DR InterPro; IPR006151; Shikm_DH/Glu-tRNA_Rdtase.
DR Pfam; PF01761; DHQ_synthase; 1.
DR Pfam; PF01487; DHquinase_I; 1.
DR Pfam; PF00275; EPSP_synthase; 1.
DR Pfam; PF18317; SDH_C; 1.
DR Pfam; PF01488; Shikimate_DH; 1.
DR Pfam; PF08501; Shikimate_dh_N; 1.
DR Pfam; PF01202; SKI; 1.
DR PIRSF; PIRSF000514; Pentafunct_AroM; 1.
DR SUPFAM; SSF51735; SSF51735; 1.
DR SUPFAM; SSF52540; SSF52540; 1.
DR SUPFAM; SSF53223; SSF53223; 1.
DR SUPFAM; SSF55205; SSF55205; 1.
DR TIGRFAMs; TIGR01356; aroA; 1.
DR TIGRFAMs; TIGR01357; aroB; 1.
DR TIGRFAMs; TIGR01093; aroD; 1.
DR TIGRFAMs; TIGR01809; Shik-DH-AROM; 1.
DR PROSITE; PS01028; DEHYDROQUINASE_I; 1.
DR PROSITE; PS00104; EPSP_SYNTHASE_1; 1.
DR PROSITE; PS00885; EPSP_SYNTHASE_2; 1.
DR PROSITE; PS01128; SHIKIMATE_KINASE; 1.
PE 3: Inferred from homology;
KW Amino-acid biosynthesis; Aromatic amino acid biosynthesis; ATP-binding;
KW Cytoplasm; Kinase; Lyase; Metal-binding; Multifunctional enzyme; NADP;
KW Nucleotide-binding; Oxidoreductase; Reference proteome; Transferase; Zinc.
FT CHAIN 1..1556
FT /note="Pentafunctional AROM polypeptide"
FT /id="PRO_0000406747"
FT REGION 1..387
FT /note="3-dehydroquinate synthase"
FT REGION 400..837
FT /note="EPSP synthase"
FT REGION 858..1049
FT /note="Shikimate kinase"
FT REGION 1050..1266
FT /note="3-dehydroquinase"
FT REGION 1279..1556
FT /note="Shikimate dehydrogenase"
FT ACT_SITE 263
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 278
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 819
FT /note="For EPSP synthase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1171
FT /note="Proton acceptor; for 3-dehydroquinate dehydratase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1200
FT /note="Schiff-base intermediate with substrate; for 3-
FT dehydroquinate dehydratase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 46..48
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 84..87
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 115..117
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 120
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 131
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 140..141
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 147
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 153
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 162
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 163
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 180..183
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 191
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 195..198
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 195
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 253
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 267..271
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 274
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 274
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 290
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 290
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 359
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 864..871
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
SQ SEQUENCE 1556 AA; 167465 MW; B23E27F7E54A9C5C CRC64;
MFAEGQIQKV PILGKESIHI GYKMQDHIVS EIVANIKSST YILVTDTNIE DLGYVESLKT
KFEAAFAKDG IKSRLLTYTV APGETSKSRA TKAAIEDWML SKGCTRDTVI LAVGGGVIGD
MIGYVAATFM RGVRFVQIPT TLLAMVDSSI GGKTAIDTPL GKNLVGAFWQ PVNIFIDTSF
LETLPVREFI NGMAEVIKTA AFYDAEEFTR LESASEIFLS TIKKRDAKDP RRVDLSPITD
TIGRIVLGSA RIKAAVVSAD EREGGLRNLL NFGHSIGHAY EAILTPYILH GECVAIGMVK
EAELSRYLGI LSPVAVARLA KCIKAYELPV SLDDATVKAR SHGKKCPVDD LLRIMGVDKK
NDGSTKKIVI LSAIGKTHEQ KASSVADKDI RFVLSEEVIV GEAPVGDKKS YTVTPPGSKS
ISNRAFVLTA LGKGPCKLRN LLHSDDTQHM LEAIELLGGA SFEWEADGET LLVTGNGGKL
TAPAQELYLG NAGTASRFLT TAATLVQKGD KDHVILTGNK RMQERPIGPL VDALRSNGAD
IAFQNAEGSL PLKIEAGVGL KGGLIEVAAT VSSQYVSSLL MCAPYAQTPV TLSLVGGKPI
SQFYIDMTIA MMADFGVVVT KDETKEHTYH IPQGVYTNPE EYVVESDASS ATYPLAYAAM
TGHTVTVPNI GSKSLQGDAR FAIDVLKAMG CTVEQTATST TVTGVPNLKA IAVDMEPMTD
AFLTACVVAA VSEGTTVITG IANQRVKECN RIEAMRVQLA KYGVVCRELE DGIEVDGISR
SDLKTPVSVH SYDDHRVAMS FSLLSSIMAA PVAIEERRCV EKTWPGWWDV LSGVFNVPLE
GVTLAKTVSK AESGLSKPSI FIVGMRGAGK THLGAQAANH LGYEFIDLDQ LLEKDLDTTI
PQLIADKGWD HFRAEELRLL KQCLNDKSEG YVISCGGGVV ETPAARDALQ TFKGVGGIVL
HVHRPVSRIL EYLNKDQSRP AFVDDLEAVW QRRKELYRSV SSNVFFAPHC DSAEATAKVQ
QMLGAFLDRV TGKSEFVIPH KDQFTSFLSL TFPDVSIAAT MLPSLSEGCS ALELRVDLLN
ENDEAIPSEE YVLSQLAILR QNVDLPILYT VRTKAQGGRF PDDKPVELAN LVNLGLKTAV
ELLDVELTYP AELVSSVGAS RGYTKLLGSH HDFPGALNWS SLEWENMYAR AEAVPVDVVK
LVGMAKSFSD NFALENFREA HTSSPLLAIN MGSHGQLSRV TNTLLTPVTH ADLPVAAAPG
QLSVEEINQT RSTIGMFNKN LSFFIVGTPI GHSKSPILHN TMFKKLGLPY EYSRFKTDDA
AAVNAKARAL LAQGNLGGIS VTIPLKQDII PFLDEVSPLA QQIGAVNTII PGPNGTLKGD
NTDILGLVNA LTRFGANSLD KKTALIVGAG GTSLAAVHGL RSLGFAKILI ANRTLSKAEA
IADKFDNVEA VTLDSFVANK YTPSVIVSCV PATTFSMLDE SNKLVSAALA ASPKGLVLEA
AYSAEATPLL KQVMDVEGWE FISGLYMLTE QGFEQFRLWT GIPAPKEVGE KAVLGN