ARO1_CRYNJ
ID ARO1_CRYNJ Reviewed; 1611 AA.
AC P0CM22; Q55XJ0; Q5KME5;
DT 28-JUN-2011, integrated into UniProtKB/Swiss-Prot.
DT 28-JUN-2011, sequence version 1.
DT 03-AUG-2022, entry version 68.
DE RecName: Full=Pentafunctional AROM polypeptide {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=DHQS {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.3.4 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-phosphoshikimate 1-carboxyvinyltransferase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.5.1.19 {ECO:0000255|HAMAP-Rule:MF_03143};
DE AltName: Full=5-enolpyruvylshikimate-3-phosphate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSP synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSPS {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate kinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=SK {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.7.1.71 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate dehydratase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=3-dehydroquinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.1.10 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate dehydrogenase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=1.1.1.25 {ECO:0000255|HAMAP-Rule:MF_03143};
GN OrderedLocusNames=CNB01990;
OS Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / ATCC
OS MYA-565) (Filobasidiella neoformans).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Tremellales; Cryptococcaceae; Cryptococcus;
OC Cryptococcus neoformans species complex.
OX NCBI_TaxID=214684;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JEC21 / ATCC MYA-565;
RX PubMed=15653466; DOI=10.1126/science.1103773;
RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D.,
RA Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E.,
RA Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J., D'Souza C.A.,
RA Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J., Huang J.C.,
RA Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I., Kwon-Chung K.J.,
RA Lengeler K.B., Maiti R., Marra M.A., Marra R.E., Mathewson C.A.,
RA Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L., Schein J.E.,
RA Shvartsbeyn A., Shin H., Shumway M., Specht C.A., Suh B.B., Tenney A.,
RA Utterback T.R., Wickes B.L., Wortman J.R., Wye N.H., Kronstad J.W.,
RA Lodge J.K., Heitman J., Davis R.W., Fraser C.M., Hyman R.W.;
RT "The genome of the basidiomycetous yeast and human pathogen Cryptococcus
RT neoformans.";
RL Science 307:1321-1324(2005).
CC -!- FUNCTION: The AROM polypeptide catalyzes 5 consecutive enzymatic
CC reactions in prechorismate polyaromatic amino acid biosynthesis.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate = 3-
CC dehydroquinate + phosphate; Xref=Rhea:RHEA:21968, ChEBI:CHEBI:32364,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:58394; EC=4.2.3.4;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-dehydroquinate = 3-dehydroshikimate + H2O;
CC Xref=Rhea:RHEA:21096, ChEBI:CHEBI:15377, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:32364; EC=4.2.1.10; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=NADP(+) + shikimate = 3-dehydroshikimate + H(+) + NADPH;
CC Xref=Rhea:RHEA:17737, ChEBI:CHEBI:15378, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:57783, ChEBI:CHEBI:58349; EC=1.1.1.25;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + shikimate = 3-phosphoshikimate + ADP + H(+);
CC Xref=Rhea:RHEA:13121, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:145989, ChEBI:CHEBI:456216;
CC EC=2.7.1.71; Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-phosphoshikimate + phosphoenolpyruvate = 5-O-(1-
CC carboxyvinyl)-3-phosphoshikimate + phosphate; Xref=Rhea:RHEA:21256,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:57701, ChEBI:CHEBI:58702,
CC ChEBI:CHEBI:145989; EC=2.5.1.19; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Note=Binds 2 Zn(2+) ions per subunit.;
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 2/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 3/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 4/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 5/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 6/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBUNIT: Homodimer. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the N-terminal section; belongs to the sugar phosphate
CC cyclases superfamily. Dehydroquinate synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 2nd section; belongs to the EPSP synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 3rd section; belongs to the shikimate kinase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 4th section; belongs to the type-I 3-dehydroquinase
CC family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the C-terminal section; belongs to the shikimate
CC dehydrogenase family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE017342; AAW41820.1; -; Genomic_DNA.
DR RefSeq; XP_569127.1; XM_569127.1.
DR AlphaFoldDB; P0CM22; -.
DR SMR; P0CM22; -.
DR STRING; 5207.AAW41820; -.
DR PaxDb; P0CM22; -.
DR EnsemblFungi; AAW41820; AAW41820; CNB01990.
DR GeneID; 3255768; -.
DR KEGG; cne:CNB01990; -.
DR VEuPathDB; FungiDB:CNB01990; -.
DR eggNOG; KOG0692; Eukaryota.
DR HOGENOM; CLU_001201_1_2_1; -.
DR InParanoid; P0CM22; -.
DR OMA; YCYDDHR; -.
DR OrthoDB; 39786at2759; -.
DR UniPathway; UPA00053; UER00085.
DR UniPathway; UPA00053; UER00086.
DR UniPathway; UPA00053; UER00087.
DR UniPathway; UPA00053; UER00088.
DR UniPathway; UPA00053; UER00089.
DR Proteomes; UP000002149; Chromosome 2.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003855; F:3-dehydroquinate dehydratase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003856; F:3-dehydroquinate synthase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003866; F:3-phosphoshikimate 1-carboxyvinyltransferase activity; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004764; F:shikimate 3-dehydrogenase (NADP+) activity; IEA:UniProtKB-UniRule.
DR GO; GO:0004765; F:shikimate kinase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0009073; P:aromatic amino acid family biosynthetic process; IEA:UniProtKB-UniRule.
DR GO; GO:0008652; P:cellular amino acid biosynthetic process; IEA:UniProtKB-KW.
DR GO; GO:0009423; P:chorismate biosynthetic process; IBA:GO_Central.
DR GO; GO:0016310; P:phosphorylation; IEA:UniProtKB-KW.
DR CDD; cd00502; DHQase_I; 1.
DR CDD; cd01556; EPSP_synthase; 1.
DR CDD; cd00464; SK; 1.
DR Gene3D; 3.20.20.70; -; 1.
DR Gene3D; 3.40.50.300; -; 1.
DR Gene3D; 3.65.10.10; -; 2.
DR HAMAP; MF_00210; EPSP_synth; 1.
DR HAMAP; MF_03143; Pentafunct_AroM; 1.
DR HAMAP; MF_00109; Shikimate_kinase; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR046346; Aminiacid_DH-like_N_sf.
DR InterPro; IPR016037; DHQ_synth_AroB.
DR InterPro; IPR030960; DHQS/DOIS.
DR InterPro; IPR001381; DHquinase_I.
DR InterPro; IPR001986; Enolpyruvate_Tfrase_dom.
DR InterPro; IPR036968; Enolpyruvate_Tfrase_sf.
DR InterPro; IPR006264; EPSP_synthase.
DR InterPro; IPR023193; EPSP_synthase_CS.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR008289; Pentafunct_AroM.
DR InterPro; IPR013792; RNA3'P_cycl/enolpyr_Trfase_a/b.
DR InterPro; IPR041121; SDH_C.
DR InterPro; IPR031322; Shikimate/glucono_kinase.
DR InterPro; IPR013708; Shikimate_DH-bd_N.
DR InterPro; IPR010110; Shikimate_DH_AroM-type.
DR InterPro; IPR000623; Shikimate_kinase/TSH1.
DR InterPro; IPR023000; Shikimate_kinase_CS.
DR InterPro; IPR006151; Shikm_DH/Glu-tRNA_Rdtase.
DR Pfam; PF01761; DHQ_synthase; 1.
DR Pfam; PF01487; DHquinase_I; 1.
DR Pfam; PF00275; EPSP_synthase; 1.
DR Pfam; PF18317; SDH_C; 1.
DR Pfam; PF01488; Shikimate_DH; 1.
DR Pfam; PF08501; Shikimate_dh_N; 1.
DR Pfam; PF01202; SKI; 1.
DR PIRSF; PIRSF000514; Pentafunct_AroM; 1.
DR SUPFAM; SSF51735; SSF51735; 1.
DR SUPFAM; SSF52540; SSF52540; 1.
DR SUPFAM; SSF53223; SSF53223; 1.
DR SUPFAM; SSF55205; SSF55205; 1.
DR TIGRFAMs; TIGR01356; aroA; 1.
DR TIGRFAMs; TIGR01357; aroB; 1.
DR TIGRFAMs; TIGR01093; aroD; 1.
DR TIGRFAMs; TIGR01809; Shik-DH-AROM; 1.
DR PROSITE; PS00104; EPSP_SYNTHASE_1; 1.
DR PROSITE; PS00885; EPSP_SYNTHASE_2; 1.
DR PROSITE; PS01128; SHIKIMATE_KINASE; 1.
PE 3: Inferred from homology;
KW Amino-acid biosynthesis; Aromatic amino acid biosynthesis; ATP-binding;
KW Cytoplasm; Kinase; Lyase; Metal-binding; Multifunctional enzyme; NADP;
KW Nucleotide-binding; Oxidoreductase; Reference proteome; Transferase; Zinc.
FT CHAIN 1..1611
FT /note="Pentafunctional AROM polypeptide"
FT /id="PRO_0000406716"
FT REGION 1..391
FT /note="3-dehydroquinate synthase"
FT REGION 404..863
FT /note="EPSP synthase"
FT REGION 892..1093
FT /note="Shikimate kinase"
FT REGION 1094..1318
FT /note="3-dehydroquinase"
FT REGION 1331..1611
FT /note="Shikimate dehydrogenase"
FT ACT_SITE 267
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 282
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 845
FT /note="For EPSP synthase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1220
FT /note="Proton acceptor; for 3-dehydroquinate dehydratase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1248
FT /note="Schiff-base intermediate with substrate; for 3-
FT dehydroquinate dehydratase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 47..49
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 84..87
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 115..117
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 120
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 131
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 140..141
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 147
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 153
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 162
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 163
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 180..183
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 191
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 195..198
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 195
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 257
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 271..275
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 278
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 278
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 294
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 294
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 363
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 899..906
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
SQ SEQUENCE 1611 AA; 173474 MW; 4FFB44F62947B992 CRC64;
MSSSSADVLK ISILGNESIH VGFHLLPYIF KTITTTLPSS TYVLITDTNL SAIYLNDFKA
SFEEAASEAD NKARFLVYEV APGEGAKSRK VKGEIEDWML DNKCTRDTVI LAFGGGVIGD
LTGFVAATFM RGVKFVQIPT TLLAMVDSSV GGKTAIDTPH GKNLIGAFWQ PSYIFVDLAF
LTTLPTREVS NGMAEVIKTA AIWKDDDFAL LESRSAEISL AASSRPTGVP TAGRFVSDRS
HAQSLLLQVV SGSIYVKAHI VTIDERETGL RNLVNFGHTI GHAIEAVLTP AMLHGECVSV
GIVLEAEVAR QLGILSQVAV GRLTRCLQAY GLPVSLSDRR ITALPASSQL SVDRLLDIMK
IDKKNSGPAK KIVLLSRIGK TYEEKASVVA DDVISKVLCE AVTVKAATPT KSPITMATPG
SKSISNRALV LAALGKGTCR VRNLLHSDDT AVMMNALVEL KGAVFSWEDG GDTIVVEGGG
GILSTPAKGK ELYLGNAGTA SRFLTTVCAM VSGSASSERS TVITGNARMK QRPIGPLVDA
LTANGAKVKY LESTGCLPLD IDTDGFRGGH IQLAASVSSQ YVSSILLCAP YAAEQVTLEL
TGGQVISQPY IDMTIAMMKQ FGATVERQKD EQGNLLDIYV IPKCTYVNPP EYSVESDASS
ATYPLAIAAI TGTTCTISNI GSSSLQGDAR FAKEILEPMG CIVEQTLTST KVTGPPVGTL
RALGNVDMEP MTDAFLTASV LAAVAVKPCL PERKVEGLPE TASRIYGIAN QRVKECNRIQ
AMRDQLAKFG IETDEFDDGI IIFGKPEASL FRGASIHCYD DHRVAMAFAV LSCIIDETII
EEKRCVEKTW PNFWDDLQNK IGVAVEGVEL ETHNQASTSA KPVSPIDQSQ SDRPIFLIGM
RGAGKTYVGR MAADILSGQF TDADDVFAQE SHQTVSEFVA ANGWDEFRKK ETEILSKFVE
EHRGNHVIAL GGGIVETETA RETLKAHVAK GGHVVHVTRA LEDIEAYLDS IGNTAVRPNW
GETFADVFKR REPWYQACSS HEFYNVLEAV GGQTHEEHTK AMRAECGRFF KFITGRESNR
PRLSVGNPTS FLSLTFPDVT PALIHLDELT EGADAVEFRV DLLSTTGQAP TRPALPPISF
VAKQLASLRL ATTLPIVFSV RSKDQGGMVP SDNAEAYGAL VRLGLRCACE YVDLEVCWPE
QLLDSIVQLK RETHIIASWH DWTGDMAWDG EEMKAKHVLC EKYGDVAKIV GTAKSGLDNA
KLAIFVGEVQ SHPGAKPLLA INMGAAGQLS RILNPILTPV THDALPSRAA PGQLTAREIL
QARALTGSLP AKKFVLFGSP IAHSVSPLLH NAGFATLGLP HTYRLHESEK VDQGVLEVIR
SPDFGGASVT IPLKLDIIPH LDSVSEDAKI IGAVNTVIPR GGKLHGENTD WQAIHQAAAQ
NLDADALSYG SSTALVIGAG GTCRAAIYAM HKLRFKTIYL FNRTPENAAK VKASFPESYN
IAVVTSLSSL PEAPVVVVST VPGNSLTLDT FSQGIYLPSE VLSRPKGVAI DLAYKPHMTA
LLHAAEKKEG WKVVPGVEIL CLQGFKQFEE WTGKRAPQKK MRKAVLDKYF A