ARO1_USTMA
ID ARO1_USTMA Reviewed; 1715 AA.
AC Q4P8F6; A0A0D1E1P2;
DT 05-APR-2011, integrated into UniProtKB/Swiss-Prot.
DT 19-JUL-2005, sequence version 1.
DT 03-AUG-2022, entry version 123.
DE RecName: Full=Pentafunctional AROM polypeptide {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=DHQS {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.3.4 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-phosphoshikimate 1-carboxyvinyltransferase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.5.1.19 {ECO:0000255|HAMAP-Rule:MF_03143};
DE AltName: Full=5-enolpyruvylshikimate-3-phosphate synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSP synthase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=EPSPS {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate kinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=SK {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=2.7.1.71 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=3-dehydroquinate dehydratase {ECO:0000255|HAMAP-Rule:MF_03143};
DE Short=3-dehydroquinase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=4.2.1.10 {ECO:0000255|HAMAP-Rule:MF_03143};
DE Includes:
DE RecName: Full=Shikimate dehydrogenase {ECO:0000255|HAMAP-Rule:MF_03143};
DE EC=1.1.1.25 {ECO:0000255|HAMAP-Rule:MF_03143};
GN ORFNames=UMAG_03607;
OS Ustilago maydis (strain 521 / FGSC 9021) (Corn smut fungus).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago.
OX NCBI_TaxID=237631;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=521 / FGSC 9021;
RX PubMed=17080091; DOI=10.1038/nature05248;
RA Kaemper J., Kahmann R., Boelker M., Ma L.-J., Brefort T., Saville B.J.,
RA Banuett F., Kronstad J.W., Gold S.E., Mueller O., Perlin M.H.,
RA Woesten H.A.B., de Vries R., Ruiz-Herrera J., Reynaga-Pena C.G.,
RA Snetselaar K., McCann M., Perez-Martin J., Feldbruegge M., Basse C.W.,
RA Steinberg G., Ibeas J.I., Holloman W., Guzman P., Farman M.L.,
RA Stajich J.E., Sentandreu R., Gonzalez-Prieto J.M., Kennell J.C., Molina L.,
RA Schirawski J., Mendoza-Mendoza A., Greilinger D., Muench K., Roessel N.,
RA Scherer M., Vranes M., Ladendorf O., Vincon V., Fuchs U., Sandrock B.,
RA Meng S., Ho E.C.H., Cahill M.J., Boyce K.J., Klose J., Klosterman S.J.,
RA Deelstra H.J., Ortiz-Castellanos L., Li W., Sanchez-Alonso P.,
RA Schreier P.H., Haeuser-Hahn I., Vaupel M., Koopmann E., Friedrich G.,
RA Voss H., Schlueter T., Margolis J., Platt D., Swimmer C., Gnirke A.,
RA Chen F., Vysotskaia V., Mannhaupt G., Gueldener U., Muensterkoetter M.,
RA Haase D., Oesterheld M., Mewes H.-W., Mauceli E.W., DeCaprio D., Wade C.M.,
RA Butler J., Young S.K., Jaffe D.B., Calvo S.E., Nusbaum C., Galagan J.E.,
RA Birren B.W.;
RT "Insights from the genome of the biotrophic fungal plant pathogen Ustilago
RT maydis.";
RL Nature 444:97-101(2006).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=521 / FGSC 9021;
RA Gueldener U., Muensterkoetter M., Walter M.C., Mannhaupt G., Kahmann R.;
RL Submitted (SEP-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: The AROM polypeptide catalyzes 5 consecutive enzymatic
CC reactions in prechorismate polyaromatic amino acid biosynthesis.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate = 3-
CC dehydroquinate + phosphate; Xref=Rhea:RHEA:21968, ChEBI:CHEBI:32364,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:58394; EC=4.2.3.4;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-dehydroquinate = 3-dehydroshikimate + H2O;
CC Xref=Rhea:RHEA:21096, ChEBI:CHEBI:15377, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:32364; EC=4.2.1.10; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=NADP(+) + shikimate = 3-dehydroshikimate + H(+) + NADPH;
CC Xref=Rhea:RHEA:17737, ChEBI:CHEBI:15378, ChEBI:CHEBI:16630,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:57783, ChEBI:CHEBI:58349; EC=1.1.1.25;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + shikimate = 3-phosphoshikimate + ADP + H(+);
CC Xref=Rhea:RHEA:13121, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:36208, ChEBI:CHEBI:145989, ChEBI:CHEBI:456216;
CC EC=2.7.1.71; Evidence={ECO:0000255|HAMAP-Rule:MF_03143};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3-phosphoshikimate + phosphoenolpyruvate = 5-O-(1-
CC carboxyvinyl)-3-phosphoshikimate + phosphate; Xref=Rhea:RHEA:21256,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:57701, ChEBI:CHEBI:58702,
CC ChEBI:CHEBI:145989; EC=2.5.1.19; Evidence={ECO:0000255|HAMAP-
CC Rule:MF_03143};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Note=Binds 2 Zn(2+) ions per subunit.;
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 2/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 3/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 4/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 5/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- PATHWAY: Metabolic intermediate biosynthesis; chorismate biosynthesis;
CC chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step
CC 6/7. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBUNIT: Homodimer. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the N-terminal section; belongs to the sugar phosphate
CC cyclases superfamily. Dehydroquinate synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 2nd section; belongs to the EPSP synthase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 3rd section; belongs to the shikimate kinase family.
CC {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the 4th section; belongs to the type-I 3-dehydroquinase
CC family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC -!- SIMILARITY: In the C-terminal section; belongs to the shikimate
CC dehydrogenase family. {ECO:0000255|HAMAP-Rule:MF_03143}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003148; KIS68520.1; -; Genomic_DNA.
DR RefSeq; XP_011390030.1; XM_011391728.1.
DR AlphaFoldDB; Q4P8F6; -.
DR SMR; Q4P8F6; -.
DR STRING; 5270.UM03607P0; -.
DR EnsemblFungi; KIS68520; KIS68520; UMAG_03607.
DR GeneID; 23564017; -.
DR KEGG; uma:UMAG_03607; -.
DR VEuPathDB; FungiDB:UMAG_03607; -.
DR eggNOG; KOG0692; Eukaryota.
DR HOGENOM; CLU_001201_1_2_1; -.
DR InParanoid; Q4P8F6; -.
DR OMA; YCYDDHR; -.
DR OrthoDB; 39786at2759; -.
DR UniPathway; UPA00053; UER00085.
DR UniPathway; UPA00053; UER00086.
DR UniPathway; UPA00053; UER00087.
DR UniPathway; UPA00053; UER00088.
DR UniPathway; UPA00053; UER00089.
DR Proteomes; UP000000561; Chromosome 9.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003855; F:3-dehydroquinate dehydratase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003856; F:3-dehydroquinate synthase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003866; F:3-phosphoshikimate 1-carboxyvinyltransferase activity; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004764; F:shikimate 3-dehydrogenase (NADP+) activity; IEA:UniProtKB-UniRule.
DR GO; GO:0004765; F:shikimate kinase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0009073; P:aromatic amino acid family biosynthetic process; IEA:UniProtKB-UniRule.
DR GO; GO:0008652; P:cellular amino acid biosynthetic process; IEA:UniProtKB-KW.
DR GO; GO:0009423; P:chorismate biosynthetic process; IBA:GO_Central.
DR GO; GO:0016310; P:phosphorylation; IEA:UniProtKB-KW.
DR CDD; cd00502; DHQase_I; 1.
DR CDD; cd01556; EPSP_synthase; 1.
DR CDD; cd00464; SK; 1.
DR Gene3D; 3.20.20.70; -; 1.
DR Gene3D; 3.40.50.300; -; 1.
DR Gene3D; 3.65.10.10; -; 2.
DR HAMAP; MF_00210; EPSP_synth; 1.
DR HAMAP; MF_03143; Pentafunct_AroM; 1.
DR HAMAP; MF_00109; Shikimate_kinase; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR046346; Aminiacid_DH-like_N_sf.
DR InterPro; IPR016037; DHQ_synth_AroB.
DR InterPro; IPR030960; DHQS/DOIS.
DR InterPro; IPR001381; DHquinase_I.
DR InterPro; IPR001986; Enolpyruvate_Tfrase_dom.
DR InterPro; IPR036968; Enolpyruvate_Tfrase_sf.
DR InterPro; IPR006264; EPSP_synthase.
DR InterPro; IPR023193; EPSP_synthase_CS.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR008289; Pentafunct_AroM.
DR InterPro; IPR013792; RNA3'P_cycl/enolpyr_Trfase_a/b.
DR InterPro; IPR041121; SDH_C.
DR InterPro; IPR031322; Shikimate/glucono_kinase.
DR InterPro; IPR013708; Shikimate_DH-bd_N.
DR InterPro; IPR010110; Shikimate_DH_AroM-type.
DR InterPro; IPR000623; Shikimate_kinase/TSH1.
DR InterPro; IPR023000; Shikimate_kinase_CS.
DR InterPro; IPR006151; Shikm_DH/Glu-tRNA_Rdtase.
DR Pfam; PF01761; DHQ_synthase; 1.
DR Pfam; PF01487; DHquinase_I; 1.
DR Pfam; PF00275; EPSP_synthase; 1.
DR Pfam; PF18317; SDH_C; 1.
DR Pfam; PF01488; Shikimate_DH; 1.
DR Pfam; PF08501; Shikimate_dh_N; 1.
DR Pfam; PF01202; SKI; 1.
DR PIRSF; PIRSF000514; Pentafunct_AroM; 1.
DR SUPFAM; SSF51735; SSF51735; 1.
DR SUPFAM; SSF52540; SSF52540; 1.
DR SUPFAM; SSF53223; SSF53223; 1.
DR SUPFAM; SSF55205; SSF55205; 1.
DR TIGRFAMs; TIGR01356; aroA; 1.
DR TIGRFAMs; TIGR01357; aroB; 1.
DR TIGRFAMs; TIGR01093; aroD; 1.
DR TIGRFAMs; TIGR01809; Shik-DH-AROM; 1.
DR PROSITE; PS00104; EPSP_SYNTHASE_1; 1.
DR PROSITE; PS00885; EPSP_SYNTHASE_2; 1.
DR PROSITE; PS01128; SHIKIMATE_KINASE; 1.
PE 3: Inferred from homology;
KW Amino-acid biosynthesis; Aromatic amino acid biosynthesis; ATP-binding;
KW Cytoplasm; Kinase; Lyase; Metal-binding; Multifunctional enzyme; NADP;
KW Nucleotide-binding; Oxidoreductase; Reference proteome; Transferase; Zinc.
FT CHAIN 1..1715
FT /note="Pentafunctional AROM polypeptide"
FT /id="PRO_0000406745"
FT REGION 1..421
FT /note="3-dehydroquinate synthase"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 434..895
FT /note="EPSP synthase"
FT REGION 948..1165
FT /note="Shikimate kinase"
FT REGION 1166..1389
FT /note="3-dehydroquinase"
FT REGION 1402..1715
FT /note="Shikimate dehydrogenase"
FT ACT_SITE 297
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 312
FT /note="Proton acceptor; for 3-dehydroquinate synthase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 877
FT /note="For EPSP synthase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1292
FT /note="Proton acceptor; for 3-dehydroquinate dehydratase
FT activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT ACT_SITE 1320
FT /note="Schiff-base intermediate with substrate; for 3-
FT dehydroquinate dehydratase activity"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 71..73
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 112..115
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 143..145
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 148
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 159
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 168..169
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 175
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 181
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 190
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 191
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 208..211
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 219
FT /ligand="NAD(+)"
FT /ligand_id="ChEBI:CHEBI:57540"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 223..226
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 223
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 287
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 301..305
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 308
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 308
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 324
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 324
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_note="catalytic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 393
FT /ligand="7-phospho-2-dehydro-3-deoxy-D-arabino-heptonate"
FT /ligand_id="ChEBI:CHEBI:58394"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
FT BINDING 955..962
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_03143"
SQ SEQUENCE 1715 AA; 186171 MW; A2ECE4C863275DE6 CRC64;
MTSTASAQQP VLRTKTPSYH APPSTDPLSG ATIHTLRCLD TPIHLGYHLI PHIAKTLLST
LPSSTYVLIT DQNLEARCSV TTAFRREFEK AAVHLSSSHS WRLLTKVIPP GEASKSRDGK
NAIEDWMFEH RVTRDAVVLA VGGGVIGDLV GFVAATFMRG LKFVQIPTTL LAMVDSAVGG
KTAIDHPLGK NLIGSFHQPN YVFIDAAWLK TLPVREFSNG MAEVVKTAAI WDPIDFAKLE
SSAPAIRSAV LGPFAKDAPL DQGRTLETRT ESQSLLLDVI RGSVGVKAHI VTIDEKETGL
RNLVNFGHTI GHAIEAVLTP EMLHGECISI GMILEAEVAR YMHGLSQVAI GRLTRCLKEY
DLPVSLSDSR VTRLAKSLDV TVDRLLDIMK VDKKNSGTNK KIVLLSSIGD TVQNMASTVS
DHIIRRVLSL AATVTPIHEQ PNKPKVTLST PGSKSISNRA LVLAALATNT CRLRNMLHSD
DTQVMMAGLH DLQAARFEFE DGGETIVVHG NAGALARPAN DKQIYLQNAG TAARFMATVV
SLVHNDGNQH PVVITGNKRM KERPIAALVD ALRSNGTSID YLEGHGCLPL AVKGTTHGFK
GGKIQLSATI SSQYVSSILL CAPYAAEQVV LELVGGQVIS QLYIDMTIAM MATFGIKVER
LLDPTTGRPS NTYRIPKGHY VSPDVYDIES DASSATYPLA IAAITGTECT VPNIGSASLQ
GDARFAKEVL EPMGCTVVQT ATSTTVIGPK LGQLRQIGLV DMEPMTDAFL TASVLLAVAA
HSPANGSTSN ARPSTRITGI ANQRVKECNR IRAMMDELAK FGVNTKEHDD GLEIFGIDYR
QLHANVRVHC YDDHRVAMAF SVLASLAPGA ILEEKRCVEK TWPNWWDDLE RKLGIRVHGV
DPECSPSLCI SPSHFTKASA ASTNGKPPTC LSQLTCGKAL TPRKYSKHAT IICIGMRASG
KTFLGAIGAA ALSRTFIDAD VVFNEKLGAK DGLGDFVREH GWPAFRQKEL EILQELIARH
PSGALISLGG GVVETEACRQ ILAEYAQTKG PVIYIVRDTN AIVKFLATSD RPAYGEPVMD
VYRRRNPWFS ECSSAELVSY SEGESLAIQP CAMNVPDPSF TIQKKFGLET EVARFFKFVV
GQDTNQIRDL VVDSRKGGRR TYFLSLTFPD VVPKLELIRS MESGSDALEF RADLLNPSGQ
PVTTPQIPPT EYVKNQLAAL RHRTSLPIVF TVRTHSQGGM FPDGKQHEYF ELISLALRHA
CEYIDLELGW DDDLLSAVVQ AKGNSQIIAS WHDWSGRLDW ESESTAKIYE KACRFGDIAK
IIGKATTMED NYSLERFRAK VSATATKPLL AVNMGSVGQL SRIVNPVFTP ITHEAMPFKA
APGQLSFRQV QTALSLIGQA DARRFALFGS PIGHSLSPLL HNTGFAALGL PHQYELLEST
EINDAVAAFV RSPDFGGASV TIPHKLNIIK LLDEVTDEAK TIGAVNTIIP IRDAQGVVTS
LVGDNTDWIA IETLARRSLR TVHLADPNLT GLVIGAGGSA RAALFALYRL GVKRILLFNR
TLANAEKLAR EVPPEWNVSV LTSLDEVAQL SADQSPSVVV SNIPAEGSTL DASSRGLIHL
PTCMLRNPAG GVVIDMSYKP HYTSLLQLAQ QVNTNHELAI GTNATKAPTR KLWAAVPGIT
ILLEQGCHQF HRWTGREAPR AQIEAAAWDV YLQRC