POLG_RHDVA
ID POLG_RHDVA Reviewed; 2344 AA.
AC Q86119; Q7THT7; Q86123; Q86124; Q9IBM0;
DT 29-MAR-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2003, sequence version 2.
DT 03-AUG-2022, entry version 122.
DE RecName: Full=Genome polyprotein;
DE AltName: Full=p254;
DE Contains:
DE RecName: Full=Protein p16;
DE Contains:
DE RecName: Full=Protein p23;
DE Contains:
DE RecName: Full=NTPase;
DE EC=3.6.1.15;
DE AltName: Full=2C-like protein;
DE AltName: Full=P2C;
DE AltName: Full=p37;
DE Contains:
DE RecName: Full=Precursor p41;
DE Contains:
DE RecName: Full=Protein p29;
DE Contains:
DE RecName: Full=Protein p23/2;
DE Contains:
DE RecName: Full=Protein p18;
DE Contains:
DE RecName: Full=Viral genome-linked protein;
DE AltName: Full=VPg;
DE AltName: Full=p13;
DE Contains:
DE RecName: Full=3C-like protease;
DE Short=3CLpro;
DE EC=3.4.22.66;
DE AltName: Full=Calicivirin;
DE AltName: Full=Thiol protease P3C;
DE AltName: Full=p15;
DE Contains:
DE RecName: Full=RNA-directed RNA polymerase;
DE EC=2.7.7.48;
DE AltName: Full=3Dpol;
DE AltName: Full=p58;
DE Contains:
DE RecName: Full=Capsid protein VP60;
GN ORFNames=ORF1;
OS Rabbit hemorrhagic disease virus (strain AST89) (Ra/LV/RHDV/AST89/1989/SP)
OS (RHDV-AST89).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Picornavirales; Caliciviridae; Lagovirus.
OX NCBI_TaxID=314538;
OH NCBI_TaxID=9986; Oryctolagus cuniculus (Rabbit).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA], PROTEIN SEQUENCE OF 719-724 AND
RP 1009-1114, MUTAGENESIS OF GLU-1767, AND PROTEOLYTIC PROCESSING OF
RP POLYPROTEIN.
RX PubMed=8551592; DOI=10.1128/jvi.70.2.1261-1265.1996;
RA Martin-Alonso J.M., Casais R., Boga J.A., Parra F.;
RT "Processing of rabbit hemorrhagic disease virus polyprotein.";
RL J. Virol. 70:1261-1265(1996).
RN [2]
RP SEQUENCE REVISION TO 1891; 2058 AND 2061.
RA Casais R., Martin-Alonso J.M., Boga J.A., Parra F.;
RL Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1650-2344.
RX PubMed=8077941; DOI=10.1099/0022-1317-75-9-2409;
RA Boga J.A., Casais R., Marin M.S., Martin-Alonso J.M., Carmenes R.,
RA Prieto M., Parra F.;
RT "Molecular cloning, sequence and expression of the capsid protein gene from
RT rabbit hemorrhagic disease virus (Spanish isolate AST/89).";
RL J. Gen. Virol. 75:2409-2413(1994).
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 1766-1797, AND PROTEIN SEQUENCE OF 1767-1780.
RX PubMed=8488721; DOI=10.1016/0168-1702(93)90034-k;
RA Parra F., Boga J.A., Marin M.S., Casais R.;
RT "The amino terminal sequence of VP60 from rabbit hemorrhagic disease virus
RT supports its putative subgenomic origin.";
RL Virus Res. 27:219-228(1993).
RN [5]
RP SUBGENOMIC ORIGIN OF VP60.
RX PubMed=1441736; DOI=10.1016/0168-1702(92)90144-x;
RA Boga J.A., Marin M.S., Casais R., Prieto M., Parra F.;
RT "In vitro translation of a subgenomic mRNA from purified virions of the
RT Spanish field isolate AST/89 of rabbit hemorrhagic disease virus (RHDV).";
RL Virus Res. 26:33-40(1992).
RN [6]
RP COVALENT RNA LINKAGE AT TYR-1014 (VPG), AND URIDYLYLATION AT TYR-1014.
RX PubMed=11369764; DOI=10.1074/jbc.m100707200;
RA Machin A., Martin Alonso J.M., Parra F.;
RT "Identification of the amino acid residue involved in rabbit hemorrhagic
RT disease virus VPg uridylylation.";
RL J. Biol. Chem. 276:27787-27792(2001).
RN [7]
RP SUBGENOMIC ORIGIN OF VP60.
RX PubMed=14744857; DOI=10.1074/jbc.m313674200;
RA Morales M., Barcena J., Ramirez M.A., Boga J.A., Parra F., Torres J.M.;
RT "Synthesis in vitro of rabbit hemorrhagic disease virus subgenomic RNA by
RT internal initiation on (-)sense genomic RNA: mapping of a subgenomic
RT promoter.";
RL J. Biol. Chem. 279:17013-17018(2004).
RN [8]
RP INTERACTION OF VP60 WITH HISTO-BLOOD GROUP ANTIGENS.
RX PubMed=18302385; DOI=10.1021/ja710854r;
RA Rademacher C., Krishna N.R., Palcic M., Parra F., Peters T.;
RT "NMR experiments reveal the molecular basis of receptor recognition by a
RT calicivirus.";
RL J. Am. Chem. Soc. 130:3669-3675(2008).
CC -!- FUNCTION: NTPase presumably plays a role in replication. {ECO:0000250}.
CC -!- FUNCTION: Viral genome-linked protein is covalently linked to the 5'-
CC end of the positive-strand, negative-strand genomic RNAs and subgenomic
CC RNA. Acts as a genome-linked replication primer. May recruit ribosome
CC to viral RNA thereby promoting viral proteins translation (By
CC similarity). {ECO:0000250}.
CC -!- FUNCTION: 3C-like protease processes the polyprotein: 3CLpro-RdRp (p72)
CC is first released by autocleavage, then all other proteins are cleaved.
CC {ECO:0000250}.
CC -!- FUNCTION: RNA-directed RNA polymerase replicates genomic and
CC antigenomic RNA by recognizing replications specific signals.
CC Transcribes also a subgenomic mRNA by initiating RNA synthesis
CC internally on antigenomic RNA. This sgRNA codes for structural
CC proteins. Catalyzes the covalent attachment VPg with viral RNAs (By
CC similarity). {ECO:0000255|PROSITE-ProRule:PRU00539}.
CC -!- FUNCTION: Capsid protein VP60 self assembles to form an icosahedral
CC capsid with a T=3 symmetry, about 35 nm in diameter, and consisting of
CC 180 capsid proteins. A smaller form of capsid with a diameter of 23 nm
CC might be capsid proteins assembled as icosahedron with T=1 symmetry.
CC The capsid encapsulate VP2 proteins and genomic or subgenomic RNA.
CC Attaches virion to target cells by binding histo-blood group antigens,
CC inducing endocytosis of the viral particle. Acidification of the
CC endosome induces conformational change of capsid protein thereby
CC injecting virus genomic RNA into host cytoplasm (By similarity).
CC {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate +
CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA-
CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395;
CC EC=2.7.7.48; Evidence={ECO:0000255|PROSITE-ProRule:PRU00539};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endopeptidase with a preference for cleavage when the P1
CC position is occupied by Glu-|-Xaa and the P1' position is occupied by
CC Gly-|-Yaa.; EC=3.4.22.66; Evidence={ECO:0000255|PROSITE-
CC ProRule:PRU01242};
CC -!- SUBUNIT: Binds to histo-blood group antigens at surface of target
CC cells. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein VP60]: Virion. Host cytoplasm
CC {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative promoter usage; Named isoforms=2;
CC Name=Genome polyprotein;
CC IsoId=Q86119-1; Sequence=Displayed;
CC Name=Subgenomic capsid protein VP60; Synonyms=VP1;
CC IsoId=Q86119-2; Sequence=VSP_034379;
CC -!- PTM: Specific enzymatic cleavages by its own cysteine protease yield
CC mature proteins. The protease cleaves itself from the nascent
CC polyprotein autocatalytically. Precursor p41 can be cleaved by viral
CC 3CLpro into protein p19 and VPg, or cleaved by host protease into
CC protein p23/2 and protein p18 (By similarity). {ECO:0000250}.
CC -!- PTM: VPg is uridylylated by the polymerase and is covalently attached
CC to the 5'-end of the polyadenylated genomic and subgenomic RNAs. This
CC uridylylated form acts as a nucleotide-peptide primer for the
CC polymerase.
CC -!- MISCELLANEOUS: Two different RNAs lead the expression of the capsid
CC protein. One arises from the cleavage of the polyprotein translated
CC from the genomic RNA and the other from the translation of a subgenomic
CC RNA derived from the (-)RNA template. Capsid protein expressed from the
CC subgenomic mRNA is produced in much larger amounts than the cleaved
CC one.
CC -!- MISCELLANEOUS: [Isoform Genome polyprotein]: Produced from the genomic
CC RNA.
CC -!- MISCELLANEOUS: [Isoform Subgenomic capsid protein VP60]: Produced from
CC the subgenomic RNA. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z49271; CAA89265.2; -; Genomic_RNA.
DR EMBL; Z49271; CAD91718.1; -; Genomic_RNA.
DR EMBL; Z24757; CAA80881.1; ALT_SEQ; Genomic_RNA.
DR EMBL; Z24757; CAA80883.1; ALT_SEQ; Genomic_RNA.
DR EMBL; X73046; CAA51524.1; -; mRNA.
DR PIR; S64740; S64740.
DR PDB; 3ZUE; EM; 10.30 A; A/B/C=1766-2344.
DR PDBsum; 3ZUE; -.
DR SMR; Q86119; -.
DR MEROPS; C24.001; -.
DR Proteomes; UP000008653; Genome.
DR GO; GO:0030430; C:host cell cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0017111; F:nucleoside-triphosphatase activity; IEA:RHEA.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0003724; F:RNA helicase activity; IEA:InterPro.
DR GO; GO:0003968; F:RNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0018144; P:RNA-protein covalent cross-linking; IEA:UniProtKB-KW.
DR GO; GO:0006351; P:transcription, DNA-templated; IEA:InterPro.
DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR CDD; cd00205; rhv_like; 1.
DR Gene3D; 2.60.120.20; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.40.50.300; -; 1.
DR InterPro; IPR004005; Calicivirus_coat.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR004004; Helic/Pol/Pept_Calicivir-typ.
DR InterPro; IPR000605; Helicase_SF3_ssDNA/RNA_vir.
DR InterPro; IPR014759; Helicase_SF3_ssRNA_vir.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR000317; Peptidase_C24.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR033703; Rhv-like.
DR InterPro; IPR001205; RNA-dir_pol_C.
DR InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR InterPro; IPR029053; Viral_coat.
DR Pfam; PF00915; Calici_coat; 1.
DR Pfam; PF03510; Peptidase_C24; 1.
DR Pfam; PF00680; RdRP_1; 1.
DR Pfam; PF00910; RNA_helicase; 1.
DR PRINTS; PR00916; 2CENDOPTASE.
DR PRINTS; PR00918; CALICVIRUSNS.
DR SUPFAM; SSF50494; SSF50494; 1.
DR SUPFAM; SSF52540; SSF52540; 1.
DR SUPFAM; SSF56672; SSF56672; 1.
DR PROSITE; PS51894; CV_3CL_PRO; 1.
DR PROSITE; PS50507; RDRP_SSRNA_POS; 1.
DR PROSITE; PS51218; SF3_HELICASE_2; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Alternative promoter usage; ATP-binding; Capsid protein;
KW Covalent protein-RNA linkage; Direct protein sequencing; Disulfide bond;
KW Helicase; Host cytoplasm; Hydrolase; Nucleotide-binding;
KW Nucleotidyltransferase; Phosphoprotein; Protease;
KW RNA-directed RNA polymerase; Thiol protease; Transferase;
KW Viral RNA replication; Virion.
FT CHAIN 1..2344
FT /note="Genome polyprotein"
FT /id="PRO_0000341999"
FT CHAIN 1..143
FT /note="Protein p16"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036941"
FT CHAIN 144..339
FT /note="Protein p23"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036942"
FT CHAIN 340..718
FT /note="NTPase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036943"
FT CHAIN 712..1108
FT /note="Precursor p41"
FT /evidence="ECO:0000250"
FT /id="PRO_0000342000"
FT CHAIN 712..936
FT /note="Protein p23/2"
FT /evidence="ECO:0000250"
FT /id="PRO_0000342001"
FT CHAIN 719..993
FT /note="Protein p29"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036944"
FT CHAIN 937..1108
FT /note="Protein p18"
FT /evidence="ECO:0000250"
FT /id="PRO_0000342002"
FT CHAIN 994..1108
FT /note="Viral genome-linked protein"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036945"
FT CHAIN 1109..1251
FT /note="3C-like protease"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036946"
FT CHAIN 1252..1767
FT /note="RNA-directed RNA polymerase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036947"
FT CHAIN 1768..2344
FT /note="Capsid protein VP60"
FT /evidence="ECO:0000250"
FT /id="PRO_0000036949"
FT DOMAIN 492..653
FT /note="SF3 helicase"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00551"
FT DOMAIN 1109..1244
FT /note="Peptidase C24"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01242"
FT DOMAIN 1495..1619
FT /note="RdRp catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00539"
FT REGION 1771..1796
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1777..1791
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 1135
FT /note="For 3CLpro activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01242"
FT ACT_SITE 1152
FT /note="For 3CLpro activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01242"
FT ACT_SITE 1212
FT /note="For 3CLpro activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01242"
FT BINDING 522..529
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00551"
FT SITE 143..144
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 339..340
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 718..719
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 936..937
FT /note="Cleavage; by host"
FT /evidence="ECO:0000250"
FT SITE 993..994
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 1108..1109
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 1251..1252
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT SITE 1767..1768
FT /note="Cleavage; by 3CLpro"
FT /evidence="ECO:0000250"
FT MOD_RES 1014
FT /note="O-(5'-phospho-RNA)-tyrosine"
FT /evidence="ECO:0000269|PubMed:11369764"
FT MOD_RES 1014
FT /note="O-UMP-tyrosine; transient"
FT /evidence="ECO:0000269|PubMed:11369764"
FT DISULFID 1584..1591
FT /evidence="ECO:0000250"
FT VAR_SEQ 1..1765
FT /note="Missing (in isoform Subgenomic capsid protein VP60)"
FT /evidence="ECO:0000305"
FT /id="VSP_034379"
FT MUTAGEN 1767
FT /note="E->G: Loss of cleavage between RNA-directed RNA
FT polymerase and VP60."
FT /evidence="ECO:0000269|PubMed:8551592"
SQ SEQUENCE 2344 AA; 256817 MW; E1C61F038111F092 CRC64;
MAAMSRLTGM TTAILPEKKP LNFFLDLRDK TPPCCIRATG KLAWPVFLGQ NGKEGPLETC
NKCGKWLNGF GCFGLEDLGD VCLCSIAQQK HKFGPVCLCN RAYIHDCGRW RRRSRFLKHY
KALNKVIPCA YQFDESFSTP VFEGEVDDLF VELGAPTSMG FMDKKLLKKG KKLMDKFVDV
DEPCLTSRDA SLLDSIASDN TIRAKLEEEY GVEMVQAARD RKDFMKNLRL ALDNRPANPV
TWYTKLGNIT EKGKQWAKKV VYGACKVTDP LKTLASILLV GLHNVIAVDT TVMLSTFKPV
NLLAILMDWT NDLTGFVTTL VRLLELYGVV QATVNLIVEG VKSFWDKVVC ATDRCFDLLK
RLFDTFEDSV PTGPTAGCLI FMAFVFSTVV GYLPNNSVIT TFMKGAGKLT TFAGVIGAIR
TLWITINQHM VAKDLTSIQQ KVMTVVKMAN EAATLDQLEI VSCLCSDLEN TLTNRCTLPS
YNQHLGILNA SQKVISDLHT MVLGKINMTK QRPQPVAVIF KGAPGIGKTY LVHRIARDLG
CQHPSTINFG LDHFDSYTGE EVAIADEFNT CGDGESWVEL FIQMVNTNPC PLNCDKAENK
NKVFNSKYLL CTTNSNMILN ATHPRAGAFY RRVMIVEARN KAVESWQATR HGSKPGRSCY
SKDMSHLTFQ VYPHNMPAPG FVFVGDKLVK SQVAPREYKY SELLDLIKSE HPDVASFEGA
NRFNFVYPDA QYDQALLMWK QYFVMYGCVA RLAKNFVDDI PYNQVHISRA SDPKIEGCVE
YQCKFQHLWR MVPQFVLGCV NMTNQLGTPL TQQQLDRITN GVEGVTVTTV NNILPFHSQT
TLINPSFIKL IWAVRKHLKG LSGVTKVAQF IWRVMTNPVD AYGSLVRTLT GAATFSDDPV
STTIICSNCT IQIHSCGGLL VRYSRDPVPV ASDNVDRGDQ GVDVFTDPNL ISGFSWRQIA
HLFVEVISHL CANHLVNLAT MAALGAVATK AFQGVKGKTK RGRGARVNLG NDEYDEWQAA
RREFVNAHDM TAEEYLAMKN KAAMGSDDQD SVMFRSWWTR RQLRPDEDQV TVVGRGGVRN
EVIRTRVRQT PKGPKTLDDG GFYDNDYEGL PGFMRHNGSG WMIHIGNGLY ISNTHTARSS
CSEVVTCSPT TDLCLVKGEA IRSVAQIAEG TPVCDWKKSP ISTYGIKKTL SDSTKIDVLA
YDGCTQTTHG DCGLPLYDSS GKIVAIHTGK LLGFSKMCTL IDLTITKGVY ETSNFFCGEP
IDYRGITAHR LVGAEPRPPV SGTRYAKVPG VPEEYKTGYR PANLGRSDPD SDKSLMNIAV
KNLQVYQQEP KLDKVDEFIE RAAADVLGYL RFLTKGERQA NLNFKAAFNT LDLSTSCGPF
VPGKKIDHVK DGVMDQVHAK HLYKCWSVAN SGKALHHIYA CGLKDELRPL DKVKEGKKRL
LWGCDVGVAV CAAAVFHNIC YKLKMVARFG PIAVGVDMTS RDVDVIINNL TSKASDFLCL
DYSKWDSTMS PCVVRLAIDI LADCCEQTEL TKSVVLTLKS HPMTILDAMI VQTKRGLPSG
MPFTSVINSI CHWLLWSAAV YKSCAEIGLH CSNLYEDAPF YTYGDDGVYA MTPMMVSLLP
AIIENLRDYG LSPTAADKTE FIDVCPLNKI SFLKRTFELT DIGWVSKLDK SSILRQLEWS
KTTSRHMMIE ETYDLAKEER GVQLEELQVA AAAHGQEFFN FVCKELERQQ AYTQFSVYSY
DAARKILADR KRVVSVVPDD EFVNVMEGKA RTAPQGEAAG TATTASVPGT TTDGMDPGVV
ATTSVVTAEN SSASIATAGI GGPPQQVDQQ ETWRTNFYYN DVFTWSVADA PGSILYTVQH
SPQNNPFTAV LSQMYAGWAG GMQFRFIVAG SGVFGGRLVA AVIPPGIEIG PGLEVRQFPH
VVIDARSLEP VTITMPDLRP NMYHPTGDPG LVPTLVLSVY NNLINPFGGS TSAIQVTVET
RPSEDFEFVM IRAPSSKTVD SISPAGLLTT PVLTGVGNDN RWNGQIVGLQ PVPGGFSTCN
RHWNLNGSTY GWSSPRFADI DHRRGSASYP GNNATNVLQF WYANAGSAID NPISQVAPDG
FPDMSFVPFN GPGIPAAGWV GFGAIWNSNS GAPNVTTVQA YELGFATGAP GNLQPTTNTS
GSQTVAKSIY AVVTGTAQNP AGLFVMASGV ISTPSANAIT YTPQPDRIVT TPGTPAAAPV
GKNTPIMFAS VVRRTGDVNA TAGSANGTQY GTGSQPLPVT IGLSLNNYSS ALMPGQFFVW
QLTFASGFME IGLSVDGYFY AGTGASTTLI DLTELIDVRP VGPRPSKSTL VFNLGGTANG
FSYV