POLG_HAVSC
ID POLG_HAVSC Reviewed; 839 AA.
AC P31788;
DT 01-JUL-1993, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-1993, sequence version 1.
DT 03-AUG-2022, entry version 99.
DE RecName: Full=Genome polyprotein;
DE Contains:
DE RecName: Full=Capsid protein VP0;
DE AltName: Full=VP4-VP2;
DE Contains:
DE RecName: Full=Capsid protein VP4;
DE AltName: Full=P1A;
DE AltName: Full=Virion protein 4;
DE Contains:
DE RecName: Full=Capsid protein VP2;
DE AltName: Full=P1B;
DE AltName: Full=Virion protein 2;
DE Contains:
DE RecName: Full=Capsid protein VP3;
DE AltName: Full=P1C;
DE AltName: Full=Virion protein 3;
DE Contains:
DE RecName: Full=Protein VP1-2A;
DE AltName: Full=VPX;
DE Contains:
DE RecName: Full=Capsid protein VP1;
DE AltName: Full=P1D;
DE AltName: Full=Virion protein 1;
DE Contains:
DE RecName: Full=Assembly signal 2A;
DE AltName: Full=pX {ECO:0000250|UniProtKB:P08617};
DE Contains:
DE RecName: Full=Protein 2BC;
DE Contains:
DE RecName: Full=Protein 2B;
DE Short=P2B;
DE Flags: Fragment;
OS Simian hepatitis A virus genotype IV (isolate CY-145) (SHAV) (Simian
OS hepatitis A virus (isolate Macaca/Philippines/CY-145/1988)).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Picornavirales; Picornaviridae; Hepatovirus.
OX NCBI_TaxID=31707;
OH NCBI_TaxID=9481; Callithrix.
OH NCBI_TaxID=9536; Cercopithecus hamlyni (Owl-faced monkey) (Hamlyn's monkey).
OH NCBI_TaxID=9534; Chlorocebus aethiops (Green monkey) (Cercopithecus aethiops).
OH NCBI_TaxID=9539; Macaca (macaques).
OH NCBI_TaxID=9598; Pan troglodytes (Chimpanzee).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=1649902; DOI=10.1099/0022-1317-72-7-1685;
RA Nainan O.V., Margolis H.S., Robertson B.H., Balayan M., Brinton M.A.;
RT "Sequence analysis of a new hepatitis A virus naturally infecting
RT cynomolgus macaques (Macaca fascicularis).";
RL J. Gen. Virol. 72:1685-1689(1991).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 764-818.
RX PubMed=1318940; DOI=10.1099/0022-1317-73-6-1365;
RA Robertson B.H., Jansen R.W., Khanna B., Totsuka A., Nainan O.V., Siegl G.,
RA Widell A., Margolis H.S., Isomura S., Ito K., Ishizu T., Moritsugu Y.,
RA Lemon S.M.;
RT "Genetic relatedness of hepatitis A virus strains recovered from different
RT geographical regions.";
RL J. Gen. Virol. 73:1365-1377(1992).
CC -!- FUNCTION: [Capsid protein VP1]: Capsid proteins VP1, VP2, and VP3 form
CC a closed capsid enclosing the viral positive strand RNA genome. All
CC these proteins contain a beta-sheet structure called beta-barrel jelly
CC roll. Together they form an icosahedral capsid (T=3) composed of 60
CC copies of each VP1, VP2, and VP3, with a diameter of approximately 300
CC Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3
CC are located at the quasi-sixfold axes. The naked capsid interacts with
CC the host receptor HAVCR1 to provide virion attachment to and probably
CC entry into the target cell. {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Capsid protein VP2]: Capsid proteins VP1, VP2, and VP3 form
CC a closed capsid enclosing the viral positive strand RNA genome. All
CC these proteins contain a beta-sheet structure called beta-barrel jelly
CC roll. Together they form an icosahedral capsid (T=3) composed of 60
CC copies of each VP1, VP2, and VP3, with a diameter of approximately 300
CC Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3
CC are located at the quasi-sixfold axes. The naked capsid interacts with
CC the host receptor HAVCR1 to provide virion attachment to and probably
CC entry into the target cell. {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Capsid protein VP3]: Capsid proteins VP1, VP2, and VP3 form
CC a closed capsid enclosing the viral positive strand RNA genome. All
CC these proteins contain a beta-sheet structure called beta-barrel jelly
CC roll. Together they form an icosahedral capsid (T=3) composed of 60
CC copies of each VP1, VP2, and VP3, with a diameter of approximately 300
CC Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3
CC are located at the quasi-sixfold axes. The naked capsid interacts with
CC the host receptor HAVCR1 to provide virion attachment to and probably
CC entry into the target cell. {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Capsid protein VP0]: VP0 precursor is a component of the
CC immature procapsids. {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Capsid protein VP4]: Plays a role in the assembly of the 12
CC pentamers into an icosahedral structure. Has not been detected in
CC mature virions, supposedly owing to its small size.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Protein VP1-2A]: Precursor component of immature procapsids
CC that corresponds to an extended form of the structural protein VP1.
CC After maturation, possibly by the host Cathepsin L, the assembly signal
CC 2A is cleaved to give rise to the mature VP1 protein.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Protein 2BC]: Affects membrane integrity and causes an
CC increase in membrane permeability. {ECO:0000250|UniProtKB:P08617}.
CC -!- FUNCTION: [Protein 2B]: Functions as a viroporin. Affects membrane
CC integrity and causes an increase in membrane permeability. Involved in
CC host intracellular membrane rearrangements probably to give rise to the
CC viral factories. Does not disrupt calcium homeostasis or glycoprotein
CC trafficking. Antagonizes the innate immune response of the host by
CC suppressing IFN-beta synthesis, which it achieves by interfering with
CC the DDX58/IFIH1 (RIG-I/MDA5) pathway. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBUNIT: [Protein 2B]: Homodimer. Homomultimer; probably interacts with
CC membranes in a multimeric form. Seems to assemble into amyloid-like
CC fibers. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBUNIT: [Protein VP1-2A]: Homopentamer. Homooligomer.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBUNIT: [Capsid protein VP1]: Interacts with capsid protein VP2.
CC Interacts with capsid protein VP3. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBUNIT: [Capsid protein VP2]: Interacts with capsid protein VP1.
CC Interacts with capsid protein VP3. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBUNIT: [Capsid protein VP3]: Interacts with capsid protein VP1.
CC Interacts with capsid protein VP2. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein VP2]: Virion
CC {ECO:0000250|UniProtKB:P08617}. Host endosome, host multivesicular body
CC {ECO:0000250|UniProtKB:P08617}. Note=The egress of newly formed virions
CC occurs through an exosome-like mechanism involving endosomal budding of
CC viral capsids into multivesicular bodies.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein VP3]: Virion
CC {ECO:0000250|UniProtKB:P08617}. Host endosome, host multivesicular body
CC {ECO:0000250|UniProtKB:P08617}. Note=The egress of newly formed virions
CC occurs through an exosome-like mechanism involving endosomal budding of
CC viral capsids into multivesicular bodies.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein VP1]: Virion
CC {ECO:0000250|UniProtKB:P08617}. Host endosome, host multivesicular body
CC {ECO:0000250|UniProtKB:P08617}. Note=The egress of newly formed virions
CC occurs through an exosome-like mechanism involving endosomal budding of
CC viral capsids into multivesicular bodies.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein VP4]: Virion
CC {ECO:0000250|UniProtKB:P08617}. Note=Present in the full mature virion.
CC The egress of newly formed virions occurs through an exosome-like
CC mechanism involving endosomal budding of viral capsids into
CC multivesicular bodies. {ECO:0000250|UniProtKB:P08617}.
CC -!- SUBCELLULAR LOCATION: [Protein 2B]: Host membrane
CC {ECO:0000250|UniProtKB:P08617}; Peripheral membrane protein
CC {ECO:0000250|UniProtKB:P08617}. Note=Probably localizes to
CC intracellular membrane vesicles that are induced after virus infection
CC as the site for viral RNA replication. {ECO:0000250|UniProtKB:P08617}.
CC -!- DOMAIN: [Protein VP1-2A]: The assembly signal 2A region mediates
CC pentamerization of P1-2A. {ECO:0000250|UniProtKB:P08617}.
CC -!- DOMAIN: [Genome polyprotein]: Late-budding domains (L domains) are
CC short sequence motifs essential for viral particle budding. They
CC recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex
CC Required for Transport) or ESCRT-associated proteins. The genome
CC polyprotein contains two L domains: a tandem of (L)YPX(n)L domain which
CC is known to bind the PDCD6IP/ALIX adaptater protein.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- DOMAIN: [Capsid protein VP2]: Late-budding domains (L domains) are
CC short sequence motifs essential for viral particle budding. They
CC recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex
CC Required for Transport) or ESCRT-associated proteins. Capsid protein
CC VP2 contains two L domains: a tandem of (L)YPX(n)L domain which is
CC known to bind the Alix adaptater protein.
CC {ECO:0000250|UniProtKB:P08617}.
CC -!- DOMAIN: [Protein 2B]: The C-terminus displays a membrane-penetrating
CC ability. {ECO:0000250|UniProtKB:P08617}.
CC -!- PTM: [Genome polyprotein]: Specific enzymatic cleavages by viral
CC protease in vivo yield a variety of precursors and mature proteins.
CC Polyprotein processing intermediates are produced, such as P1-2A which
CC is a functional precursor of the structural proteins, VP0 which is a
CC VP4-VP2 precursor, VP1-2A precursor, 3ABC precursor which is a stable
CC and catalytically active precursor of 3A, 3B and 3C proteins, 3AB and
CC 3CD precursors. The assembly signal 2A is removed from VP1-2A by a host
CC protease, possibly host Cathepsin L. This cleavage occurs over a region
CC of 3 amino-acids probably generating VP1 proteins with heterogeneous C-
CC termini. {ECO:0000250|UniProtKB:P08617}.
CC -!- PTM: [Capsid protein VP0]: During virion maturation, immature virions
CC are rendered infectious following cleavage of VP0 into VP4 and VP2.
CC This maturation seems to be an autocatalytic event triggered by the
CC presence of RNA in the capsid and is followed by a conformational
CC change of the particle. {ECO:0000250|UniProtKB:P03303}.
CC -!- PTM: [Protein VP1-2A]: The assembly signal 2A is removed from VP1-2A by
CC a host protease, possibly host Cathepsin L in naked virions. This
CC cleavage does not occur in enveloped virions. This cleavage occurs over
CC a region of 3 amino-acids probably generating VP1 proteins with
CC heterogeneous C-termini. {ECO:0000250|UniProtKB:P08617}.
CC -!- PTM: [Capsid protein VP4]: Unlike other picornaviruses, does not seem
CC to be myristoylated. {ECO:0000250|UniProtKB:P08617}.
CC -!- MISCELLANEOUS: [Genome polyprotein]: The need for an intact eIF4G
CC factor for the initiation of translation of HAV results in an inability
CC to shut off host protein synthesis by a mechanism similar to that of
CC other picornaviruses. {ECO:0000250|UniProtKB:P08617}.
CC -!- MISCELLANEOUS: [Genome polyprotein]: During infection, enveloped
CC virions (eHAV) are released from cells. These eHAV are cloaked in host-
CC derived membranes and resemble exosomes. The membrane of eHAV is devoid
CC of viral proteins and thus prevents their neutralization by antibodies.
CC eHAV budding is dependent on ESCRT-associated proteins VPS4B and
CC PDCD6IP/ALIX. eHAV are produced and released in the serum and plasma,
CC but not in bile and feces which only contain the naked, nonenveloped
CC virions. It is likely that eHAV also use HAVCR1 as a functional
CC receptor to infect cells, an evolutionary trait that may enhance HAV
CC infectivity. {ECO:0000250|UniProtKB:P08617}.
CC -!- SIMILARITY: Belongs to the picornaviridae polyprotein family.
CC {ECO:0000305}.
CC -!- CAUTION: It is uncertain whether Met-1 or Met-3 is the initiator.
CC {ECO:0000250|UniProtKB:P08617}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M59286; AAA45473.1; -; Genomic_RNA.
DR EMBL; L07732; -; NOT_ANNOTATED_CDS; Genomic_RNA.
DR PIR; JQ1180; GNNYS2.
DR SMR; P31788; -.
DR Proteomes; UP000007633; Genome.
DR GO; GO:0072494; C:host multivesicular body; IEA:UniProtKB-SubCell.
DR GO; GO:0044385; C:integral to membrane of host cell; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0039618; C:T=pseudo3 icosahedral viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0005216; F:ion channel activity; IEA:UniProtKB-KW.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0039707; P:pore formation by virus in membrane of host cell; IEA:UniProtKB-KW.
DR GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR CDD; cd00205; rhv_like; 2.
DR Gene3D; 2.60.120.20; -; 3.
DR InterPro; IPR024354; Hepatitis_A_VP1-2A.
DR InterPro; IPR001676; Picornavirus_capsid.
DR InterPro; IPR033703; Rhv-like.
DR InterPro; IPR029053; Viral_coat.
DR Pfam; PF12944; HAV_VP; 1.
DR Pfam; PF00073; Rhv; 2.
PE 3: Inferred from homology;
KW Capsid protein; Host endosome; Host membrane; Host-virus interaction;
KW Ion channel; Ion transport; Membrane; T=pseudo3 icosahedral capsid protein;
KW Transport; Viral attachment to host cell; Viral ion channel; Virion;
KW Virus entry into host cell.
FT CHAIN 1..>839
FT /note="Genome polyprotein"
FT /id="PRO_0000311031"
FT CHAIN 1..245
FT /note="Capsid protein VP0"
FT /id="PRO_0000311032"
FT CHAIN 1..23
FT /note="Capsid protein VP4"
FT /id="PRO_0000039990"
FT CHAIN 24..245
FT /note="Capsid protein VP2"
FT /id="PRO_0000039991"
FT CHAIN 246..491
FT /note="Capsid protein VP3"
FT /id="PRO_0000039992"
FT CHAIN 492..835
FT /note="Protein VP1-2A"
FT /id="PRO_0000311033"
FT CHAIN 492..765
FT /note="Capsid protein VP1"
FT /id="PRO_0000039993"
FT CHAIN 766..835
FT /note="Assembly signal 2A"
FT /id="PRO_0000039994"
FT CHAIN 836..>839
FT /note="Protein 2B"
FT /id="PRO_0000311034"
FT CHAIN 836..>839
FT /note="Protein 2BC"
FT /id="PRO_0000311035"
FT REGION 55..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 496..530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 766..836
FT /note="Involved in P1-2A pentamerization"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT REGION 773..792
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 167..171
FT /note="(L)YPX(n)L motif"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT MOTIF 200..205
FT /note="(L)YPX(n)L motif"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT COMPBIAS 55..69
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 496..514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 774..792
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT SITE 23..24
FT /note="Cleavage"
FT /evidence="ECO:0000255"
FT SITE 245..246
FT /note="Cleavage; by protease 3C"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT SITE 491..492
FT /note="Cleavage; by protease 3C"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT SITE 765..766
FT /note="Cleavage; partial; by host"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT SITE 769
FT /note="Important for VP1 folding and capsid assembly"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT SITE 835..836
FT /note="Cleavage; by protease 3C"
FT /evidence="ECO:0000250|UniProtKB:P08617"
FT NON_TER 839
SQ SEQUENCE 839 AA; 93825 MW; 2CACC4BD1E192DBC CRC64;
MNMARQGLFQ TVGSGLDHIL SLADVEEEQM IQSVDRTAVT GASYFTSVDQ SSVHTAEVGS
HQSEPLKTSV DKPGSKKTQG EKFFLIHSAD WLSTHALFHE VAKLDVVSLL YNEQFAVQGL
LRYHTYARFG IEIQVQINPT PFQQGGLICA MVPGDQGYGS IASLTVYPHG LLNCNINNVV
RIKVPFIYTR GAYHFKDPQY PVWELTIRVW SEFNIGTGTS AYTSLNVLAR FTDLELHGLT
PLSTQMMRNE FRVSTTENVV NLSNYEDARA KMSFALDQEN WRSDPSEGGG IKITHFSTWT
SIPTLAAQFA FNASASVGQQ IKVIPVDPYF YQMTNSNPDQ KYITALASIC QMFCFWRGDL
VFDFQVFPTK YHSGRLQFCF VPGNELIEVT SITLKQATTA PCAVMDITGV QSTLRFRVPW
ISDTPYRVNC YIKSSHQKGE YTAIEKLIVY CYNRLTSPSN VASHVRVNVY LSAINLECFA
PLYHAMDVTS QTGDDSGGFS TTVSTEQNVP DPQVGITTPK DLKGKANKGK MDVSGVQAPV
GAITTIEDPV LAKKVPETFP ELKPGESRHT SDHMSVYKFM GRSHFLCTFT FNANNREYTF
PITLSSTSNP PHGSPSTLRW FFNLFQLYRG PLDLTIIITG ATDVDGMAWF TPVGLAVDTP
WVEKQSALTI DYKTALGAIR FNTRRTGNIQ IRLPWYSYLY AVSGALDGLG DTTDSTFGLV
SIQIANYNHS DEYLSFSCYL SVTEQSEFFF PRAPLNSSAM MTSENMLDRI AGGDLESSVD
DPRTDEDRRF ESHIEKKPYK ELRLEVGKQR FKYAREELSN EILPPPRKLK GLFSQSKIS