HBSAG_WMHBV
ID HBSAG_WMHBV Reviewed; 391 AA.
AC O71305; O92934;
DT 26-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1998, sequence version 1.
DT 02-JUN-2021, entry version 75.
DE RecName: Full=Large envelope protein {ECO:0000255|HAMAP-Rule:MF_04075};
DE AltName: Full=L glycoprotein {ECO:0000255|HAMAP-Rule:MF_04075};
DE AltName: Full=L-HBsAg {ECO:0000255|HAMAP-Rule:MF_04075};
DE Short=LHB {ECO:0000255|HAMAP-Rule:MF_04075};
DE AltName: Full=Large S protein {ECO:0000255|HAMAP-Rule:MF_04075};
DE AltName: Full=Large surface protein {ECO:0000255|HAMAP-Rule:MF_04075};
DE AltName: Full=Major surface antigen {ECO:0000255|HAMAP-Rule:MF_04075};
GN Name=S {ECO:0000255|HAMAP-Rule:MF_04075};
OS Woolly monkey hepatitis B virus (isolate Louisville) (WMHBV).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Blubervirales; Hepadnaviridae; Orthohepadnavirus.
OX NCBI_TaxID=490134;
OH NCBI_TaxID=9519; Lagothrix lagotricha (Brown woolly monkey) (Humboldt's woolly monkey).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=9576957; DOI=10.1073/pnas.95.10.5757;
RA Lanford R.E., Chavez D., Brasky K.M., Burns R.B. III, Rico-Hesse R.;
RT "Isolation of a hepadnavirus from the woolly monkey, a New World primate.";
RL Proc. Natl. Acad. Sci. U.S.A. 95:5757-5761(1998).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=12829821; DOI=10.1128/jvi.77.14.7814-7819.2003;
RA Lanford R.E., Chavez D., Barrera A., Brasky K.M.;
RT "An infectious clone of woolly monkey hepatitis B virus.";
RL J. Virol. 77:7814-7819(2003).
RN [3]
RP REVIEW.
RX PubMed=8957666; DOI=10.1159/000150471;
RA Bruss V., Gerhardt E., Vieluf K., Wunderlich G.;
RT "Functions of the large hepatitis B virus surface protein in viral particle
RT morphogenesis.";
RL Intervirology 39:23-31(1996).
RN [4]
RP REVIEW.
RX PubMed=9498079; DOI=10.1007/978-1-4615-5383-0_20;
RA Block T.M., Lu X., Mehta A., Park J., Blumberg B.S., Dwek R.;
RT "Role of glycan processing in hepatitis B virus envelope protein
RT trafficking.";
RL Adv. Exp. Med. Biol. 435:207-216(1998).
RN [5]
RP REVIEW.
RX PubMed=15567498; DOI=10.1016/j.virusres.2004.08.016;
RA Bruss V.;
RT "Envelopment of the hepatitis B virus nucleocapsid.";
RL Virus Res. 106:199-209(2004).
RN [6]
RP REVIEW.
RX PubMed=16863502; DOI=10.1111/j.1349-7006.2006.00235.x;
RA Wang H.C., Huang W., Lai M.D., Su I.J.;
RT "Hepatitis B virus pre-S mutants, endoplasmic reticulum stress and
RT hepatocarcinogenesis.";
RL Cancer Sci. 97:683-688(2006).
CC -!- FUNCTION: The large envelope protein exists in two topological
CC conformations, one which is termed 'external' or Le-HBsAg and the other
CC 'internal' or Li-HBsAg. In its external conformation the protein
CC attaches the virus to cell receptors and thereby initiating infection.
CC This interaction determines the species specificity and liver tropism.
CC This attachment induces virion internalization predominantly through
CC caveolin-mediated endocytosis. The large envelope protein also assures
CC fusion between virion membrane and endosomal membrane. In its internal
CC conformation the protein plays a role in virion morphogenesis and
CC mediates the contact with the nucleocapsid like a matrix protein.
CC {ECO:0000255|HAMAP-Rule:MF_04075}.
CC -!- FUNCTION: The middle envelope protein plays an important role in the
CC budding of the virion. It is involved in the induction of budding in a
CC nucleocapsid independent way. In this process the majority of envelope
CC proteins bud to form subviral lipoprotein particles of 22 nm of
CC diameter that do not contain a nucleocapsid. {ECO:0000255|HAMAP-
CC Rule:MF_04075}.
CC -!- SUBUNIT: [Isoform L]: In its internal form (Li-HBsAg), interacts with
CC the capsid protein and with the isoform S. Interacts with host
CC chaperone CANX. {ECO:0000250|UniProtKB:P03141}.
CC -!- SUBUNIT: [Isoform M]: Associates with host chaperone CANX through its
CC pre-S2 N glycan; this association may be essential for isoform M proper
CC secretion. {ECO:0000250|UniProtKB:P03141}.
CC -!- SUBUNIT: [Isoform S]: Interacts with isoform L. Interacts with the
CC antigens of satellite virus HDV (HDVAgs); this interaction is required
CC for encapsidation of HDV genomic RNA. {ECO:0000250|UniProtKB:P03141}.
CC -!- SUBCELLULAR LOCATION: Virion membrane {ECO:0000255|HAMAP-
CC Rule:MF_04075}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing, Alternative initiation; Named isoforms=3;
CC Name=L; Synonyms=Large envelope protein, LHB, L-HBsAg;
CC IsoId=O71305-1; Sequence=Displayed;
CC Name=M; Synonyms=Middle envelope protein, MHB, M-HBsAg;
CC IsoId=O71305-2; Sequence=VSP_031471;
CC Name=S; Synonyms=Small envelope protein, SHB, S-HBsAg;
CC IsoId=O71305-3; Sequence=VSP_031470;
CC -!- DOMAIN: The large envelope protein is synthesized with the pre-S region
CC at the cytosolic side of the endoplasmic reticulum and, hence will be
CC within the virion after budding. Therefore the pre-S region is not N-
CC glycosylated. Later a post-translational translocation of N-terminal
CC pre-S and TM1 domains occur in about 50% of proteins at the virion
CC surface. These molecules change their topology by an unknown mechanism,
CC resulting in exposure of pre-S region at virion surface. For isoform M
CC in contrast, the pre-S2 region is translocated cotranslationally to the
CC endoplasmic reticulum lumen and is N-glycosylated. {ECO:0000255|HAMAP-
CC Rule:MF_04075}.
CC -!- PTM: Isoform M is N-terminally acetylated by host at a ratio of 90%,
CC and N-glycosylated by host at the pre-S2 region.
CC {ECO:0000250|UniProtKB:P03138, ECO:0000255|HAMAP-Rule:MF_04075}.
CC -!- PTM: Myristoylated. {ECO:0000255|HAMAP-Rule:MF_04075}.
CC -!- SIMILARITY: Belongs to the orthohepadnavirus major surface antigen
CC family. {ECO:0000255|HAMAP-Rule:MF_04075}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC16906.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC Sequence=AAO74857.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF046996; AAC16905.1; -; Genomic_DNA.
DR EMBL; AF046996; AAC16906.1; ALT_INIT; Genomic_DNA.
DR EMBL; AY226578; AAO74856.1; -; Genomic_DNA.
DR EMBL; AY226578; AAO74857.1; ALT_INIT; Genomic_DNA.
DR RefSeq; YP_009175035.1; NC_028129.1.
DR RefSeq; YP_009175036.1; NC_028129.1.
DR GeneID; 26101581; -.
DR GeneID; 26101584; -.
DR KEGG; vg:26101581; -.
DR KEGG; vg:26101584; -.
DR Proteomes; UP000008599; Genome.
DR Proteomes; UP000163759; Genome.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0075513; P:caveolin-mediated endocytosis of virus by host cell; IEA:UniProtKB-KW.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-UniRule.
DR HAMAP; MF_04075; HBV_HBSAG; 1.
DR InterPro; IPR000349; HBV_HBSAG.
DR Pfam; PF00695; vMSA; 1.
PE 3: Inferred from homology;
KW Acetylation; Alternative initiation; Alternative splicing;
KW Caveolin-mediated endocytosis of virus by host;
KW Fusion of virus membrane with host endosomal membrane;
KW Fusion of virus membrane with host membrane; Glycoprotein;
KW Host-virus interaction; Lipoprotein; Membrane; Myristate; Transmembrane;
KW Transmembrane helix; Viral attachment to host cell;
KW Viral penetration into host cytoplasm; Virion; Virus endocytosis by host;
KW Virus entry into host cell.
FT INIT_MET 1
FT /note="Removed; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT CHAIN 2..391
FT /note="Large envelope protein"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT /id="PRO_0000319294"
FT TOPO_DOM 2..242
FT /note="Intravirion; in internal conformation"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TOPO_DOM 2..170
FT /note="Virion surface; in external conformation"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TRANSMEM 171..191
FT /note="Helical; Name=TM1; Note=In external conformation"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TOPO_DOM 192..242
FT /note="Intravirion; in external conformation"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TRANSMEM 243..263
FT /note="Helical; Name=TM2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TOPO_DOM 264..339
FT /note="Virion surface"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TRANSMEM 340..360
FT /note="Helical"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TOPO_DOM 361..366
FT /note="Intravirion"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TRANSMEM 367..389
FT /note="Helical; Name=TM3"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT TOPO_DOM 390..391
FT /note="Virion surface"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT REGION 2..163
FT /note="Pre-S"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT REGION 2..108
FT /note="Pre-S1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT REGION 73..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 109..163
FT /note="Pre-S2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT LIPID 2
FT /note="N-myristoyl glycine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT CARBOHYD 311
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04075"
FT VAR_SEQ 1..163
FT /note="Missing (in isoform S)"
FT /evidence="ECO:0000305"
FT /id="VSP_031470"
FT VAR_SEQ 1..108
FT /note="Missing (in isoform M)"
FT /evidence="ECO:0000305"
FT /id="VSP_031471"
FT MOD_RES O71305-2:1
FT /note="N-acetylmethionine"
FT /evidence="ECO:0000250|UniProtKB:P03138"
SQ SEQUENCE 391 AA; 42557 MW; BB69E0686FBA1E9D CRC64;
MGLNQSTFNP LGFFPSHQLD PLFKANAGSA DWDKNPNKDP WPQAHDTAVG AFGPGLVPPH
GGLLGWSSQA QGLSVTVPDT PPPPSTNRDK GRKPTPATPP LRDTHPQAMT WNTSSFQSYL
QNPKVRGLYF PAGGSTSSIV NPVPTTASTT SSSFSTTGVP VSTMDITSSG FLGPLLALQA
VFFLLTKILT MPQSLDSLWT SLNFLGGTPA CPGLNSQSPT SSHSPTCCPP TCPGYRWMCL
RRSIIFLFIL LLCLIFLLVL LDYQGMLPVC PLLPTVTGTT TTTGPCRTCT PIVPGISSYP
SCCCTKPTDG NCTCIPIPSS WAFAKFLWDW ALARFSWLNS LLPFVQWFAG LSPTVWLLVI
WMMWFWGPSL FSILSPFLPL LPLFFWLWAY I