CAPSD_WMHBV
ID CAPSD_WMHBV Reviewed; 182 AA.
AC O71303;
DT 18-MAR-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1998, sequence version 1.
DT 29-SEP-2021, entry version 82.
DE RecName: Full=Capsid protein {ECO:0000255|HAMAP-Rule:MF_04076};
DE AltName: Full=Core antigen {ECO:0000255|HAMAP-Rule:MF_04076};
DE AltName: Full=Core protein {ECO:0000255|HAMAP-Rule:MF_04076};
DE AltName: Full=HBcAg {ECO:0000255|HAMAP-Rule:MF_04076};
DE AltName: Full=p21.5 {ECO:0000255|HAMAP-Rule:MF_04076};
GN Name=C {ECO:0000255|HAMAP-Rule:MF_04076};
OS Woolly monkey hepatitis B virus (isolate Louisville) (WMHBV).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Blubervirales; Hepadnaviridae; Orthohepadnavirus.
OX NCBI_TaxID=490134;
OH NCBI_TaxID=9519; Lagothrix lagotricha (Brown woolly monkey) (Humboldt's woolly monkey).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=9576957; DOI=10.1073/pnas.95.10.5757;
RA Lanford R.E., Chavez D., Brasky K.M., Burns R.B. III, Rico-Hesse R.;
RT "Isolation of a hepadnavirus from the woolly monkey, a New World primate.";
RL Proc. Natl. Acad. Sci. U.S.A. 95:5757-5761(1998).
CC -!- FUNCTION: Self assembles to form an icosahedral capsid. Most capsids
CC appear to be large particles with an icosahedral symmetry of T=4 and
CC consist of 240 copies of capsid protein, though a fraction forms
CC smaller T=3 particles consisting of 180 capsid proteins. Entering
CC capsids are transported along microtubules to the nucleus.
CC Phosphorylation of the capsid is thought to induce exposure of nuclear
CC localization signal in the C-terminal portion of the capsid protein
CC that allows binding to the nuclear pore complex via the importin
CC (karyopherin-) alpha and beta. Capsids are imported in intact form
CC through the nuclear pore into the nuclear basket, where it probably
CC binds NUP153. Only capsids that contain the mature viral genome can
CC release the viral DNA and capsid protein into the nucleoplasm. Immature
CC capsids get stuck in the basket. Capsids encapsulate the pre-genomic
CC RNA and the P protein. Pre-genomic RNA is reverse-transcribed into DNA
CC while the capsid is still in the cytoplasm. The capsid can then either
CC be directed to the nucleus, providing more genomes for transcription,
CC or bud through the endoplasmic reticulum to provide new virions.
CC {ECO:0000255|HAMAP-Rule:MF_04076}.
CC -!- SUBUNIT: Homodimerizes, then multimerizes. Interacts with cytosol
CC exposed regions of viral L glycoprotein present in the reticulum-to-
CC Golgi compartment. Interacts with human FLNB. Phosphorylated form
CC interacts with host importin alpha; this interaction depends on the
CC exposure of the NLS, which itself depends upon genome maturation and/or
CC phosphorylation of the capsid protein. Interacts with host NUP153.
CC {ECO:0000255|HAMAP-Rule:MF_04076}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000255|HAMAP-Rule:MF_04076}. Host
CC cytoplasm {ECO:0000255|HAMAP-Rule:MF_04076}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative initiation; Named isoforms=2;
CC Name=Capsid protein;
CC IsoId=O71303-1; Sequence=Displayed;
CC Name=External core antigen;
CC IsoId=P0C6J0-1; Sequence=External;
CC -!- PTM: Phosphorylated by host SRPK1, SRPK2, and maybe protein kinase C or
CC GAPDH. Phosphorylation is critical for pregenomic RNA packaging.
CC Protein kinase C phosphorylation is stimulated by HBx protein and may
CC play a role in transport of the viral genome to the nucleus at the late
CC step during the viral replication cycle. {ECO:0000255|HAMAP-
CC Rule:MF_04076}.
CC -!- SIMILARITY: Belongs to the orthohepadnavirus core antigen family.
CC {ECO:0000255|HAMAP-Rule:MF_04076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF046996; AAC16904.1; -; Genomic_DNA.
DR SMR; O71303; -.
DR Proteomes; UP000008599; Genome.
DR GO; GO:0030430; C:host cell cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0039619; C:T=4 icosahedral viral capsid; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0005198; F:structural molecule activity; IEA:UniProtKB-UniRule.
DR GO; GO:0075521; P:microtubule-dependent intracellular transport of viral material towards nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-UniRule.
DR GO; GO:0075732; P:viral penetration into host nucleus; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.4090.10; -; 1.
DR HAMAP; MF_04076; HBV_HBEAG; 1.
DR InterPro; IPR002006; Hepatitis_core.
DR InterPro; IPR036459; Viral_capsid_core_dom_sf_HBV.
DR Pfam; PF00906; Hepatitis_core; 2.
DR SUPFAM; SSF47852; SSF47852; 1.
PE 3: Inferred from homology;
KW Alternative initiation; Capsid protein;
KW Cytoplasmic inwards viral transport; DNA-binding; Host cytoplasm;
KW Host-virus interaction; Microtubular inwards viral transport;
KW Phosphoprotein; Repeat; RNA-binding; T=4 icosahedral capsid protein;
KW Viral penetration into host nucleus; Virion; Virus entry into host cell.
FT CHAIN 1..182
FT /note="Capsid protein"
FT /id="PRO_0000324381"
FT REPEAT 161..168
FT /note="1"
FT REPEAT 169..176
FT /note="2"
FT REGION 136..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 161..176
FT /note="2 X 8 AA repeats of S-P-R-R-R-[PR]-S-Q"
FT REGION 176..182
FT /note="RNA binding"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04076"
FT MOTIF 157..174
FT /note="Bipartite nuclear localization signal"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04076"
FT COMPBIAS 153..172
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 161
FT /note="Phosphoserine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04076"
FT MOD_RES 169
FT /note="Phosphoserine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04076"
SQ SEQUENCE 182 AA; 20845 MW; 4DDE5AFD12EEFAB3 CRC64;
MDIDPYKEFG ATVELLSFLP ADFFPSVRDL LDTASALYRE ALESSDHCSP HHTALRQTVL
CWGELMSLAS WVGTNLEDPA ARELVVSYVN DNMGLKVRQL LWFHISCLTF GRETVLEYLV
SFWVWIRTPP AYRPPNAPIL STLPETTVVR RRRPSGRRTP SPRRRRSQSP RRRRSQSPAS
SC