CAPSD_HEVMG
ID CAPSD_HEVMG Reviewed; 660 AA.
AC Q6J8F7; O36613; Q6J8G3;
DT 20-MAY-2008, integrated into UniProtKB/Swiss-Prot.
DT 05-JUL-2004, sequence version 1.
DT 29-SEP-2021, entry version 52.
DE RecName: Full=Capsid protein;
DE AltName: Full=Protein ORF2;
DE Short=pORF2;
DE Flags: Precursor;
GN ORFNames=ORF2;
OS Hepatitis E virus genotype 3 (isolate Swine/United States/swUS1) (HEV-3)
OS (Hepatitis E virus genotype 3 (isolate Swine/United States/Meng)).
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes;
OC Hepelivirales; Hepeviridae; Orthohepevirus; Hepatitis E virus.
OX NCBI_TaxID=512345;
OH NCBI_TaxID=69079; Bandicota bengalensis (lesser bandicoot rat).
OH NCBI_TaxID=9481; Callithrix.
OH NCBI_TaxID=9536; Cercopithecus hamlyni (Owl-faced monkey) (Hamlyn's monkey).
OH NCBI_TaxID=9534; Chlorocebus aethiops (Green monkey) (Cercopithecus aethiops).
OH NCBI_TaxID=9606; Homo sapiens (Human).
OH NCBI_TaxID=9539; Macaca (macaques).
OH NCBI_TaxID=10090; Mus musculus (Mouse).
OH NCBI_TaxID=9598; Pan troglodytes (Chimpanzee).
OH NCBI_TaxID=9520; Saimiri (squirrel monkeys).
OH NCBI_TaxID=9823; Sus scrofa (Pig).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=9275216; DOI=10.1073/pnas.94.18.9860;
RA Meng X.J., Purcell R.H., Halbur P.G., Lehman J.R., Webb D.M., Tsareva T.S.,
RA Haynes J.S., Thacker B.J., Emerson S.U.;
RT "A novel virus in swine is closely related to the human hepatitis E
RT virus.";
RL Proc. Natl. Acad. Sci. U.S.A. 94:9860-9865(1997).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=9811705; DOI=10.1128/jvi.72.12.9714-9721.1998;
RA Meng X.J., Halbur P.G., Shapiro M.S., Govindarajan S., Bruna J.D.,
RA Mushahwar I.K., Purcell R.H., Emerson S.U.;
RT "Genetic and experimental evidence for cross-species infection by swine
RT hepatitis E virus.";
RL J. Virol. 72:9714-9721(1998).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RC STRAIN=Isolate pSHEV-1, Isolate pSHEV-2, and Isolate pSHEV-3;
RX PubMed=15650181; DOI=10.1128/jvi.79.3.1552-1558.2005;
RA Huang Y.W., Haqshenas G., Kasorndorkbua C., Halbur P.G., Emerson S.U.,
RA Meng X.J.;
RT "Capped RNA transcripts of full-length cDNA clones of swine hepatitis E
RT virus are replication competent when transfected into Huh7 cells and
RT infectious when intrahepatically inoculated into pigs.";
RL J. Virol. 79:1552-1558(2005).
CC -!- FUNCTION: Major viral capsid protein that encapsidates the viral
CC genome. Binds to the 5' end of the genomic RNA (By similarity).
CC {ECO:0000250}.
CC -!- SUBUNIT: Homodimers. Homooligomer. Self-assembles to form the capsid.
CC The capsid is dominated by dimers that define the 30 morphological
CC units. The unglycosylated form interacts with the phosphorylated ORF3
CC protein (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000305}. Host cytoplasm
CC {ECO:0000250}. Host cell surface {ECO:0000250}. Note=Initially
CC cotranslationally translocated into the ER from where it is
CC retrotranslocated to the cytoplasm. A fraction is also observed on the
CC cell surface (By similarity). {ECO:0000250}.
CC -!- PTM: Glycosylated when overexpressed in mammalian cells. In vivo, the
CC glycosylated form is probably much less stable than the non-
CC glycosylated form, which is present in the cytosol and represents the
CC major product accumulated in the cell. May be present initially as a
CC glycosylated protein in the ER, and may become unglycosylated and
CC retrotranslocated to the cytoplasm by the endoplasmic reticulum-
CC associated degradation (ERAD) system. The non-glycosylated form may
CC therefore be the authentic intermediate in HEV capsid assembly (By
CC similarity). {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the hepevirus capsid protein family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF082843; AAC97210.1; -; Genomic_RNA.
DR EMBL; AY575857; AAT40995.1; -; Genomic_RNA.
DR EMBL; AY575858; AAT40998.1; -; Genomic_RNA.
DR EMBL; AY575859; AAT41001.1; -; Genomic_RNA.
DR SMR; Q6J8F7; -.
DR Proteomes; UP000001028; Genome.
DR Proteomes; UP000008858; Genome.
DR Proteomes; UP000008859; Genome.
DR Proteomes; UP000008989; Genome.
DR GO; GO:0030430; C:host cell cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0044228; C:host cell surface; IEA:UniProtKB-SubCell.
DR GO; GO:0039615; C:T=1 icosahedral viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR Gene3D; 2.60.120.20; -; 1.
DR InterPro; IPR004261; SP2.
DR InterPro; IPR029053; Viral_coat.
DR Pfam; PF03014; SP2; 1.
PE 3: Inferred from homology;
KW Capsid protein; Host cytoplasm; RNA-binding; Signal;
KW T=1 icosahedral capsid protein; Virion.
FT SIGNAL 1..23
FT /evidence="ECO:0000255"
FT CHAIN 24..660
FT /note="Capsid protein"
FT /id="PRO_0000334536"
FT REGION 19..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 64..125
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 368..394
FT /note="particle formation"
FT /evidence="ECO:0000250"
FT REGION 585..610
FT /note="Oligomerization"
FT /evidence="ECO:0000250"
FT VARIANT 51
FT /note="F -> L (in strain: Isolate pSHEV-1)"
FT VARIANT 59
FT /note="T -> A (in strain: Isolate pSHEV-1)"
FT VARIANT 390
FT /note="S -> L (in strain: Isolate pSHEV-1)"
FT CONFLICT 30
FT /note="R -> C (in Ref. 1 and 2; AAC97210)"
FT /evidence="ECO:0000305"
FT CONFLICT 74
FT /note="A -> V (in Ref. 1 and 2; AAC97210)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 660 AA; 70992 MW; E4406AC35EF3D49A CRC64;
MRPRAVLLLL FVLLPMLPAP PAGQPSGRRR GRRNGGAGGG FWGDRVDSQP FALPYIHPTN
PFAADVVSQP GAGARPRQPP RPLGSAWRDQ SQRPSTAPRR RSAPAGAAPL TAVSPAPDTA
PVPDVDSRGA ILRRQYNLST SPLTSSVAAG TNLVLYAAPL NPLLPLQDGT NTHIMATEAS
NYAQYRVVRA TIRYRPLVPN AVGGYAISIS FWPQTTTTPT SVDMNSITST DVRILVQPGI
ASELVIPSER LHYRNQGWRS VETTGVAEEE ATSGLVMLCI HGSPVNSYTN TPYTGALGLL
DFALELEFRN LTPGNTNTRV SRYTSTARHR LRRGADGTAE LTTTAATRFM KDLHFTGTNG
VGEVGRGIAL TLFNLADTLL GGLPTELISS AGGQLFYSRP VVSANGEPTV KLYTSVENAQ
QDKGITIPHD IDLGDSRVVI QDYDNQHEQD RPTPSPAPSR PFSVLRANDV LWLSLTAAEY
DQTTYGSSTN PMYVSDTVTL VNVATGAQAV ARSLDWSKVT LDGRPLTTIQ QYSKTFYVLP
LRGKLSFWEA GTTKAGYPYN YNTTASDQIL IENAAGHRVA ISTYTTSLGA GPTSISAVGV
LAPHSALAVL EDTVDYPARA HTFDDFCPEC RTLGLQGCAF QSTIAELQRL KMKVGKTRES