SPIKE_CVPIA
ID SPIKE_CVPIA Reviewed; 1349 AA.
AC Q8JSP8;
DT 17-APR-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2002, sequence version 1.
DT 03-AUG-2022, entry version 89.
DE RecName: Full=Spike glycoprotein {ECO:0000255|HAMAP-Rule:MF_04099};
DE Short=S glycoprotein {ECO:0000255|HAMAP-Rule:MF_04099};
DE AltName: Full=E2 {ECO:0000255|HAMAP-Rule:MF_04099};
DE AltName: Full=Peplomer protein {ECO:0000255|HAMAP-Rule:MF_04099};
DE Contains:
DE RecName: Full=Spike protein S1 {ECO:0000255|HAMAP-Rule:MF_04099};
DE Contains:
DE RecName: Full=Spike protein S2 {ECO:0000255|HAMAP-Rule:MF_04099};
DE Contains:
DE RecName: Full=Spike protein S2' {ECO:0000255|HAMAP-Rule:MF_04099};
DE Flags: Precursor;
GN Name=S {ECO:0000255|HAMAP-Rule:MF_04099};
OS Porcine hemagglutinating encephalomyelitis virus (strain IAF-404) (HEV).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Betacoronavirus; Embecovirus.
OX NCBI_TaxID=230236;
OH NCBI_TaxID=9823; Sus scrofa (Pig).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=12237422; DOI=10.1099/0022-1317-83-10-2411;
RA Sasseville A.M.-J., Boutin M., Gelinas A.-M., Dea S.;
RT "Sequence of the 3'-terminal end (8.1 kb) of the genome of porcine
RT haemagglutinating encephalomyelitis virus: comparison with other
RT haemagglutinating coronaviruses.";
RL J. Gen. Virol. 83:2411-2416(2002).
CC -!- FUNCTION: [Spike protein S1]: Attaches the virion to the cell membrane
CC by interacting with host receptor, initiating the infection.
CC {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- FUNCTION: [Spike protein S2]: Mediates fusion of the virion and
CC cellular membranes by acting as a class I viral fusion protein. Under
CC the current model, the protein has at least three conformational
CC states: pre-fusion native state, pre-hairpin intermediate state, and
CC post-fusion hairpin state. During viral and target cell membrane
CC fusion, the coiled coil regions (heptad repeats) assume a trimer-of-
CC hairpins structure, positioning the fusion peptide in close proximity
CC to the C-terminal region of the ectodomain. The formation of this
CC structure appears to drive apposition and subsequent fusion of viral
CC and target cell membranes. {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- FUNCTION: [Spike protein S2']: Acts as a viral fusion peptide which is
CC unmasked following S2 cleavage occurring upon virus endocytosis.
CC {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- SUBUNIT: Homotrimer; each monomer consists of a S1 and a S2 subunit.
CC The resulting peplomers protrude from the virus surface as spikes.
CC {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- SUBCELLULAR LOCATION: Virion membrane {ECO:0000255|HAMAP-
CC Rule:MF_04099}; Single-pass type I membrane protein {ECO:0000255|HAMAP-
CC Rule:MF_04099}. Host endoplasmic reticulum-Golgi intermediate
CC compartment membrane {ECO:0000255|HAMAP-Rule:MF_04099}; Single-pass
CC type I membrane protein {ECO:0000255|HAMAP-Rule:MF_04099}. Host cell
CC membrane {ECO:0000255|HAMAP-Rule:MF_04099}; Single-pass type I membrane
CC protein {ECO:0000255|HAMAP-Rule:MF_04099}. Note=Accumulates in the
CC endoplasmic reticulum-Golgi intermediate compartment, where it
CC participates in virus particle assembly. Some S oligomers are
CC transported to the host plasma membrane, where they may mediate cell-
CC cell fusion. {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- DOMAIN: Fusion peptide 1 (FP1) and fusion peptide 2 (FP2) function
CC cooperatively and have a membrane-ordering effect on lipid headgroups
CC and shallow hydrophobic regions of target bilayers. They are considered
CC as two domains of an extended, bipartite FP. The membrane-ordering
CC activity is calcium-dependent and also dependent on correct folding,
CC which is maintained by an internal disulfide bond in FP2.
CC {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- PTM: Specific enzymatic cleavages in vivo yield mature proteins. The
CC precursor is processed into S1 and S2 by host cell furin or another
CC cellular protease to yield the mature S1 and S2 proteins. Additionally,
CC a second cleavage leads to the release of a fusion peptide after viral
CC attachment to host cell receptor. {ECO:0000255|HAMAP-Rule:MF_04099}.
CC -!- PTM: The cytoplasmic Cys-rich domain is palmitoylated. Spike
CC glycoprotein is digested within host endosomes. {ECO:0000255|HAMAP-
CC Rule:MF_04099}.
CC -!- SIMILARITY: Belongs to the betacoronaviruses spike protein family.
CC {ECO:0000255|HAMAP-Rule:MF_04099}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF481863; AAM77000.1; -; Genomic_RNA.
DR SMR; Q8JSP8; -.
DR Proteomes; UP000007543; Genome.
DR GO; GO:0044173; C:host cell endoplasmic reticulum-Golgi intermediate compartment membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-UniRule.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0075509; P:endocytosis involved in viral entry into host cell; IEA:UniProtKB-UniRule.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0019064; P:fusion of virus membrane with host plasma membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0046813; P:receptor-mediated virion attachment to host cell; IEA:UniProtKB-UniRule.
DR CDD; cd21508; HEV_Spike_S1_RBD; 1.
DR CDD; cd21625; MHV-like_Spike_S1_NTD; 1.
DR Gene3D; 2.60.120.960; -; 1.
DR HAMAP; MF_04099; BETA_CORONA_SPIKE; 1.
DR InterPro; IPR032500; bCoV_S1_N.
DR InterPro; IPR042578; BETA_CORONA_SPIKE.
DR InterPro; IPR043607; CoV_S1_C.
DR InterPro; IPR043473; S2_sf_CoV.
DR InterPro; IPR043002; Spike_N_sf.
DR InterPro; IPR044339; Spike_S1_NTD_MHV-like.
DR InterPro; IPR018548; Spike_S1_RBD_bCoV.
DR InterPro; IPR044380; Spike_S1_RBD_HEV.
DR InterPro; IPR036326; Spike_S1_RBD_sf_bCoV.
DR InterPro; IPR002552; Spike_S2_CoV.
DR InterPro; IPR043614; Spike_S2_CoV_C.
DR InterPro; IPR044873; Spike_S2_CoV_HR1.
DR InterPro; IPR044874; Spike_S2_CoV_HR2.
DR Pfam; PF16451; bCoV_S1_N; 1.
DR Pfam; PF09408; bCoV_S1_RBD; 1.
DR Pfam; PF19209; CoV_S1_C; 1.
DR Pfam; PF01601; CoV_S2; 1.
DR Pfam; PF19214; CoV_S2_C; 1.
DR SUPFAM; SSF111474; SSF111474; 2.
DR SUPFAM; SSF143587; SSF143587; 1.
DR PROSITE; PS51921; BCOV_S1_CTD; 1.
DR PROSITE; PS51922; BCOV_S1_NTD; 1.
DR PROSITE; PS51923; COV_S2_HR1; 1.
PE 3: Inferred from homology;
KW Coiled coil; Disulfide bond;
KW Fusion of virus membrane with host endosomal membrane;
KW Fusion of virus membrane with host membrane; Glycoprotein;
KW Host cell membrane; Host membrane; Host-virus interaction; Lipoprotein;
KW Membrane; Palmitate; Signal; Transmembrane; Transmembrane helix;
KW Viral attachment to host cell; Viral envelope protein;
KW Viral penetration into host cytoplasm; Virion; Virulence;
KW Virus entry into host cell.
FT SIGNAL 1..13
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CHAIN 14..1349
FT /note="Spike glycoprotein"
FT /evidence="ECO:0000250"
FT /id="PRO_0000283904"
FT CHAIN 14..754
FT /note="Spike protein S1"
FT /evidence="ECO:0000250"
FT /id="PRO_0000283905"
FT CHAIN 755..1349
FT /note="Spike protein S2"
FT /evidence="ECO:0000250"
FT /id="PRO_0000283906"
FT CHAIN 900..1349
FT /note="Spike protein S2'"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT /id="PRO_0000444088"
FT TOPO_DOM 14..1293
FT /note="Extracellular"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT TRANSMEM 1294..1314
FT /note="Helical"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT TOPO_DOM 1315..1349
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT DOMAIN 15..298
FT /note="BetaCoV S1-NTD"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01270"
FT DOMAIN 329..603
FT /note="BetaCoV S1-CTD"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01269"
FT REGION 900..921
FT /note="Fusion peptide 1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT REGION 919..939
FT /note="Fusion peptide 2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT REGION 1000..1050
FT /note="Heptad repeat 1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT REGION 1244..1282
FT /note="Heptad repeat 2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT COILED 1029..1073
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT COILED 1255..1283
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT MOTIF 1345..1349
FT /note="KxHxx"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT SITE 754..755
FT /note="Cleavage; by host"
FT /evidence="ECO:0000250"
FT SITE 899..900
FT /note="Cleavage"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 22
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 59
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 133
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 198
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 437
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 456
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 512
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 611
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 635
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 662
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 682
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 700
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 725
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 774
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 881
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 923
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1180
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1210
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1220
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1239
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1253
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT CARBOHYD 1274
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
FT DISULFID 21..165
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01270"
FT DISULFID 160..193
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01270"
FT DISULFID 172..252
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01270"
FT DISULFID 286..296
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01270"
FT DISULFID 331..356
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01269"
FT DISULFID 374..427
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01269"
FT DISULFID 386..601
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01269"
FT DISULFID 924..935
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04099"
SQ SEQUENCE 1349 AA; 149649 MW; 78CDD0CD923DAC83 CRC64;
MFFILLITLP SVFAVIGDLK CNTSSINDVD TGVPSISSEV VDVTNGLGTF YVLDRVYLNT
TLLLNGYYPI SGATFRNVAL KGTRLLSTLW FKPPFLSPFN DGIFAKVKNS RFSKHGVIYS
EFPAITIGST FVNTSYSIVV KPHTSFINGN LQGFLQISVC QYTMCEYPQT ICHPNLGNQR
IELWHHDTDV VSCLYRRNFT YDVNADYLYF HFYQEGGTFY AYFTDTGFVT KFLFKLYLGT
VLSHYYVMPL TCDSALSLEY WVTPLTTRQF LLAFDQDGVL YHAVDCASDF MSEIMCKTSS
ITPPTGVYEL NGYTVQPVAT VYRRIPDLPN CDIEAWLNSK TVSSPLNWER KIFSNCNFNM
GRLMSFIQAD SFGCNNIDAS RLYGMCFGSI TIDKFAIPNS RKVDLQVGKS GYLQSFNYKI
DTAVSSCQLY YSLPAANVSV THYNPSSWNR RYGFINQSFG SRGLHDAVYS QQCFNTPNTY
CPCRTSQCIG GAGTGTCPVG TTVRKCFAAV TNATKCTCWC QPDPSTYKGV NAWTCPQSKV
SIQPGQHCPG LGLVEDDCSG NPCTCKPQAF IGWSSETCLQ NGRCNIFANF ILNDVNSGTT
CSTDLQQGNT NITTDVCVNY DLYGITGQGI LIEVNATYYN SWQNLLYDSS GNLYGFRDYL
SNRTFLIRSC YSGRVSAVFH ANSSEPALMF RNLKCSHVFN YTILRQIQLV NYFDSYLGCV
VNAYNNTASA VSTCDLTVGS GYCVDYVTAL RSRRSFTTGY RFTNFEPFAA NLVNDSIEPV
GGLYEIQIPS EFTIGNLEEF IQTSSPKVTI DCATFVCGDY AACRQQLAEY GSFCENINAI
LIEVNELLDT TQLQVANSLM NGVTLSTKIK DGINFNVDDI NFSSVLGCLG SECNRASTRS
AIEDLLFDKV KLSDVGFVQA YNNCTGGAEI RDLICVQSYN GIKVLPPLLS ENQISGYTSA
ATAASLFPPW TAAAGVPFYL NVQYRINGLG VTMDVLSQNQ KLIASAFNNA LDSIQEGFDA
TNSALVKIQA VVNANAEALN NLLQQLSNRF GAISASLQEI LSRLDALEAK AQIDRLINGR
LTALNAYVSQ QLSDSTLVKF SAAQAIEKVN ECVKSQSSRI NFCGNGNHII SLVQNAPYGL
YFIHFSYVPT KYVTAKVSPG LCIAGDIGIS PKSGYFINVN NSWMFTGSGY YYPEPITQNN
VVVMSTCAVN YTKAPDLMLN TSTPNLPDFK EELYQWFKNQ SSLAPDLSFD YINVTFLDLQ
DEMNRLQEAI KVLNHSYINL KDIGTYEYYV KWPWYVWLLI CLAGVVMLVL LFFICCCTGC
GTSCFKKCGG CFDDYTGHQE FVIKTSHDD