R1A_CVHN2
ID R1A_CVHN2 Reviewed; 4441 AA.
AC P0C6U4; Q14EB2;
DT 10-JUN-2008, integrated into UniProtKB/Swiss-Prot.
DT 10-JUN-2008, sequence version 1.
DT 03-AUG-2022, entry version 83.
DE RecName: Full=Replicase polyprotein 1a;
DE Short=pp1a;
DE AltName: Full=ORF1a polyprotein;
DE Contains:
DE RecName: Full=Non-structural protein 1;
DE Short=nsp1;
DE AltName: Full=p28;
DE Contains:
DE RecName: Full=Non-structural protein 2;
DE Short=nsp2;
DE AltName: Full=p65;
DE Contains:
DE RecName: Full=Non-structural protein 3;
DE Short=nsp3;
DE EC=3.4.19.12;
DE EC=3.4.22.69;
DE AltName: Full=PL1-PRO/PL2-PRO;
DE AltName: Full=PL1/PL2;
DE AltName: Full=Papain-like proteinases 1/2;
DE AltName: Full=p210;
DE Contains:
DE RecName: Full=Non-structural protein 4;
DE Short=nsp4;
DE AltName: Full=Peptide HD2;
DE AltName: Full=p44;
DE Contains:
DE RecName: Full=3C-like proteinase;
DE Short=3CL-PRO;
DE Short=3CLp;
DE EC=3.4.22.-;
DE AltName: Full=M-PRO;
DE AltName: Full=nsp5;
DE AltName: Full=p27;
DE Contains:
DE RecName: Full=Non-structural protein 6;
DE Short=nsp6;
DE Contains:
DE RecName: Full=Non-structural protein 7;
DE Short=nsp7;
DE AltName: Full=p10;
DE Contains:
DE RecName: Full=Non-structural protein 8;
DE Short=nsp8;
DE AltName: Full=p22;
DE Contains:
DE RecName: Full=Non-structural protein 9;
DE Short=nsp9;
DE AltName: Full=p12;
DE Contains:
DE RecName: Full=Non-structural protein 10;
DE Short=nsp10;
DE AltName: Full=Growth factor-like peptide;
DE Short=GFL;
DE AltName: Full=p15;
DE Contains:
DE RecName: Full=Non-structural protein 11;
DE Short=nsp11;
GN ORFNames=1a;
OS Human coronavirus HKU1 (isolate N2) (HCoV-HKU1).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Betacoronavirus; Embecovirus.
OX NCBI_TaxID=443240;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=16809319; DOI=10.1128/jvi.00509-06;
RA Woo P.C.Y., Lau S.K.P., Yip C.C.Y., Huang Y., Tsoi H.-W., Chan K.-H.,
RA Yuen K.-Y.;
RT "Comparative analysis of 22 coronavirus HKU1 genomes reveals a novel
RT genotype and evidence of natural recombination in coronavirus HKU1.";
RL J. Virol. 80:7136-7145(2006).
CC -!- FUNCTION: The papain-like proteinase 1 (PL1-PRO) and papain-like
CC proteinase 2 (PL2-PRO) are responsible for the cleavages located at the
CC N-terminus of the replicase polyprotein. In addition, PLP2 possesses a
CC deubiquitinating/deISGylating activity and processes both 'Lys-48'- and
CC 'Lys-63'-linked polyubiquitin chains from cellular substrates.
CC Antagonizes innate immune induction of type I interferon by blocking
CC the phosphorylation, dimerization and subsequent nuclear translocation
CC of host IRF-3 (By similarity). {ECO:0000250}.
CC -!- FUNCTION: [3C-like proteinase]: Responsible for the majority of
CC cleavages as it cleaves the C-terminus of replicase polyprotein at 11
CC sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-
CC [SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-
CC CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function
CC (By similarity). {ECO:0000255|PROSITE-ProRule:PRU00772}.
CC -!- FUNCTION: Nsp7-nsp8 hexadecamer may possibly confer processivity to the
CC polymerase, maybe by binding to dsRNA or by producing primers utilized
CC by the latter. {ECO:0000250}.
CC -!- FUNCTION: Nsp9 is a ssRNA-binding protein. {ECO:0000250}.
CC -!- FUNCTION: [Non-structural protein 1]: Binds to the 40S ribosomal
CC subunit and inhibits host translation. The nsp1-40S ribosome complex
CC further induces an endonucleolytic cleavage near the 5'UTR of host
CC mRNAs, targeting them for degradation. By suppressing host gene
CC expression, nsp1 facilitates efficient viral gene expression in
CC infected cells and evasion from host immune response (By similarity).
CC {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides
CC corresponding to the two self-cleavage sites of the SARS 3C-like
CC proteinase are the two most reactive peptide substrates. The enzyme
CC exhibits a strong preference for substrates containing Gln at P1
CC position and Leu at P2 position.; EC=3.4.22.69;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Thiol-dependent hydrolysis of ester, thioester, amide, peptide
CC and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-
CC residue protein attached to proteins as an intracellular targeting
CC signal).; EC=3.4.19.12;
CC -!- SUBUNIT: 3CL-PRO exists as monomer and homodimer. Eight copies of nsp7
CC and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is
CC a dimer. Nsp10 forms a dodecamer (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 3]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 6]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 7]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 8]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 9]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 10]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Name=Replicase polyprotein 1a; Synonyms=pp1a, ORF1a polyprotein;
CC IsoId=P0C6U4-1; Sequence=Displayed;
CC Name=Replicase polyprotein 1ab; Synonyms=pp1ab;
CC IsoId=P0C6X3-1; Sequence=External;
CC -!- DOMAIN: The hydrophobic domains (HD) could mediate the membrane
CC association of the replication complex and thereby alter the
CC architecture of the host cell membrane.
CC -!- PTM: Specific enzymatic cleavages in vivo by its own proteases yield
CC mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically
CC processed (By similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: Isolate N2 belongs to genotype B.
CC -!- MISCELLANEOUS: [Isoform Replicase polyprotein 1a]: Produced by
CC conventional translation.
CC -!- SIMILARITY: Belongs to the coronaviruses polyprotein 1ab family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY884001; -; NOT_ANNOTATED_CDS; Genomic_RNA.
DR SMR; P0C6U4; -.
DR Proteomes; UP000006551; Genome.
DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:UniProtKB-EC.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0008242; F:omega peptidase activity; IEA:InterPro.
DR GO; GO:0003968; F:RNA-directed 5'-3' RNA polymerase activity; IEA:InterPro.
DR GO; GO:0003727; F:single-stranded RNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039595; P:induction by virus of catabolism of host mRNA; IEA:UniProtKB-KW.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039648; P:modulation by virus of host protein ubiquitination; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039657; P:suppression by virus of host gene expression; IEA:UniProtKB-KW.
DR GO; GO:0039579; P:suppression by virus of host ISG15-protein conjugation; IEA:UniProtKB-KW.
DR GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR GO; GO:0039548; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of IRF3 activity; IEA:UniProtKB-KW.
DR GO; GO:0019079; P:viral genome replication; IEA:InterPro.
DR GO; GO:0019082; P:viral protein processing; IEA:InterPro.
DR CDD; cd21560; betaCoV-Nsp6; 1.
DR CDD; cd21666; betaCoV_Nsp5_Mpro; 1.
DR CDD; cd21519; cv_beta_Nsp2_MHV-like; 1.
DR CDD; cd21473; cv_Nsp4_TM; 1.
DR CDD; cd21557; Macro_X_Nsp3-like; 1.
DR CDD; cd21467; Ubl1_cv_Nsp3_N-like; 1.
DR CDD; cd21466; Ubl2_cv_PLpro_N_Nsp3-like; 1.
DR Gene3D; 1.10.150.420; -; 1.
DR Gene3D; 1.10.1840.10; -; 1.
DR Gene3D; 1.10.8.1190; -; 2.
DR Gene3D; 1.10.8.370; -; 1.
DR Gene3D; 2.40.10.10; -; 2.
DR Gene3D; 2.40.10.250; -; 1.
DR Gene3D; 2.60.120.1680; -; 1.
DR Gene3D; 3.10.20.350; -; 1.
DR Gene3D; 3.10.20.540; -; 1.
DR Gene3D; 3.30.70.3540; -; 1.
DR Gene3D; 3.40.220.10; -; 1.
DR Gene3D; 3.40.50.11020; -; 1.
DR InterPro; IPR022570; B-CoV_A_NSP1.
DR InterPro; IPR043613; CoV_NSP2_C.
DR InterPro; IPR043611; CoV_NSP3_C.
DR InterPro; IPR032505; CoV_NSP4_C.
DR InterPro; IPR043612; CoV_NSP4_N.
DR InterPro; IPR022733; DPUP_SUD_C_bCoV.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR044371; Macro_X_NSP3-like.
DR InterPro; IPR036333; NSP10_sf_CoV.
DR InterPro; IPR044384; NSP2_MHV-like.
DR InterPro; IPR032592; NSP3_NAB_bCoV.
DR InterPro; IPR042570; NSP3_NAB_bCoV_sf.
DR InterPro; IPR044357; NSP3_Ubl1_dom_CoV.
DR InterPro; IPR044353; Nsp3_Ubl2_dom_CoV.
DR InterPro; IPR038083; NSP3A-like.
DR InterPro; IPR038123; NSP4_C_sf_CoV.
DR InterPro; IPR044367; NSP6_betaCoV.
DR InterPro; IPR043610; NSP6_CoV.
DR InterPro; IPR014828; NSP7_CoV.
DR InterPro; IPR037204; NSP7_sf_CoV.
DR InterPro; IPR014829; NSP8_CoV.
DR InterPro; IPR037230; NSP8_sf_CoV.
DR InterPro; IPR014822; NSP9_CoV.
DR InterPro; IPR036499; NSP9_sf_CoV.
DR InterPro; IPR002705; Pept_C30/C16_B_coronavir.
DR InterPro; IPR013016; Peptidase_C16_CoV.
DR InterPro; IPR008740; Peptidase_C30_CoV.
DR InterPro; IPR043477; Peptidase_C30_dom3_CoV.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043177; PLpro_N_sf_CoV.
DR InterPro; IPR043503; PLpro_palm_finger_dom_CoV.
DR InterPro; IPR043178; PLpro_thumb_sf_CoV.
DR InterPro; IPR018995; RNA_synth_NSP10_CoV.
DR Pfam; PF16251; bCoV_NAR; 1.
DR Pfam; PF09401; CoV_NSP10; 1.
DR Pfam; PF19212; CoV_NSP2_C; 1.
DR Pfam; PF19218; CoV_NSP3_C; 1.
DR Pfam; PF16348; CoV_NSP4_C; 1.
DR Pfam; PF19217; CoV_NSP4_N; 1.
DR Pfam; PF19213; CoV_NSP6; 1.
DR Pfam; PF08716; CoV_NSP7; 1.
DR Pfam; PF08717; CoV_NSP8; 1.
DR Pfam; PF08710; CoV_NSP9; 1.
DR Pfam; PF08715; CoV_peptidase; 1.
DR Pfam; PF11963; DUF3477; 1.
DR Pfam; PF01661; Macro; 1.
DR Pfam; PF01831; Peptidase_C16; 1.
DR Pfam; PF05409; Peptidase_C30; 1.
DR SMART; SM00506; A1pp; 1.
DR SUPFAM; SSF101816; SSF101816; 1.
DR SUPFAM; SSF140367; SSF140367; 1.
DR SUPFAM; SSF143076; SSF143076; 1.
DR SUPFAM; SSF144246; SSF144246; 1.
DR SUPFAM; SSF159936; SSF159936; 1.
DR SUPFAM; SSF50494; SSF50494; 1.
DR SUPFAM; SSF52949; SSF52949; 1.
DR PROSITE; PS51963; BCOV_NSP1_C; 1.
DR PROSITE; PS51942; BCOV_NSP3C_C; 1.
DR PROSITE; PS51994; BCOV_NSP3E_G2M; 1.
DR PROSITE; PS51945; BCOV_NSP3E_NAB; 1.
DR PROSITE; PS51952; COV_EXON_MTASE_COACT; 1.
DR PROSITE; PS51962; COV_NSP1; 1.
DR PROSITE; PS51991; COV_NSP2_C; 1.
DR PROSITE; PS51990; COV_NSP2_M; 1.
DR PROSITE; PS51989; COV_NSP2_N; 1.
DR PROSITE; PS51992; COV_NSP3_Y3; 1.
DR PROSITE; PS51943; COV_NSP3A_UBL; 1.
DR PROSITE; PS51944; COV_NSP3D_UBL; 1.
DR PROSITE; PS51946; COV_NSP4C; 1.
DR PROSITE; PS51949; COV_NSP7; 1.
DR PROSITE; PS51950; COV_NSP8; 1.
DR PROSITE; PS51951; COV_NSP9_SSRNA_BD; 1.
DR PROSITE; PS51442; M_PRO; 1.
DR PROSITE; PS51154; MACRO; 1.
DR PROSITE; PS51124; PEPTIDASE_C16; 2.
PE 3: Inferred from homology;
KW Activation of host autophagy by virus; Decay of host mRNAs by virus;
KW Eukaryotic host gene expression shutoff by virus;
KW Eukaryotic host translation shutoff by virus; Host cytoplasm;
KW Host gene expression shutoff by virus; Host membrane;
KW Host mRNA suppression by virus; Host-virus interaction; Hydrolase;
KW Inhibition of host innate immune response by virus;
KW Inhibition of host interferon signaling pathway by virus;
KW Inhibition of host IRF3 by virus; Inhibition of host ISG15 by virus;
KW Inhibition of host RLR pathway by virus; Membrane; Metal-binding;
KW Modulation of host ubiquitin pathway by viral deubiquitinase;
KW Modulation of host ubiquitin pathway by virus; Protease; Repeat;
KW Ribosomal frameshifting; RNA-binding; Thiol protease; Transmembrane;
KW Transmembrane helix; Ubl conjugation pathway; Viral immunoevasion; Zinc;
KW Zinc-finger.
FT CHAIN 1..4441
FT /note="Replicase polyprotein 1a"
FT /id="PRO_0000338206"
FT CHAIN 1..222
FT /note="Non-structural protein 1"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338207"
FT CHAIN 223..809
FT /note="Non-structural protein 2"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338208"
FT CHAIN 810..2808
FT /note="Non-structural protein 3"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338209"
FT CHAIN 2809..3304
FT /note="Non-structural protein 4"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338210"
FT CHAIN 3305..3607
FT /note="3C-like proteinase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338211"
FT CHAIN 3608..3894
FT /note="Non-structural protein 6"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338212"
FT CHAIN 3895..3986
FT /note="Non-structural protein 7"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338213"
FT CHAIN 3987..4180
FT /note="Non-structural protein 8"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338214"
FT CHAIN 4181..4290
FT /note="Non-structural protein 9"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338215"
FT CHAIN 4291..4427
FT /note="Non-structural protein 10"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338216"
FT CHAIN 4428..4441
FT /note="Non-structural protein 11"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338217"
FT TRANSMEM 2196..2216
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2257..2277
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2288..2308
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2371..2391
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2413..2433
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2814..2834
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3089..3109
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3121..3141
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3148..3168
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3173..3193
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3621..3641
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3646..3666
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3671..3691
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3714..3734
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3742..3762
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3770..3790
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3813..3833
FT /note="Helical"
FT /evidence="ECO:0000255"
FT DOMAIN 54..174
FT /note="CoV Nsp1 globular"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01307"
FT DOMAIN 192..222
FT /note="BetaCoV Nsp1 C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01308"
FT DOMAIN 226..488
FT /note="CoV Nsp2 N-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT DOMAIN 493..681
FT /note="CoV Nsp2 middle"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01334"
FT DOMAIN 697..809
FT /note="CoV Nsp2 C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01335"
FT DOMAIN 811..923
FT /note="Ubiquitin-like 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00214"
FT REPEAT 945..954
FT /note="1"
FT REPEAT 955..964
FT /note="2"
FT REPEAT 965..974
FT /note="3"
FT REPEAT 975..984
FT /note="4"
FT REPEAT 985..994
FT /note="5"
FT REPEAT 995..1004
FT /note="6"
FT REPEAT 1005..1014
FT /note="7"
FT REPEAT 1015..1024
FT /note="8"
FT REPEAT 1025..1034
FT /note="9"
FT REPEAT 1035..1044
FT /note="10"
FT REPEAT 1045..1054
FT /note="11"
FT DOMAIN 1093..1343
FT /note="Peptidase C16 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT DOMAIN 1321..1492
FT /note="Macro"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00490"
FT DOMAIN 1548..1619
FT /note="DPUP"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01289"
FT DOMAIN 1619..1674
FT /note="Ubiquitin-like 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00214"
FT DOMAIN 1688..1948
FT /note="Peptidase C16 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT DOMAIN 1962..2063
FT /note="Nucleic acid-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01290"
FT DOMAIN 2078..2227
FT /note="G2M"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01338"
FT DOMAIN 2705..2808
FT /note="CoV Nsp3 Y3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01336"
FT DOMAIN 3207..3304
FT /note="Nsp4C"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01291"
FT DOMAIN 3305..3607
FT /note="Peptidase C30"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT DOMAIN 3895..3983
FT /note="RdRp Nsp7 cofactor"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01294"
FT DOMAIN 3984..4180
FT /note="RdRp Nsp8 cofactor"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01295"
FT DOMAIN 4181..4290
FT /note="Nsp9 ssRNA-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01296"
FT DOMAIN 4291..4428
FT /note="ExoN/MTase coactivator"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT ZN_FING 1208..1236
FT /note="C4-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ZN_FING 1805..1841
FT /note="C4-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ZN_FING 4364..4380
FT /evidence="ECO:0000250"
FT ZN_FING 4406..4419
FT /evidence="ECO:0000250"
FT REGION 365..389
FT /note="C4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT REGION 945..1054
FT /note="11 X 10 AA tandem repeat of N-[DN]-D-E-D-V-V-T-G-D"
FT REGION 947..1036
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2196..2433
FT /note="HD1"
FT /evidence="ECO:0000250"
FT REGION 2814..3193
FT /note="HD2"
FT /evidence="ECO:0000250"
FT REGION 3621..3833
FT /note="HD3"
FT /evidence="ECO:0000250"
FT ACT_SITE 1131
FT /note="For PL1-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1282
FT /note="For PL1-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1727
FT /note="For PL2-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1884
FT /note="For PL2-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 3345
FT /note="For 3CL-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT ACT_SITE 3449
FT /note="For 3CL-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT BINDING 365
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 370
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 386
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 389
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 4364
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4367
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4373
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4380
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4406
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4409
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4417
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4419
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT SITE 222..223
FT /note="Cleavage; by PL1-PRO"
FT /evidence="ECO:0000250"
FT SITE 809..810
FT /note="Cleavage; by PL1-PRO"
FT /evidence="ECO:0000250"
FT SITE 2808..2809
FT /note="Cleavage; by PL2-PRO"
FT /evidence="ECO:0000250"
FT SITE 3304..3305
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3607..3608
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3894..3895
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3986..3987
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4180..4181
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4290..4291
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4427..4428
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
SQ SEQUENCE 4441 AA; 499423 MW; 9BED41AF10453162 CRC64;
MIKTSKYGLG FKWAPEFRWL LPDAAEELAS PMKSDEGGLC PSTGQAMESV GFVYDNHVKI
DCRCILGQEW HVQSNLIRDI FVHEDLHVVE VLTKTAVKSG TAILIKSPLH SLGGFPKGYV
MGLFRSYKTK RYVVHHLSMT TSTTNFGEDF LGWIVPFGFM PSYVHKWFQF CRLYIEESDL
IISNFKFDDY DFSVEDAYAE VHAEPKGKYS QKAYALLRQY RGIKPVLFVD QYGCDYSGKL
ADCLQAYGHY SLQDMRQKQS VWLANCDFDI VVAWHVVRDS RFVMRLQTIA TICGIKYVAQ
PTEDVVDGAV VIREPVHLLS ADAIVLKLPS LMKVMTHMDD FSIKSIYNVD LCDCGFVMQY
GYVDCFNDNC DFYGWVSGNM MDGFSCPLCC TVYDSSEVKA QSSGVIPENP VLFTNSTDTV
NPDSFNLYGY SVTPFGSCIY WSPRPGLWIP IIKSSVKSYD DLVYSGVVGC KSIVKETALI
THALYLDYVQ CKCGNLEQNH ILGVNNSWCR QLLLNRGDYN MLLKNIDLFV KRRADFACKF
AVCGDGFVPF LLDGLIPRSY YLIQSGIFFT SLMSQFSQEV SDMCLKMCIL FMDRVSVATF
YIEHYVNRLV TQFKLLGTTL VNKMVNWFNT MLDASAPATG WLLYQLLNGL FVVSQANFNF
VALIPDYAKI LVNKFYTFFK LLLECVTVDV LKDMPVLKTI NGLVCIVGNK FYNVSTGLIP
GFVLPCNAQE QQIYFFEGVA ESVIVEDDVI ENVKSSLSSY EYCQPPKSVE KICIIDNMYM
GKCGDKFFPI VMNDKNICLL DQAWRFPCAG RKVNFNEKPV VMEIPSLMTV KVMFDLDSTF
DDILGKVCSE FEVEKGVTVD DFVAVVCDAI ENALNSCKDH PVVGYQVRAF LNKLNENVVY
LFDEAGDEAM ASRMYCTFAI EDVEDVISSE AVEDTIDGVV EDTINDDEDV VTGDNDDEDV
VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV
VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEEI VTGDNDDQIV VTGDDVDDIE
SVYDFDTYKA LLVFNDVYND ALFVSYGSSV ETETYFKVNG LWSPTITHTN CWLRSVLLVM
QKLPFKFKDL AIENMWLSYK VGYNQSFVDY LLTTIPKAIV LPQGGYVADF AYWFLNQFDI
NAYANWCCLK CGFSFDLNGL DAVFFYGDIV SHVCKCGHNM TLIAADLPCT LHFSLFDDNF
CAFCTPKKIF IAACAVDVNV CHSVAVIGDE QIDGKFVTKF SGDKFDFIVG YGMSFSMSSF
ELAQLYGLCI TPNVCFVKGD IINVARLVKA DVIVNPANGH MLHGGGVAKA IAVAAGKKFS
KETAAMVKSK GVCQVGDCYV STGGKLCKTI LNIVGPDARQ DGRQSYVLLA RAYKHLNNYD
CCLSTLISAG IFSVPADVSL TYLLGVVDKQ VILVSNNKED FDIIQKCQIT SVVGTKALAV
RLTANVGRVI KFETDAYKLF LSGDDCFVSN SSVIQEVLLL RHDIQLNNDV RDYLLSKMTS
LPKDWRLINK FDVINGVKTV KYFECPNSIY ICSQGKDFGY VCDGSFYKAT VNQVCVLLAK
KIDVLLTVDG VNFKSISLTV GEVFGKILGN VFCDGIDVTK LKCSDFYADK ILYQYENLSL
ADISAVQSSF GFDQQQLLAY YNFLTVCKWS VVVNGPFFSF EQSHNNCYVN VACLMLQHIN
LKFNKWQWQE AWYEFRAGRP HRLVALVLAK GHFKFDEPSD ATDFIRVVLK QADLSGAICE
LELICDCGIK QESRVGVDAV MHFGTLAKTD LFNGYKIGCN CAGRIVHCTK LNVPFLICSN
TPLSKDLPDD VVAANMFMGV GVGHYTHLKC GSPYQHYDAC SVKKYTGVSG CLTDCLYLKN
LTQTFTSMLT NYFLDDVEMV AYNPDLSQYY CDNGKYYTKP IIKAQFKPFA KVDGVYTNFK
LVGHDICAQL NDKLGFNVDL PFVEYKVTVW PVATGDVVLA SDDLYVKRYF KGCETFGKPV
IWLCHDEASL NSLTYFNKPS FKSENRYSVL SVDSVSEESQ GNVVTSVMES QISTKEVKLK
GVRKTVKIED AIIVNDENSS IKVVKSLSLV DVWDMYLTGC DYVVWVANEL SRLVKSPTVR
EYIRYGIKPI TIPIDLLCLR DDNQTLLVPK IFKARAIEFY GFLKWLFIYV FSLLHFTNDK
TIFYTTEIAS KFTFNLFCLA LKNAFQTFRW SIFIKGFLVV ATVFLFWFNF LYINVIFSDF
YLPNISVFPI FVGRIVMWIK ATFGLVTICD FYSKLGVGFT SHFCNGSFIC ELCYSGFDML
DTYAAIDFVQ YEVDRRVLFD YVSLVKLIVE LVIGYSLYTV WFYPLFCLIG LQLFTTWLPD
LFMLETMHWL IRFIVFVANM LPAFVLLRFY IVVTAMYKVV GFIRHIVYGC NKAGCLFCYK
RNCSVRVKCS TIVGGVIRYY DITANGGTGF CVKHQWNCFN CHSFKPGNTF ITVEAAIELS
KELKRPVNPT DASHYVVTDI KQVGCMMRLF YDRDGQRVYD DVDASLFVDI NNLLHSKVKV
VPNLYVVVVE SDADRANFLN AVVFYAQSLY RPILLVDKKL ITTACNGISV TQTMFDVYVD
TFMSHFDVDR KSFNNFVNIA HASLREGVQL EKVLDTFVGC VRKCCSIDSD VETRFITKSM
ISAVAAGLEF TDENYNNLVP TYLKSDNIVA ADLGVLIQNG AKHVQGNVAK AANISCIWFI
DTFNQLTADL QHKLKKACVK TGLKLKLTFN KQEASVPILT TPFSLKGGVV LSNLLYILFF
ISLICFILLW ALLPTYSVYK SDIHLPAYAS FKVIDNGVVR DISVNDLCFA NKFFQFDQWY
ESTFGSFYYH NSMDCPIVVA VMDEDIGSTM FNVPTKVLRH GFHVLHFLTY AFASDSVQCY
TPHIQISYND FYASGCVLSS LCTMFKRGDG TPHPYCYSDG VMKNASLYTS LVPHTRYSLA
NSNGFIRFPD VISEGIVRIV RTRSMTYCRV GACEYAEEGI CFNFNSSWVL NNDYYRSMPG
TFCGRDLFDL FYQFFSSLIR PIDFFSLTAS SIFGAILAIV VVLVFYYLIK LKRAFGDYTS
VVVINVIVWC INFLMLFVFQ VYPICACVYA CFYFYVTLYF PSEISVIMHL QWIVMYGAIM
PFWFCVTYVA MVIANHVLWL FSYCRKIGVN VCNDSTFEET SLTTFMITKD SYCRLKNSVS
DVAYNRYLSL YNKYRYYSGK MDTAAYREAA CSQLAKAMET FNHNNGNDVL YQPPTASVST
SFLQSGIVKM VSPTSKIEPC IVSVTYGSMT LNGLWLDDKV YCPRHVICLS SNMNEPDYSA
LLCRVTLGDF TIMSGRMSLT VVSYQMQGCQ LVLTVSLQNP YTPKYTFGVV KPGETFTVLA
AYNGRPQGAF HVTMRSSYTI KGSFLCGSCG SVGYVLTGDS VKFVYMHQLE LSTGCHTGTD
FTGNFYGPYR DAQVVQLPVK DYVQTVNVIA WLYAAILNNC AWFVQNDVCS IEDFNVWAMT
NGFSQVKADL VLDALASMTG VSIETLLAAI KRLYMGFQGR QILGSCTFED ELAPSDVYQQ
LAGVKLQSKT KRFIKETIYW ILISTFLFSC IISAFVKWTI FMYINTHMIG VTLCVLCFVS
FMMLLVKHKH FYLTMYIIPV LCTLFYVNYL VVYKEGFRGL TYVWLSYFVP AVNFTYVYEV
FYGCILCVFA IFITMHSINH DIFSLMFLVG RIVTLISMWY FGSNLEEDVL LFITAFLGTY
TWTTILSLAI AKIVANWLSV NIFYFTDVPY IKLILLSYLF IGYILSCYWG FFSLLNSVFR
MPMGVYNYKI SVQELRYMNA NGLRPPRNSF EAILLNLKLL GIGGVPVIEV SQIQSKLTDV
KCANVVLLNC LQHLHVASNS RLWQYCSILH NEILSTSDLS VAFDKLAQLL IVLFANPAAV
DTKCLASIDE VSDDYVQDST VLQALQSEFV NMASFVEYEV AKKNLADAKN SGSVNQQQIK
QLEKACNIAK SVYERDKAVA RKLERMADLA LTNMYKEARI NDKKSKVVSA LQTMLFSMVR
KLDNQALNSI LDNAVKGCVP LNAIPALAAN TLTIIIPDKQ VFDKVVDNVY VAYAGSVWHI
QTVQDADGIN KQLTDISVDS NWPLVIIANR YNEVANAVMQ NNELMPHKLK IQVVNSGSDM
NCNIPTQCYY NNGSSGRIVY AVLSDVDGLK YTKIIKDDGN CVVLELDPPC KFSIQDVKGL
KIKYLYFIKG CNTLARGWVV GTLSSTIRLQ AGVATEYAAN SSILSLCAFS VDPKKTYLDY
IQQGGVPIIN CVKMLCDHAG TGMAITIKPE ATINQDSYGG ASVCIYCRAR VEHPDVDGLC
KLRGKFVQVP LGIKDPILYV LTHDVCQVCG FWRDGSCSCV GSGVAVQSKD LNFLNGFGVL
V