R1A_CVHN5
ID R1A_CVHN5 Reviewed; 4421 AA.
AC P0C6U5; Q0ZME9;
DT 10-JUN-2008, integrated into UniProtKB/Swiss-Prot.
DT 10-JUN-2008, sequence version 1.
DT 03-AUG-2022, entry version 92.
DE RecName: Full=Replicase polyprotein 1a;
DE Short=pp1a;
DE AltName: Full=ORF1a polyprotein;
DE Contains:
DE RecName: Full=Non-structural protein 1;
DE Short=nsp1;
DE AltName: Full=p28;
DE Contains:
DE RecName: Full=Non-structural protein 2;
DE Short=nsp2;
DE AltName: Full=p65;
DE Contains:
DE RecName: Full=Non-structural protein 3;
DE Short=nsp3;
DE EC=3.4.19.12;
DE EC=3.4.22.69;
DE AltName: Full=PL1-PRO/PL2-PRO;
DE AltName: Full=PL1/PL2;
DE AltName: Full=Papain-like proteinases 1/2;
DE AltName: Full=p210;
DE Contains:
DE RecName: Full=Non-structural protein 4;
DE Short=nsp4;
DE AltName: Full=Peptide HD2;
DE AltName: Full=p44;
DE Contains:
DE RecName: Full=3C-like proteinase;
DE Short=3CL-PRO;
DE Short=3CLp;
DE EC=3.4.22.-;
DE AltName: Full=M-PRO;
DE AltName: Full=nsp5;
DE AltName: Full=p27;
DE Contains:
DE RecName: Full=Non-structural protein 6;
DE Short=nsp6;
DE Contains:
DE RecName: Full=Non-structural protein 7;
DE Short=nsp7;
DE AltName: Full=p10;
DE Contains:
DE RecName: Full=Non-structural protein 8;
DE Short=nsp8;
DE AltName: Full=p22;
DE Contains:
DE RecName: Full=Non-structural protein 9;
DE Short=nsp9;
DE AltName: Full=p12;
DE Contains:
DE RecName: Full=Non-structural protein 10;
DE Short=nsp10;
DE AltName: Full=Growth factor-like peptide;
DE Short=GFL;
DE AltName: Full=p15;
DE Contains:
DE RecName: Full=Non-structural protein 11;
DE Short=nsp11;
GN ORFNames=1a;
OS Human coronavirus HKU1 (isolate N5) (HCoV-HKU1).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Betacoronavirus; Embecovirus.
OX NCBI_TaxID=443241;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=16809319; DOI=10.1128/jvi.00509-06;
RA Woo P.C.Y., Lau S.K.P., Yip C.C.Y., Huang Y., Tsoi H.-W., Chan K.-H.,
RA Yuen K.-Y.;
RT "Comparative analysis of 22 coronavirus HKU1 genomes reveals a novel
RT genotype and evidence of natural recombination in coronavirus HKU1.";
RL J. Virol. 80:7136-7145(2006).
CC -!- FUNCTION: The papain-like proteinase 1 (PL1-PRO) and papain-like
CC proteinase 2 (PL2-PRO) are responsible for the cleavages located at the
CC N-terminus of the replicase polyprotein. In addition, PLP2 possesses a
CC deubiquitinating/deISGylating activity and processes both 'Lys-48'- and
CC 'Lys-63'-linked polyubiquitin chains from cellular substrates.
CC Antagonizes innate immune induction of type I interferon by blocking
CC the phosphorylation, dimerization and subsequent nuclear translocation
CC of host IRF-3 (By similarity). {ECO:0000250}.
CC -!- FUNCTION: [3C-like proteinase]: Responsible for the majority of
CC cleavages as it cleaves the C-terminus of replicase polyprotein at 11
CC sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-
CC [SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-
CC CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function
CC (By similarity). {ECO:0000255|PROSITE-ProRule:PRU00772}.
CC -!- FUNCTION: Nsp7-nsp8 hexadecamer may possibly confer processivity to the
CC polymerase, maybe by binding to dsRNA or by producing primers utilized
CC by the latter. {ECO:0000250}.
CC -!- FUNCTION: Nsp9 is a ssRNA-binding protein. {ECO:0000250}.
CC -!- FUNCTION: [Non-structural protein 1]: Binds to the 40S ribosomal
CC subunit and inhibits host translation. The nsp1-40S ribosome complex
CC further induces an endonucleolytic cleavage near the 5'UTR of host
CC mRNAs, targeting them for degradation. By suppressing host gene
CC expression, nsp1 facilitates efficient viral gene expression in
CC infected cells and evasion from host immune response (By similarity).
CC {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides
CC corresponding to the two self-cleavage sites of the SARS 3C-like
CC proteinase are the two most reactive peptide substrates. The enzyme
CC exhibits a strong preference for substrates containing Gln at P1
CC position and Leu at P2 position.; EC=3.4.22.69;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Thiol-dependent hydrolysis of ester, thioester, amide, peptide
CC and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-
CC residue protein attached to proteins as an intracellular targeting
CC signal).; EC=3.4.19.12;
CC -!- SUBUNIT: 3CL-PRO exists as monomer and homodimer. Eight copies of nsp7
CC and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is
CC a dimer. Nsp10 forms a dodecamer (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 3]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 6]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 7]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 8]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 9]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 10]: Host cytoplasm, host
CC perinuclear region {ECO:0000250}. Note=nsp7, nsp8, nsp9 and nsp10 are
CC localized in cytoplasmic foci, largely perinuclear. Late in infection,
CC they merge into confluent complexes.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Name=Replicase polyprotein 1a; Synonyms=pp1a, ORF1a polyprotein;
CC IsoId=P0C6U5-1; Sequence=Displayed;
CC Name=Replicase polyprotein 1ab; Synonyms=pp1ab;
CC IsoId=P0C6X4-1; Sequence=External;
CC -!- DOMAIN: The hydrophobic domains (HD) could mediate the membrane
CC association of the replication complex and thereby alter the
CC architecture of the host cell membrane.
CC -!- PTM: Specific enzymatic cleavages in vivo by its own proteases yield
CC mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically
CC processed (By similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: Isolate N5 belongs to genotype C. Genotype C probably
CC arose from recombination between genotypes A and B.
CC -!- MISCELLANEOUS: [Isoform Replicase polyprotein 1a]: Produced by
CC conventional translation.
CC -!- SIMILARITY: Belongs to the coronaviruses polyprotein 1ab family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ339101; -; NOT_ANNOTATED_CDS; Genomic_RNA.
DR SMR; P0C6U5; -.
DR PRIDE; P0C6U5; -.
DR Proteomes; UP000001985; Genome.
DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:UniProtKB-EC.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0008242; F:omega peptidase activity; IEA:InterPro.
DR GO; GO:0003968; F:RNA-directed 5'-3' RNA polymerase activity; IEA:InterPro.
DR GO; GO:0003727; F:single-stranded RNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039595; P:induction by virus of catabolism of host mRNA; IEA:UniProtKB-KW.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039648; P:modulation by virus of host protein ubiquitination; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039657; P:suppression by virus of host gene expression; IEA:UniProtKB-KW.
DR GO; GO:0039579; P:suppression by virus of host ISG15-protein conjugation; IEA:UniProtKB-KW.
DR GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR GO; GO:0039548; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of IRF3 activity; IEA:UniProtKB-KW.
DR GO; GO:0019079; P:viral genome replication; IEA:InterPro.
DR GO; GO:0019082; P:viral protein processing; IEA:InterPro.
DR CDD; cd21560; betaCoV-Nsp6; 1.
DR CDD; cd21666; betaCoV_Nsp5_Mpro; 1.
DR CDD; cd21519; cv_beta_Nsp2_MHV-like; 1.
DR CDD; cd21473; cv_Nsp4_TM; 1.
DR CDD; cd21557; Macro_X_Nsp3-like; 1.
DR CDD; cd21467; Ubl1_cv_Nsp3_N-like; 1.
DR CDD; cd21466; Ubl2_cv_PLpro_N_Nsp3-like; 1.
DR Gene3D; 1.10.150.420; -; 1.
DR Gene3D; 1.10.1840.10; -; 1.
DR Gene3D; 1.10.8.1190; -; 2.
DR Gene3D; 1.10.8.370; -; 1.
DR Gene3D; 2.40.10.10; -; 2.
DR Gene3D; 2.40.10.250; -; 1.
DR Gene3D; 2.60.120.1680; -; 1.
DR Gene3D; 3.10.20.350; -; 1.
DR Gene3D; 3.10.20.540; -; 1.
DR Gene3D; 3.30.70.3540; -; 1.
DR Gene3D; 3.40.220.10; -; 1.
DR Gene3D; 3.40.50.11020; -; 1.
DR InterPro; IPR022570; B-CoV_A_NSP1.
DR InterPro; IPR043613; CoV_NSP2_C.
DR InterPro; IPR043611; CoV_NSP3_C.
DR InterPro; IPR032505; CoV_NSP4_C.
DR InterPro; IPR043612; CoV_NSP4_N.
DR InterPro; IPR022733; DPUP_SUD_C_bCoV.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR044371; Macro_X_NSP3-like.
DR InterPro; IPR036333; NSP10_sf_CoV.
DR InterPro; IPR044384; NSP2_MHV-like.
DR InterPro; IPR032592; NSP3_NAB_bCoV.
DR InterPro; IPR042570; NSP3_NAB_bCoV_sf.
DR InterPro; IPR044357; NSP3_Ubl1_dom_CoV.
DR InterPro; IPR044353; Nsp3_Ubl2_dom_CoV.
DR InterPro; IPR038083; NSP3A-like.
DR InterPro; IPR038123; NSP4_C_sf_CoV.
DR InterPro; IPR044367; NSP6_betaCoV.
DR InterPro; IPR043610; NSP6_CoV.
DR InterPro; IPR014828; NSP7_CoV.
DR InterPro; IPR037204; NSP7_sf_CoV.
DR InterPro; IPR014829; NSP8_CoV.
DR InterPro; IPR037230; NSP8_sf_CoV.
DR InterPro; IPR014822; NSP9_CoV.
DR InterPro; IPR036499; NSP9_sf_CoV.
DR InterPro; IPR002705; Pept_C30/C16_B_coronavir.
DR InterPro; IPR013016; Peptidase_C16_CoV.
DR InterPro; IPR008740; Peptidase_C30_CoV.
DR InterPro; IPR043477; Peptidase_C30_dom3_CoV.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043177; PLpro_N_sf_CoV.
DR InterPro; IPR043503; PLpro_palm_finger_dom_CoV.
DR InterPro; IPR043178; PLpro_thumb_sf_CoV.
DR InterPro; IPR018995; RNA_synth_NSP10_CoV.
DR Pfam; PF16251; bCoV_NAR; 1.
DR Pfam; PF09401; CoV_NSP10; 1.
DR Pfam; PF19212; CoV_NSP2_C; 1.
DR Pfam; PF19218; CoV_NSP3_C; 1.
DR Pfam; PF16348; CoV_NSP4_C; 1.
DR Pfam; PF19217; CoV_NSP4_N; 1.
DR Pfam; PF19213; CoV_NSP6; 1.
DR Pfam; PF08716; CoV_NSP7; 1.
DR Pfam; PF08717; CoV_NSP8; 1.
DR Pfam; PF08710; CoV_NSP9; 1.
DR Pfam; PF08715; CoV_peptidase; 1.
DR Pfam; PF11963; DUF3477; 1.
DR Pfam; PF01661; Macro; 1.
DR Pfam; PF01831; Peptidase_C16; 1.
DR Pfam; PF05409; Peptidase_C30; 1.
DR SMART; SM00506; A1pp; 1.
DR SUPFAM; SSF101816; SSF101816; 1.
DR SUPFAM; SSF140367; SSF140367; 1.
DR SUPFAM; SSF143076; SSF143076; 1.
DR SUPFAM; SSF144246; SSF144246; 1.
DR SUPFAM; SSF159936; SSF159936; 1.
DR SUPFAM; SSF50494; SSF50494; 1.
DR SUPFAM; SSF52949; SSF52949; 1.
DR PROSITE; PS51963; BCOV_NSP1_C; 1.
DR PROSITE; PS51942; BCOV_NSP3C_C; 1.
DR PROSITE; PS51994; BCOV_NSP3E_G2M; 1.
DR PROSITE; PS51945; BCOV_NSP3E_NAB; 1.
DR PROSITE; PS51952; COV_EXON_MTASE_COACT; 1.
DR PROSITE; PS51962; COV_NSP1; 1.
DR PROSITE; PS51991; COV_NSP2_C; 1.
DR PROSITE; PS51990; COV_NSP2_M; 1.
DR PROSITE; PS51989; COV_NSP2_N; 1.
DR PROSITE; PS51992; COV_NSP3_Y3; 1.
DR PROSITE; PS51943; COV_NSP3A_UBL; 1.
DR PROSITE; PS51944; COV_NSP3D_UBL; 1.
DR PROSITE; PS51946; COV_NSP4C; 1.
DR PROSITE; PS51949; COV_NSP7; 1.
DR PROSITE; PS51950; COV_NSP8; 1.
DR PROSITE; PS51951; COV_NSP9_SSRNA_BD; 1.
DR PROSITE; PS51442; M_PRO; 1.
DR PROSITE; PS51154; MACRO; 1.
DR PROSITE; PS51124; PEPTIDASE_C16; 2.
PE 3: Inferred from homology;
KW Activation of host autophagy by virus; Decay of host mRNAs by virus;
KW Eukaryotic host gene expression shutoff by virus;
KW Eukaryotic host translation shutoff by virus; Host cytoplasm;
KW Host gene expression shutoff by virus; Host membrane;
KW Host mRNA suppression by virus; Host-virus interaction; Hydrolase;
KW Inhibition of host innate immune response by virus;
KW Inhibition of host interferon signaling pathway by virus;
KW Inhibition of host IRF3 by virus; Inhibition of host ISG15 by virus;
KW Inhibition of host RLR pathway by virus; Membrane; Metal-binding;
KW Modulation of host ubiquitin pathway by viral deubiquitinase;
KW Modulation of host ubiquitin pathway by virus; Protease; Repeat;
KW Ribosomal frameshifting; RNA-binding; Thiol protease; Transmembrane;
KW Transmembrane helix; Ubl conjugation pathway; Viral immunoevasion; Zinc;
KW Zinc-finger.
FT CHAIN 1..4421
FT /note="Replicase polyprotein 1a"
FT /id="PRO_0000338218"
FT CHAIN 1..222
FT /note="Non-structural protein 1"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338219"
FT CHAIN 223..809
FT /note="Non-structural protein 2"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338220"
FT CHAIN 810..2788
FT /note="Non-structural protein 3"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338221"
FT CHAIN 2789..3284
FT /note="Non-structural protein 4"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338222"
FT CHAIN 3285..3587
FT /note="3C-like proteinase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338223"
FT CHAIN 3588..3874
FT /note="Non-structural protein 6"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338224"
FT CHAIN 3875..3966
FT /note="Non-structural protein 7"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338225"
FT CHAIN 3967..4160
FT /note="Non-structural protein 8"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338226"
FT CHAIN 4161..4270
FT /note="Non-structural protein 9"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338227"
FT CHAIN 4271..4407
FT /note="Non-structural protein 10"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338228"
FT CHAIN 4408..4421
FT /note="Non-structural protein 11"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338229"
FT TRANSMEM 2176..2196
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2237..2257
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2268..2288
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2351..2371
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2393..2413
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2794..2814
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3069..3089
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3101..3121
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3128..3148
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3153..3173
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3601..3621
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3626..3646
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3651..3671
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3694..3714
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3722..3742
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3750..3770
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3793..3813
FT /note="Helical"
FT /evidence="ECO:0000255"
FT DOMAIN 54..174
FT /note="CoV Nsp1 globular"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01307"
FT DOMAIN 192..222
FT /note="BetaCoV Nsp1 C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01308"
FT DOMAIN 226..488
FT /note="CoV Nsp2 N-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT DOMAIN 493..681
FT /note="CoV Nsp2 middle"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01334"
FT DOMAIN 697..809
FT /note="CoV Nsp2 C-terminal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01335"
FT DOMAIN 811..923
FT /note="Ubiquitin-like 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00214"
FT REPEAT 945..954
FT /note="1"
FT REPEAT 955..964
FT /note="2"
FT REPEAT 965..974
FT /note="3"
FT REPEAT 975..984
FT /note="4"
FT REPEAT 985..994
FT /note="5"
FT REPEAT 995..1004
FT /note="6"
FT REPEAT 1005..1014
FT /note="7"
FT REPEAT 1015..1024
FT /note="8"
FT REPEAT 1025..1034
FT /note="9"
FT DOMAIN 1073..1323
FT /note="Peptidase C16 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT DOMAIN 1301..1472
FT /note="Macro"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00490"
FT DOMAIN 1528..1599
FT /note="DPUP"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01289"
FT DOMAIN 1599..1654
FT /note="Ubiquitin-like 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00214"
FT DOMAIN 1668..1928
FT /note="Peptidase C16 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT DOMAIN 1942..2043
FT /note="Nucleic acid-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01290"
FT DOMAIN 2058..2207
FT /note="G2M"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01338"
FT DOMAIN 2685..2788
FT /note="CoV Nsp3 Y3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01336"
FT DOMAIN 3187..3284
FT /note="Nsp4C"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01291"
FT DOMAIN 3285..3587
FT /note="Peptidase C30"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT DOMAIN 3875..3963
FT /note="RdRp Nsp7 cofactor"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01294"
FT DOMAIN 3964..4160
FT /note="RdRp Nsp8 cofactor"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01295"
FT DOMAIN 4161..4270
FT /note="Nsp9 ssRNA-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01296"
FT DOMAIN 4271..4408
FT /note="ExoN/MTase coactivator"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT ZN_FING 1188..1216
FT /note="C4-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ZN_FING 1785..1821
FT /note="C4-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ZN_FING 4344..4360
FT /evidence="ECO:0000250"
FT ZN_FING 4386..4399
FT /evidence="ECO:0000250"
FT REGION 365..389
FT /note="C4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT REGION 945..1034
FT /note="9 X 10 AA tandem repeat of N-[DN]-D-E-D-V-V-T-G-D"
FT REGION 947..1042
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2176..2413
FT /note="HD1"
FT /evidence="ECO:0000250"
FT REGION 2794..3173
FT /note="HD2"
FT /evidence="ECO:0000250"
FT REGION 3601..3813
FT /note="HD3"
FT /evidence="ECO:0000250"
FT ACT_SITE 1111
FT /note="For PL1-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1262
FT /note="For PL1-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1707
FT /note="For PL2-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 1864
FT /note="For PL2-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00444"
FT ACT_SITE 3325
FT /note="For 3CL-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT ACT_SITE 3429
FT /note="For 3CL-PRO activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00772"
FT BINDING 365
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 370
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 386
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 389
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01333"
FT BINDING 4344
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4347
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4353
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4360
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4386
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4389
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4397
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT BINDING 4399
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01297"
FT SITE 222..223
FT /note="Cleavage; by PL1-PRO"
FT /evidence="ECO:0000250"
FT SITE 809..810
FT /note="Cleavage; by PL1-PRO"
FT /evidence="ECO:0000250"
FT SITE 2788..2789
FT /note="Cleavage; by PL2-PRO"
FT /evidence="ECO:0000250"
FT SITE 3284..3285
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3587..3588
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3874..3875
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 3966..3967
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4160..4161
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4270..4271
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
FT SITE 4407..4408
FT /note="Cleavage; by 3CL-PRO"
FT /evidence="ECO:0000250"
SQ SEQUENCE 4421 AA; 497400 MW; 19FFC89AAA1E43AB CRC64;
MIKTSKYGLG FKWAPEFRWL LPDAAEELAS PMKSDEGGLC PSTGQAMESV GFVYDNHVKI
DCRCILGQEW HVQSNLIRDI FVHEDLHVVE VLTKTAVKSG TAILIKSPLH SLGGFPKGYV
MGLFRSYKTK RYVVHHLSMT TSTTNFGEDF FGWIVPFGFM PSYVHKWFQF CRLYIEESDL
IISNFKFDDY DFSVEAAYAE VHAEPKGKYS QKAYALLRQY RGIKPVLFVD QYGCDYSGKL
ADCLQAYGHY SLQDMRQKQS VWLANCDFDI VVAWHVVRDS RFVMRLQTIA TICGIKYVAQ
PTEDVVDGDV VIREPVHLLS ADAIVLKLPS LMKVMTHMDD FSIKSIYNVD LCDCGFVMQY
GYVDCFNDNC DFYGWVSGNM MDGFSCPLCC TVYDSSEVKA QSSGVIPENP VLFTNSTDTV
NPDSFNLYGY SVTPFGSCIY WSPRPGLWIP IIKSSVKSYD DLVYSGVVGC KSIVKETALI
THALYLDYVQ CKCGNLEQNH ILGVNNSWCR QLLLNRGDYN MLLKNIDLFV KRRADFACKF
AVCGDGFVPF LLDGLIPRSY YLIQSGIFFT SLMSQFSQEV SDMCLKMCIL FMDRVSVATF
YIEHYVNRLV TQFKLLGTTL VNKMVNWFNT MLDASAPATG WLLYQLLNGF FVVSQANLNF
VALIPDYAKI LVNKFYTFFK LLLECVTVDV LKDMPVLKTI NGLVCIVGNK FYNVSTGLIP
GFVLPCNAQE QQIYFFEGVA ESVIVEDDVI ENVKSSLSSY EYCQPPKSVE KICIIDNMYM
GKCGDKFFPI VMNDKNICLL DHAWRFPCAG RKVNFNEKPV VMEIPSLMTV KVMFDLDSTF
DDILGKVCSE FEVEKGVTVD DFVAVVCDAI ENALNSCKEH PVVGYQVRAF LNKLNENVVY
LFDEAGDEAM ASRMYCTFAI EDVEDVISSE AVEDTIDGVV EDTINDDEDV VTGDNDDEDV
VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV
VTGDNNDEDV VTGDNNDEES VTGDNDDQIV VTGDDVDDIE SIYDFDTYKA LLVFNDVYND
ALFVSYGSSV ETETYFKVNG LWSPTITHTN CWLRSVLLVM QKLPFKFKDL AIENMWLSYK
VGYNQSFVDY LLTTIPKAIV LPQGGYVADF AYWFLNQFDI NAYANWCCLK CGFSFDLNGL
DAVFFYGDIV SHVCKCGHNM TLIAADLPCT LHFSLFDDNF CAFCTPKKIF IAACAVDVNV
CHSVAVIGDE QIDGKFVTKF SGDKFDFIVG YGMSFSMSSF ELAQLYGLCI TPNVCFVKGD
IINVARLVKA DVIVNPANGH MLHGGGVAKA IAVAAGKKFS KETAAMVKSK GVCQVGDCYV
STGGKLCKTI LNIVGPDARQ DGRQSYVLLA RAYKHLNNYD CCLSTLISAG IFSVPADVSL
TYLLGVVDKQ VILVSNNKED FDIIQKCQIT SVVGTKALAV RLTANVGRVI KFETDAYKLF
LSGDDCFVSN SSVIQEVLLL RHDIQLNNDV RDYLLSKMTS LPKDWRLINK FDVINGVKTV
KYFECPNSIY ICSQGKDFGY VCDGSFYKAT VNQVCVLLAK KIDVLLTVDG VNFKSISLTV
GEVFGKILGN VFCDGIDVTK LKCSDFYADK ILYQYENLSL ADISAVQSSF GFDQQQLLAY
YNFLTVCKWS VVVNGPFFSF EQSHNNCYVN VACLMLQHIN LKFNKWQWQE AWYEFRAGRP
HRLVALVLAK GHFKFDEPSD ATDFIRVVLK QADLSGAICE LELICDCGIK QESRVGVDAV
MHFGTLAKTD LFNGYKIGCN CAGRIVHCTK LNVPFLICSN TPLSKDLPDD VVAANMFMGV
GVGHYTHLKC GSPYQHYDAC SVKKYTGVSG CLTDCLYLKN LTQTFTSMLT NYFLDDVEMV
AYNPDLSQYY CDNGKYYTKP IIKAQFKPFA KVDGVYTNFK LVGHDICAQL NDKLGFNVDL
PFVEYKVTVW PVATGDVVLA SDDLYVKRYF KGCETFGKPV IWFCHDEASL NSLTYFNKPS
FKSENRYSVL SVDSVSEESQ GNVVTSVMES QISTKEVKLK GVRKTVKIED AIIVNDENSS
IKVVKSLSLV DVWDMYLTGC DYVVWVANEL SRLVKSPTVR EYIRYGIKPI TIPIDLLCLR
DDNQTLLVPK IFKARAIEFY GFLKWLFIYV FSLLHFTNDK TIFYTTEIAS KFTFNLFCLA
LKNAFQTFRW SIFIKGFLVV ATVFLFWFNF LYINVIFSDF YLPNISVFPI FVGRIVMWIK
ATFGLVTICD FYSKLGVGFT SHFCNGSFIC ELCHSGFDML DTYAAIDFVQ YEVDRRVLFD
YVSLVKLIVE LVIGYSLYTV WFYPLFCLIG LQLFTTWLPD LFMLETMHWL IRFIVFVANM
LPAFVLLRFY IVVTAMYKVV GFIRHIVYGC NKAGCLFCYK RNCSVRVKCS TIVGGVIRYY
DITANGGTGF CVKHQWNCFN CHSFKPGNTF ITVEAAIELS KELKRPVNPT DASHYVVTDI
KQVGCMMRLF YDRDGQRVYD DVDASLFVDI NNLLHSKVKV VPNLYVVVVE SDADRANFLN
AVVFYAQSLY RPILLVDKKL ITTACNGISV TQIMFDVYVD TFMSHFDVDR KSFNNFVNIA
HASLREGVQL EKVLDTFVGC VRKCCSIDSD VETRFITKSM ISAVAAGLEF TDENYNNLVP
TYLKSDNIVA ADLGVLIQNG AKHVQGNVAK VANISCIWFI DAFNQLTADL QHKLKKACVK
TGLKLKLTFN KQEASVPILT TPFSLKGGVV LSNLLYILFF ISLICFILLW ALLPTYSVYK
SDIHLPAYAS FKVIDNGVVR DISVNDLCFA NKFFQFDQWY ESTFGSVYYH NSMDCPIVVA
VMDEDIGSTM FNVPTKVLRY GFHVLHFLTY AFASDSVQCY TPHIQISYND FYASGCVLSS
LCTMFKRGDG TPHPYCYTDG VMKNASLYTS LVPHTRYSLA NSNGFIRFPD VISEGIVRIV
RTRSMTYCRV GACEYAEEGI CFNFNSSWVL NNDYYRSMPG TFCGRDFFDL FYQFFSSLIR
PIDFFSLTAS SIFGAILAIV VVLVFYYLIK LKRAFGDYTS VVVINVIVWC INFLMLFVFQ
VYPICACVYA CFYFYVTLYF PSEISVIMHL QWIVMYGAIM PFWFCVTYVA MVIANHVLWL
FSYCRKIGVN VCSDSTFEET SLTTFMITKD SYCRLKNSVS DVAYNRYLSL YNKYRYYSGK
MDTAAYREAA CSQLAKAMET FNHNNGNDVL YQPPTASVST SFLQSGIVKM VSPTSKIEPC
LVSVTYGSMT LNGLWLDDKV YCPRHVICLS SNMNEPDYSA LLCRVTLGDF TIMSGRMSLT
VVSYQMQGCQ LVLTVSLQNP YTPKYTFGVV KPGETFTVLA AYNGRPQGAF HVTMRSSYTI
KGSFLCGSCG SVGYVLTGDS VKFVYMHQLE LSTGCHTGTD FNGNFYGPYR DAQVVQLPVK
DYVQTVNVIA WLYAAILNNC AWFVQNDVCS IEDFNVWAMT NGFSQVKADL VLDALASMTG
VSIETLLAAI KRLYMGFQGR QILGSCTFED ELAPSDVYQQ LAGVKLQSKT KRFIKETIYW
ILISTFLFSC IISAFVKWTI FMYINTHMIG VTLCVLCFVS FMMLLVKHKH FYLTMYIIPV
LCTLFYVNYL VVYKEGFRGF TYVWLSHFVP AVNFTYVYEV FYGCILCVFA IFITMHSINH
DIFSLMFLVG RIVTLISMWY FGSNLEEDVL LFITAFLGTY TWTTILSLAI AKIVANWLSV
NIFYFTDVPY IKLILLSYLF IGYILSCYWG FFSLLNSVFR MPMGVYNYKI SVQELRYMNA
NGLRPPRNSF EAILLNLKLL GIGGVPVIEV SQIQSKLTDV KCANVVLLNC LQHLHVASNS
KLWQYCSVLH NEILSTSDLS VAFDKLAQLL IVLFSNPAAV DTKCLASIDE VSDDYVQDST
VLQALQSEFV NMASFVEYEV AKKNLADAKN SGSVNQQQIK QLEKACNIAK SVYERDKAVA
RKLERMADLA LTNMYKEARI NDKKSKVVSA LQTMLFSMVR KLDNQALNSI LDNAVKGCVP
LSAIPALAAN TLTIIIPDKQ VFDKVVDNVY VTYAGSVWHI QTVQDADGIN KQLTDISVDS
NWPLVIIANR YNEVANAVMQ NNELMPHKLK IQVVNSGSDI NCNIPTQCYY NNVSSGRIVY
AVLSDVDGLK YTKIMKDDGN CVVLELDPPC KFSIQDVKGL KIKYLYFIKG CNTLARGWVV
GTLSSTIRLQ AGVATEYAAN SSILSLCAFS VDPKKTYLDY IQQGGVPIIN CVKMLCDHAG
TGMAITIKPE ATINQDSYGG ASVCIYCRAR VEHPDVDGIC KLRGKFVQVP LGIKDPILYV
LTHDVCQVCG FWRDGSCSCV GSSVAVQSKD LNFLNGLGVL V