R1A_WBV24
ID R1A_WBV24 Reviewed; 4555 AA.
AC Q008X5;
DT 31-MAY-2011, integrated into UniProtKB/Swiss-Prot.
DT 14-NOV-2006, sequence version 1.
DT 03-AUG-2022, entry version 61.
DE RecName: Full=Replicase polyprotein 1a;
DE Short=pp1a;
DE AltName: Full=ORF1a polyprotein;
DE Contains:
DE RecName: Full=Non-structural protein 1;
DE Short=nsp1;
DE Contains:
DE RecName: Full=Non-structural protein 2;
DE Short=nsp2;
DE Contains:
DE RecName: Full=3C-like serine proteinase;
DE Short=3CLSP;
DE EC=3.4.21.-;
DE AltName: Full=M-PRO;
DE AltName: Full=nsp3;
DE AltName: Full=p27;
DE Contains:
DE RecName: Full=Non-structural protein 4;
DE Short=nsp4;
DE Contains:
DE RecName: Full=Non-structural protein 5;
DE Short=nsp5;
DE Contains:
DE RecName: Full=Non-structural protein 6;
DE Short=nsp6;
DE Contains:
DE RecName: Full=Non-structural protein 7;
DE Short=nsp7;
DE Contains:
DE RecName: Full=Non-structural protein 8;
DE Short=nsp8;
GN ORFNames=1a;
OS White bream virus (isolate Blicca bjoerkna L./Germany/DF24/00) (WBV).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Tornidovirineae; Tobaniviridae; Piscanivirinae; Bafinivirus;
OC Blicbavirus.
OX NCBI_TaxID=766180;
OH NCBI_TaxID=58317; Blicca bjoerkna (white bream).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=16987966; DOI=10.1128/jvi.01758-06;
RA Schuetze H., Ulferts R., Schelle B., Bayer S., Granzow H., Hoffmann B.,
RA Mettenleiter T.C., Ziebuhr J.;
RT "Characterization of White bream virus reveals a novel genetic cluster of
RT nidoviruses.";
RL J. Virol. 80:11598-11609(2006).
RN [2]
RP CHARACTERIZATION, ACTIVE SITES, AND MUTAGENESIS OF HIS-3492; ASP-3509;
RP ASP-3518 AND SER-3589.
RX PubMed=21068254; DOI=10.1128/jvi.01716-10;
RA Ulferts R., Mettenleiter T.C., Ziebuhr J.;
RT "Characterization of Bafinivirus main protease autoprocessing activities.";
RL J. Virol. 85:1348-1359(2011).
CC -!- FUNCTION: The 3C-like serine proteinase is responsible for the majority
CC of cleavages.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 1]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 2]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Name=Replicase polyprotein 1a; Synonyms=pp1a, ORF1a polyprotein;
CC IsoId=Q008X5-1; Sequence=Displayed;
CC Name=Replicase polyprotein 1ab; Synonyms=pp1ab;
CC IsoId=Q008X6-1; Sequence=External;
CC -!- DOMAIN: The hydrophobic domains (HD) could mediate the membrane
CC association of the replication complex and thereby alter the
CC architecture of the host cell membrane. {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages in vivo by its own protease yield
CC mature proteins. 3CL-PRO is autocatalytically processed (By
CC similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: [Isoform Replicase polyprotein 1a]: Produced by
CC conventional translation.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ898157; ABI97393.1; -; Genomic_RNA.
DR RefSeq; YP_803214.1; NC_008516.1.
DR MEROPS; S75.001; -.
DR GeneID; 4443108; -.
DR KEGG; vg:4443108; -.
DR Proteomes; UP000000680; Genome.
DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd21557; Macro_X_Nsp3-like; 1.
DR Gene3D; 3.40.220.10; -; 1.
DR Gene3D; 3.40.50.150; -; 1.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR044371; Macro_X_NSP3-like.
DR InterPro; IPR041698; Methyltransf_25.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR Pfam; PF01661; Macro; 1.
DR Pfam; PF13649; Methyltransf_25; 1.
DR SMART; SM00506; A1pp; 1.
DR SUPFAM; SSF50494; SSF50494; 1.
DR SUPFAM; SSF52949; SSF52949; 1.
DR SUPFAM; SSF53335; SSF53335; 1.
DR PROSITE; PS51154; MACRO; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 1: Evidence at protein level;
KW Coiled coil; Host membrane; Hydrolase; Membrane; Metal-binding; Protease;
KW Reference proteome; Repeat; Ribosomal frameshifting; Thiol protease;
KW Transmembrane; Transmembrane helix.
FT CHAIN 1..4555
FT /note="Replicase polyprotein 1a"
FT /id="PRO_0000408895"
FT CHAIN 1..3071
FT /note="Non-structural protein 1"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408896"
FT CHAIN 3072..3442
FT /note="Non-structural protein 2"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408897"
FT CHAIN 3443..3709
FT /note="3C-like serine proteinase"
FT /evidence="ECO:0000305"
FT /id="PRO_0000408898"
FT CHAIN 3710..3985
FT /note="Non-structural protein 4"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408899"
FT CHAIN 3986..4157
FT /note="Non-structural protein 5"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408900"
FT CHAIN 4158..4375
FT /note="Non-structural protein 6"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408901"
FT CHAIN 4376..4452
FT /note="Non-structural protein 7"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408902"
FT CHAIN 4453..4555
FT /note="Non-structural protein 8"
FT /evidence="ECO:0000255"
FT /id="PRO_0000408903"
FT TRANSMEM 2542..2564
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2571..2593
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2644..2664
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2744..2764
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2773..2793
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3079..3099
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3247..3267
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3278..3298
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3319..3339
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3725..3745
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3751..3771
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3789..3809
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3813..3833
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3844..3864
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3888..3908
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3911..3931
FT /note="Helical"
FT /evidence="ECO:0000255"
FT DOMAIN 1637..1827
FT /note="Macro"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00490"
FT REGION 159..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 972..1037
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1120..1200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2037..2112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2293..2683
FT /note="HD1"
FT /evidence="ECO:0000250"
FT REGION 2889..3156
FT /note="HD2"
FT /evidence="ECO:0000250"
FT REGION 3546..3732
FT /note="HD3"
FT /evidence="ECO:0000250"
FT COILED 4212..4250
FT /evidence="ECO:0000255"
FT COMPBIAS 159..179
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 978..994
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 999..1017
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1134
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1144..1200
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 3492
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000305|PubMed:21068254"
FT ACT_SITE 3518
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000305|PubMed:21068254"
FT ACT_SITE 3589
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000305|PubMed:21068254"
FT SITE 3071..3072
FT /note="Cleavage; by 3C-like serine proteinase"
FT SITE 3442..3443
FT /note="Cleavage; by 3C-like serine proteinase"
FT SITE 3603
FT /note="Substrate binding"
FT /evidence="ECO:0000255"
FT SITE 3709..3710
FT /note="Cleavage; by 3C-like serine proteinase"
FT SITE 3985..3986
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4157..4158
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4375..4376
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4452..4453
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT MUTAGEN 3492
FT /note="H->A: Loss of proteolytic processing."
FT /evidence="ECO:0000269|PubMed:21068254"
FT MUTAGEN 3509
FT /note="D->A: No effect on proteolytic processing."
FT /evidence="ECO:0000269|PubMed:21068254"
FT MUTAGEN 3518
FT /note="D->A: Reduced proteolytic processing."
FT /evidence="ECO:0000269|PubMed:21068254"
FT MUTAGEN 3584
FT /note="T->A: Reduced proteolytic processing."
FT MUTAGEN 3589
FT /note="S->A: Loss of proteolytic processing."
FT /evidence="ECO:0000269|PubMed:21068254"
FT MUTAGEN 3603
FT /note="H->A: Loss of proteolytic processing."
SQ SEQUENCE 4555 AA; 508987 MW; 06287FC647203FF8 CRC64;
MSILFGNRQA NATKRSDMAS VARAVYEVDL ISTKYARRTQ ERLAHNKHAK PSYPSVFFGR
RMKAVKEPTF TPSTLFFEEA TLPKVLASKA KPDTGIKTRR VYVADSLTIN GHTYPIVGHF
VEMAVSKKEA FPIQPKRVKP KPLMAKPIPN IRRTFLTPEE RTNTPTTPTT TTTTPFVAGE
TAGPTIEYTP TSIDLPFAMP TVKQIKENAH TILREQDDCL RFAQTALFKH LGTVTHTTPN
HATTFQVKGR TSLLTFEWRK TTQSPLTDGH FYLQTANNHA ELMQPVEGKL TTIFTTTIQQ
GTTHSLHLIK QESARTLKTR KPLKLVTYKQ ETPTTTITPQ SLKKTITYIP GSFCINVAEP
TLQSVMRRQP LTPTPNDALL QIYHKLGCTT KSPNHASTFE LFGNTYTWYP VQHTNNLLHK
DPNRRFFLHI TGQTPQLLIR TERKTFLTLQ DEVTYISGKL FVMNHAPIQG EYKTQTAEWV
GSYNMAKTPK AIKPAKTVEY INTTPCHKPA TMPAPITYRQ CPYTWTLHEP SISKVQRNLF
HIPKTATNCL DRIQKALFPE IVTSNQHFPI GFTIQTDTTM QSYEWAILCK KVTRTYYLAV
QNHHATLWYK CAQVYMCLSD DIAPTTELQG SVYILNKVTD PGFYQNTRQW CGSNDDLHEP
KHLIKNLANG DVNNIAHCSL TPWTTTPLVY STQKNKLTRK LIHYYNATYT VQVPKENPNA
QPSTMKCAYK AYIDLATPTE LQIHLDLEIQ GKLYSQTAQK KGSKFNKLDI PTFGDILKGT
LYVFSSKVLL YEQPKRCTSV CFHLPNNAQS FFFNTETIQT FEDLFARISS EEVEDNLVLI
KGFVPCLGAI YITKDLKFIQ PELKEDFKHP TTYYTFTTTV DPEQVLYSLH PSFTDIVPPH
GFPYYTFAKL NNHIDAWNIT DDQADTLAEA QPIIFQWPTE EATITTPYKV LHYEHLEGLD
YISLSSFNTV ECPEETQSAH DSSSESESED EELPAHPLSN APSQASLSSV ASTAPPTSPT
SSPTPSPTPL QQQVGLKGKE VPVGGWVLVS EEETPSEEVD SPKLLPNEVP LSFDFDLPIE
PITRPISPEL QQPILTHYEH PTSPTPSVEI EIDFGSYENL TLQTEETVTT EVQPEPTPAP
TPEPTIVTET VQETPVPTET TQESTPESTP ESTPEPTPES TSESTLEPEH VATPSQSPTH
ITVTEITHEP ETPDSWSERY DSTSNIPEVF NQLSFGSTDS VKITTPKTET PDEPQQPTVE
TVSAAQQLLQ IVQTATPDIA QLMSELPPYR LICIGSYCPI LAENISKQLP TAVTTPTDAD
IPTVIFNVSE ETMDTVINTV KTKHQANHLT FSLTTIIALD VPKDKSLPLQ QIYDKLTQQD
YNTDFIYESH HRQPKESLTH ASVLSAYTAS YKTTAIKSIA DNAVVLDIGY GKGNDGPRYA
VRPLTVTGID TAARMLAIAD QNKPENVTLV KQGFFTHITK TSNTYTHVIA FNSLHYPLAS
SHPDTLVQRL PTCPANILIP CHHLLEGIQT PTYSVVKDED MWCVKVTKNE FIESSYNYDV
FVKALESKYH VTIGSLLDCV EKPSTRSITP TLWTAMRNFV NNDQEMQRIL SGYITFNLTP
LPPKVEIIND WLDNNATVTI NNPFASNEGV TFAVHNIGAI TTTEGEFIVN AANKQLNNGT
GVTGAIFAAH DKELKLTQAI KALPTYGASD KLESHQHVVQ TIIKNNNSTH AINILHAAAP
IKVKCTSKNP EVLLAHNETA QSELKETYKA IVDYAQLNKL THIYLPLFGA GAYGHKPLDS
LEAFLDAMRN RSPQSTTQYT LLLSDPVKPL DNPSFSYEFL NLLVTNLNIN KQFAQLIVNK
YHNTCALQSA IQMNTTTDTH NFLATFIYLL YTMPYSMNTF RQTHTPEEPF TPGKMVTVVD
TTIAFLTTLD ILPPCGQPCG YLPPSITKDG EYICACQKTS NWSLPFHFYN ARYNKVYHTG
LNNILTHKHS AFHKSRNAAH FIAKTGPSTS SYPVYMAPVP EILAYNASYR DSCQDNAIEE
QSDSQASQSP SSPVTIPVSL PTASPASSVK SALRSDIPIT TDQQSTTSAS ISTATTASTI
PTAPLTSSDS NTSVVTSLYG NMEELTYLDA SGTSQDFILS ETTPFIAHIY HNNEATFIPP
GYQLLDTNTN DPIEMYITPP RPIDGSPMIS LASTASTTPM TYPLLSIRLT TEELTSFFKT
KTDKFHLISH KSCLTVHLFD SPTLNSIAAD STSDAHLYQQ HLKDLYTFSD CCSMYTRTEV
YNCIEADTPL IRQSEQTKFH PINLDTLIEM VATFPPIVKR YSQTTTPDFT NLTVYFVSNG
DIITTPTGST SEQPPQLKIF LDYQTSSKFT TLVDLTLHEQ TEANTIITYH HGEHQLLKPN
PSAFYIEFQT YSSFFSRFQT FSTNFFWTLF INFLINVRFC ITADSAYFHW QGKPIETTNL
NIVYSIGRLD FVLSKHTTPW LTKPTDTLNP LTLIKNTLVQ PIAINFHGRI RPLQSTNTRF
GATHTPTKLP VHLLNTSLRT HYLSLLSLFQ CTFSVFLAYI ALLYSFSGHG IFTVVAYFTM
LFARYYITSF INFCTSQLTA TQVTQWFAAI KAKYTGIYES SQDRVLTVNV TGTNVPYIVK
YSTILTVTMY VAFMAFVWTV STYAAQYTAG ERYDRPPYQT VFQKTLNVLG LTETVTYYYP
YASLNEACAA STSILCRLGS PFNFHYPSDY TQVRTVQTDT SSPFWLFIIF MPPSFLFIVL
PWLILCTITP TVSIAQLLVP SIILNATIVF IYIRRKFTGH CCGPHTCIKH ADISRSLQFR
PTSQIQHSLT FCGTLCAKHN WYCNNSDSPT HTLGIQLAQL IETTYKLQPG TIKPDSSYTH
TTETATLPIM KVSTTSTDFT TSEPNTTVEH LHLQVIAHVT GTRISIESSS NKVQQQNTQH
TRLTNKPVTG FMHTTLLQKL KRQHKDELSS YLCNFVPSDN KKDCILPHSV VHMTLTENQR
TFLLKNFTFS TNVTVDPTTT GFIPSSLNIS TLPHKHFMIN VIESAMLAKL PKEVQDTLRT
THLETTTLER QAMSLTTQAI LTTFALIMAT FVVAFLAFFS TAQVGKTPYA GLNPTMVGNV
NAEPYIQPTT LENSILIPLH GASKVCWRAQ NGTLFFTDAI PTTECARAAV PYIGYKSEFT
QTCASSNLRY PFTVYLGSIK VMYLRDGISY LTSTLSHNSN TKKLCVQVGS NAVRCASVLP
TGASSNVAAL LMASVVVISM VLFYLYLLQI FKFYTNSVIM SFVIQLLTLL ATTVSTPLAV
TVQLFVITYG YTNWILLTLS LLNLTVLLST PVGITFVVIY GLYKAYTLFT SSGQGCVYNE
GGTIRFSGSF EQVANSTFPL TNASCVQLLS DLGITYQQLN VYASSRDRNV RRLAQALLHR
QLDSASECIL YEGCSGNTIT RQALQRIRQA VTVVVTPASQ NLCKITSNQA NGIGLSCTGT
FFTSTEIITC AHGIGTSDIT AVHKGITYDC KVKSINNDIA ILITTTVNLS VQNIKLDSSF
SQKSDNYQRN FVQFVSFVDQ QNSDAVTINN TVMLPSGHFF AIGTEAGESG SPYTLNGNII
GIHYGIDNAG SWMLASRPDG SFYVPATQHG NSAKVTFSTD AFAQQFPAIV TNKTSDQLVN
EIAATNNTAF ELDTTDVSHL SNLLKHLKNN NETPKSLSDY LPVAPVTQQT SVTVGQTLTS
PVNNMQTALY TLMLVSEVIT YILTPNSDLS VLISMFVTSA FLKFGASKLF YNTEMLRNTI
TTFVVYRYTT LLIALVFSQY YLHILSYALK LNTLVLTVIA LTFLVTPLVL LTIRRVYYYS
QNYLISCVFI ISCFASHTYT LYILTDTTVD FQTFIISEPI FSTLLCNLFI GFTLISVVPN
PLYVCVVFMY ILLDCEALGF IVTCFMASYL CPKPLRSLTT FLCTDTLVLT APAYLHWYGA
KGTQREYSVI YAVFDSILTP ETTIQIPVTI MEGIEQQVKF IFAVPKSANI QDEEEAYVEY
NQNSDIKDLA IKNEEKVCTI TGMFRKTRTI KGTLMESFYP RHQPDSYILN QVVRHNYVLA
HDPETIILTT VNPTHLESEP FQTIVKKVRK LHALYQEISE QSSDDTELCH AYILALIKST
VLEAQMPTEK INFVSSQTML SPAMILIFAE AYQILTESRS FTNHITPQSD IGSMQTTLAS
LAEMDTDEMT PQERKIHIKR MNVLKQEIAK MESASLKLEK FLDNMHKAEI SKRGKEDILL
KVSNMLRLHL NKVANAAHCT IQTPSAGLIT LASAFDVHSL CVTQHSESVL IQTPDDDTFL
VYVDGQIYTC YNPTDITGKK LVPINVMSPD NVQFPTYPVV FSLSKQDYAE EITEQNNIGY
TERTHNFKLK ELASGLAVTL DGLVVVTETK ELATAFKIGQ RYFKFLNTNK TPVARNNTHA
IIQLLRNNIS QQAVVRIGGS RVSNDHIAIS QVPVQTIGYL TYAGISVCRQ CATKQDHTCQ
YAGYFVQIPR EHVSNIFNLT DTPPCLHNKF TCTTCQPLQQ QSKTQQPPLN WWGSV