R1A_BRV1
ID R1A_BRV1 Reviewed; 4445 AA.
AC P0C6F4; Q3T8J2;
DT 10-JUN-2008, integrated into UniProtKB/Swiss-Prot.
DT 10-JUN-2008, sequence version 1.
DT 03-AUG-2022, entry version 62.
DE RecName: Full=Replicase polyprotein 1a;
DE Short=pp1a;
DE AltName: Full=ORF1a polyprotein;
DE Contains:
DE RecName: Full=Non-structural protein 1;
DE Short=nsp1;
DE Contains:
DE RecName: Full=Non-structural protein 2;
DE Short=nsp2;
DE Contains:
DE RecName: Full=3C-like serine proteinase;
DE Short=3CLSP;
DE EC=3.4.21.-;
DE AltName: Full=M-PRO;
DE AltName: Full=nsp3;
DE AltName: Full=p27;
DE Contains:
DE RecName: Full=Non-structural protein 4;
DE Short=nsp4;
DE Contains:
DE RecName: Full=Non-structural protein 5;
DE Short=nsp5;
DE Contains:
DE RecName: Full=Non-structural protein 6;
DE Short=nsp6;
DE Contains:
DE RecName: Full=Non-structural protein 7;
DE Short=nsp7;
DE Contains:
DE RecName: Full=Non-structural protein 8;
DE Short=nsp8;
DE Contains:
DE RecName: Full=Non-structural protein 9;
DE Short=nsp9;
GN ORFNames=1a;
OS Breda virus 1 (BRV-1).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Tornidovirineae; Tobaniviridae; Torovirinae; Torovirus;
OC Renitovirus.
OX NCBI_TaxID=360393;
OH NCBI_TaxID=9913; Bos taurus (Bovine).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=16137782; DOI=10.1016/j.virusres.2005.07.005;
RA Draker R., Roper R.L., Petric M., Tellier R.;
RT "The complete sequence of the bovine torovirus genome.";
RL Virus Res. 115:56-68(2006).
CC -!- FUNCTION: The 3C-like serine proteinase is responsible for the majority
CC of cleavages.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 1]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 2]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4]: Host membrane
CC {ECO:0000305}; Multi-pass membrane protein {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Name=Replicase polyprotein 1a; Synonyms=pp1a, ORF1a polyprotein;
CC IsoId=P0C6F4-1; Sequence=Displayed;
CC Name=Replicase polyprotein 1ab; Synonyms=pp1ab;
CC IsoId=P0C6V8-1; Sequence=External;
CC -!- DOMAIN: The hydrophobic domains (HD) could mediate the membrane
CC association of the replication complex and thereby alter the
CC architecture of the host cell membrane. {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages in vivo by its own protease yield
CC mature proteins. 3CL-PRO is autocatalytically processed (By
CC similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: [Isoform Replicase polyprotein 1a]: Produced by
CC conventional translation.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY427798; AAS17958.1; -; Genomic_RNA.
DR RefSeq; YP_337906.1; NC_007447.1.
DR MEROPS; S65.001; -.
DR GeneID; 3707765; -.
DR KEGG; vg:3707765; -.
DR Proteomes; UP000000355; Genome.
DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd21557; Macro_X_Nsp3-like; 1.
DR Gene3D; 3.40.220.10; -; 1.
DR InterPro; IPR009097; Cyclic_Pdiesterase.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR044371; Macro_X_NSP3-like.
DR InterPro; IPR039573; NS2A-like.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR Pfam; PF05213; Corona_NS2A; 1.
DR Pfam; PF01661; Macro; 1.
DR SMART; SM00506; A1pp; 1.
DR SUPFAM; SSF50494; SSF50494; 1.
DR SUPFAM; SSF52949; SSF52949; 1.
DR SUPFAM; SSF54001; SSF54001; 1.
DR SUPFAM; SSF55144; SSF55144; 1.
DR PROSITE; PS51154; MACRO; 1.
PE 3: Inferred from homology;
KW Host membrane; Hydrolase; Membrane; Metal-binding; Protease;
KW Reference proteome; Repeat; Ribosomal frameshifting; Thiol protease;
KW Transmembrane; Transmembrane helix.
FT CHAIN 1..4445
FT /note="Replicase polyprotein 1a"
FT /id="PRO_0000338027"
FT CHAIN 1..2753
FT /note="Non-structural protein 1"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338028"
FT CHAIN 2754..3131
FT /note="Non-structural protein 2"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338029"
FT CHAIN 3132..3418
FT /note="3C-like serine proteinase"
FT /evidence="ECO:0000250"
FT /id="PRO_0000338031"
FT CHAIN 3419..3677
FT /note="Non-structural protein 4"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338032"
FT CHAIN 3678..3854
FT /note="Non-structural protein 5"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338033"
FT CHAIN 3855..4036
FT /note="Non-structural protein 6"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338034"
FT CHAIN 4037..4121
FT /note="Non-structural protein 7"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338035"
FT CHAIN 4122..4274
FT /note="Non-structural protein 8"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338036"
FT CHAIN 4275..4445
FT /note="Non-structural protein 9"
FT /evidence="ECO:0000255"
FT /id="PRO_0000338037"
FT TRANSMEM 2191..2211
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2219..2239
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2266..2286
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2411..2431
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2521..2541
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2546..2566
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2769..2789
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2937..2957
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 2986..3006
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3022..3042
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3422..3442
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3456..3478
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3486..3506
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3514..3534
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3538..3558
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3573..3593
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 3598..3613
FT /note="Helical"
FT /evidence="ECO:0000255"
FT DOMAIN 1633..1814
FT /note="Macro"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00490"
FT REGION 2183..2565
FT /note="HD1"
FT /evidence="ECO:0000250"
FT REGION 2769..3042
FT /note="HD2"
FT /evidence="ECO:0000250"
FT REGION 3430..3613
FT /note="HD3"
FT /evidence="ECO:0000250"
FT ACT_SITE 3184
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000255"
FT ACT_SITE 3222
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000255"
FT ACT_SITE 3291
FT /note="Charge relay system; for 3C-like serine proteinase
FT activity"
FT /evidence="ECO:0000255"
FT SITE 2753..2754
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 3131..3132
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000250"
FT SITE 3418..3419
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000250"
FT SITE 3677..3678
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 3854..3855
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4036..4037
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4121..4122
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
FT SITE 4274..4275
FT /note="Cleavage; by 3C-like serine proteinase"
FT /evidence="ECO:0000255"
SQ SEQUENCE 4445 AA; 505933 MW; D4F915B6BBF9F9D9 CRC64;
MSKTSRELTN ETELHLCSST LDLISKSQLL AQCLGTPQNL VSLSKMVPSI LESPTLEPRY
TSTHSSSLQS LQLLALNTSS TLYKWTTGSI SKLRGHLERE LCRGLVPLND FIPKGNYVEL
SLMIPSVLTG QGTSTTTTLQ EMCSDMVQSC IKSMETDLLK GVLALKDQTS CQEYFLSANY
QSLIPPQPLV NAMRMSSVVD LSPLILENTR LLLKLSPFHG GTSVSYTSMI REFVDCSRRD
EKCLKRRLTK KQKRQEEGSF DANKVITLGG KMYRYRVVIL KCSDEVDDLI GFDGKVGEFD
YNFENVPHCW RDLVKRRCLI RAKATWNLAG GVDENLDHVY IDESQXDFRC ADGSSDSPSA
CVEDPHLEER IFSRVWLKQT SRFFGTKIQQ VSELFKSIGL PELETTYCGV NPVKVGNKWL
SFRDQGRSRV FFVYTDSNVY LATTRQKVCC DYILTKFKSV KWIGNKPDQC RVVKVLAWLI
SVNKVKNCTR VITPMLTVQG KISHRRVDYL DISVLDSYVS DTAGLNCVQK VKKFLSMYYN
CGADLGLLDN FLTPIECGTK QLVFERCNCP NHQFYVAQFD NHVVLGLGRP TGVVYPEEIP
SCANIYAVGF ATQKRVVEVH YYSEMDRHQL PQDYYYFAYD QEFQHVGGDD YVNHHLDDVE
DQPFPPVLFD DVYDSGDSLD DGGSDLDCFD VGYDFFWPEA PIPVPSPYGY YQGQRLRDLC
VAGGDFGCDC PRCDGTFIYH PFRPRHYHSF DEVGPFIQMC EFTLTYSGQN YNLFYGLEPK
VCLQDLVEAS DKLLQLLVRG QLENISLPND ILACLSSLKL GANIHPFLWP APFFNANGEW
VDIFGGGDFT VFGEDFCLKA KSMVESVYFL VENFFSVDCP IGNLYCNLHL DGDVKKMLWS
TIHMKYIYLA LIHSEKVFNI ILNSRQLSHQ ELVKLVIIGT FDVSIVAPCA CSGDCNHGKV
YNWTNLLSSV YRFVTLDQLV GLSYCEKRSL VLRKVQQYLE VEEGYQRPVQ LLMAPFYGFN
DNAEPDEQPL TGVFHQQVMQ MFDTCVMLDV ICGLKRPRAS VYNLFGVLAD YFRRPFTFRY
YQVAEFSGSE STQVFTDVTS ALTSKDPCSN RPYIYHDYAV CRVVEPRTAA VTTRGAIYPP
EVIEMIRSYL PIEFDVGVMN YVDGNCDFKY CNLEFCLSGR GLVKLDTGEL LDYKTNLFVV
RYKTLPLLYV TSNPIYLSDF SLDNAVCLTG DFKLSFDVEP GSTLFGLYFT NGRCYRDVWE
TLPRFGLGTL SPPKCHSKCE PFENLAEVFF FKRRVQLVPL VNNYTPVFRH RPDIPKVLTV
ELMPYYSSIG YQGFVAPKCV LPGCVATQYC KLRHQLDRCV QVTKLAVAYA FYFKPLNIGS
LYHLDPMRGT SYGKPAVVQF EPVGLIKEVN ILVYQFGKHV AIHYFPECPT YVAYGHYPSH
SVGVWLGYLP SVEECVIAQR NYRVYVPTCF RLSRTGCYHI QQDEDFERTH ITVSYHYARD
FDTKSLTPMF QMFSKIFGKS KQDLICALNS LSEESQSVLT LFCEEFDSAY TLQTISDEVS
FETSTSPELV ACVLAYAIGY ELCLTVKTDG ECESLDVGSS LEQVYVDYDV SKNVWDLSTH
LQDDSSDDLE LPFNQYYEFK VGRASVVLVQ DDFKSVYDFL KSEQGVDYVV NPANNQLKHG
GGIAKVISCM CGPKLTSWSN NYIKQYKKLG VTCAIRSPGF QLGKGVQIIH VVGPKSADSD
VVNKLEASWR SVFQNVKPDT TVLTSMLSTG IFGCSVTDSA TTLLSNLVDL DKDVVVFVVT
NVSDQYIEAL GVVESFQSAH GLPNFGNTCW FNALYQLLKS FAVKEQIVQD LVNCFDDFYE
CPTRQCVEWV CDQLGVVFGE QYDAVEMLVK IFDVFKCNVR VGYDCLARLQ QVALGSCREV
PADAVLMFFG QDKSGHWVAA RKVCGVWYTF DDKVVVKKDP DWSKVVLVLR ERGLFKATDF
ETPRPRRRRV AYRVPRDTIS QDAIMFLEER QFSSGTMLAH SCVESVESFH VEGVQPSPLQ
SVDGLDDVAD LSCDNHVCDN SDLQEPQVVV SQPSEVLTTS MSIECPVLEN SECSVETDLN
PVCEENEQVG ESGIKEQDGV TTSDSQQVFS KSLDPIIKQH EVESVEPQDL PVFSQQPQVM
LSMTWRDVLF QQYLGFKSDL LSLTHVNKFK IVVYLMVLWF VLLYCFSDFS LLSRFCLYVF
LLWLSHVVLV VKKLDLGLVN SGGESYVLRI LSSVKVPNCI AFNCDGVHWL ILKLLFYSFH
FYDFFVKTLV VVFQMPQLRC FTWPLLKLGF ADTFLSHHIL AFPTKQVSQS CLPVFGDERK
YIYVPYWCKE SFRTLVARAK QLTATGRTKT LDNWHYQCCS KTVKPSSCFN VRDFVFDDAC
NNHKHYGFFS ALWFYVVFYS GFVSFWLPLM FCYCALFMCT FKNLPVNITR PIRWTVLQQV
VDDLLSIITK PLFGRPACPP LSAYLTATTA DEAVRASRSL LGRFCTPVGF QQPIMNVENG
VAVSSLGFIN PLMWPLFIVV LLDNRFVWFF NVLSYIMLPV FVIILFYFYL RKICGCVNVK
GVVKNCTRHF QNFSKPLVAA GVHGNRTNFT YQPMQENWCD RHSWYCPKEE HYMTPEMAMF
IKNYYNLATS PMADTIWCDY VKSVPNMTWA NFKFSLFKSN ETVMCGPSSH ADSMLLSWYA
FLHGIRFAVN PSVIDIPSQT QPIYVSSDSD DSLDKGCDVS LRPTKNKGKF KKQSVAYFSA
GPVDLWYYVM LIIALGAIFV FMYSCFMVGQ YVVMPRDKFF GVNPTGYSYV NAQPYLHASP
PVLRNSDGMV LATPLKVPSI SYSVYRLLSG HLYFTKLIVA ENECTPPFGA XRLSHEFTCN
DFTYILPAHL RIFGRYIMLI HPDQLHMLPF EVEHSTHTRL CYVTGTNIVE CLPTFEIISP
YVFVVLVAIF TIVFLFLLRM YIVMYSYFKV FTYVVFKLLF VNTVMVLFVV CLPPLVPGVV
FVLALWLCDS VVFLLYLAVL SLFILPWFYV MLFVLIVGGF VFWWMMKSSD VVHLTPDGLT
FNGTFEQVSK CVFPLNPLIV NRLLLDCRMS HSDLVEKSKL KTTEGKLATE MMKVFMTGET
AYYQPSNFSF QSVFSKVVSP FTLHARPPMP MFRLYVYFNG QCVGTTCTGT GFAIDDSTIV
TAKHLFECDD LKPTHLSVEL SCRSYWCTWK EPNVLSWKFE GENAYISVEN LRDFYGIDFK
YLPFQQIECE FYKRMEAVTI YSIKYGSEFA TQAWQTVNGH FVCCNTEGGD SGAPLVWRDS
VIGVHQGLCD SFKTTLASDS KGVMMTEVKG YHVDPPVYYK PIIMSAAYNK FVADSDVSVG
ECTNYHNFVN EDFFSMHDEL EKVSFGDKMF RYCQSLPRYL EPLHYFHVPS FWQPFKKQSV
SSNVSWVVEN LHFIFSVYFL VCDFVAYWWL DDPFSVVLPL FFVVQLLSTV VLKNVLFWNT
SYLVTLAVTF YVHSEVAESM YLLGLFSDQI VNRVGLILVV SVMCLFVVVR VVVNVKRAIF
VVVVSVLLIV VNVVLGVVQF NSLVAVCMFD IYAVFAALLT PQPVVAIMML ILFDTKCLMS
FAFVVIVLSF RVFKNYKFVR VLHNLCNFDF VLTQLSLFRY RHHNQGNNPS HYEALWLFLK
ELYYGVQDVK YEVFSPQAGT YNVRFLTDMT EQDQLEAVEQ VQRRLQRFSI VQDKNSQRLV
LYSKNVDFLR SQIQHQRVLG ANPFIITTLT PKDIAIDNVE VHNPSQFKPE DLQAHMWFYS
KSPIFVGQVP IPTNVQTAAV LDTTYNCQDL TADEKNNVAA NLQIQNAALT LSLFEECNRF
LESELGDVPT LMWQSEDVVD VKQLEVQIEK LRVVLDGMQL GTSEYKATRK QINILQSQLD
KALAFERKLA KFLEKVDQQQ AITNETAKQL SAFKNLVKQV YESYMSSLKV RVVESNDASC
LLTSTDLPRK LVLMRPITGL DNIKIVEKAN GCEITAFGDT FTTGLGSNLA GLAYSSTQPL
SAYPFIFNLE GIFKQQANIG YKTVECNMSS DNGSVLYKGK IVAVPSEDNP DFVVCGKGYK
LDCGINVLMI PSIVRYITLN LTDHLQRQSL KPRRRLQYKQ QGVRLGGVNL GEHQAFSNEL
ISSVGYTTWV SSTVCTDKSH KHPWFVQIPS SEKDPEWFMH NTQVKNNQWV VDAKPTHWLV
DADTNEQLFA LALTDEEYLK AESILAKWSP ITQDVECWFK DLRGYYTVSG LQPLWPVCPK
KICSLKIVPI FQSQSVAYAD EPTHFLSLPV VNKNFLEAFY ELQEGFPGEK QVAPHISLTM
LKLTEEDVAK VEDILDEMVL PNTYATITNP HMMGQYYVFE VEGLQALHDE VVSVLRQHGI
ACDQTRMWKP HLTIGEIKDG SVFNKFKDFG ITCKLEDCDF VKLGAPKANA RYEFIATLPV
GDLNC