VE1_HPV28
ID VE1_HPV28 Reviewed; 662 AA.
AC P50763;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-1996, sequence version 1.
DT 03-AUG-2022, entry version 84.
DE RecName: Full=Replication protein E1 {ECO:0000255|HAMAP-Rule:MF_04000};
DE EC=3.6.4.12 {ECO:0000255|HAMAP-Rule:MF_04000};
DE AltName: Full=ATP-dependent helicase E1 {ECO:0000255|HAMAP-Rule:MF_04000};
GN Name=E1 {ECO:0000255|HAMAP-Rule:MF_04000};
OS Human papillomavirus 28.
OC Viruses; Monodnaviria; Shotokuvirae; Cossaviricota; Papovaviricetes;
OC Zurhausenvirales; Papillomaviridae; Firstpapillomavirinae;
OC Alphapapillomavirus.
OX NCBI_TaxID=37111;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RA Delius H.;
RL Submitted (OCT-1995) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: ATP-dependent DNA helicase required for initiation of viral
CC DNA replication. It forms a complex with the viral E2 protein. The E1-
CC E2 complex binds to the replication origin which contains binding sites
CC for both proteins. During the initial step, a dimer of E1 interacts
CC with a dimer of protein E2 leading to a complex that binds the viral
CC origin of replication with high specificity. Then, a second dimer of E1
CC displaces the E2 dimer in an ATP-dependent manner to form the E1
CC tetramer. Following this, two E1 monomers are added to each half of the
CC site, which results in the formation of two E1 trimers on the viral
CC ori. Subsequently, two hexamers will be created. The double hexamer
CC acts as a bi-directional helicase machinery and unwinds the viral DNA
CC and then recruits the host DNA polymerase to start replication.
CC {ECO:0000255|HAMAP-Rule:MF_04000}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.12;
CC Evidence={ECO:0000255|HAMAP-Rule:MF_04000};
CC -!- SUBUNIT: Can form hexamers. Interacts with E2 protein; this interaction
CC increases E1 DNA binding specificity. Interacts with host DNA
CC polymerase subunit POLA2. Interacts with host single stranded DNA-
CC binding protein RPA1. Interacts with host TOP1; this interaction
CC stimulates the enzymatic activity of TOP1. {ECO:0000255|HAMAP-
CC Rule:MF_04000}.
CC -!- SUBCELLULAR LOCATION: Host nucleus {ECO:0000255|HAMAP-Rule:MF_04000}.
CC -!- PTM: Phosphorylated. {ECO:0000255|HAMAP-Rule:MF_04000}.
CC -!- PTM: Sumoylated. {ECO:0000255|HAMAP-Rule:MF_04000}.
CC -!- SIMILARITY: Belongs to the papillomaviridae E1 protein family.
CC {ECO:0000255|HAMAP-Rule:MF_04000}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U31783; AAA79424.1; -; Genomic_DNA.
DR SMR; P50763; -.
DR Proteomes; UP000009158; Genome.
DR GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:RHEA.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003678; F:DNA helicase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0006260; P:DNA replication; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.10.510; -; 1.
DR Gene3D; 3.40.50.300; -; 1.
DR HAMAP; MF_04000; PPV_E1; 1.
DR InterPro; IPR001177; DNA_helicase_E1_C_Papillomavir.
DR InterPro; IPR014000; DNA_helicase_E1_N_Papillomavir.
DR InterPro; IPR014015; Helicase_SF3_DNA-vir.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR016393; Rep_E1_papillomaV.
DR InterPro; IPR037102; Znf_lg_T-Ag_D1_dom_sf.
DR Pfam; PF00519; PPV_E1_C; 1.
DR Pfam; PF00524; PPV_E1_N; 1.
DR PIRSF; PIRSF003383; Rep_E1_papillomaV; 1.
DR SUPFAM; SSF52540; SSF52540; 1.
DR PROSITE; PS51206; SF3_HELICASE_1; 1.
PE 3: Inferred from homology;
KW ATP-binding; DNA replication; DNA-binding; Early protein; Helicase;
KW Host nucleus; Hydrolase; Isopeptide bond; Nucleotide-binding;
KW Phosphoprotein; Ubl conjugation.
FT CHAIN 1..662
FT /note="Replication protein E1"
FT /id="PRO_0000133126"
FT DOMAIN 464..614
FT /note="SF3 helicase"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT REGION 28..60
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 160..197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 199..365
FT /note="DNA-binding region"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT REGION 637..662
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 88..90
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT MOTIF 110..119
FT /note="Nuclear export signal"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT COMPBIAS 38..54
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT BINDING 490..497
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT MOD_RES 94
FT /note="Phosphoserine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT MOD_RES 98
FT /note="Phosphoserine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT MOD_RES 111
FT /note="Phosphoserine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
FT CROSSLNK 571
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04000"
SQ SEQUENCE 662 AA; 74389 MW; CBF68D40684DB2EE CRC64;
MDDTSGTEGD ECSELERAGG WFMVEAIVDR RTGDKPSSDE DEDEDADEGE DFVDFIDDRP
VGDGQEVAQE LLLQQAAADD DVAVQAVKRK FAPSPYFSPV CMQPSIENEL SPRLDAIKLG
RQSGTAKRRL FQLPDSGYGQ TQVDTESGPL QVQDICETGT QDGRQDADEG SGRNVGGNGG
QEEERAGGDG EESQTQGVQT DKAACGVLAI LRASNQKATL LGKFKEQFGL GFNELIRHFK
SSKTVCLDWV VCVFGVYCTL AEGIKTLIQP QCDYAHIQVL SCQWGMTVLM LVRYKRAKNR
ETVAKGLSTL LNVPESHMLI EPPKLRSGPA ALYWYKTAMS NCSDVYGETP EWIVRQTMVG
HALEEAQFSL SEMVQYAYDH DITDESMLAF EYALLADTDA NAAAFLSSNC QAKYVKDACT
MCRHYKRGEQ ARMNMSEWIW FRGDKVQGDG DWKPIVQFLR YHDVEFIPFL CAFKTFLQGI
PKKSCLVFYG PADTGKSYFC MSLLRFLGGV VISYANSNSH FWLQPLADAK IGLLDDATSQ
CWCYIDTYLR NALDGNQVCI DRKHRALLQL KCPPLLITTN INPLEDDRWK YLRSRVQLFT
FKNKFPLTTQ GEPLYTLNDQ NWKCFFRRLW ARLSLTDPDD EEENGNPSEP FRCVPGQNAR
TL