DPOL_PYRHO
ID DPOL_PYRHO Reviewed; 1235 AA.
AC O59610;
DT 15-DEC-1998, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1998, sequence version 1.
DT 25-MAY-2022, entry version 139.
DE RecName: Full=DNA polymerase;
DE EC=2.7.7.7;
DE Contains:
DE RecName: Full=Pho pol intein;
DE AltName: Full=Pho Pol I intein;
GN Name=pol; OrderedLocusNames=PH1947; ORFNames=PHBT047;
OS Pyrococcus horikoshii (strain ATCC 700860 / DSM 12428 / JCM 9974 / NBRC
OS 100139 / OT-3).
OC Archaea; Euryarchaeota; Thermococci; Thermococcales; Thermococcaceae;
OC Pyrococcus.
OX NCBI_TaxID=70601;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700860 / DSM 12428 / JCM 9974 / NBRC 100139 / OT-3;
RX PubMed=9679194; DOI=10.1093/dnares/5.2.55;
RA Kawarabayasi Y., Sawada M., Horikawa H., Haikawa Y., Hino Y., Yamamoto S.,
RA Sekine M., Baba S., Kosugi H., Hosoyama A., Nagai Y., Sakai M., Ogura K.,
RA Otsuka R., Nakazawa H., Takamiya M., Ohfuku Y., Funahashi T., Tanaka T.,
RA Kudoh Y., Yamazaki J., Kushida N., Oguchi A., Aoki K., Yoshizawa T.,
RA Nakamura Y., Robb F.T., Horikoshi K., Masuchi Y., Shizuya H., Kikuchi H.;
RT "Complete sequence and gene organization of the genome of a hyper-
RT thermophilic archaebacterium, Pyrococcus horikoshii OT3.";
RL DNA Res. 5:55-76(1998).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.7;
CC -!- PTM: This protein undergoes a protein self splicing that involves a
CC post-translational excision of the intervening region (intein) followed
CC by peptide ligation. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the DNA polymerase type-B family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BA000001; BAA31074.1; -; Genomic_DNA.
DR PIR; C71210; C71210.
DR RefSeq; WP_010886013.1; NC_000961.1.
DR AlphaFoldDB; O59610; -.
DR SMR; O59610; -.
DR STRING; 70601.3258391; -.
DR EnsemblBacteria; BAA31074; BAA31074; BAA31074.
DR GeneID; 1442795; -.
DR KEGG; pho:PH1947; -.
DR eggNOG; arCOG00328; Archaea.
DR eggNOG; arCOG03145; Archaea.
DR OMA; YAHNSYY; -.
DR OrthoDB; 35869at2157; -.
DR BRENDA; 2.7.7.7; 5244.
DR Proteomes; UP000000752; Chromosome.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0000166; F:nucleotide binding; IEA:InterPro.
DR GO; GO:0006260; P:DNA replication; IEA:UniProtKB-KW.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR Gene3D; 1.10.132.60; -; 1.
DR Gene3D; 3.10.28.10; -; 1.
DR Gene3D; 3.30.420.10; -; 1.
DR Gene3D; 3.90.1600.10; -; 2.
DR InterPro; IPR006172; DNA-dir_DNA_pol_B.
DR InterPro; IPR017964; DNA-dir_DNA_pol_B_CS.
DR InterPro; IPR006133; DNA-dir_DNA_pol_B_exonuc.
DR InterPro; IPR006134; DNA-dir_DNA_pol_B_multi_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR042087; DNA_pol_B_thumb.
DR InterPro; IPR023211; DNA_pol_palm_dom_sf.
DR InterPro; IPR003586; Hint_dom_C.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR006142; INTEIN.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR041005; PI-TkoII_IV.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR Pfam; PF00136; DNA_pol_B; 2.
DR Pfam; PF03104; DNA_pol_B_exo1; 2.
DR Pfam; PF18714; PI-TkoII_IV; 1.
DR PRINTS; PR00106; DNAPOLB.
DR PRINTS; PR00379; INTEIN.
DR SMART; SM00305; HintC; 1.
DR SMART; SM00306; HintN; 1.
DR SMART; SM00486; POLBc; 1.
DR SUPFAM; SSF51294; SSF51294; 1.
DR SUPFAM; SSF53098; SSF53098; 1.
DR SUPFAM; SSF55608; SSF55608; 1.
DR SUPFAM; SSF56672; SSF56672; 2.
DR TIGRFAMs; TIGR01443; intein_Cterm; 1.
DR TIGRFAMs; TIGR01445; intein_Nterm; 1.
DR PROSITE; PS00116; DNA_POLYMERASE_B; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
PE 3: Inferred from homology;
KW Autocatalytic cleavage; DNA replication; DNA-binding;
KW DNA-directed DNA polymerase; Nucleotidyltransferase; Protein splicing;
KW Transferase.
FT CHAIN 1..492
FT /note="DNA polymerase, 1st part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000007322"
FT CHAIN 493..952
FT /note="Pho pol intein"
FT /evidence="ECO:0000255"
FT /id="PRO_0000007323"
FT CHAIN 953..1235
FT /note="DNA polymerase, 2nd part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000007324"
FT DOMAIN 773..887
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00273"
SQ SEQUENCE 1235 AA; 143087 MW; 73CC7AA14873CCE4 CRC64;
MILDADYITE DGKPIIRIFK KENGEFKVEY DRNFRPYIYA LLRDDSAIDE IKKITAQRHG
KVVRIVETEK IQRKFLGRPI EVWKLYLEHP QDVPAIRDKI REHPAVVDIF EYDIPFAKRY
LIDKGLTPME GNEKLTFLAV DIETLYHEGE EFGKGPVIMI SYADEEGAKV ITWKKIDLPY
VEVVSSEREM IKRLIRVIKE KDPDVIITYN GDNFDFPYLL KRAEKLGIKL LLGRDNSEPK
MQKMGDSLAV EIKGRIHFDL FPVIRRTINL PTYTLEAVYE AIFGKPKEKV YADEIAKAWE
TGEGLERVAK YSMEDAKVTY ELGREFFPME AQLARLVGQP VWDVSRSSTG NLVEWFLLRK
AYERNELAPN KPDEKEYERR LRESYEGGYV KEPEKGLWEG IVSLDFRSLY PSIIITHNVS
PDTLNREGCE EYDVAPKVGH RFCKDFPGFI PSLLGQLLEE RQKIKKRMKE SKDPVEKKLL
DYRQRAIKIL ANSILPDEWL PIVENEKVRF VKIGDFIDRE IEENAERVKR DGETEILEVK
DLKALSFNRE TKKSELKKVK ALIRHRYSGK VYSIKLKSGR RIKITSGHSL FSVKNGKLVK
VRGDELKPGD LVVVPGRLKL PESKQVLNLV ELLLKLPEEE TSNIVMMIPV KGRKNFFKGM
LKTLYWIFGE GERPRTAGRY LKHLERLGYV KLKRRGCEVL DWESLKRYRK LYETLIKNLK
YNGNSRAYMV EFNSLRDVVS LMPIEELKEW IIGEPRGPKI GTFIDVDDSF AKLLGYYISS
GDVEKDRVKF HSKDQNVLED IAKLAEKLFG KVRRGRGYIE VSGKISHAIF RVLAEGKRIP
EFIFTSPMDI KVAFLKGLNG NAEELTFSTK SELLVNQLIL LLNSIGVSDI KIEHEKGVYR
VYINKKESSN GDIVLDSVES IEVEKYEGYV YDLSVEDNEN FLVGFGLLYA HNSYYGYYGY
AKARWYCKEC AESVTAWGRQ YIDLVRRELE ARGFKVLYID TDGLYATIPG VKDWEEVKRR
ALEFVDYINS KLPGVLELEY EGFYARGFFV TKKKYALIDE EGKIVTRGLE IVRRDWSEIA
KETQARVLEA ILKHGNVEEA VKIVKDVTEK LTNYEVPPEK LVIYEQITRP INEYKAIGPH
VAVAKRLMAR GIKVKPGMVI GYIVLRGDGP ISKRAISIEE FDPRKHKYDA EYYIENQVLP
AVERILKAFG YKREDLRWQK TKQVGLGAWI KVKKS