POL_HTL1C
ID POL_HTL1C Reviewed; 1462 AA.
AC P14078; O56228;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 3.
DT 03-AUG-2022, entry version 160.
DE RecName: Full=Gag-Pro-Pol polyprotein;
DE AltName: Full=Pr160Gag-Pro-Pol;
DE Contains:
DE RecName: Full=Matrix protein p19;
DE Short=MA;
DE Contains:
DE RecName: Full=Capsid protein p24;
DE Short=CA;
DE Contains:
DE RecName: Full=Nucleocapsid protein p15-pro;
DE Short=NC';
DE Short=NC-pro;
DE Contains:
DE RecName: Full=Protease;
DE Short=PR;
DE EC=3.4.23.- {ECO:0000255|PROSITE-ProRule:PRU00275};
DE Contains:
DE RecName: Full=p1;
DE Contains:
DE RecName: Full=Reverse transcriptase/ribonuclease H, p49 subunit;
DE Short=p49 RT;
DE EC=2.7.7.49 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=2.7.7.7 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=3.1.26.4 {ECO:0000255|PROSITE-ProRule:PRU00408};
DE Contains:
DE RecName: Full=Reverse transcriptase/ribonuclease H, p62 subunit;
DE Short=p62 RT;
DE EC=2.7.7.49 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=2.7.7.7 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=3.1.26.4 {ECO:0000255|PROSITE-ProRule:PRU00408};
DE Contains:
DE RecName: Full=Integrase;
DE Short=IN;
DE EC=2.7.7.- {ECO:0000250|UniProtKB:P03363};
DE EC=3.1.-.- {ECO:0000250|UniProtKB:P03363};
GN Name=gag-pro-pol;
OS Human T-cell leukemia virus 1 (isolate Caribbea HS-35 subtype A) (HTLV-1).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Deltaretrovirus.
OX NCBI_TaxID=11927;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=2899128; DOI=10.1099/0022-1317-69-7-1695;
RA Malik K.T.A., Even J., Karpas A.;
RT "Molecular cloning and complete nucleotide sequence of an adult T cell
RT leukaemia virus/human T cell leukaemia virus type I (ATLV/HTLV-I) isolate
RT of Caribbean origin: relationship to other members of the ATLV/HTLV-I
RT subgroup.";
RL J. Gen. Virol. 69:1695-1710(1988).
CC -!- FUNCTION: [Gag-Pro-Pol polyprotein]: The matrix domain targets Gag,
CC Gag-Pro and Gag-Pro-Pol polyproteins to the plasma membrane via a
CC multipartite membrane binding signal, that includes its myristoylated
CC N-terminus. {ECO:0000250|UniProtKB:P03345}.
CC -!- FUNCTION: [Matrix protein p19]: Matrix protein.
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- FUNCTION: [Capsid protein p24]: Forms the spherical core of the virus
CC that encapsulates the genomic RNA-nucleocapsid complex.
CC {ECO:0000250|UniProtKB:P03362}.
CC -!- FUNCTION: [Nucleocapsid protein p15-pro]: Binds strongly to viral
CC nucleic acids and promote their aggregation. Also destabilizes the
CC nucleic acids duplexes via highly structured zinc-binding motifs.
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- FUNCTION: [Protease]: The aspartyl protease mediates proteolytic
CC cleavages of Gag and Gag-Pol polyproteins during or shortly after the
CC release of the virion from the plasma membrane. Cleavages take place as
CC an ordered, step-wise cascade to yield mature proteins. This process is
CC called maturation. Displays maximal activity during the budding process
CC just prior to particle release from the cell (Potential). Cleaves the
CC translation initiation factor eIF4G leading to the inhibition of host
CC cap-dependent translation (By similarity).
CC {ECO:0000250|UniProtKB:P03362, ECO:0000255|PROSITE-ProRule:PRU00275}.
CC -!- FUNCTION: [Reverse transcriptase/ribonuclease H, p49 subunit]: RT is a
CC multifunctional enzyme that converts the viral RNA genome into dsDNA in
CC the cytoplasm, shortly after virus entry into the cell. This enzyme
CC displays a DNA polymerase activity that can copy either DNA or RNA
CC templates, and a ribonuclease H (RNase H) activity that cleaves the RNA
CC strand of RNA-DNA heteroduplexes in a partially processive 3' to 5'-
CC endonucleasic mode. Conversion of viral genomic RNA into dsDNA requires
CC many steps. A tRNA-Pro binds to the primer-binding site (PBS) situated
CC at the 5'-end of the viral RNA. RT uses the 3' end of the tRNA primer
CC to perform a short round of RNA-dependent minus-strand DNA synthesis.
CC The reading proceeds through the U5 region and ends after the repeated
CC (R) region which is present at both ends of viral RNA. The portion of
CC the RNA-DNA heteroduplex is digested by the RNase H, resulting in a
CC ssDNA product attached to the tRNA primer. This ssDNA/tRNA hybridizes
CC with the identical R region situated at the 3' end of viral RNA. This
CC template exchange, known as minus-strand DNA strong stop transfer, can
CC be either intra- or intermolecular. RT uses the 3' end of this newly
CC synthesized short ssDNA to perform the RNA-dependent minus-strand DNA
CC synthesis of the whole template. RNase H digests the RNA template
CC except for a polypurine tract (PPT) situated at the 5' end of the
CC genome. It is not clear if both polymerase and RNase H activities are
CC simultaneous. RNase H probably can proceed both in a polymerase-
CC dependent (RNA cut into small fragments by the same RT performing DNA
CC synthesis) and a polymerase-independent mode (cleavage of remaining RNA
CC fragments by free RTs). Secondly, RT performs DNA-directed plus-strand
CC DNA synthesis using the PPT that has not been removed by RNase H as
CC primer. PPT and tRNA primers are then removed by RNase H. The 3' and 5'
CC ssDNA PBS regions hybridize to form a circular dsDNA intermediate.
CC Strand displacement synthesis by RT to the PBS and PPT ends produces a
CC blunt ended, linear dsDNA copy of the viral genome that includes long
CC terminal repeats (LTRs) at both ends (By similarity). {ECO:0000250}.
CC -!- FUNCTION: [Reverse transcriptase/ribonuclease H, p62 subunit]: RT is a
CC multifunctional enzyme that converts the viral RNA genome into dsDNA in
CC the cytoplasm, shortly after virus entry into the cell. This enzyme
CC displays a DNA polymerase activity that can copy either DNA or RNA
CC templates, and a ribonuclease H (RNase H) activity that cleaves the RNA
CC strand of RNA-DNA heteroduplexes in a partially processive 3' to 5'-
CC endonucleasic mode. Conversion of viral genomic RNA into dsDNA requires
CC many steps. A tRNA-Pro binds to the primer-binding site (PBS) situated
CC at the 5'-end of the viral RNA. RT uses the 3' end of the tRNA primer
CC to perform a short round of RNA-dependent minus-strand DNA synthesis.
CC The reading proceeds through the U5 region and ends after the repeated
CC (R) region which is present at both ends of viral RNA. The portion of
CC the RNA-DNA heteroduplex is digested by the RNase H, resulting in a
CC ssDNA product attached to the tRNA primer. This ssDNA/tRNA hybridizes
CC with the identical R region situated at the 3' end of viral RNA. This
CC template exchange, known as minus-strand DNA strong stop transfer, can
CC be either intra- or intermolecular. RT uses the 3' end of this newly
CC synthesized short ssDNA to perform the RNA-dependent minus-strand DNA
CC synthesis of the whole template. RNase H digests the RNA template
CC except for a polypurine tract (PPT) situated at the 5' end of the
CC genome. It is not clear if both polymerase and RNase H activities are
CC simultaneous. RNase H probably can proceed both in a polymerase-
CC dependent (RNA cut into small fragments by the same RT performing DNA
CC synthesis) and a polymerase-independent mode (cleavage of remaining RNA
CC fragments by free RTs). Secondly, RT performs DNA-directed plus-strand
CC DNA synthesis using the PPT that has not been removed by RNase H as
CC primer. PPT and tRNA primers are then removed by RNase H. The 3' and 5'
CC ssDNA PBS regions hybridize to form a circular dsDNA intermediate.
CC Strand displacement synthesis by RT to the PBS and PPT ends produces a
CC blunt ended, linear dsDNA copy of the viral genome that includes long
CC terminal repeats (LTRs) at both ends (By similarity). {ECO:0000250}.
CC -!- FUNCTION: [Integrase]: Catalyzes viral DNA integration into the host
CC chromosome, by performing a series of DNA cutting and joining
CC reactions. {ECO:0000250|UniProtKB:P03362}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endonucleolytic cleavage to 5'-phosphomonoester.; EC=3.1.26.4;
CC Evidence={ECO:0000255|PROSITE-ProRule:PRU00408};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.49; Evidence={ECO:0000255|PROSITE-
CC ProRule:PRU00405};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.7; Evidence={ECO:0000255|PROSITE-
CC ProRule:PRU00405};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000255|PROSITE-ProRule:PRU00405};
CC Note=The RT polymerase active site binds 2 magnesium ions.
CC {ECO:0000255|PROSITE-ProRule:PRU00405};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420; Evidence={ECO:0000250};
CC Note=Binds 2 magnesium ions for ribonuclease H (RNase H) activity.
CC {ECO:0000250};
CC -!- SUBUNIT: [Gag-Pro-Pol polyprotein]: Homodimer; the homodimers are part
CC of the immature particles. Interacts with human TSG101 and NEDD4; these
CC interactions are essential for budding and release of viral particles.
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- SUBUNIT: [Matrix protein p19]: Homodimer; further assembles as
CC homohexamers. {ECO:0000250|UniProtKB:P03345}.
CC -!- SUBCELLULAR LOCATION: [Matrix protein p19]: Virion
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein p24]: Virion
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- SUBCELLULAR LOCATION: [Nucleocapsid protein p15-pro]: Virion
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=3;
CC Comment=This strategy of translation probably allows the virus to
CC modulate the quantity of each viral protein. {ECO:0000305};
CC Name=Gag-Pol polyprotein;
CC IsoId=P14078-1; Sequence=Displayed;
CC Name=Gag-Pro polyprotein;
CC IsoId=P14074-1; Sequence=External;
CC Name=Gag polyprotein;
CC IsoId=P14076-1; Sequence=External;
CC -!- DOMAIN: Gag polyprotein: Late-budding domains (L domains) are short
CC sequence motifs essential for viral particle release. They can occur
CC individually or in close proximity within structural proteins. They
CC interacts with sorting cellular proteins of the multivesicular body
CC (MVB) pathway. Most of these proteins are class E vacuolar protein
CC sorting factors belonging to ESCRT-I, ESCRT-II or ESCRT-III complexes.
CC Matrix protein p19 contains two L domains: a PTAP/PSAP motif which
CC interacts with the UEV domain of TSG101, and a PPXY motif which binds
CC to the WW domains of the ubiquitin ligase NEDD4.
CC {ECO:0000250|UniProtKB:P03345}.
CC -!- DOMAIN: [Capsid protein p24]: The capsid protein N-terminus seems to be
CC involved in Gag-Gag interactions. {ECO:0000250|UniProtKB:P03362}.
CC -!- PTM: [Matrix protein p19]: Phosphorylation of the matrix protein p19 by
CC MAPK1 seems to play a role in budding. {ECO:0000250|UniProtKB:P03345}.
CC -!- PTM: [Gag-Pro-Pol polyprotein]: Myristoylated. Myristoylation of the
CC matrix (MA) domain mediates the transport and binding of Gag
CC polyproteins to the host plasma membrane and is required for the
CC assembly of viral particles. {ECO:0000250|UniProtKB:P03345}.
CC -!- PTM: [Gag-Pro-Pol polyprotein]: Specific enzymatic cleavages by the
CC viral protease yield mature proteins. The polyprotein is cleaved during
CC and after budding, this process is termed maturation. The protease is
CC autoproteolytically processed at its N- and C-termini.
CC {ECO:0000250|UniProtKB:P03362}.
CC -!- MISCELLANEOUS: Reverse transcriptase/ribonuclease H: The reverse
CC transcriptase is an error-prone enzyme that lacks a proof-reading
CC function. High mutations rate is a direct consequence of this
CC characteristic. RT also displays frequent template switching leading to
CC high recombination rate. Recombination mostly occurs between homologous
CC regions of the two copackaged RNA genomes. If these two RNA molecules
CC derive from different viral strains, reverse transcription will give
CC rise to highly recombinated proviral DNAs. {ECO:0000255|PROSITE-
CC ProRule:PRU00405}.
CC -!- MISCELLANEOUS: HTLV-1 lineages are divided in four clades, A
CC (Cosmopolitan), B (Central African group), C (Melanesian group) and D
CC (New Central African group). {ECO:0000305}.
CC -!- MISCELLANEOUS: [Isoform Gag-Pol polyprotein]: Produced by -1 ribosomal
CC frameshifting at the gag-pol genes boundary. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAA02931.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D13784; BAA02931.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AF033817; AAC82581.1; -; Genomic_DNA.
DR PIR; C28136; GNLJCN.
DR RefSeq; NP_057860.1; NC_001436.1.
DR PDB; 4ZNY; X-ray; 2.40 A; B=121-130.
DR PDB; 6VOY; EM; 3.70 A; A/B/C/D=1168-1462.
DR PDBsum; 4ZNY; -.
DR PDBsum; 6VOY; -.
DR BMRB; P14078; -.
DR SMR; P14078; -.
DR ELM; P14078; -.
DR MEROPS; A02.012; -.
DR PRIDE; P14078; -.
DR GeneID; 1724740; -.
DR KEGG; vg:1724740; -.
DR Proteomes; UP000001061; Genome.
DR Proteomes; UP000110593; Genome.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:UniProtKB-EC.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039657; P:suppression by virus of host gene expression; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR Gene3D; 2.40.70.10; -; 1.
DR Gene3D; 3.30.420.10; -; 2.
DR Gene3D; 3.30.70.270; -; 2.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR003139; D_retro_matrix.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR003308; Integrase_Zn-bd_dom_N.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF02228; Gag_p19; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00552; IN_DBD_C; 1.
DR Pfam; PF02022; Integrase_Zn; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00077; RVP; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF50122; SSF50122; 1.
DR SUPFAM; SSF50630; SSF50630; 1.
DR SUPFAM; SSF53098; SSF53098; 1.
DR SUPFAM; SSF56672; SSF56672; 1.
DR SUPFAM; SSF57756; SSF57756; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Aspartyl protease; Capsid protein; DNA integration;
KW DNA recombination; DNA-binding; Endonuclease;
KW Eukaryotic host gene expression shutoff by virus;
KW Eukaryotic host translation shutoff by virus;
KW Host gene expression shutoff by virus; Host-virus interaction; Hydrolase;
KW Lipoprotein; Magnesium; Metal-binding; Multifunctional enzyme; Myristate;
KW Nuclease; Nucleotidyltransferase; Phosphoprotein; Protease;
KW Reference proteome; Repeat; Ribosomal frameshifting;
KW RNA-directed DNA polymerase; Transferase; Viral genome integration;
KW Viral nucleoprotein; Virion; Virus entry into host cell; Zinc; Zinc-finger.
FT INIT_MET 1
FT /note="Removed; by host"
FT /evidence="ECO:0000255"
FT CHAIN 2..1462
FT /note="Gag-Pro-Pol polyprotein"
FT /id="PRO_0000259940"
FT CHAIN 2..130
FT /note="Matrix protein p19"
FT /id="PRO_0000259941"
FT CHAIN 131..344
FT /note="Capsid protein p24"
FT /id="PRO_0000259942"
FT CHAIN 345..449
FT /note="Nucleocapsid protein p15-pro"
FT /id="PRO_0000259943"
FT CHAIN 450..574
FT /note="Protease"
FT /id="PRO_0000259944"
FT PEPTIDE 575..582
FT /note="p1"
FT /id="PRO_0000259945"
FT CHAIN 583..1167
FT /note="Reverse transcriptase/ribonuclease H, p62 subunit"
FT /id="PRO_0000038875"
FT CHAIN 583..1021
FT /note="Reverse transcriptase/ribonuclease H, p49 subunit"
FT /id="PRO_0000442548"
FT CHAIN 1168..1462
FT /note="Integrase"
FT /id="PRO_0000038876"
FT DOMAIN 476..554
FT /note="Peptidase A2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00275"
FT DOMAIN 614..804
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT DOMAIN 1031..1165
FT /note="RNase H type-1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT DOMAIN 1219..1388
FT /note="Integrase catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT ZN_FING 355..372
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 378..395
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT DNA_BIND 1393..1443
FT /note="Integrase-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00506"
FT REGION 93..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 118..121
FT /note="PPXY motif"
FT /evidence="ECO:0000250|UniProtKB:P03345"
FT MOTIF 124..127
FT /note="PTAP/PSAP motif"
FT /evidence="ECO:0000250|UniProtKB:P03345"
FT COMPBIAS 95..126
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 481
FT /note="Protease; shared with dimeric partner"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00275"
FT BINDING 680
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 755
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 756
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 1040
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1074
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1096
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1157
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1230
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1287
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT SITE 130..131
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 344..345
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 449..450
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 574..575
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 582..583
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 1021..1022
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT SITE 1167..1168
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P03362"
FT MOD_RES 105
FT /note="Phosphoserine; by host MAPK1"
FT /evidence="ECO:0000250|UniProtKB:P03345"
FT LIPID 2
FT /note="N-myristoyl glycine; by host"
FT /evidence="ECO:0000255"
SQ SEQUENCE 1462 AA; 162686 MW; 89F03B47B8BA7805 CRC64;
MGQIFSRSAS PIPRPPRGLA AHHWLNFLQA AYRLEPGPSS YDFHQLKKFL KIALETPVWI
CPINYSLLAS LLPKGYPGRV NEILHILIQT QAQIPSRPAP PPPSSSTHDP PDSDPQIPPP
YVEPTAPQVL PVMHPHGAPP NHRPWQMKDL QAIKQEVSQA APGSPQFMQT IRLAVQQFDP
TAKDLQDLLQ YLCSSLVASL HHQQLDSLIS EAETRGITGY NPLAGPLRVQ ANNPQQQGLR
REYQQLWLAA FAALPGSAKD PSWASILQGL EEPYHAFVER LNIALDNGLP EGTPKDPILR
SLAYSNANKE CQKLLQARGH TNSPLGDMLR ACQAWTPKDK TKVLVVQPKK PPPNQPCFRC
GKAGHWSRDC TQPRPPPGPC PLCQDPTHWK RDCPRLKPTI PEPEPEEDAL LLDLPADIPH
PKNLHRGGGL TSPPTLQQVL PNQDPTSILP VIPLDPARRP VIKAQIDTQT SHPKTIEALL
DTGADMTVLP IALFSSNTPL KNTSVLGAGG QTQDHFKLTS LPVLIRLPFR TTPIVLTSCL
VDTKNNWAII GRDALQQCQG VLYLPEAKRP PVILPIQAPA VLGLEHLPRP PEISQFPLNP
ERLQALQHLV RKALEAGHIE PYTGPGNNPV FPVKKANGTW RFIHDLRATN SLTIDLSSSS
PGPPDLSSLP TTLAHLQTID LKDAFFQIPL PKQFQPYFAF TVPQQCNYGP GTRYAWRVLP
QGFKNSPTLF EMQLAHILQP IRQAFPQCTI LQYMDDILLA SPSHADLQLL SEATMASLIS
HGLPVSENKT QQTPGTIKFL GQIISPNHLT YDAVPKVPIR SRWALPELQA LLGEIQWVSK
GTPTLRQPLH SLYCALQRHT DPRDQIYLNP SQVQSLVQLR QALSQNCRSR LVQTLPLLGA
IMLTLTGTTT VVFQSKQQWP LVWLHAPLPH TSQCPWGQLL ASAVLLLDKY TLQSYGLLCQ
TIHHNISTQT FNQFIQTSDH PSVPILLHHS HRFKNLGAQT GELWNTFLKT TAPLAPVKAL
MPVFTLSPVI INTAPCLFSD GSTSQAAYIL WDKHILSQRS FPLPPPHKSA QRAELLGLLH
GLSSARSWRC LNIFLDSKYL YHYLRTLALG TFQGRSSQAP FQALLPRLLS RKVVYLHHVR
SHTNLPDPIS RLNALTDALL ITPVLQLSPA DLHSFTHCGQ TALTLQGATT TEASNILRSC
HACRKNNPQH QMPQGHIRRG LLPNHIWQGD ITHFKYKNTL YRLHVWVDTF SGAISATQKR
KETSSEAISS LLQAIAYLGK PSYINTDNGP AYISQDFLNM CTSLAIRHTT HVPYNPTSSG
LVERSNGILK TLLYKYFTDK PDLPMDNALS IALWTINHLN VLTNCHKTRW QLHHSPRLQP
IPETHSLSNK QTHWYYFKLP GLNSRQWKGP QEALQEAAGA ALIPVSASSA QWIPWRLLKR
AACPRPVGGP ADPKEKDHQH HG