POL_VILVK
ID POL_VILVK Reviewed; 1506 AA.
AC P35956;
DT 01-JUN-1994, integrated into UniProtKB/Swiss-Prot.
DT 28-FEB-2018, sequence version 2.
DT 03-AUG-2022, entry version 145.
DE RecName: Full=Gag-Pol polyprotein;
DE Contains:
DE RecName: Full=Matrix protein p16;
DE Contains:
DE RecName: Full=Capsid protein p25;
DE Contains:
DE RecName: Full=Nucleocapsid protein p14;
DE Contains:
DE RecName: Full=Protease;
DE EC=3.4.23.- {ECO:0000255|PROSITE-ProRule:PRU00275};
DE AltName: Full=Retropepsin;
DE Contains:
DE RecName: Full=Reverse transcriptase/ribonuclease H;
DE Short=RT;
DE EC=2.7.7.49 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=2.7.7.7 {ECO:0000255|PROSITE-ProRule:PRU00405};
DE EC=3.1.26.4 {ECO:0000255|PROSITE-ProRule:PRU00408};
DE AltName: Full=Exoribonuclease H;
DE EC=3.1.13.2;
DE Contains:
DE RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase;
DE Short=dUTPase;
DE EC=3.6.1.23 {ECO:0000305};
DE Contains:
DE RecName: Full=Integrase;
DE Short=IN;
DE EC=2.7.7.- {ECO:0000250|UniProtKB:P03370};
DE EC=3.1.-.- {ECO:0000250|UniProtKB:P03370};
GN Name=pol;
OS Maedi visna virus (strain KV1772) (MVV) (Visna lentivirus).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus.
OX NCBI_TaxID=36374;
OH NCBI_TaxID=9940; Ovis aries (Sheep).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=8382414; DOI=10.1006/viro.1993.1106;
RA Andresson O.S., Elser J.E., Tobin G.J., Greenwood J.D., Gonda M.A.,
RA Georgsson G., Andresdottir V., Benediktsdottir E., Carlsdottir H.M.,
RA Maentylae E.O., Rafnar B., Palsson P.A., Casey J.W., Petursson G.;
RT "Nucleotide sequence and biological properties of a pathogenic proviral
RT molecular clone of neurovirulent visna virus.";
RL Virology 193:89-105(1993).
RN [2]
RP RIBOSOMAL FRAMESHIFTING.
RX PubMed=18495941; DOI=10.1261/rna.1042108;
RA Pennell S., Manktelow E., Flatt A., Kelly G., Smerdon S.J., Brierley I.;
RT "The stimulatory RNA of the Visna-Maedi retrovirus ribosomal frameshifting
RT signal is an unusual pseudoknot with an interstem element.";
RL RNA 14:1366-1377(2008).
RN [3]
RP X-RAY CRYSTALLOGRAPHY (4.9 ANGSTROMS) OF 1228-1506, AND SUBUNIT
RP (INTEGRASE).
RX PubMed=28059770; DOI=10.1126/science.aah7002;
RA Ballandras-Colas A., Maskell D.P., Serrao E., Locke J., Swuec P.,
RA Jonsson S.R., Kotecha A., Cook N.J., Pye V.E., Taylor I.A.,
RA Andresdottir V., Engelman A.N., Costa A., Cherepanov P.;
RT "A supramolecular assembly mediates lentiviral DNA integration.";
RL Science 355:93-95(2017).
CC -!- FUNCTION: [Gag-Pol polyprotein]: Mediates, with Gag polyprotein, the
CC essential events in virion assembly, including binding the plasma
CC membrane, making the protein-protein interactions necessary to create
CC spherical particles, recruiting the viral Env proteins, and packaging
CC the genomic RNA via direct interactions with the RNA packaging
CC sequence. {ECO:0000250|UniProtKB:P04585}.
CC -!- FUNCTION: [Matrix protein p16]: Targets the polyprotein to the plasma
CC membrane. {ECO:0000250|UniProtKB:P12497}.
CC -!- FUNCTION: [Capsid protein p25]: Forms the core that encapsulates the
CC genomic RNA-nucleocapsid complex in the virion.
CC {ECO:0000250|UniProtKB:P04585}.
CC -!- FUNCTION: [Nucleocapsid protein p14]: Encapsulates and protects viral
CC dimeric unspliced genomic RNA (gRNA). Binds these RNAs through its zinc
CC fingers. Acts as a nucleic acid chaperone which is involved in
CC rearrangement of nucleic acid secondary structure during gRNA
CC retrotranscription. Also facilitates template switch leading to
CC recombination. {ECO:0000250|UniProtKB:P04585}.
CC -!- FUNCTION: [Protease]: The aspartyl protease mediates proteolytic
CC cleavages of Gag and Gag-Pol polyproteins during or shortly after the
CC release of the virion from the plasma membrane. Cleavages take place as
CC an ordered, step-wise cascade to yield mature proteins. This process is
CC called maturation. Displays maximal activity during the budding process
CC just prior to particle release from the cell. {ECO:0000255|PROSITE-
CC ProRule:PRU00275}.
CC -!- FUNCTION: [Reverse transcriptase/ribonuclease H]: RT is a
CC multifunctional enzyme that converts the viral dimeric RNA genome into
CC dsDNA in the cytoplasm, shortly after virus entry into the cell. This
CC enzyme displays a DNA polymerase activity that can copy either DNA or
CC RNA templates, and a ribonuclease H (RNase H) activity that cleaves the
CC RNA strand of RNA-DNA heteroduplexes in a partially processive 3' to 5'
CC endonucleasic mode. Conversion of viral genomic RNA into dsDNA requires
CC many steps. A tRNA binds to the primer-binding site (PBS) situated at
CC the 5' end of the viral RNA. RT uses the 3' end of the tRNA primer to
CC perfom a short round of RNA-dependent minus-strand DNA synthesis. The
CC reading proceeds through the U5 region and ends after the repeated (R)
CC region which is present at both ends of viral RNA. The portion of the
CC RNA-DNA heteroduplex is digested by the RNase H, resulting in a ssDNA
CC product attached to the tRNA primer. This ssDNA/tRNA hybridizes with
CC the identical R region situated at the 3' end of viral RNA. This
CC template exchange, known as minus-strand DNA strong stop transfer, can
CC be either intra- or intermolecular. RT uses the 3' end of this newly
CC synthesized short ssDNA to perfom the RNA-dependent minus-strand DNA
CC synthesis of the whole template. RNase H digests the RNA template
CC except for a polypurine tract (PPT) situated at the 5' end of the
CC genome. It is not clear if both polymerase and RNase H activities are
CC simultaneous. RNase H probably can proceed both in a polymerase-
CC dependent (RNA cut into small fragments by the same RT performing DNA
CC synthesis) and a polymerase-independent mode (cleavage of remaining RNA
CC fragments by free RTs). Secondly, RT performs DNA-directed plus-strand
CC DNA synthesis using the PPT that has not been removed by RNase H as
CC primers. PPT and tRNA primers are then removed by RNase H. The 3' and
CC 5' ssDNA PBS regions hybridize to form a circular dsDNA intermediate.
CC Strand displacement synthesis by RT to the PBS and PPT ends produces a
CC blunt ended, linear dsDNA copy of the viral genome that includes long
CC terminal repeats (LTRs) at both ends. {ECO:0000255|PROSITE-
CC ProRule:PRU00405}.
CC -!- FUNCTION: [Integrase]: Catalyzes viral DNA integration into the host
CC chromosome, by performing a series of DNA cutting and joining
CC reactions. {ECO:0000250|UniProtKB:P03370}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3'-end directed exonucleolytic cleavage of viral RNA-DNA
CC hybrid.; EC=3.1.13.2;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endonucleolytic cleavage to 5'-phosphomonoester.; EC=3.1.26.4;
CC Evidence={ECO:0000255|PROSITE-ProRule:PRU00408};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=dUTP + H2O = diphosphate + dUMP + H(+); Xref=Rhea:RHEA:10248,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:33019,
CC ChEBI:CHEBI:61555, ChEBI:CHEBI:246422; EC=3.6.1.23;
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.49; Evidence={ECO:0000255|PROSITE-
CC ProRule:PRU00405};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) =
CC diphosphate + DNA(n+1); Xref=Rhea:RHEA:22508, Rhea:RHEA-COMP:17339,
CC Rhea:RHEA-COMP:17340, ChEBI:CHEBI:33019, ChEBI:CHEBI:61560,
CC ChEBI:CHEBI:173112; EC=2.7.7.7; Evidence={ECO:0000255|PROSITE-
CC ProRule:PRU00405};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000255|PROSITE-ProRule:PRU00405};
CC Note=The RT polymerase active site binds 2 magnesium ions.
CC {ECO:0000255|PROSITE-ProRule:PRU00405};
CC -!- SUBUNIT: [Integrase]: Homotetramer; further associates as a
CC homohexadecamer. {ECO:0000269|PubMed:28059770}.
CC -!- SUBCELLULAR LOCATION: [Matrix protein p16]: Virion {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein p25]: Virion {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Nucleocapsid protein p14]: Virion {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Name=Gag-Pol polyprotein;
CC IsoId=P35956-1; Sequence=Displayed;
CC Name=Gag polyprotein;
CC IsoId=P35955-1; Sequence=External;
CC -!- PTM: [Gag-Pol polyprotein]: Specific enzymatic cleavages by the viral
CC protease yield mature proteins. {ECO:0000305}.
CC -!- MISCELLANEOUS: The reverse transcriptase is an error-prone enzyme that
CC lacks a proof-reading function. High mutations rate is a direct
CC consequence of this characteristic. RT also displays frequent template
CC swiching leading to high recombination rate. Recombination mostly
CC occurs between homologous regions of the two copackaged RNA genomes. If
CC these two RNA molecules derive from different viral strains, reverse
CC transcription will give rise to highly recombinated proviral DNAs.
CC {ECO:0000255|PROSITE-ProRule:PRU00405}.
CC -!- MISCELLANEOUS: [Isoform Gag-Pol polyprotein]: Produced by a -1
CC ribosomal frameshifting between gag and pol.
CC {ECO:0000305|PubMed:18495941}.
CC -!- SIMILARITY: Belongs to the retroviral Pol polyprotein family.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA48359.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=AAB25460.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L06906; AAA48359.1; ALT_SEQ; Genomic_RNA.
DR EMBL; S55323; AAB25460.1; ALT_SEQ; Genomic_DNA.
DR PDB; 3HPG; X-ray; 3.28 A; A/B/C/D/E/F=1228-1445.
DR PDB; 3HPH; X-ray; 2.64 A; A/B/C/D=1228-1445.
DR PDB; 5LLJ; X-ray; 1.78 A; A/B=1444-1501.
DR PDB; 5M0Q; EM; 4.91 A; A/B/C/D/E/F/G/H/I/J/K/L/M/N/O/P=1228-1506.
DR PDB; 5M0R; EM; 8.20 A; A/B/C/D/E/F/G/H/I/J/K/L/M/N/O/P=1228-1506.
DR PDB; 5T3A; X-ray; 2.50 A; A=1285-1506.
DR PDBsum; 3HPG; -.
DR PDBsum; 3HPH; -.
DR PDBsum; 5LLJ; -.
DR PDBsum; 5M0Q; -.
DR PDBsum; 5M0R; -.
DR PDBsum; 5T3A; -.
DR SMR; P35956; -.
DR MEROPS; A02.006; -.
DR PRIDE; P35956; -.
DR EvolutionaryTrace; P35956; -.
DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0004170; F:dUTP diphosphatase activity; IEA:UniProtKB-EC.
DR GO; GO:0004533; F:exoribonuclease H activity; IEA:UniProtKB-EC.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:UniProtKB-EC.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0009117; P:nucleotide metabolic process; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR CDD; cd07557; trimeric_dUTPase; 1.
DR Gene3D; 1.10.10.200; -; 1.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR Gene3D; 2.30.30.10; -; 1.
DR Gene3D; 2.40.70.10; -; 1.
DR Gene3D; 2.70.40.10; -; 1.
DR Gene3D; 3.30.420.10; -; 2.
DR Gene3D; 3.30.70.270; -; 3.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR029054; dUTPase-like.
DR InterPro; IPR036157; dUTPase-like_sf.
DR InterPro; IPR033704; dUTPase_trimeric.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR017856; Integrase-like_N.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR003308; Integrase_Zn-bd_dom_N.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR010659; RVT_connect.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF00692; dUTPase; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF02022; Integrase_Zn; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00077; RVP; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF06815; RVT_connect; 1.
DR Pfam; PF00098; zf-CCHC; 2.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF46919; SSF46919; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF50122; SSF50122; 1.
DR SUPFAM; SSF50630; SSF50630; 1.
DR SUPFAM; SSF51283; SSF51283; 1.
DR SUPFAM; SSF53098; SSF53098; 2.
DR SUPFAM; SSF56672; SSF56672; 1.
DR SUPFAM; SSF57756; SSF57756; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
DR PROSITE; PS50876; ZF_INTEGRASE; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Aspartyl protease; Capsid protein; DNA integration;
KW DNA recombination; DNA-binding; Endonuclease; Hydrolase; Magnesium;
KW Metal-binding; Multifunctional enzyme; Nuclease; Nucleotide metabolism;
KW Nucleotidyltransferase; Protease; Repeat; Ribosomal frameshifting;
KW RNA-directed DNA polymerase; Transferase; Viral genome integration; Virion;
KW Virus entry into host cell; Zinc; Zinc-finger.
FT CHAIN 1..1506
FT /note="Gag-Pol polyprotein"
FT /id="PRO_0000443369"
FT CHAIN 1..143
FT /note="Matrix protein p16"
FT /id="PRO_0000443370"
FT CHAIN 144..363
FT /note="Capsid protein p25"
FT /id="PRO_0000443371"
FT CHAIN 364..442
FT /note="Nucleocapsid protein p14"
FT /id="PRO_0000443372"
FT CHAIN 443..540
FT /note="Protease"
FT /id="PRO_0000038869"
FT CHAIN 541..1091
FT /note="Reverse transcriptase/ribonuclease H"
FT /id="PRO_0000038870"
FT CHAIN 1092..1225
FT /note="Deoxyuridine 5'-triphosphate nucleotidohydrolase"
FT /id="PRO_0000038871"
FT CHAIN 1226..1506
FT /note="Integrase"
FT /id="PRO_0000038872"
FT DOMAIN 459..530
FT /note="Peptidase A2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00275"
FT DOMAIN 587..776
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT DOMAIN 971..1093
FT /note="RNase H type-1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT DOMAIN 1270..1430
FT /note="Integrase catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT ZN_FING 385..402
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 404..421
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 1228..1269
FT /note="Integrase-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00450"
FT DNA_BIND 1447..1499
FT /note="Integrase-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00506"
FT ACT_SITE 464
FT /note="Protease; shared with dimeric partner"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00275"
FT BINDING 652
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 727
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 728
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="1"
FT /ligand_note="catalytic; for reverse transcriptase
FT activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00405"
FT BINDING 980
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1012
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1032
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1085
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="2"
FT /ligand_note="catalytic; for RNase H activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00408"
FT BINDING 1237
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00450"
FT BINDING 1241
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00450"
FT BINDING 1265
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00450"
FT BINDING 1268
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00450"
FT BINDING 1291
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1343
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00457"
FT BINDING 1379
FT /ligand="Mg(2+)"
FT /ligand_id="ChEBI:CHEBI:18420"
FT /ligand_label="3"
FT /ligand_note="catalytic; for integrase activity"
FT /evidence="ECO:0000305"
FT SITE 363..364
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250|UniProtKB:P35955"
FT HELIX 1230..1240
FT /evidence="ECO:0007829|PDB:3HPH"
FT HELIX 1244..1250
FT /evidence="ECO:0007829|PDB:3HPH"
FT HELIX 1255..1262
FT /evidence="ECO:0007829|PDB:3HPH"
FT HELIX 1266..1269
FT /evidence="ECO:0007829|PDB:3HPG"
FT STRAND 1287..1295
FT /evidence="ECO:0007829|PDB:5T3A"
FT STRAND 1298..1305
FT /evidence="ECO:0007829|PDB:5T3A"
FT TURN 1306..1308
FT /evidence="ECO:0007829|PDB:5T3A"
FT STRAND 1311..1317
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1321..1335
FT /evidence="ECO:0007829|PDB:5T3A"
FT STRAND 1339..1342
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1346..1349
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1351..1359
FT /evidence="ECO:0007829|PDB:5T3A"
FT STRAND 1363..1367
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1372..1392
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1393..1395
FT /evidence="ECO:0007829|PDB:5T3A"
FT HELIX 1399..1411
FT /evidence="ECO:0007829|PDB:5T3A"
FT TURN 1417..1419
FT /evidence="ECO:0007829|PDB:3HPG"
FT HELIX 1422..1440
FT /evidence="ECO:0007829|PDB:5T3A"
FT STRAND 1448..1451
FT /evidence="ECO:0007829|PDB:5LLJ"
FT STRAND 1463..1467
FT /evidence="ECO:0007829|PDB:5LLJ"
FT STRAND 1473..1480
FT /evidence="ECO:0007829|PDB:5LLJ"
FT TURN 1481..1484
FT /evidence="ECO:0007829|PDB:5LLJ"
FT STRAND 1485..1490
FT /evidence="ECO:0007829|PDB:5LLJ"
FT HELIX 1491..1493
FT /evidence="ECO:0007829|PDB:5LLJ"
FT STRAND 1494..1497
FT /evidence="ECO:0007829|PDB:5LLJ"
SQ SEQUENCE 1506 AA; 171957 MW; A51F80B2AEB4952A CRC64;
MAKQGSKEKK GYPELKEVIK ATCKIRVGPG KETLTEGNCL WALKTIDFIF EDLKTEPWTI
TKMYTVWDRL KGLTPEETSK REFASLQATL ACIMCSQMGM KPETVQAAKG IISMKEGLHE
NKEAKGEKVE QLYPNLEKHR EVYPIVNLQA GGRSWKAVES VVFQQLQTVA MQHGLVSEDF
ERQLAYYATT WTSKDILEVL AMMPGNRAQK ELIQGKLNEE AERWVRQNPP GPNVLTVDQI
MGVGQTNQQA SQANMDQARQ ICLQWVITAL RSVRHMSHRP GNPMLVKQKN TESYEDFIAR
LLEAIDAEPV TDPIKTYLKV TLSYTNASTD CQKQMDRTLG TRVQQATVEE KMQACRDVGS
EGFKMQLLAQ ALRPQGKAGQ KGVNQKCYNC GKPGHLARQC RQGIICHHCG KRGHMQKDCR
QKKQQGKQQE GATCGAVRAP YVVTEAPPKI EIKVGTRWKK LLVDTGADKT IVTSHDMSGI
PKGRIILQGI GGIIEGEKWE QVHLQYKDKI IRGTIVVLAT SPVEVLGRDN MRELGIGLIM
ANLEEKKIPS TRVRLKEGCK GPHIAQWPLT QEKLEGLKEI VDRLEKEGKV GRAPPHWTCN
TPIFCIKKKS GKWRMLIDFR ELNKQTEDLA EAQLGLPHPG GLQRKKHVTI LDIGDAYFTI
PLYEPYRQYT CFTMLSPNNL GPCVRYYWKV LPQGWKLSPA VYQFTMQKIL RGWIEEHPMI
QFGIYMDDIY IGSDLGLEEH RGIVNELASY IAQYGFMLPE DKRQEGYPAK WLGFELHPEK
WKFQKHTLPE ITEGPITLNK LQKLVGDLVW RQSLIGKSIP NILKLMEGDR ALQSERYIES
IHVREWEACR QKLKEMEGNY YDEEKDIYGQ LDWGNKAIEY IVFQEKGKPL WVNVVHSIKN
LSQAQQIIKA AQKLTQEVII RTGKIPWILL PGREEDWILE LQMGNINWMP SFWSCYKGSV
RWKKRNVIAE VVPGPTYYTD GGKKNGRGSL GYITSTGEKF RIHEEGTNQQ LELRAIEEAC
KQGPEKMNIV TDSRYAYEFM LRNWDEEVIR NPIQARIMEL VHNKEKIGVH WVPGHKGIPQ
NEEIDRYISE IFLAKEGRGI LQKRAEDAGY DLICPQEISI PAGQVKRIAI DLKINLKKDQ
WAMIGTKSSF ANKGVFVQGG IIDSGYQGTI QVVIYNSNNK EVVIPQGRKF AQLILMPLIH
EELEPWGETR KTERGEQGFG STGMYWIENI PLAEEEHNKW HQDAVSLHLE FGIPRTAAED
IVQQCDVCQE NKMPSTLRGS NKRGIDHWQV DYTHYEDKII LVWVETNSGL IYAERVKGET
GQEFRVQTMK WYAMFAPKSL QSDNGPAFVA ESTQLLMKYL GIEHTTGIPW NPQSQALVER
THQTLKNTLE KLIPMFNAFE SALAGTLITL NIKRKGGLGT SPMDIFIFNK EQQRIQQQSK
SKQEKIRFCY YRTRKRGHPG EWQGPTQVLW GGDGAIVVKD RGTDRYLVIA NKDVKFIPPP
KEIQKE