GAG_HTL32

ID   GAG_HTL32               Reviewed;         422 AA.
AC   Q0R5R4;
DT   14-NOV-2006, integrated into UniProtKB/Swiss-Prot.
DT   23-JAN-2007, sequence version 3.
DT   23-FEB-2022, entry version 62.
DE   RecName: Full=Gag polyprotein;
DE   AltName: Full=Pr53Gag;
DE   Contains:
DE     RecName: Full=Matrix protein p19;
DE              Short=MA;
DE   Contains:
DE     RecName: Full=Capsid protein p24;
DE              Short=CA;
DE   Contains:
DE     RecName: Full=Nucleocapsid protein p15-gag;
DE              Short=NC-gag;
GN   Name=gag;
OS   Human T-cell leukemia virus 3 (strain 2026ND) (HTLV-3).
OC   Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC   Ortervirales; Retroviridae; Orthoretrovirinae; Deltaretrovirus.
OX   NCBI_TaxID=402036;
OH   NCBI_TaxID=9606; Homo sapiens (Human).
RN   [1]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX   PubMed=16840323; DOI=10.1128/jvi.00690-06;
RA   Switzer W.M., Qari S.H., Wolfe N.D., Burke D.S., Folks T.M., Heneine W.;
RT   "Ancient origin and molecular features of the novel human T-lymphotropic
RT   virus type 3 revealed by complete genome analysis.";
RL   J. Virol. 80:7427-7438(2006).
CC   -!- FUNCTION: Matrix protein p19 targets Gag, Gag-Pro and Gag-Pro-Pol
CC       polyproteins to the plasma membrane via a multipartite membrane binding
CC       signal, that includes its myristoylated N-terminus. Also mediates
CC       nuclear localization of the preintegration complex (By similarity).
CC       {ECO:0000250}.
CC   -!- FUNCTION: Capsid protein p24 forms the conical core of the virus that
CC       encapsulates the genomic RNA-nucleocapsid complex. {ECO:0000250}.
CC   -!- FUNCTION: Nucleocapsid protein p15 is involved in the packaging and
CC       encapsidation of two copies of the genome. {ECO:0000250}.
CC   -!- SUBUNIT: Interacts with human TSG101. This interaction is essential for
CC       budding and release of viral particles (By similarity). {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: [Matrix protein p19]: Virion {ECO:0000305}.
CC   -!- SUBCELLULAR LOCATION: [Capsid protein p24]: Virion {ECO:0000305}.
CC   -!- SUBCELLULAR LOCATION: [Nucleocapsid protein p15-gag]: Virion
CC       {ECO:0000305}.
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Ribosomal frameshifting; Named isoforms=3;
CC         Comment=This strategy of translation probably allows the virus to
CC         modulate the quantity of each viral protein.;
CC       Name=Gag polyprotein;
CC         IsoId=Q0R5R4-1; Sequence=Displayed;
CC       Name=Gag-Pro polyprotein;
CC         IsoId=Q0R5R3-1; Sequence=External;
CC       Name=Gag-Pol polyprotein;
CC         IsoId=Q0R5R2-1; Sequence=External;
CC   -!- DOMAIN: Late-budding domains (L domains) are short sequence motifs
CC       essential for viral particle release. They can occur individually or in
CC       close proximity within structural proteins. They interacts with sorting
CC       cellular proteins of the multivesicular body (MVB) pathway. Most of
CC       these proteins are class E vacuolar protein sorting factors belonging
CC       to ESCRT-I, ESCRT-II or ESCRT-III complexes. Matrix protein p19
CC       contains two L domains: a PTAP/PSAP motif which interacts with the UEV
CC       domain of TSG101, and a PPXY motif which binds to the WW domains of
CC       HECT (homologous to E6-AP C-terminus) E3 ubiquitin ligases (By
CC       similarity). {ECO:0000250}.
CC   -!- PTM: Specific enzymatic cleavages by the viral protease yield mature
CC       proteins. The polyprotein is cleaved during and after budding, this
CC       process is termed maturation (By similarity). {ECO:0000250}.
CC   -!- MISCELLANEOUS: [Isoform Gag polyprotein]: Produced by conventional
CC       translation.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DQ093792; AAZ77657.1; -; Genomic_DNA.
DR   SMR; Q0R5R4; -.
DR   Proteomes; UP000008029; Genome.
DR   GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0016032; P:viral process; IEA:InterPro.
DR   Gene3D; 1.10.1200.30; -; 1.
DR   Gene3D; 1.10.375.10; -; 1.
DR   InterPro; IPR003139; D_retro_matrix.
DR   InterPro; IPR045345; Gag_p24_C.
DR   InterPro; IPR000721; Gag_p24_N.
DR   InterPro; IPR008916; Retrov_capsid_C.
DR   InterPro; IPR008919; Retrov_capsid_N.
DR   InterPro; IPR010999; Retrovr_matrix.
DR   InterPro; IPR001878; Znf_CCHC.
DR   InterPro; IPR036875; Znf_CCHC_sf.
DR   Pfam; PF02228; Gag_p19; 1.
DR   Pfam; PF00607; Gag_p24; 1.
DR   Pfam; PF19317; Gag_p24_C; 1.
DR   Pfam; PF00098; zf-CCHC; 1.
DR   SMART; SM00343; ZnF_C2HC; 2.
DR   SUPFAM; SSF47836; SSF47836; 1.
DR   SUPFAM; SSF47943; SSF47943; 1.
DR   SUPFAM; SSF57756; SSF57756; 1.
DR   PROSITE; PS50158; ZF_CCHC; 1.
PE   3: Inferred from homology;
KW   Capsid protein; Host-virus interaction; Lipoprotein; Metal-binding;
KW   Myristate; Phosphoprotein; Reference proteome; Repeat;
KW   Ribosomal frameshifting; Viral nucleoprotein; Virion; Zinc; Zinc-finger.
FT   INIT_MET        1
FT                   /note="Removed; by host"
FT                   /evidence="ECO:0000250"
FT   CHAIN           2..123
FT                   /note="Matrix protein p19"
FT                   /evidence="ECO:0000250"
FT                   /id="PRO_0000259779"
FT   CHAIN           124..337
FT                   /note="Capsid protein p24"
FT                   /evidence="ECO:0000250"
FT                   /id="PRO_0000259780"
FT   CHAIN           338..422
FT                   /note="Nucleocapsid protein p15-gag"
FT                   /evidence="ECO:0000250"
FT                   /id="PRO_0000259781"
FT   ZN_FING         349..366
FT                   /note="CCHC-type 1"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT   ZN_FING         372..389
FT                   /note="CCHC-type 2"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT   REGION          95..116
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          388..422
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOTIF           98..101
FT                   /note="PTAP/PSAP motif"
FT   MOTIF           109..112
FT                   /note="PPXY motif"
FT   SITE            123..124
FT                   /note="Cleavage; by viral protease"
FT                   /evidence="ECO:0000250"
FT   SITE            337..338
FT                   /note="Cleavage; by viral protease"
FT                   /evidence="ECO:0000250"
FT   LIPID           2
FT                   /note="N-myristoyl glycine; by host"
FT                   /evidence="ECO:0000250"
SQ   SEQUENCE   422 AA;  46935 MW;  FA2644F00249CA32 CRC64;
     MGKTYSSPIN PIPKAPKGLA IHHWLNFLQA AYRLQPGPSE FDFHQLRKFL KLAIKTPVWL
     NPINYSVLAG LIPKNYPGRV HEIVAILIQE TPAREAPPSA PLAEDPQKPP PYPEQAQEAS
     QCLPILHPHG APAAHRPWQM KDLQAIKQEV SSSAPGSPQF MQTIRLAVQQ FDPTAKDLHD
     LLQYLCSSLV ASLHHQQLET LIAQAETQGI TGYNPLAGPL RIQANNPNQQ GLRKEYQNLW
     LSAFSALPGN TKDPTWAAIL QGPEEPFGSF VERLNVALDN GLPEGTPKDP ILRSLAYSNA
     NKECQKLLQA RGQTNSPLGE MLRACQTWTP RDKNKILMVQ PKKTPPPNQP CFRCGQVGHW
     SRDCKQPRPP PGPCPVCQDP THWKRDCPQL KTDTRDSEDL LLDLPCEAPN VRERKNSSGG
     ED