GAG_HTL32
ID GAG_HTL32 Reviewed; 422 AA.
AC Q0R5R4;
DT 14-NOV-2006, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 3.
DT 23-FEB-2022, entry version 62.
DE RecName: Full=Gag polyprotein;
DE AltName: Full=Pr53Gag;
DE Contains:
DE RecName: Full=Matrix protein p19;
DE Short=MA;
DE Contains:
DE RecName: Full=Capsid protein p24;
DE Short=CA;
DE Contains:
DE RecName: Full=Nucleocapsid protein p15-gag;
DE Short=NC-gag;
GN Name=gag;
OS Human T-cell leukemia virus 3 (strain 2026ND) (HTLV-3).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Deltaretrovirus.
OX NCBI_TaxID=402036;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=16840323; DOI=10.1128/jvi.00690-06;
RA Switzer W.M., Qari S.H., Wolfe N.D., Burke D.S., Folks T.M., Heneine W.;
RT "Ancient origin and molecular features of the novel human T-lymphotropic
RT virus type 3 revealed by complete genome analysis.";
RL J. Virol. 80:7427-7438(2006).
CC -!- FUNCTION: Matrix protein p19 targets Gag, Gag-Pro and Gag-Pro-Pol
CC polyproteins to the plasma membrane via a multipartite membrane binding
CC signal, that includes its myristoylated N-terminus. Also mediates
CC nuclear localization of the preintegration complex (By similarity).
CC {ECO:0000250}.
CC -!- FUNCTION: Capsid protein p24 forms the conical core of the virus that
CC encapsulates the genomic RNA-nucleocapsid complex. {ECO:0000250}.
CC -!- FUNCTION: Nucleocapsid protein p15 is involved in the packaging and
CC encapsidation of two copies of the genome. {ECO:0000250}.
CC -!- SUBUNIT: Interacts with human TSG101. This interaction is essential for
CC budding and release of viral particles (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Matrix protein p19]: Virion {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein p24]: Virion {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Nucleocapsid protein p15-gag]: Virion
CC {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=3;
CC Comment=This strategy of translation probably allows the virus to
CC modulate the quantity of each viral protein.;
CC Name=Gag polyprotein;
CC IsoId=Q0R5R4-1; Sequence=Displayed;
CC Name=Gag-Pro polyprotein;
CC IsoId=Q0R5R3-1; Sequence=External;
CC Name=Gag-Pol polyprotein;
CC IsoId=Q0R5R2-1; Sequence=External;
CC -!- DOMAIN: Late-budding domains (L domains) are short sequence motifs
CC essential for viral particle release. They can occur individually or in
CC close proximity within structural proteins. They interacts with sorting
CC cellular proteins of the multivesicular body (MVB) pathway. Most of
CC these proteins are class E vacuolar protein sorting factors belonging
CC to ESCRT-I, ESCRT-II or ESCRT-III complexes. Matrix protein p19
CC contains two L domains: a PTAP/PSAP motif which interacts with the UEV
CC domain of TSG101, and a PPXY motif which binds to the WW domains of
CC HECT (homologous to E6-AP C-terminus) E3 ubiquitin ligases (By
CC similarity). {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages by the viral protease yield mature
CC proteins. The polyprotein is cleaved during and after budding, this
CC process is termed maturation (By similarity). {ECO:0000250}.
CC -!- MISCELLANEOUS: [Isoform Gag polyprotein]: Produced by conventional
CC translation.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ093792; AAZ77657.1; -; Genomic_DNA.
DR SMR; Q0R5R4; -.
DR Proteomes; UP000008029; Genome.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0016032; P:viral process; IEA:InterPro.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR InterPro; IPR003139; D_retro_matrix.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF02228; Gag_p19; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF57756; SSF57756; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 3: Inferred from homology;
KW Capsid protein; Host-virus interaction; Lipoprotein; Metal-binding;
KW Myristate; Phosphoprotein; Reference proteome; Repeat;
KW Ribosomal frameshifting; Viral nucleoprotein; Virion; Zinc; Zinc-finger.
FT INIT_MET 1
FT /note="Removed; by host"
FT /evidence="ECO:0000250"
FT CHAIN 2..123
FT /note="Matrix protein p19"
FT /evidence="ECO:0000250"
FT /id="PRO_0000259779"
FT CHAIN 124..337
FT /note="Capsid protein p24"
FT /evidence="ECO:0000250"
FT /id="PRO_0000259780"
FT CHAIN 338..422
FT /note="Nucleocapsid protein p15-gag"
FT /evidence="ECO:0000250"
FT /id="PRO_0000259781"
FT ZN_FING 349..366
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 372..389
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT REGION 95..116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 98..101
FT /note="PTAP/PSAP motif"
FT MOTIF 109..112
FT /note="PPXY motif"
FT SITE 123..124
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250"
FT SITE 337..338
FT /note="Cleavage; by viral protease"
FT /evidence="ECO:0000250"
FT LIPID 2
FT /note="N-myristoyl glycine; by host"
FT /evidence="ECO:0000250"
SQ SEQUENCE 422 AA; 46935 MW; FA2644F00249CA32 CRC64;
MGKTYSSPIN PIPKAPKGLA IHHWLNFLQA AYRLQPGPSE FDFHQLRKFL KLAIKTPVWL
NPINYSVLAG LIPKNYPGRV HEIVAILIQE TPAREAPPSA PLAEDPQKPP PYPEQAQEAS
QCLPILHPHG APAAHRPWQM KDLQAIKQEV SSSAPGSPQF MQTIRLAVQQ FDPTAKDLHD
LLQYLCSSLV ASLHHQQLET LIAQAETQGI TGYNPLAGPL RIQANNPNQQ GLRKEYQNLW
LSAFSALPGN TKDPTWAAIL QGPEEPFGSF VERLNVALDN GLPEGTPKDP ILRSLAYSNA
NKECQKLLQA RGQTNSPLGE MLRACQTWTP RDKNKILMVQ PKKTPPPNQP CFRCGQVGHW
SRDCKQPRPP PGPCPVCQDP THWKRDCPQL KTDTRDSEDL LLDLPCEAPN VRERKNSSGG
ED