GAG_EIAV9
ID GAG_EIAV9 Reviewed; 486 AA.
AC P69730; P03351;
DT 21-JUL-1986, integrated into UniProtKB/Swiss-Prot.
DT 21-JUL-1986, sequence version 1.
DT 23-FEB-2022, entry version 85.
DE RecName: Full=Gag polyprotein;
DE Contains:
DE RecName: Full=Matrix protein p15;
DE Short=MA;
DE Contains:
DE RecName: Full=Capsid protein p26;
DE Short=CA;
DE Contains:
DE RecName: Full=p1;
DE Contains:
DE RecName: Full=Nucleocapsid protein p11;
DE Short=NC;
DE Contains:
DE RecName: Full=p9;
GN Name=gag;
OS Equine infectious anemia virus (isolate 1369) (EIAV).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus.
OX NCBI_TaxID=11670;
OH NCBI_TaxID=9793; Equus asinus (Donkey) (Equus africanus asinus).
OH NCBI_TaxID=9796; Equus caballus (Horse).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=3035786; DOI=10.1016/0042-6822(87)90202-9;
RA Kawakami T., Sherman L., Dahlberg J., Gazit A., Yaniv A., Tronick S.R.,
RA Aaronson S.A.;
RT "Nucleotide sequence analysis of equine infectious anemia virus proviral
RT DNA.";
RL Virology 158:300-312(1987).
RN [2]
RP INTERACTION OF P9 WITH HUMAN PDCD6IP/AIP1.
RX PubMed=14505569; DOI=10.1016/s0092-8674(03)00653-6;
RA Strack B., Calistri A., Craig S., Popova E., Goettlinger H.G.;
RT "AIP1/ALIX is a binding partner for HIV-1 p6 and EIAV p9 functioning in
RT virus budding.";
RL Cell 114:689-699(2003).
CC -!- FUNCTION: Matrix protein p15 forms the outer shell of the core of the
CC virus, lining the inner surface of the viral membrane. {ECO:0000250}.
CC -!- FUNCTION: Capsid protein p26 forms the conical core of the virus that
CC encapsulates the genomic RNA-nucleocapsid complex. {ECO:0000250}.
CC -!- FUNCTION: Nucleocapsid protein p11 encapsulates and protects viral
CC dimeric unspliced (genomic) RNA. Binds these RNAs through its zinc
CC fingers (By similarity). {ECO:0000250}.
CC -!- FUNCTION: p9 plays a role in budding of the assembled particle by
CC interacting with PDCD6IP/AIP1.
CC -!- SUBUNIT: [p9]: Interacts with human PDCD6IP/AIP1.
CC {ECO:0000269|PubMed:14505569}.
CC -!- INTERACTION:
CC P69730; Q8WUM4: PDCD6IP; Xeno; NbExp=2; IntAct=EBI-1220941, EBI-310624;
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000305}.
CC -!- DOMAIN: Late-budding domains (L domains) are short sequence motifs
CC essential for viral particle budding. They recruit proteins of the host
CC ESCRT machinery (Endosomal Sorting Complex Required for Transport) or
CC ESCRT-associated proteins. p9 contains one L domain: a LYPX(n)L motif,
CC which interacts with PDCD6IP/AIP1.
CC -!- SIMILARITY: Belongs to the equine lentivirus group gag polyprotein
CC family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M16575; AAB59861.1; -; Genomic_RNA.
DR PIR; A03949; FOLJEV.
DR RefSeq; NP_056901.1; NC_001450.1.
DR PDB; 6T61; EM; 3.70 A; A/B/C/D/E/F/G/H/I/J/K/L/M/N/O/P/Q/R=1-486.
DR PDB; 6T63; EM; 3.80 A; A/B/C/D/E/F/G/H/I/J/K/L/M/N/O/P/Q/R=1-486.
DR PDB; 6T64; EM; 3.70 A; A/B/C=1-486.
DR PDBsum; 6T61; -.
DR PDBsum; 6T63; -.
DR PDBsum; 6T64; -.
DR BMRB; P69730; -.
DR SMR; P69730; -.
DR IntAct; P69730; 1.
DR GeneID; 1489984; -.
DR KEGG; vg:1489984; -.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0039660; F:structural constituent of virion; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039702; P:viral budding via host ESCRT complex; IEA:UniProtKB-KW.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.150.90; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR InterPro; IPR014834; Gag_p15.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR012344; Matrix_HIV/RSV_N.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF08723; Gag_p15; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00098; zf-CCHC; 2.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF57756; SSF57756; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 1: Evidence at protein level;
KW 3D-structure; Capsid protein; Disulfide bond; Host-virus interaction;
KW Metal-binding; Repeat; Viral budding;
KW Viral budding via the host ESCRT complexes; Viral matrix protein;
KW Viral nucleoprotein; Viral release from host cell; Virion; Zinc;
KW Zinc-finger.
FT CHAIN 1..124
FT /note="Matrix protein p15"
FT /evidence="ECO:0000250"
FT /id="PRO_0000038772"
FT CHAIN 125..354
FT /note="Capsid protein p26"
FT /evidence="ECO:0000250"
FT /id="PRO_0000038773"
FT PEPTIDE 355..359
FT /note="p1"
FT /evidence="ECO:0000255"
FT /id="PRO_0000272313"
FT CHAIN 360..435
FT /note="Nucleocapsid protein p11"
FT /evidence="ECO:0000250"
FT /id="PRO_0000038774"
FT CHAIN 436..486
FT /note="p9"
FT /evidence="ECO:0000250"
FT /id="PRO_0000038775"
FT ZN_FING 381..398
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 400..417
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT REGION 414..460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 457..461
FT /note="LYPX(n)L motif"
FT COMPBIAS 417..460
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 322..342
FT /evidence="ECO:0000250"
SQ SEQUENCE 486 AA; 54809 MW; 97137FA5933D1DDA CRC64;
MGDPLTWSKA LKKLEKVTVQ GSQKLTTGNC NWALSLVDLF HDTNFVKEKD WQLRDVIPLL
EDVTQTLSGQ EREAFERTWW AISAVKMGLQ INNVVDGKAS FQLLRAKYEK KTANKKQSEP
SEEYPIMIDG AGNRNFRPLT PRGYTTWVNT IQTNGLLNEA SQNLFGILSV DCTSEEMNAF
LDVVPGQAGQ KQILLDAIDK IADDWDNRHP LPNAPLVAPP QGPIPMTARF IRGLGVPRER
QMEPAFDQFR QTYRQWIIEA MSEGIKVMIG KPKAQNIRQG AKEPYPEFVD RLLSQIKSEG
HPQEISKFLT DTLTIQNANE ECRNAMRHLR PEDTLEEKMY ACRDIGTTKQ KMMLLAKALQ
TGLAGPFKGG ALKGGPLKAA QTCYNCGKPG HLSSQCRAPK VCFKCKQPGH FSKQCRSVPK
NGKQGAQGRP QKQTFPIQQK SQHNKSVVQE TPQTQNLYPD LSEIKKEYNV KEKDQVEDLN
LDSLWE