GAG_MMTVG
ID GAG_MMTVG Reviewed; 353 AA.
AC P03343;
DT 21-JUL-1986, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 3.
DT 23-FEB-2022, entry version 100.
DE RecName: Full=Gag polyprotein;
DE Contains:
DE RecName: Full=Matrix protein p10;
DE Contains:
DE RecName: Full=Phosphorylated protein pp21;
DE Contains:
DE RecName: Full=Protein p3;
DE Contains:
DE RecName: Full=Protein p8;
DE Contains:
DE RecName: Full=Protein n;
DE Contains:
DE RecName: Full=Capsid protein p27;
DE Flags: Fragment;
GN Name=gag;
OS Mouse mammary tumor virus (strain GR) (MMTV).
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Betaretrovirus.
OX NCBI_TaxID=11760;
OH NCBI_TaxID=10090; Mus musculus (Mouse).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=6314267; DOI=10.1093/nar/11.20.6943;
RA Fasel N., Buetti E., Firzlaff J., Pearson K., Diggelmann H.;
RT "Nucleotide sequence of the 5' noncoding region and part of the gag gene of
RT mouse mammary tumor virus; identification of the 5' splicing site for
RT subgenomic mRNAs.";
RL Nucleic Acids Res. 11:6943-6955(1983).
CC -!- FUNCTION: [Matrix protein p10]: Matrix protein.
CC -!- FUNCTION: Nucleocapsid protein p14: Nucleocapsid protein. Binds to
CC single-stranded DNA.
CC -!- FUNCTION: [Capsid protein p27]: Capsid protein.
CC -!- SUBCELLULAR LOCATION: [Matrix protein p10]: Virion {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: [Capsid protein p27]: Virion {ECO:0000305}.
CC -!- DOMAIN: Late-budding domains (L domains) are short sequence motifs
CC essential for viral particle budding. They recruit proteins of the host
CC ESCRT machinery (Endosomal Sorting Complex Required for Transport) or
CC ESCRT-associated proteins. Gag-p27 contains one L domain: a PTAP/PSAP
CC motif, which interacts with the UEV domain of TSG101 (Potential).
CC {ECO:0000305}.
CC -!- PTM: p10 is myristoylated. {ECO:0000250}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X00018; CAA24916.1; -; Genomic_RNA.
DR PIR; A03941; FOMVGR.
DR SMR; P03343; -.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000166; F:nucleotide binding; IEA:UniProtKB-KW.
DR GO; GO:0039660; F:structural constituent of virion; IEA:UniProtKB-KW.
DR GO; GO:0039702; P:viral budding via host ESCRT complex; IEA:UniProtKB-KW.
DR Gene3D; 1.10.150.490; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR InterPro; IPR003322; B_retro_matrix.
DR InterPro; IPR038124; B_retro_matrix_sf.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR Pfam; PF02337; Gag_p10; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
PE 3: Inferred from homology;
KW Capsid protein; DNA-binding; Host-virus interaction; Lipoprotein;
KW Myristate; Nucleotide-binding; Phosphoprotein; Viral budding;
KW Viral budding via the host ESCRT complexes; Viral matrix protein;
KW Viral nucleoprotein; Viral release from host cell; Virion.
FT INIT_MET 1
FT /note="Removed; by host"
FT /evidence="ECO:0000250"
FT CHAIN 2..99
FT /note="Matrix protein p10"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040934"
FT CHAIN 100..195
FT /note="Phosphorylated protein pp21"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040935"
FT CHAIN 196..228
FT /note="Protein p3"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040936"
FT CHAIN 229..252
FT /note="Protein p8"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040937"
FT CHAIN 253..269
FT /note="Protein n"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040938"
FT CHAIN 270..>353
FT /note="Capsid protein p27"
FT /evidence="ECO:0000250"
FT /id="PRO_0000040939"
FT REGION 151..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 305..308
FT /note="PTAP/PSAP motif"
FT /evidence="ECO:0000255"
FT LIPID 2
FT /note="N-myristoyl glycine; by host"
FT /evidence="ECO:0000250"
FT NON_TER 353
SQ SEQUENCE 353 AA; 40375 MW; E20C6832DBDA08CB CRC64;
MGVSGSKGQK LFVSVLQRLL SERGLHVKES STIEFYQFLI KVSLGFPKKE DLNLQDWKRV
GREMKKYAAD DGTDSIPKQA YPIWLQLREI LTEQSDLVLL SAEAKSVTEE ELEEGLTGLL
SASSQEKTYG TRGTAYAEID TEADKLSEHI YDEPYEEKEK ADKNEEKDHV RKVKKIVQRK
ENSEHKRKEK DQKAFLATDW NDDDLSPEDW DNLEEQAAHY HDDDELILPV KRKVVKKKPL
ALRRKPLPPV GFAGAMAEAR EKGDLTFTFP VVFMGESDDD DTPVWEPLPL KTLKELQSAV
RTMGPSAPYT LEVVDMVASQ WLTPSDWHQT ARATLSPGDY VLWRTEYEEK SKE