GAK9_HUMAN
ID GAK9_HUMAN Reviewed; 666 AA.
AC P63126; Q9UKH4;
DT 13-SEP-2004, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 2.
DT 25-MAY-2022, entry version 109.
DE RecName: Full=Endogenous retrovirus group K member 9 Gag polyprotein;
DE AltName: Full=HERV-K(C6) Gag protein;
DE AltName: Full=HERV-K109 Gag protein;
DE AltName: Full=HERV-K_6q14.1 provirus ancestral Gag polyprotein;
DE Short=Gag polyprotein;
GN Name=ERVK-9;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=10469592; DOI=10.1016/s0960-9822(99)80390-x;
RA Barbulescu M., Turner G., Seaman M.I., Deinard A.S., Kidd K.K., Lenz J.;
RT "Many human endogenous retrovirus K (HERV-K) proviruses are unique to
RT humans.";
RL Curr. Biol. 9:861-868(1999).
RN [2]
RP CHARACTERIZATION.
RX PubMed=7983737; DOI=10.1128/jvi.69.1.414-421.1995;
RA Sauter M., Schommer S., Kremmer E., Remberger K., Doelken G., Lemm I.,
RA Buck M., Best B., Neumann-Haefelin D., Mueller-Lantzsch N.;
RT "Human endogenous retrovirus K10: expression of Gag protein and detection
RT of antibodies in patients with seminomas.";
RL J. Virol. 69:414-421(1995).
CC -!- FUNCTION: The products of the Gag polyproteins of infectious
CC retroviruses perform highly complex orchestrated tasks during the
CC assembly, budding, maturation, and infection stages of the viral
CC replication cycle. During viral assembly, the proteins form membrane
CC associations and self-associations that ultimately result in budding of
CC an immature virion from the infected cell. Gag precursors also function
CC during viral assembly to selectively bind and package two plus strands
CC of genomic RNA. Endogenous Gag proteins may have kept, lost or modified
CC their original function during evolution.
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor. Note=Cytoplasmic
CC membrane (in a transfection system). {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=2;
CC Comment=This protein is synthesized as a Gag polypeptide and as a
CC Gag-Pro-Pol polyprotein. The later is the precursor of the Pro and
CC Pol proteins. It is thought, by similarity with type-B retroviruses,
CC to be generated by -1 frameshifts occurring at the Gag-Pro and
CC Pro-Pol genes boundaries.;
CC Name=2;
CC IsoId=P63126-1; Sequence=Displayed;
CC Name=1;
CC IsoId=P63128-1; Sequence=External;
CC -!- DOMAIN: HERV-K Gag polyprotein contains regions homologous to the
CC matrix (MA), capsid (CA) and nucleocapsid (NC) proteins from infectious
CC retroviruses. Evidence suggests that HERV-K(HML-2) Gag polyprotein can
CC be cleaved into mature MA, CA and NC under certain circumstances.
CC However, the exact boundaries as well as the size of processed Gag
CC proteins have not been precisely determined yet.
CC -!- PTM: Myristoylation is essential for retroviral assembly. Alteration of
CC the glycine residue leads to a block in the budding of particles and an
CC accumulation of Gag inside the cell (By similarity). {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages may yield mature proteins.
CC {ECO:0000305}.
CC -!- MISCELLANEOUS: This Gag protein is encoded by a human specific
CC provirus.
CC -!- MISCELLANEOUS: [Isoform 2]: Gag-Pro polyprotein is produced from
CC conventional translation of the gag ORF.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral Gag protein family.
CC HERV class-II K(HML-2) gag subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAD51799.1; Type=Frameshift; Note=The frameshift occurs probably within the codons for the last amino acids in the Gag and Pro open reading frames.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF164615; AAD51799.1; ALT_SEQ; Genomic_DNA.
DR AlphaFoldDB; P63126; -.
DR SMR; P63126; -.
DR IntAct; P63126; 1.
DR BioMuta; HGNC:39005; -.
DR DMDM; 52000663; -.
DR jPOST; P63126; -.
DR MassIVE; P63126; -.
DR PeptideAtlas; P63126; -.
DR PRIDE; P63126; -.
DR GeneCards; ERVK-9; -.
DR HGNC; HGNC:39005; ERVK-9.
DR neXtProt; NX_P63126; -.
DR PhylomeDB; P63126; -.
DR Pharos; P63126; Tdark.
DR Proteomes; UP000005640; Unplaced.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0016032; P:viral process; IEA:InterPro.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.150.490; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR InterPro; IPR003322; B_retro_matrix.
DR InterPro; IPR038124; B_retro_matrix_sf.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF02337; Gag_p10; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF57756; SSF57756; 2.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 1: Evidence at protein level;
KW Cell membrane; ERV; Lipoprotein; Membrane; Metal-binding; Myristate;
KW Reference proteome; Repeat; Ribosomal frameshifting; Transposable element;
KW Zinc; Zinc-finger.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0000255"
FT CHAIN 2..666
FT /note="Endogenous retrovirus group K member 9 Gag
FT polyprotein"
FT /id="PRO_0000186747"
FT ZN_FING 544..561
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 580..597
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT REGION 165..264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 598..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 217..253
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT LIPID 2
FT /note="N-myristoyl glycine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 666 AA; 74005 MW; BF58ED0497788F84 CRC64;
MGQTKSKIKS KYASYLSFIK ILLKRGGVKV STKNLIKLFQ IIEQFCPWFP EQGTLDLKDW
KRIGKELKQA GRKGNIIPLT VWNDWAIIKA ALEPFQTEED SISVSDAPGS GIIDCNEKTR
KKSQKETESL HCEYVAEPVM AQSTQNVDYN QLQEVIYPET LKLEGKGPEL VGPSESKPRG
TSPLPAGQVP VTLQPQKQVK ENKTQPPVAY QYWPPAELQY RPPPESQYGY PGMPPAPQGR
APYPQPPTRR LNPTAPPSRQ GSELHEIIDK SRKEGDTEAW QFPVTLEPMP PGEGAQEGEP
PTVEARYKSF SIKILKDMKE GVKQYGPNSP YMRTLLDSIA HGHRLIPYDW EILAKSSLSP
SQFLQFKTWW IDGVQEQVRR NRAANPPVNI DADQLLGIGQ NWSTISQQAL MQNEAIEQVR
AICLRAWEKI QDPGSTCPSF NTVRQGSKEP YPDFVARLQD VAQKSIADEK ARKVIVELMA
YENANPECQS AIKPLKGKVP AGSDVISEYV KACDGIGGAM HKAMLMAQAI TGVVLGGQVR
TFGGKCYNCG QIGHLKKNCP VLNKQNITIQ ATTTGREPPD LCPRCKKGKH WASQCRSKFD
KNGQPLSGNE QRGQPQAPQQ TGAFPIQPFV PQGFQGQQPP LSQVFQGISQ LPQYNNCPPP
QVAVQQ