GAK21_HUMAN
ID GAK21_HUMAN Reviewed; 666 AA.
AC P62683;
DT 19-JUL-2004, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 2.
DT 25-MAY-2022, entry version 102.
DE RecName: Full=Endogenous retrovirus group K member 21 Gag polyprotein;
DE AltName: Full=HERV-K_12q14.1 provirus ancestral Gag polyprotein;
DE Short=Gag polyprotein;
GN Name=ERVK-21;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16541075; DOI=10.1038/nature04569;
RA Scherer S.E., Muzny D.M., Buhay C.J., Chen R., Cree A., Ding Y.,
RA Dugan-Rocha S., Gill R., Gunaratne P., Harris R.A., Hawes A.C.,
RA Hernandez J., Hodgson A.V., Hume J., Jackson A., Khan Z.M., Kovar-Smith C.,
RA Lewis L.R., Lozado R.J., Metzker M.L., Milosavljevic A., Miner G.R.,
RA Montgomery K.T., Morgan M.B., Nazareth L.V., Scott G., Sodergren E.,
RA Song X.-Z., Steffen D., Lovering R.C., Wheeler D.A., Worley K.C., Yuan Y.,
RA Zhang Z., Adams C.Q., Ansari-Lari M.A., Ayele M., Brown M.J., Chen G.,
RA Chen Z., Clerc-Blankenburg K.P., Davis C., Delgado O., Dinh H.H.,
RA Draper H., Gonzalez-Garay M.L., Havlak P., Jackson L.R., Jacob L.S.,
RA Kelly S.H., Li L., Li Z., Liu J., Liu W., Lu J., Maheshwari M.,
RA Nguyen B.-V., Okwuonu G.O., Pasternak S., Perez L.M., Plopper F.J.H.,
RA Santibanez J., Shen H., Tabor P.E., Verduzco D., Waldron L., Wang Q.,
RA Williams G.A., Zhang J., Zhou J., Allen C.C., Amin A.G., Anyalebechi V.,
RA Bailey M., Barbaria J.A., Bimage K.E., Bryant N.P., Burch P.E.,
RA Burkett C.E., Burrell K.L., Calderon E., Cardenas V., Carter K., Casias K.,
RA Cavazos I., Cavazos S.R., Ceasar H., Chacko J., Chan S.N., Chavez D.,
RA Christopoulos C., Chu J., Cockrell R., Cox C.D., Dang M., Dathorne S.R.,
RA David R., Davis C.M., Davy-Carroll L., Deshazo D.R., Donlin J.E.,
RA D'Souza L., Eaves K.A., Egan A., Emery-Cohen A.J., Escotto M., Flagg N.,
RA Forbes L.D., Gabisi A.M., Garza M., Hamilton C., Henderson N.,
RA Hernandez O., Hines S., Hogues M.E., Huang M., Idlebird D.G., Johnson R.,
RA Jolivet A., Jones S., Kagan R., King L.M., Leal B., Lebow H., Lee S.,
RA LeVan J.M., Lewis L.C., London P., Lorensuhewa L.M., Loulseged H.,
RA Lovett D.A., Lucier A., Lucier R.L., Ma J., Madu R.C., Mapua P.,
RA Martindale A.D., Martinez E., Massey E., Mawhiney S., Meador M.G.,
RA Mendez S., Mercado C., Mercado I.C., Merritt C.E., Miner Z.L., Minja E.,
RA Mitchell T., Mohabbat F., Mohabbat K., Montgomery B., Moore N., Morris S.,
RA Munidasa M., Ngo R.N., Nguyen N.B., Nickerson E., Nwaokelemeh O.O.,
RA Nwokenkwo S., Obregon M., Oguh M., Oragunye N., Oviedo R.J., Parish B.J.,
RA Parker D.N., Parrish J., Parks K.L., Paul H.A., Payton B.A., Perez A.,
RA Perrin W., Pickens A., Primus E.L., Pu L.-L., Puazo M., Quiles M.M.,
RA Quiroz J.B., Rabata D., Reeves K., Ruiz S.J., Shao H., Sisson I.,
RA Sonaike T., Sorelle R.P., Sutton A.E., Svatek A.F., Svetz L.A.,
RA Tamerisa K.S., Taylor T.R., Teague B., Thomas N., Thorn R.D., Trejos Z.Y.,
RA Trevino B.K., Ukegbu O.N., Urban J.B., Vasquez L.I., Vera V.A.,
RA Villasana D.M., Wang L., Ward-Moore S., Warren J.T., Wei X., White F.,
RA Williamson A.L., Wleczyk R., Wooden H.S., Wooden S.H., Yen J., Yoon L.,
RA Yoon V., Zorrilla S.E., Nelson D., Kucherlapati R., Weinstock G.,
RA Gibbs R.A.;
RT "The finished DNA sequence of human chromosome 12.";
RL Nature 440:346-351(2006).
RN [2]
RP CHARACTERIZATION.
RX PubMed=7983737; DOI=10.1128/jvi.69.1.414-421.1995;
RA Sauter M., Schommer S., Kremmer E., Remberger K., Doelken G., Lemm I.,
RA Buck M., Best B., Neumann-Haefelin D., Mueller-Lantzsch N.;
RT "Human endogenous retrovirus K10: expression of Gag protein and detection
RT of antibodies in patients with seminomas.";
RL J. Virol. 69:414-421(1995).
CC -!- FUNCTION: The products of the Gag polyproteins of infectious
CC retroviruses perform highly complex orchestrated tasks during the
CC assembly, budding, maturation, and infection stages of the viral
CC replication cycle. During viral assembly, the proteins form membrane
CC associations and self-associations that ultimately result in budding of
CC an immature virion from the infected cell. Gag precursors also function
CC during viral assembly to selectively bind and package two plus strands
CC of genomic RNA. Endogenous Gag proteins may have kept, lost or modified
CC their original function during evolution.
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor. Note=Cytoplasmic
CC membrane (in a transfection system). {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Ribosomal frameshifting; Named isoforms=1;
CC Comment=This protein is synthesized as a Gag polypeptide and as a
CC Gag-Pro-Pol polyprotein. The later is the precursor of the Pro and
CC Pol proteins. It is thought, by similarity with type-B retroviruses,
CC to be generated by -1 frameshifts occurring at the Gag-Pro and
CC Pro-Pol genes boundaries.;
CC Name=1;
CC IsoId=P62683-1; Sequence=Displayed;
CC -!- DOMAIN: HERV-K Gag polyprotein contains regions homologous to the
CC matrix (MA), capsid (CA) and nucleocapsid (NC) proteins from infectious
CC retroviruses. Evidence suggests that HERV-K(HML-2) Gag polyprotein can
CC be cleaved into mature MA, CA and NC under certain circumstances.
CC However, the exact boundaries as well as the size of processed Gag
CC proteins have not been precisely determined yet.
CC -!- PTM: Myristoylation is essential for retroviral assembly. Alteration of
CC the glycine residue leads to a block in the budding of particles and an
CC accumulation of Gag inside the cell (By similarity). {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages may yield mature proteins.
CC {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral Gag protein family.
CC HERV class-II K(HML-2) gag subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC025420; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; P62683; -.
DR SMR; P62683; -.
DR IntAct; P62683; 3.
DR BioMuta; HGNC:39035; -.
DR DMDM; 50400277; -.
DR jPOST; P62683; -.
DR MassIVE; P62683; -.
DR PeptideAtlas; P62683; -.
DR PRIDE; P62683; -.
DR GeneCards; ERVK-21; -.
DR HGNC; HGNC:39035; ERVK-21.
DR neXtProt; NX_P62683; -.
DR PhylomeDB; P62683; -.
DR Pharos; P62683; Tdark.
DR Proteomes; UP000005640; Unplaced.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0016032; P:viral process; IEA:InterPro.
DR Gene3D; 1.10.1200.30; -; 1.
DR Gene3D; 1.10.150.490; -; 1.
DR Gene3D; 1.10.375.10; -; 1.
DR InterPro; IPR003322; B_retro_matrix.
DR InterPro; IPR038124; B_retro_matrix_sf.
DR InterPro; IPR045345; Gag_p24_C.
DR InterPro; IPR000721; Gag_p24_N.
DR InterPro; IPR008916; Retrov_capsid_C.
DR InterPro; IPR008919; Retrov_capsid_N.
DR InterPro; IPR010999; Retrovr_matrix.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR Pfam; PF02337; Gag_p10; 1.
DR Pfam; PF00607; Gag_p24; 1.
DR Pfam; PF19317; Gag_p24_C; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF47836; SSF47836; 1.
DR SUPFAM; SSF47943; SSF47943; 1.
DR SUPFAM; SSF57756; SSF57756; 2.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 1: Evidence at protein level;
KW Cell membrane; ERV; Lipoprotein; Membrane; Metal-binding; Myristate;
KW Reference proteome; Repeat; Ribosomal frameshifting; Transposable element;
KW Zinc; Zinc-finger.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0000255"
FT CHAIN 2..666
FT /note="Endogenous retrovirus group K member 21 Gag
FT polyprotein"
FT /id="PRO_0000186744"
FT ZN_FING 544..561
FT /note="CCHC-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT ZN_FING 580..597
FT /note="CCHC-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT REGION 165..189
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 217..264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 598..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 217..253
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT LIPID 2
FT /note="N-myristoyl glycine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 666 AA; 74000 MW; D74458D3DA55403C CRC64;
MGQTKSKIKS KYASYLSFIK ILLKRGGVKV STKNLIKLFQ IIEQFCPWFP EQGTLDLKDW
KRIGKELKQA GRKGNIIPLT VWNDWAIIKA ALEPFQTEED SISVSDAPGS CIIDCNENTR
KKSQKETEGL HCEYAAEPVM AQSTQNVDYN QLQEVIYPET LKLEGKGPEL VGPSESKPRG
TSPLPAGQVP VTLQPQTQVK ENKTQPPVAY QYWPPAELQY RPPPESQYGY PGMPPAPQGR
APYPQPPTRR LNPTAPPSRQ GSELHEIIDK SRKEGDTEAW QFPVMLEPMP PGEGAQEGEP
PTVEARYKSF SIKMLKDMKE GVKQYGPNSP YMRTLLDSIA HGHRLIPYDW EILAKSSLLP
SQFLQFKTWW IDGVQEQVQR NRAANPPVNI DADQLLGIGQ NWSTISQQAL MQNEAIEQVR
AICLRAWEKI QDPGSTCPSF NTVRQSSKEP YPDFVARLQD VAQKSIADEK ARKVIVELMA
YENANPECQS AIKPLKGKVP AGSDVISEYV KACDGIGGAM HKAMLMAQAI TGVVLGGQVR
TFGGKCYNCG QIGHLKKNCP VLNKQNITIQ ATTTGREPPD LCPRCKKGKH WASQCRSKFD
KNGQPLSGNE QRGQPQAPQQ TGAFPIQPFV PQGFQGQQPP LSQVFQGISQ LPQYNNCPPP
QAAVQQ