ENK21_HUMAN
ID ENK21_HUMAN Reviewed; 698 AA.
AC P61565;
DT 24-MAY-2004, integrated into UniProtKB/Swiss-Prot.
DT 24-MAY-2004, sequence version 1.
DT 25-MAY-2022, entry version 93.
DE RecName: Full=Endogenous retrovirus group K member 21 Env polyprotein;
DE AltName: Full=EnvK1 protein;
DE AltName: Full=Envelope polyprotein;
DE AltName: Full=HERV-K_12q14.1 provirus ancestral Env polyprotein;
DE Contains:
DE RecName: Full=Surface protein;
DE Short=SU;
DE Contains:
DE RecName: Full=Transmembrane protein;
DE Short=TM;
DE Flags: Precursor;
GN Name=ERVK-21;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16541075; DOI=10.1038/nature04569;
RA Scherer S.E., Muzny D.M., Buhay C.J., Chen R., Cree A., Ding Y.,
RA Dugan-Rocha S., Gill R., Gunaratne P., Harris R.A., Hawes A.C.,
RA Hernandez J., Hodgson A.V., Hume J., Jackson A., Khan Z.M., Kovar-Smith C.,
RA Lewis L.R., Lozado R.J., Metzker M.L., Milosavljevic A., Miner G.R.,
RA Montgomery K.T., Morgan M.B., Nazareth L.V., Scott G., Sodergren E.,
RA Song X.-Z., Steffen D., Lovering R.C., Wheeler D.A., Worley K.C., Yuan Y.,
RA Zhang Z., Adams C.Q., Ansari-Lari M.A., Ayele M., Brown M.J., Chen G.,
RA Chen Z., Clerc-Blankenburg K.P., Davis C., Delgado O., Dinh H.H.,
RA Draper H., Gonzalez-Garay M.L., Havlak P., Jackson L.R., Jacob L.S.,
RA Kelly S.H., Li L., Li Z., Liu J., Liu W., Lu J., Maheshwari M.,
RA Nguyen B.-V., Okwuonu G.O., Pasternak S., Perez L.M., Plopper F.J.H.,
RA Santibanez J., Shen H., Tabor P.E., Verduzco D., Waldron L., Wang Q.,
RA Williams G.A., Zhang J., Zhou J., Allen C.C., Amin A.G., Anyalebechi V.,
RA Bailey M., Barbaria J.A., Bimage K.E., Bryant N.P., Burch P.E.,
RA Burkett C.E., Burrell K.L., Calderon E., Cardenas V., Carter K., Casias K.,
RA Cavazos I., Cavazos S.R., Ceasar H., Chacko J., Chan S.N., Chavez D.,
RA Christopoulos C., Chu J., Cockrell R., Cox C.D., Dang M., Dathorne S.R.,
RA David R., Davis C.M., Davy-Carroll L., Deshazo D.R., Donlin J.E.,
RA D'Souza L., Eaves K.A., Egan A., Emery-Cohen A.J., Escotto M., Flagg N.,
RA Forbes L.D., Gabisi A.M., Garza M., Hamilton C., Henderson N.,
RA Hernandez O., Hines S., Hogues M.E., Huang M., Idlebird D.G., Johnson R.,
RA Jolivet A., Jones S., Kagan R., King L.M., Leal B., Lebow H., Lee S.,
RA LeVan J.M., Lewis L.C., London P., Lorensuhewa L.M., Loulseged H.,
RA Lovett D.A., Lucier A., Lucier R.L., Ma J., Madu R.C., Mapua P.,
RA Martindale A.D., Martinez E., Massey E., Mawhiney S., Meador M.G.,
RA Mendez S., Mercado C., Mercado I.C., Merritt C.E., Miner Z.L., Minja E.,
RA Mitchell T., Mohabbat F., Mohabbat K., Montgomery B., Moore N., Morris S.,
RA Munidasa M., Ngo R.N., Nguyen N.B., Nickerson E., Nwaokelemeh O.O.,
RA Nwokenkwo S., Obregon M., Oguh M., Oragunye N., Oviedo R.J., Parish B.J.,
RA Parker D.N., Parrish J., Parks K.L., Paul H.A., Payton B.A., Perez A.,
RA Perrin W., Pickens A., Primus E.L., Pu L.-L., Puazo M., Quiles M.M.,
RA Quiroz J.B., Rabata D., Reeves K., Ruiz S.J., Shao H., Sisson I.,
RA Sonaike T., Sorelle R.P., Sutton A.E., Svatek A.F., Svetz L.A.,
RA Tamerisa K.S., Taylor T.R., Teague B., Thomas N., Thorn R.D., Trejos Z.Y.,
RA Trevino B.K., Ukegbu O.N., Urban J.B., Vasquez L.I., Vera V.A.,
RA Villasana D.M., Wang L., Ward-Moore S., Warren J.T., Wei X., White F.,
RA Williamson A.L., Wleczyk R., Wooden H.S., Wooden S.H., Yen J., Yoon L.,
RA Yoon V., Zorrilla S.E., Nelson D., Kucherlapati R., Weinstock G.,
RA Gibbs R.A.;
RT "The finished DNA sequence of human chromosome 12.";
RL Nature 440:346-351(2006).
RN [2]
RP CHARACTERIZATION.
RX PubMed=12970426; DOI=10.1128/jvi.77.19.10414-10422.2003;
RA de Parseval N., Lazar V., Casella J.-F., Benit L., Heidmann T.;
RT "Survey of human genes of retroviral origin: identification and
RT transcriptome of the genes with coding capacity for complete envelope
RT proteins.";
RL J. Virol. 77:10414-10422(2003).
RN [3]
RP FUNCTION.
RX PubMed=14557543; DOI=10.1073/pnas.2132646100;
RA Blaise S., de Parseval N., Benit L., Heidmann T.;
RT "Genomewide screening for fusogenic human endogenous retrovirus envelopes
RT identifies syncytin 2, a gene conserved on primate evolution.";
RL Proc. Natl. Acad. Sci. U.S.A. 100:13013-13018(2003).
RN [4]
RP SUBGENOMIC RNA.
RX PubMed=12629516; DOI=10.1038/sj.onc.1206241;
RA Wang-Johanning F., Frost A.R., Jian B., Epp L., Lu D.W., Johanning G.L.;
RT "Quantitation of HERV-K env gene expression and splicing in human breast
RT cancer.";
RL Oncogene 22:1528-1535(2003).
CC -!- FUNCTION: Retroviral envelope proteins mediate receptor recognition and
CC membrane fusion during early infection. Endogenous envelope proteins
CC may have kept, lost or modified their original function during
CC evolution. This endogenous envelope protein has lost its original
CC fusogenic properties. {ECO:0000269|PubMed:14557543}.
CC -!- FUNCTION: SU mediates receptor recognition. {ECO:0000250}.
CC -!- FUNCTION: TM anchors the envelope heterodimer to the viral membrane
CC through one transmembrane domain. The other hydrophobic domain, called
CC fusion peptide, mediates fusion of the viral membrane with the target
CC cell membrane (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: The surface (SU) and transmembrane (TM) proteins form a
CC heterodimer. SU and TM are attached by noncovalent interactions or by a
CC labile interchain disulfide bond (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Transmembrane protein]: Cell membrane
CC {ECO:0000250}; Single-pass type I membrane protein {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Surface protein]: Cell membrane {ECO:0000250};
CC Peripheral membrane protein {ECO:0000250}. Note=The surface protein is
CC not anchored to the membrane, but localizes to the extracellular
CC surface through its binding to TM. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: [Endogenous retrovirus group K member 21 Env
CC polyprotein]: Virion {ECO:0000250}.
CC -!- PTM: Specific enzymatic cleavages in vivo yield the mature SU and TM
CC proteins. {ECO:0000250}.
CC -!- MISCELLANEOUS: ERVK-21 has a type 2 genome. The HERV-K(HML-2) family
CC contains type 1 and type 2 genomes depending on the absence or presence
CC of 292 nucleotides at the 5'-end of the env gene resulting in Env
CC proteins of distinct sizes. Despite their overall retroviral envelope
CC structure HERV-K(HML-2) type 1 envelope proteins lack a predictable
CC signal sequence. Subgenomic RNA transcripts coding for full-length
CC envelope proteins have been detected for both type of genomes.
CC -!- SIMILARITY: Belongs to the beta type-B retroviral envelope protein
CC family. HERV class-II K(HML-2) env subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC025420; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; P61565; -.
DR IntAct; P61565; 1.
DR GlyGen; P61565; 11 sites.
DR iPTMnet; P61565; -.
DR PhosphoSitePlus; P61565; -.
DR BioMuta; HGNC:39035; -.
DR jPOST; P61565; -.
DR MassIVE; P61565; -.
DR PeptideAtlas; P61565; -.
DR PRIDE; P61565; -.
DR GeneCards; ERVK-21; -.
DR HGNC; HGNC:39035; ERVK-21.
DR neXtProt; NX_P61565; -.
DR PhylomeDB; P61565; -.
DR Pharos; P61565; Tdark.
DR Proteomes; UP000005640; Unplaced.
DR RNAct; P61565; protein.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR CDD; cd09909; HIV-1-like_HR1-HR2; 1.
DR InterPro; IPR000328; GP41-like.
DR InterPro; IPR029104; HERV-K_env.
DR Pfam; PF00517; GP41; 1.
DR Pfam; PF13804; HERV-K_env_2; 1.
PE 1: Evidence at protein level;
KW Cell membrane; Cleavage on pair of basic residues; Disulfide bond; ERV;
KW Glycoprotein; Membrane; Reference proteome; Signal; Transmembrane;
KW Transmembrane helix; Transposable element; Viral envelope protein; Virion.
FT SIGNAL 1..88
FT /evidence="ECO:0000255"
FT CHAIN 89..698
FT /note="Endogenous retrovirus group K member 21 Env
FT polyprotein"
FT /id="PRO_0000008497"
FT CHAIN 89..464
FT /note="Surface protein"
FT /evidence="ECO:0000250"
FT /id="PRO_0000008498"
FT CHAIN 465..698
FT /note="Transmembrane protein"
FT /evidence="ECO:0000250"
FT /id="PRO_0000008499"
FT TOPO_DOM 89..631
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 632..652
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 653..698
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 465..485
FT /note="Fusion peptide"
FT /evidence="ECO:0000255"
FT COMPBIAS 9..24
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT SITE 464..465
FT /note="Cleavage"
FT /evidence="ECO:0000250"
FT CARBOHYD 99
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 127
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 152
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 273
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 354
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 371
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 460
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 506
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 553
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 565
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 584
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 698 AA; 79236 MW; 367669EE890E465A CRC64;
MHPSEMQRKA PPRRRRHRNR APLTHKMNKM VTSEQMKLPS TKKAEPPTWA QLKKLTQLAT
KYLENTKVTQ TPESMLLAAL MIVSMVVSLP MPAGAAAANY TNWAYVPFPP LIRAVTWMDN
PIEVYVNDSV WVHGPIDDRC PAKPEEEGMM INISIGYHYP PICLGRAPGC LMPAVQNWLV
EVPTVSPISR FTYNMVSGMS LRPRVNYLQD FSYQRSLKFR PKGKPCPKEI PKESKNTEVL
VWEECVANSV VILQNNEFGT IIDWAPRGQF YHNCSGQTQS CPSAQVSPAV DSDLTESLDK
HKHKKLQSFY PWEWGEKGIS TPRPKIISPV SGPEHPELWR LTVASHHIRI WSGNQTLETR
DRKPFYTVDL NSSLTVPLQS CVKPPYMLVV GNIVIKPDSQ TITCENCRLL TCIDSTFNWQ
HRILLVRARE GVWIPVSMDR PWEASPSIHI LTEVLKGVLN RSKRFIFTLI AVIMGLIAVT
AMAAVAGVAL HSFVQSVNFV NDWQKNSTRL WNSQSSIDQK LANQINDLRQ TVIWMGDRLM
SLEHRFQLQC DWNTSDFCIT PQIYNESEHH WDMVRRHLQG REDNLTLDIS KLKEQIFEAS
KAHLNLVPGT EAIAGVADGL ANLNPVTWVK TIGSTTIINL ILILVCLFCL LLVCRCTQQL
RRDSDHRERA MMTMVVLSKR KGGNVGKSKR DQIVTVSV