GLYC_ALLVP
ID GLYC_ALLVP Reviewed; 507 AA.
AC Q9DK03;
DT 20-JAN-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 75.
DE RecName: Full=Pre-glycoprotein polyprotein GP complex {ECO:0000255|HAMAP-Rule:MF_04084};
DE Short=Pre-GP-C {ECO:0000255|HAMAP-Rule:MF_04084};
DE Contains:
DE RecName: Full=Stable signal peptide {ECO:0000255|HAMAP-Rule:MF_04084};
DE Short=SSP {ECO:0000255|HAMAP-Rule:MF_04084};
DE Contains:
DE RecName: Full=Glycoprotein G1 {ECO:0000255|HAMAP-Rule:MF_04084};
DE Short=GP1 {ECO:0000255|HAMAP-Rule:MF_04084};
DE Contains:
DE RecName: Full=Glycoprotein G2 {ECO:0000255|HAMAP-Rule:MF_04084};
DE Short=GP2 {ECO:0000255|HAMAP-Rule:MF_04084};
GN Name=GPC {ECO:0000255|HAMAP-Rule:MF_04084}; Synonyms=GP-C;
OS Allpahuayo mammarenavirus (isolate Rat/Peru/CLHP-2472/1997) (ALLV).
OC Viruses; Riboviria; Orthornavirae; Negarnaviricota; Polyploviricotina;
OC Ellioviricetes; Bunyavirales; Arenaviridae; Mammarenavirus.
OX NCBI_TaxID=144752;
OH NCBI_TaxID=48011; Oecomys bicolor (Bicolored arboreal rice rat).
OH NCBI_TaxID=48012; Oecomys roberti (Robert's arboreal rice rat).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=11384226; DOI=10.1006/viro.2000.0803;
RA Moncayo A.C., Hice C.L., Watts D.M., Travassos de Rosa A.P., Guzman H.,
RA Russell K.L., Calampa C., Gozalo A., Popov V.L., Weaver S.C., Tesh R.B.;
RT "Allpahuayo virus: a newly recognized arenavirus (arenaviridae) from
RT arboreal rice rats (oecomys bicolor and oecomys paricola) in northeastern
RT peru.";
RL Virology 284:277-286(2001).
CC -!- FUNCTION: [Glycoprotein G2]: Class I viral fusion protein that directs
CC fusion of viral and host endosomal membranes, leading to delivery of
CC the nucleocapsid into the cytoplasm. Membrane fusion is mediated by
CC irreversible conformational changes induced upon acidification in the
CC endosome. {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- FUNCTION: Stable signal peptide (SSP): cleaved and functions as a
CC signal peptide. In addition, it is also retained as the third component
CC of the GP complex. The SSP is required for efficient glycoprotein
CC expression, post-translational maturation cleavage of GP1 and GP2,
CC glycoprotein transport to the cell surface plasma membrane, formation
CC of infectious virus particles, and acid pH-dependent glycoprotein-
CC mediated cell fusion. {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- FUNCTION: [Glycoprotein G1]: Interacts with the host receptor.
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- SUBUNIT: [Glycoprotein G1]: Homotetramer; disulfide-linked.
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- SUBUNIT: [Glycoprotein G2]: Homotetramer. GP2 homotetramers bind
CC through ionic interactions with GP1 homotetramers to form the GP
CC complex together with the stable signal peptide. The GP-C polyprotein
CC interacts with the host protease MBTPS1/SKI-1 resulting in the
CC polyprotein processing. {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- SUBCELLULAR LOCATION: [Glycoprotein G1]: Virion membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Peripheral membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host endoplasmic reticulum membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Peripheral membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host Golgi apparatus membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Peripheral membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host cell membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Peripheral membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- SUBCELLULAR LOCATION: [Glycoprotein G2]: Virion membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Single-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host endoplasmic reticulum membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Single-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host Golgi apparatus membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Single-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host cell membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Single-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Note=Binding to the stable signal
CC peptide masks endogenous ER localization signals in the cytoplasmic
CC domain of G2 to ensure that only the fully assembled, tripartite GP
CC complex is transported for virion assembly. {ECO:0000255|HAMAP-
CC Rule:MF_04084}.
CC -!- SUBCELLULAR LOCATION: [Stable signal peptide]: Virion membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Multi-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host endoplasmic reticulum membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Multi-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host Golgi apparatus membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Multi-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}. Host cell membrane
CC {ECO:0000255|HAMAP-Rule:MF_04084}; Multi-pass membrane protein
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- DOMAIN: The cytoplasmic domain of GP2 plays a role in ER location. It
CC also contains a zinc-binding domain that allows SSP retention in the
CC GPC complex by accepting a cysteine from SSP as the fourth ligand.
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- PTM: Specific enzymatic cleavages in vivo yield mature proteins. GP-C
CC polyprotein is cleaved in the endoplasmic reticulum by the host
CC protease MBTPS1. Only cleaved glycoprotein is incorporated into
CC virions. {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- PTM: The SSP remains stably associated with the GP complex following
CC cleavage by signal peptidase and plays crucial roles in the trafficking
CC of GP through the secretory pathway. {ECO:0000255|HAMAP-Rule:MF_04084}.
CC -!- SIMILARITY: Belongs to the arenaviridae GPC protein family.
CC {ECO:0000255|HAMAP-Rule:MF_04084}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY012687; AAG42531.1; -; Genomic_RNA.
DR RefSeq; YP_001649221.1; NC_010253.1.
DR SMR; Q9DK03; -.
DR GeneID; 5848530; -.
DR KEGG; vg:5848530; -.
DR Proteomes; UP000009258; Genome.
DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044178; C:host cell Golgi membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-UniRule.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-UniRule.
DR GO; GO:0019065; P:receptor-mediated endocytosis of virus by host cell; IEA:UniProtKB-UniRule.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-UniRule.
DR Gene3D; 2.20.28.180; -; 1.
DR HAMAP; MF_04084; ARENA_GPC; 1.
DR InterPro; IPR001535; Arena_glycoprot.
DR InterPro; IPR043015; Arena_glycoprot_zinc-bd.
DR Pfam; PF00798; Arena_glycoprot; 1.
DR PIRSF; PIRSF004028; GPC_ArenaV; 1.
PE 3: Inferred from homology;
KW Disulfide bond; Fusion of virus membrane with host endosomal membrane;
KW Fusion of virus membrane with host membrane; Glycoprotein;
KW Host cell membrane; Host endoplasmic reticulum; Host Golgi apparatus;
KW Host membrane; Host-virus interaction; Lipoprotein; Membrane;
KW Metal-binding; Myristate; Transmembrane; Transmembrane helix;
KW Viral attachment to host cell; Viral envelope protein;
KW Viral penetration into host cytoplasm; Virion; Virus endocytosis by host;
KW Virus entry into host cell; Zinc.
FT INIT_MET 1
FT /note="Removed; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CHAIN 2..507
FT /note="Pre-glycoprotein polyprotein GP complex"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT /id="PRO_0000361572"
FT CHAIN 2..58
FT /note="Stable signal peptide"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT /id="PRO_0000361573"
FT CHAIN 59..272
FT /note="Glycoprotein G1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT /id="PRO_0000361574"
FT CHAIN 273..507
FT /note="Glycoprotein G2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT /id="PRO_0000361575"
FT TOPO_DOM 2..17
FT /note="Extracellular"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TRANSMEM 18..32
FT /note="Helical"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TOPO_DOM 33
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TRANSMEM 34..53
FT /note="Helical"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TOPO_DOM 54..58
FT /note="Extracellular"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TOPO_DOM 59..445
FT /note="Extracellular"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TRANSMEM 446..466
FT /note="Helical"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT TOPO_DOM 467..507
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 57
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 468
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 470
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 476
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 480
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 488
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT BINDING 490
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT SITE 58..59
FT /note="Cleavage; by host signal peptidase"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT SITE 272..273
FT /note="Cleavage; by host MBTPS1"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT LIPID 2
FT /note="N-myristoyl glycine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 74
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 89
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 111
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 119
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 130
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 179
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 223
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 240
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 378
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 386
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 403
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT CARBOHYD 408
FT /note="N-linked (GlcNAc...) asparagine; by host"
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT DISULFID 86..247
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT DISULFID 292..305
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT DISULFID 314..323
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
FT DISULFID 377..398
FT /evidence="ECO:0000255|HAMAP-Rule:MF_04084"
SQ SEQUENCE 507 AA; 57515 MW; A5482D146D98113F CRC64;
MGQVVTFLQS LPEVINEAIN IALIAISIIC ILKGLVNFWK CGVVQLAIFL CLAGRKCDGL
MIDRRHELSH VELNLTRMFD NLPQSCSKNN THHYYKGPKG TTWGIELTLT NTSLDSYANM
SRIRSLAFGN ITNCDKTGEA GHTLKWLLNE LHFNVLHVTR HVGARCRVSE GAGLLIQYNL
TIGDHGGEVG RHLIASLAQI IGDNKAAWVG KCDSHCTMDG KCNYTNCEGF THYNYLIIQN
TTWENHCSYS PMSTIRMALN KVAYSSVSRQ LLGFFTWDIS DSSGAHVPGG YCLEQWAIVW
AGIKCFDNAV MAKCNKDHNV EFCDTMRLFD FNQNAIKTLQ LNVENSVNLL KRSINGLISD
SLVIRNSLKQ LAKIPYCNYT KFWYVNDTIT GKHSLPQCWL MRNGSYLNET HFKNEWLWES
QNLYNEMLLK EYEDRQGKTP IALTDICFWS LVFFTSTVFL QLVGIPTHRH LVGEGCPKPH
RITSNSLCAC GYYKIPKRPT RWVRKGK