CSG_METVO
ID CSG_METVO Reviewed; 576 AA.
AC Q50833;
DT 26-SEP-2001, integrated into UniProtKB/Swiss-Prot.
DT 26-SEP-2001, sequence version 2.
DT 25-MAY-2022, entry version 69.
DE RecName: Full=S-layer protein {ECO:0000303|PubMed:8132478};
DE AltName: Full=Cell surface glycoprotein;
DE AltName: Full=Surface layer protein;
DE Flags: Precursor; Fragment;
GN Name=sla;
OS Methanococcus voltae.
OC Archaea; Euryarchaeota; Methanomada group; Methanococci; Methanococcales;
OC Methanococcaceae; Methanococcus.
OX NCBI_TaxID=2188;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND PROTEIN SEQUENCE OF 24-37.
RX PubMed=1825827; DOI=10.1128/jb.173.6.2131-2133.1991;
RA Dharmavaram R., Gillevet P., Konisky J.;
RT "Nucleotide sequence of the gene encoding the vanadate-sensitive membrane-
RT associated ATPase of Methanococcus voltae.";
RL J. Bacteriol. 173:2131-2133(1991).
RN [2]
RP PROTEIN SEQUENCE OF 98-114, GLYCOSYLATION AT ASN-102, GLYCAN STRUCTURE, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=ATCC 33273 / DSM 1537 / NBRC 100457 / OCM 70 / PS;
RX PubMed=15723834; DOI=10.1074/jbc.m500329200;
RA Voisin S., Houliston R.S., Kelly J., Brisson J.-R., Watson D., Bardy S.L.,
RA Jarrell K.F., Logan S.M.;
RT "Identification and characterization of the unique N-linked glycan common
RT to the flagellins and S-layer glycoprotein of Methanococcus voltae.";
RL J. Biol. Chem. 280:16586-16593(2005).
RN [3]
RP IDENTIFICATION, PROTEIN SEQUENCE OF 24-51, FUNCTION, AND SUBCELLULAR
RP LOCATION.
RC STRAIN=ATCC 33273 / DSM 1537 / NBRC 100457 / OCM 70 / PS;
RX PubMed=8132478; DOI=10.1128/jb.176.6.1790-1792.1994;
RA Konisky J., Lynn D., Hoppert M., Mayer F., Haney P.;
RT "Identification of the Methanococcus voltae S-layer structural gene.";
RL J. Bacteriol. 176:1790-1792(1994).
CC -!- FUNCTION: S-layer protein. The S-layer is a paracrystalline mono-
CC layered assembly of proteins which coat the surface of the cell.
CC {ECO:0000269|PubMed:8132478}.
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall, S-layer
CC {ECO:0000269|PubMed:8132478}.
CC -!- PTM: N-linked glycans consist of the 779 Da trisaccharide beta-
CC ManNAc(Thr)-(1-4)-beta-GlcNAc3NAcA-(1-3)-beta-GlcNAc.
CC {ECO:0000269|PubMed:15723834}.
CC -!- SIMILARITY: Belongs to the Mj S-layer protein family. {ECO:0000305}.
CC -!- CAUTION: Was originally thought to be a P-type ATPase.
CC {ECO:0000305|PubMed:1825827}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA93515.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M59200; AAA93515.1; ALT_INIT; Genomic_DNA.
DR PIR; A38542; A38542.
DR AlphaFoldDB; Q50833; -.
DR iPTMnet; Q50833; -.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0030115; C:S-layer; IEA:UniProtKB-SubCell.
DR GO; GO:0071555; P:cell wall organization; IEA:UniProtKB-KW.
DR InterPro; IPR022651; S_layer_C.
DR InterPro; IPR006454; S_layer_MJ.
DR InterPro; IPR022650; S_layer_N.
DR Pfam; PF05124; S_layer_C; 1.
DR Pfam; PF05123; S_layer_N; 1.
DR TIGRFAMs; TIGR01564; S_layer_MJ; 1.
PE 1: Evidence at protein level;
KW Cell wall; Cell wall biogenesis/degradation; Direct protein sequencing;
KW Glycoprotein; S-layer; Secreted; Signal.
FT SIGNAL <1..23
FT /evidence="ECO:0000269|PubMed:1825827,
FT ECO:0000269|PubMed:8132478"
FT CHAIN 24..576
FT /note="S-layer protein"
FT /id="PRO_0000032621"
FT CARBOHYD 102
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:15723834"
FT CARBOHYD 132
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
FT NON_TER 1
SQ SEQUENCE 576 AA; 60676 MW; 3777E1153BF007B4 CRC64;
KKIGAIAAGS AMVASALATG VFAVEKIGDV EGFKVIDNGE PTADIVVGST AAAADVVSAA
NVAAKVGSMM FKEGEAASGS AKLTVKASAE SDDANLKSLL TNGTNDFTEL DAGKEAFVVA
AADSDYSDAL INATTGFANI ADNVLYDQAK LAAAVSLGDL STLSVVKDID PSDWYADKNK
AADVATKDYY DQDGDAVEML MATVASNDDG KSLTVDEDGV LYASIAYDDD NEDFQRATQV
LKEGNRLPFL GEEYALVKLD TDDDIVYLGK EVFDGVLKEG DTYNIGDGYE LKVVAILKSG
DEYKISLQLM KDGKVVAEKF DKVSATSALK MIYTPGNIGI VVNEAWENVG QDYGYGSTLI
TKDVIALELG EEYIPDWEVV TIEKDTTTDN TKDSKMTLSD DKITKDNTYG IGLQYVGDEE
DNFKSGKAIK IAKYAELELD DEDKEDTKLN LFFSMDETKE ATLAAGQKVT VLNSDITLSE
VMADAKAPVA FKAPLAVLDT EVSLDAANKK LILVGGPVAN ALTKELADAG KIEMTVESPA
TLAVVAGAAN GNDVLVVAGG DRAATAEAAN ALIEML