Y5790_MYCS2
ID Y5790_MYCS2 Reviewed; 90 AA.
AC A0R4D0; I7G8U6;
DT 10-AUG-2010, integrated into UniProtKB/Swiss-Prot.
DT 09-JAN-2007, sequence version 1.
DT 25-MAY-2022, entry version 84.
DE RecName: Full=Uncharacterized protein MSMEG_5790/MSMEI_5637;
GN OrderedLocusNames=MSMEG_5790, MSMEI_5637;
OS Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) (Mycobacterium
OS smegmatis).
OC Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
OC Mycolicibacterium.
OX NCBI_TaxID=246196;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RA Fleischmann R.D., Dodson R.J., Haft D.H., Merkel J.S., Nelson W.C.,
RA Fraser C.M.;
RL Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RX PubMed=17295914; DOI=10.1186/gb-2007-8-2-r20;
RA Deshayes C., Perrodou E., Gallien S., Euphrasie D., Schaeffer C.,
RA Van-Dorsselaer A., Poch O., Lecompte O., Reyrat J.-M.;
RT "Interrupted coding sequences in Mycobacterium smegmatis: authentic
RT mutations or sequencing errors?";
RL Genome Biol. 8:R20.1-R20.9(2007).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RX PubMed=18955433; DOI=10.1101/gr.081901.108;
RA Gallien S., Perrodou E., Carapito C., Deshayes C., Reyrat J.-M.,
RA Van Dorsselaer A., Poch O., Schaeffer C., Lecompte O.;
RT "Ortho-proteogenomics: multiple proteomes investigation through orthology
RT and a new MS-based protocol.";
RL Genome Res. 19:128-135(2009).
RN [4]
RP PUPYLATION AT LYS-88, AND IDENTIFICATION BY MASS SPECTROMETRY.
RX PubMed=20094657; DOI=10.1039/b916104j;
RA Watrous J., Burns K., Liu W.T., Patel A., Hook V., Bafna V.,
RA Barry C.E. III, Bark S., Dorrestein P.C.;
RT "Expansion of the mycobacterial 'PUPylome'.";
RL Mol. Biosyst. 6:376-385(2010).
CC -!- SEQUENCE CAUTION:
CC Sequence=AFP42072.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000480; ABK70882.1; -; Genomic_DNA.
DR EMBL; CP001663; AFP42072.1; ALT_INIT; Genomic_DNA.
DR RefSeq; YP_890018.1; NC_008596.1.
DR AlphaFoldDB; A0R4D0; -.
DR SMR; A0R4D0; -.
DR STRING; 246196.MSMEI_5637; -.
DR EnsemblBacteria; ABK70882; ABK70882; MSMEG_5790.
DR EnsemblBacteria; AFP42072; AFP42072; MSMEI_5637.
DR KEGG; msg:MSMEI_5637; -.
DR KEGG; msm:MSMEG_5790; -.
DR PATRIC; fig|246196.19.peg.5635; -.
DR eggNOG; ENOG5032Y6I; Bacteria.
DR OMA; MCTAPKQ; -.
DR Proteomes; UP000000757; Chromosome.
DR Proteomes; UP000006158; Chromosome.
DR InterPro; IPR008969; CarboxyPept-like_regulatory.
DR InterPro; IPR010814; DUF1416.
DR Pfam; PF07210; DUF1416; 1.
DR SUPFAM; SSF49464; SSF49464; 1.
PE 1: Evidence at protein level;
KW Isopeptide bond; Reference proteome; Ubl conjugation.
FT CHAIN 1..90
FT /note="Uncharacterized protein MSMEG_5790/MSMEI_5637"
FT /id="PRO_0000396089"
FT CROSSLNK 88
FT /note="Isoglutamyl lysine isopeptide (Lys-Gln) (interchain
FT with Q-Cter in protein Pup)"
FT /evidence="ECO:0000269|PubMed:20094657"
SQ SEQUENCE 90 AA; 9050 MW; BC3B8F7C66EBAEEF CRC64;
MPAGVDLEKE TVITGRVVDG SGQAVGGAFV RLLDGSDEFT AEVVASATGD FRFFAAPGTW
TVRALSSAGN GNVTVAPTGA GIHEVDVKVA