ESPE_MYCS2
ID ESPE_MYCS2 Reviewed; 525 AA.
AC A0QNI5; I7FCC6;
DT 30-NOV-2016, integrated into UniProtKB/Swiss-Prot.
DT 09-JAN-2007, sequence version 1.
DT 25-MAY-2022, entry version 66.
DE RecName: Full=ESX-1 secretion-associated protein EspE {ECO:0000303|PubMed:22233444};
GN Name=espE {ECO:0000303|PubMed:22233444};
GN OrderedLocusNames=MSMEG_0055, MSMEI_0056;
OS Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) (Mycobacterium
OS smegmatis).
OC Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
OC Mycolicibacterium.
OX NCBI_TaxID=246196;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RA Fleischmann R.D., Dodson R.J., Haft D.H., Merkel J.S., Nelson W.C.,
RA Fraser C.M.;
RL Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RX PubMed=17295914; DOI=10.1186/gb-2007-8-2-r20;
RA Deshayes C., Perrodou E., Gallien S., Euphrasie D., Schaeffer C.,
RA Van-Dorsselaer A., Poch O., Lecompte O., Reyrat J.-M.;
RT "Interrupted coding sequences in Mycobacterium smegmatis: authentic
RT mutations or sequencing errors?";
RL Genome Biol. 8:R20.1-R20.9(2007).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700084 / mc(2)155;
RX PubMed=18955433; DOI=10.1101/gr.081901.108;
RA Gallien S., Perrodou E., Carapito C., Deshayes C., Reyrat J.-M.,
RA Van Dorsselaer A., Poch O., Schaeffer C., Lecompte O.;
RT "Ortho-proteogenomics: multiple proteomes investigation through orthology
RT and a new MS-based protocol.";
RL Genome Res. 19:128-135(2009).
RN [4]
RP SUBCELLULAR LOCATION.
RC STRAIN=ATCC 700084 / mc(2)155;
RX PubMed=22233444; DOI=10.1111/j.1365-2958.2011.07958.x;
RA Wirth S.E., Krywy J.A., Aldridge B.B., Fortune S.M., Fernandez-Suarez M.,
RA Gray T.A., Derbyshire K.M.;
RT "Polar assembly and scaffolding proteins of the virulence-associated ESX-1
RT secretory apparatus in mycobacteria.";
RL Mol. Microbiol. 83:654-664(2012).
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000269|PubMed:22233444}.
CC Note=Localizes at or near the cell pole in (on average) 1 discrete spot
CC that colocalizes with SaeC (PubMed:22233444).
CC {ECO:0000269|PubMed:22233444}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AFP36538.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000480; ABK71412.1; -; Genomic_DNA.
DR EMBL; CP001663; AFP36538.1; ALT_INIT; Genomic_DNA.
DR RefSeq; WP_011726636.1; NZ_SIJM01000058.1.
DR RefSeq; YP_884473.1; NC_008596.1.
DR AlphaFoldDB; A0QNI5; -.
DR STRING; 246196.MSMEI_0056; -.
DR EnsemblBacteria; ABK71412; ABK71412; MSMEG_0055.
DR EnsemblBacteria; AFP36538; AFP36538; MSMEI_0056.
DR GeneID; 66738245; -.
DR KEGG; msg:MSMEI_0056; -.
DR KEGG; msm:MSMEG_0055; -.
DR PATRIC; fig|246196.19.peg.53; -.
DR eggNOG; ENOG503200Z; Bacteria.
DR OMA; WTINIVE; -.
DR OrthoDB; 1790169at2; -.
DR Proteomes; UP000000757; Chromosome.
DR Proteomes; UP000006158; Chromosome.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR InterPro; IPR043796; ESX-1_EspA/EspE-like.
DR Pfam; PF18879; EspA_EspE; 1.
PE 4: Predicted;
KW Cytoplasm; Reference proteome.
FT CHAIN 1..525
FT /note="ESX-1 secretion-associated protein EspE"
FT /id="PRO_0000438361"
FT REGION 244..341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 375..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 270..284
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..454
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 525 AA; 53068 MW; 14209F309EF34943 CRC64;
MGVLSDVADF GKDVIDDRDK WADRFKKVGH FIERNAKGDR LERVAKFGRA LGDFSGKFTD
FFESNLGKRM VKAARSPILA AGQHVITGMK LTTGVGDPEN GRRFGEGADH MSAAGRTLDS
AYPTSDWDSA ASDAYLSQNN GQVTRAEALV HADQVVAAVL SREAEQIATT REVLDSEADW
LGDMSLVTMA TGLIPYVGRA AQTAAEIAMV TKAVGASTDQ FMMMRDKADE NAAEVRDAIG
KYEAVAGDAD DTTADDTAGD APESEPTAAE DSSETSKEDG QSRHENPVAA PSGGGGGATS
GGGGGAPSSA SSAGPAGTPQ VPSPPGFGAA NTPTDAQPGA AAASDAAGML GSVMGAMLGP
LGGIVGGVVQ AAGQALQAAT QAGAQAAQLA GQAAAAPDVD RADTDEDTDK DPDAEGDKDS
DKRDGEGKED GTAPRDREST DAMGADDDDR NAHPQAGSGT DSAGEDDKKP AMTLPPDLQA
ASALDTGAGS APVHVGADFE HSQLRTVAAA TLDHGIPGSA AARGA