Y593_MYCLE
ID Y593_MYCLE Reviewed; 869 AA.
AC Q49689; O33141;
DT 15-JUL-1998, integrated into UniProtKB/Swiss-Prot.
DT 27-APR-2001, sequence version 2.
DT 25-MAY-2022, entry version 128.
DE RecName: Full=UPF0051 protein ML0593;
DE Contains:
DE RecName: Full=Mle pps1 intein;
GN OrderedLocusNames=ML0593; ORFNames=B1496_C2_189, MLCL536.28c;
OS Mycobacterium leprae (strain TN).
OC Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
OC Mycobacterium.
OX NCBI_TaxID=272631;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RA Smith D.R., Robison K.;
RL Submitted (NOV-1993) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TN;
RX PubMed=11234002; DOI=10.1038/35059006;
RA Cole S.T., Eiglmeier K., Parkhill J., James K.D., Thomson N.R.,
RA Wheeler P.R., Honore N., Garnier T., Churcher C.M., Harris D.E.,
RA Mungall K.L., Basham D., Brown D., Chillingworth T., Connor R.,
RA Davies R.M., Devlin K., Duthoy S., Feltwell T., Fraser A., Hamlin N.,
RA Holroyd S., Hornsby T., Jagels K., Lacroix C., Maclean J., Moule S.,
RA Murphy L.D., Oliver K., Quail M.A., Rajandream M.A., Rutherford K.M.,
RA Rutter S., Seeger K., Simon S., Simmonds M., Skelton J., Squares R.,
RA Squares S., Stevens K., Taylor K., Whitehead S., Woodward J.R.,
RA Barrell B.G.;
RT "Massive gene decay in the leprosy bacillus.";
RL Nature 409:1007-1011(2001).
CC -!- PTM: This protein undergoes a protein self splicing that involves a
CC post-translational excision of the intervening region (intein) followed
CC by peptide ligation. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the UPF0051 (ycf24) family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U00013; AAA17127.1; -; Genomic_DNA.
DR EMBL; Z99125; CAB16171.1; -; Genomic_DNA.
DR EMBL; Z99125; CAB16172.1; -; Genomic_DNA.
DR EMBL; AL583919; CAC30101.1; -; Genomic_DNA.
DR PIR; A86983; A86983.
DR PIR; S72760; S72760.
DR RefSeq; NP_301502.1; NC_002677.1.
DR RefSeq; WP_010907826.1; NC_002677.1.
DR AlphaFoldDB; Q49689; -.
DR SMR; Q49689; -.
DR STRING; 272631.ML0593; -.
DR EnsemblBacteria; CAC30101; CAC30101; CAC30101.
DR KEGG; mle:ML0593; -.
DR PATRIC; fig|272631.5.peg.1031; -.
DR Leproma; ML0593; -.
DR eggNOG; COG0719; Bacteria.
DR eggNOG; COG1372; Bacteria.
DR HOGENOM; CLU_007402_0_0_11; -.
DR OMA; NERNGWK; -.
DR Proteomes; UP000000806; Chromosome.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR GO; GO:0016226; P:iron-sulfur cluster assembly; IEA:InterPro.
DR Gene3D; 3.10.28.10; -; 1.
DR InterPro; IPR003586; Hint_dom_C.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR006142; INTEIN.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR004860; LAGLIDADG_2.
DR InterPro; IPR010231; SUF_FeS_clus_asmbl_SufB.
DR InterPro; IPR000825; SUF_FeS_clus_asmbl_SufBD.
DR InterPro; IPR037284; SUF_FeS_clus_asmbl_SufBD_sf.
DR Pfam; PF14528; LAGLIDADG_3; 1.
DR Pfam; PF01458; SUFBD; 1.
DR PRINTS; PR00379; INTEIN.
DR SMART; SM00305; HintC; 1.
DR SMART; SM00306; HintN; 1.
DR SUPFAM; SSF101960; SSF101960; 2.
DR SUPFAM; SSF51294; SSF51294; 1.
DR SUPFAM; SSF55608; SSF55608; 1.
DR TIGRFAMs; TIGR01443; intein_Cterm; 1.
DR TIGRFAMs; TIGR01980; sufB; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
PE 3: Inferred from homology;
KW Autocatalytic cleavage; Protein splicing; Reference proteome.
FT CHAIN 1..201
FT /note="UPF0051 protein ML0593, 1st part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036187"
FT CHAIN 202..587
FT /note="Mle pps1 intein"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036188"
FT CHAIN 588..869
FT /note="UPF0051 protein ML0593, 2nd part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036189"
FT DOMAIN 344..477
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00273"
FT CONFLICT 482
FT /note="A -> R (in Ref. 1; AAA17127)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 869 AA; 95573 MW; DB04CF70CB50765A CRC64;
MTRTSETTKS PAPELLTQQQ AIDSLGKYGY GWADSDVAGA SARRGLSEDV VRDISAKKDE
PEWMLQARLK ALRVFERKPM PRWGSNLDGI DFDNIKYFVR STEKQAASWD ELPEDIRNTY
DRLGIPDAEK QRLVAGVAAQ YESEVVYHQI RADLKDQGVV FLDTETGLRE YPDIFKQYLG
TVIPAGDNKF SALNTAVWSG GCLTADARIN VKGKGLVSIA DVQPGDEVFG VNIGCELERG
KVLAKVASGT KPVYEMHVAG RALEATGNHQ FLVARRVEEG KRTRWTAVWA PLEEIESGEP
IAVARVLPDD SGTIFFSESE LDIKNRTRQC LYFPCQNSVD LLWLLGLWLG DGHTAAPHKH
MRQVAFSVPA GDPVHHTAIR VVSEQFGANV TVVNCGFIVS SKAFETWLAE LGFSGDEKTK
RLPAWIYSLP HEHQLALIGG LVDADGWTES SGATMSIAFA SRELLEDVRQ LAIGCGLYPD
GALVERTRSA TCRDGRIVTS TSWRLRIQGS LDRVGTRTPG KRGKPVSNKG RRQRYVAAAG
LNFSSLSTDT VGFARLKSKT LVGEKPTYDI QVVGLENFVA NGIVAHNSFI YVPPGVHVDI
PLQAYFRINT ENMGQFERTL IIADTGSYVH YVEGCTAPIY KSDSLHSAVV EIIVKPHARV
RYTTIQNWSN NVYNLVTKRA RVETGATMEW IDGNIGSKVT MKYPAVWMTG EHAKGEVLSV
AFAGEGQHQD TGAKMLHLAS NTSSNIVSKS VARGGGRTSY RGLVQVNKGA HGSRSSVKCD
ALLVDTISRS DTYPYVDIRE DDVTMGHEAT VSKVSENQLF YLMSRGLAED EAMAMVVRGF
VEPIAKELPM EYALELNRLI ELQMEGAVG