Y1461_MYCTU
ID Y1461_MYCTU Reviewed; 846 AA.
AC P9WFP7; L0T8B9; O53152; P67125;
DT 16-APR-2014, integrated into UniProtKB/Swiss-Prot.
DT 16-APR-2014, sequence version 1.
DT 25-MAY-2022, entry version 43.
DE RecName: Full=UPF0051 protein Rv1461;
DE Contains:
DE RecName: Full=Endonuclease PI-MtuHIIP;
DE EC=3.1.-.-;
DE AltName: Full=Mtu pps1 intein;
GN OrderedLocusNames=Rv1461; ORFNames=MTV007.08;
OS Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv).
OC Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
OC Mycobacterium; Mycobacterium tuberculosis complex.
OX NCBI_TaxID=83332;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 25618 / H37Rv;
RX PubMed=9634230; DOI=10.1038/31159;
RA Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.M., Harris D.E.,
RA Gordon S.V., Eiglmeier K., Gas S., Barry C.E. III, Tekaia F., Badcock K.,
RA Basham D., Brown D., Chillingworth T., Connor R., Davies R.M., Devlin K.,
RA Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K.,
RA Krogh A., McLean J., Moule S., Murphy L.D., Oliver S., Osborne J.,
RA Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton S.,
RA Squares S., Squares R., Sulston J.E., Taylor K., Whitehead S.,
RA Barrell B.G.;
RT "Deciphering the biology of Mycobacterium tuberculosis from the complete
RT genome sequence.";
RL Nature 393:537-544(1998).
RN [2]
RP IDENTIFICATION AS A DRUG TARGET [LARGE SCALE ANALYSIS].
RX PubMed=19099550; DOI=10.1186/1752-0509-2-109;
RA Raman K., Yeturu K., Chandra N.;
RT "targetTB: a target identification pipeline for Mycobacterium tuberculosis
RT through an interactome, reactome and genome-scale structural analysis.";
RL BMC Syst. Biol. 2:109-109(2008).
RN [3]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC STRAIN=ATCC 25618 / H37Rv;
RX PubMed=21969609; DOI=10.1074/mcp.m111.011627;
RA Kelkar D.S., Kumar D., Kumar P., Balakrishnan L., Muthusamy B., Yadav A.K.,
RA Shrivastava P., Marimuthu A., Anand S., Sundaram H., Kingsbury R.,
RA Harsha H.C., Nair B., Prasad T.S., Chauhan D.S., Katoch K., Katoch V.M.,
RA Kumar P., Chaerkady R., Ramachandran S., Dash D., Pandey A.;
RT "Proteogenomic analysis of Mycobacterium tuberculosis by high resolution
RT mass spectrometry.";
RL Mol. Cell. Proteomics 10:M111.011627-M111.011627(2011).
CC -!- PTM: This protein undergoes a protein self splicing that involves a
CC post-translational excision of the intervening region (intein) followed
CC by peptide ligation. {ECO:0000305}.
CC -!- MISCELLANEOUS: Was identified as a high-confidence drug target.
CC -!- SIMILARITY: Belongs to the UPF0051 (ycf24) family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL123456; CCP44220.1; -; Genomic_DNA.
DR PIR; H70871; H70871.
DR RefSeq; NP_215977.1; NC_000962.3.
DR RefSeq; WP_003407484.1; NZ_NVQJ01000004.1.
DR AlphaFoldDB; P9WFP7; -.
DR SMR; P9WFP7; -.
DR STRING; 83332.Rv1461; -.
DR PaxDb; P9WFP7; -.
DR GeneID; 886609; -.
DR KEGG; mtu:Rv1461; -.
DR TubercuList; Rv1461; -.
DR eggNOG; COG0719; Bacteria.
DR eggNOG; COG1372; Bacteria.
DR OMA; NERNGWK; -.
DR Proteomes; UP000001584; Chromosome.
DR GO; GO:0005829; C:cytosol; HDA:MTBBASE.
DR GO; GO:0004519; F:endonuclease activity; IDA:MTBBASE.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR GO; GO:0006314; P:intron homing; IEA:UniProtKB-KW.
DR GO; GO:0016226; P:iron-sulfur cluster assembly; IEA:InterPro.
DR Gene3D; 3.10.28.10; -; 1.
DR InterPro; IPR003586; Hint_dom_C.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR006142; INTEIN.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR010231; SUF_FeS_clus_asmbl_SufB.
DR InterPro; IPR000825; SUF_FeS_clus_asmbl_SufBD.
DR InterPro; IPR037284; SUF_FeS_clus_asmbl_SufBD_sf.
DR InterPro; IPR045595; SufBD_N.
DR Pfam; PF01458; SUFBD; 1.
DR Pfam; PF19295; SufBD_N; 1.
DR PRINTS; PR00379; INTEIN.
DR SMART; SM00305; HintC; 1.
DR SMART; SM00306; HintN; 1.
DR SUPFAM; SSF101960; SSF101960; 2.
DR SUPFAM; SSF51294; SSF51294; 1.
DR TIGRFAMs; TIGR01443; intein_Cterm; 1.
DR TIGRFAMs; TIGR01980; sufB; 1.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
PE 1: Evidence at protein level;
KW Autocatalytic cleavage; Endonuclease; Hydrolase; Intron homing; Nuclease;
KW Protein splicing; Reference proteome.
FT CHAIN 1..252
FT /note="UPF0051 protein Rv1461, 1st part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036190"
FT CHAIN 253..611
FT /note="Endonuclease PI-MtuHIIP"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036191"
FT CHAIN 612..846
FT /note="UPF0051 protein Rv1461, 2nd part"
FT /evidence="ECO:0000255"
FT /id="PRO_0000036192"
FT DOMAIN 388..528
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00273"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 846 AA; 94171 MW; 468CEEF979B02222 CRC64;
MTLTPEASKS VAQPPTQAPL TQEEAIASLG RYGYGWADSD VAGANAQRGL SEAVVRDISA
KKNEPDWMLQ SRLKALRIFD RKPIPKWGSN LDGIDFDNIK YFVRSTEKQA ASWDDLPEDI
RNTYDRLGIP EAEKQRLVAG VAAQYESEVV YHQIREDLEA QGVIFLDTDT GLREHPDIFK
EYFGTVIPAG DNKFSALNTA VWSGGSFIYV PPGVHVDIPL QAYFRINTEN MGQFERTLII
ADEGSYVHYV EGCLPAGELI TTADGDLRPI ESIRVGDFVT GHDGRPHRVT AVQVRDLDGE
LFTFTPMSPA NAFSVTAEHP LLAIPRDEVR VMRKERNGWK AEVNSTKLRS AEPRWIAAKD
VAEGDFLIYP KPKPIPHRTV LPLEFARLAG YYLAEGHACL TNGCESLIFS FHSDEFEYVE
DVRQACKSLY EKSGSVLIEE HKHSARVTVY TKAGYAAMRD NVGIGSSNKK LSDLLMRQDE
TFLRELVDAY VNGDGNVTRR NGAVWKRVHT TSRLWAFQLQ SILARLGHYA TVELRRPGGP
GVIMGRNVVR KDIYQVQWTE GGRGPKQARD CGDYFAVPIK KRAVREAHEP VYNLDVENPD
SYLAYGFAVH NCTAPIYKSD SLHSAVVEII VKPHARVRYT TIQNWSNNVY NLVTKRARAE
AGATMEWIDG NIGSKVTMKY PAVWMTGEHA KGEVLSVAFA GEDQHQDTGA KMLHLAPNTS
SNIVSKSVAR GGGRTSYRGL VQVNKGAHGS RSSVKCDALL VDTVSRSDTY PYVDIREDDV
TMGHEATVSK VSENQLFYLM SRGLTEDEAM AMVVRGFVEP IAKELPMEYA LELNRLIELQ
MEGAVG