YEHI_ECOLI
ID YEHI_ECOLI Reviewed; 1210 AA.
AC P33346; P76430; Q2MAW3;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1997, sequence version 2.
DT 03-AUG-2022, entry version 123.
DE RecName: Full=Uncharacterized protein YehI;
GN Name=yehI; OrderedLocusNames=b2118, JW2105;
OS Escherichia coli (strain K12).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=83333;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / BHB2600;
RA Richterich P., Lakey N., Gryan G., Jaehn L., Mintz L., Robison K.,
RA Church G.M.;
RT "Automated multiplex sequencing of the E.coli genome.";
RL Submitted (OCT-1993) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA Shao Y.;
RT "The complete genome sequence of Escherichia coli K-12.";
RL Science 277:1453-1462(1997).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX PubMed=16738553; DOI=10.1038/msb4100049;
RA Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT and W3110.";
RL Mol. Syst. Biol. 2:E1-E5(2006).
CC -!- SIMILARITY: To E.coli molybdate metabolism regulator (MolR).
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA60478.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U00007; AAA60478.1; ALT_FRAME; Genomic_DNA.
DR EMBL; U00096; AAC75179.1; -; Genomic_DNA.
DR EMBL; AP009048; BAE76593.1; -; Genomic_DNA.
DR PIR; E64979; E64979.
DR RefSeq; NP_416621.1; NC_000913.3.
DR RefSeq; WP_000356817.1; NZ_LN832404.1.
DR AlphaFoldDB; P33346; -.
DR BioGRID; 4259176; 37.
DR BioGRID; 850993; 2.
DR IntAct; P33346; 8.
DR STRING; 511145.b2118; -.
DR PaxDb; P33346; -.
DR PRIDE; P33346; -.
DR EnsemblBacteria; AAC75179; AAC75179; b2118.
DR EnsemblBacteria; BAE76593; BAE76593; BAE76593.
DR GeneID; 946649; -.
DR KEGG; ecj:JW2105; -.
DR KEGG; eco:b2118; -.
DR PATRIC; fig|511145.12.peg.2195; -.
DR EchoBASE; EB1936; -.
DR eggNOG; COG3831; Bacteria.
DR HOGENOM; CLU_006807_1_0_6; -.
DR OMA; ESSWQRC; -.
DR BioCyc; EcoCyc:EG11995-MON; -.
DR PRO; PR:P33346; -.
DR Proteomes; UP000000318; Chromosome.
DR Proteomes; UP000000625; Chromosome.
DR InterPro; IPR025406; DUF4132.
DR Pfam; PF13569; DUF4132; 1.
PE 4: Predicted;
KW Reference proteome.
FT CHAIN 1..1210
FT /note="Uncharacterized protein YehI"
FT /id="PRO_0000169131"
SQ SEQUENCE 1210 AA; 138068 MW; 0C2D3412D3CD6574 CRC64;
MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQKAEKKKNI
VWRYENFSLE WQSAITQAIN LIGEHKPSIP ARTMAALACI AQNDSQQLLD EIVQQEGLEY
ATEVVIARQF IARCYESDPL VVTLQYQDED YGYGYRSETY NEFDLRLRKH LSLAEESCWQ
RCADKLIAAL PGINKVRRPF IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA
VRKLEHYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL
VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP PARPGYPIIE
DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA AEVMPWLSTQ PQSVLKSYLS APPKPVIDGT
DNSNLPEILV SPPWRSKKKM TAPRLDLAPL ELTPQVYWQP GEQERLAATE PARYFSTESL
AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LFVQWHYYQG RVEEILNGWN
SPEAQLAEQA LRSGHIEALI NIWENDNYSH YRPEKSVWNL YLLAQLPREM ALTFWLRINE
KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV AHVWRRFAAQ
RDLARQWILQ WPEHTASALI PLVFTKPSDN SEAALLALRL LYEQGHGELL QTVANRWQRT
DVWSALEQLL KQGPMDIYPA RIPKAPDFWH PQMWSRPRLI TNNQTVTNDA LEIIGEMLRF
TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWAFLALS LFGDESTARD
LTTQILAWPQ EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSRPLRD NAAEFLQVVA
ENRGLSQEEL ADRLVPTLGL DDPQALSFDF GPRQFTVRFD ENLNPVIFDQ QNVRQKSVPR
LRADDDQLKA PEALARLKGL KKDATQVSKN LLPRLETALR TTRRWSLADF HSLFVNHPFT
RLVTQRLIWG VYPANEPRCL LKAFRVAAEG EFCNAQDEPI DLPADALIGI AHPLEMTAEM
RSEFAQLFAD YEIMPPFRQL SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES
GYEDAFVYNL GEYRLVLKFS PGFNHYNVDS KALMSFRSLR VYRDNKSVTF AELDVFDLSE
ALSAPDVIFH