YEHQ_ECOLI
ID YEHQ_ECOLI Reviewed; 614 AA.
AC P33353; A0A385XJJ8; Q2MAV8;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 15-JUL-1998, sequence version 2.
DT 03-AUG-2022, entry version 123.
DE RecName: Full=Protein YehQ;
GN Name=yehQ; OrderedLocusNames=b2122, JW2110;
OS Escherichia coli (strain K12).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=83333;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / BHB2600;
RA Richterich P., Lakey N., Gryan G., Jaehn L., Mintz L., Robison K.,
RA Church G.M.;
RT "Automated multiplex sequencing of the E.coli genome.";
RL Submitted (OCT-1993) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA Shao Y.;
RT "The complete genome sequence of Escherichia coli K-12.";
RL Science 277:1453-1462(1997).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX PubMed=16738553; DOI=10.1038/msb4100049;
RA Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT and W3110.";
RL Mol. Syst. Biol. 2:E1-E5(2006).
CC -!- MISCELLANEOUS: Missing up to 60 C-terminal residues compared to
CC orthologs. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA60485.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U00007; AAA60485.1; ALT_INIT; Genomic_DNA.
DR EMBL; U00096; AYC08228.1; -; Genomic_DNA.
DR EMBL; AP009048; BAE76598.1; -; Genomic_DNA.
DR PIR; A64980; A64980.
DR AlphaFoldDB; P33353; -.
DR BioGRID; 4260443; 14.
DR DIP; DIP-11905N; -.
DR IntAct; P33353; 6.
DR STRING; 316407.85675235; -.
DR PRIDE; P33353; -.
DR EnsemblBacteria; AYC08228; AYC08228; b2122.
DR EnsemblBacteria; BAE76598; BAE76598; BAE76598.
DR KEGG; ecj:JW2110; -.
DR PATRIC; fig|83333.103.peg.2987; -.
DR EchoBASE; EB1941; -.
DR eggNOG; COG4715; Bacteria.
DR HOGENOM; CLU_020792_0_1_6; -.
DR InParanoid; P33353; -.
DR OMA; EEGQTCR; -.
DR BioCyc; EcoCyc:EG12003-MON; -.
DR PRO; PR:P33353; -.
DR Proteomes; UP000000318; Chromosome.
DR Proteomes; UP000000625; Chromosome.
DR GO; GO:0005829; C:cytosol; IDA:EcoCyc.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR InterPro; IPR007527; Znf_SWIM.
DR Pfam; PF04434; SWIM; 2.
DR PROSITE; PS50966; ZF_SWIM; 2.
PE 4: Predicted;
KW Metal-binding; Reference proteome; Repeat; Zinc; Zinc-finger.
FT CHAIN 1..614
FT /note="Protein YehQ"
FT /id="PRO_0000169136"
FT ZN_FING 55..89
FT /note="SWIM-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00325"
FT ZN_FING 151..185
FT /note="SWIM-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00325"
SQ SEQUENCE 614 AA; 67730 MW; 8056294BAE3CAE6E CRC64;
MNSLRPELLE LTPQALTALS NAGFVKRSLK ELENGNVPEI SHENDALIAT FSDGVRTQLA
NGQALKEAQC SCGANGMCRH RVMLVLSYQR LCATTQSTEK EEEWDPAIWL EELATLPDAT
RKRAQALVAK GITIELFCAP GEIPSARLPM SDVRFYSRSS IRFARCDCIE GTLCEHVVLA
VQAFVEAKAQ QAEFNHLIWQ MRSEHVTSSD DPFASEEGNA CRQYVQQLSQ TLWLGGISQP
LIHYEAAFNR ALQAAETCNW RWVSESLRQL RASVDAFHAR ASHYNAGECL HQLAALNSRL
NCAQEMARRD SIGEVPPVPW RTVVGSGIAG EAKLDHLRLV SLGMRCWQDI EHYGLRIWFT
DPDTGSILHL SRSWPRSEQE NSPAATRRLF SFQAGALAGG QIVSQAAKRS ADGELLLATR
NRLSSVVPLS PDAWQMLSAP LRQPGIVALR EYLRQRPPAC IRPLNQVDNL FILPVAECIS
LGWDSSRQTL DAQVISGEGE DNVLTLSLPA SASAPYAVER MAALLQQTDD PVCLVSGFVS
FVEGQLTLEP RVMMTKTRAW ALDAETTPVA PLPSASVLPV PSTAHQLLIR CQALLIQLLH
NGWRYQEQSA IGQA