位置:首页 > 蛋白库 > YHGF_ECOLI
YHGF_ECOLI
ID   YHGF_ECOLI              Reviewed;         773 AA.
AC   P46837; P76689; Q2M772;
DT   01-NOV-1995, integrated into UniProtKB/Swiss-Prot.
DT   15-JUL-1999, sequence version 3.
DT   03-AUG-2022, entry version 150.
DE   RecName: Full=Protein YhgF;
GN   Name=yhgF; OrderedLocusNames=b3407, JW3370;
OS   Escherichia coli (strain K12).
OC   Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC   Enterobacteriaceae; Escherichia.
OX   NCBI_TaxID=83333;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=K12 / MG1655 / ATCC 47076;
RX   PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA   Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA   Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA   Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA   Shao Y.;
RT   "The complete genome sequence of Escherichia coli K-12.";
RL   Science 277:1453-1462(1997).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX   PubMed=16738553; DOI=10.1038/msb4100049;
RA   Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA   Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT   "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT   and W3110.";
RL   Mol. Syst. Biol. 2:E1-E5(2006).
RN   [3]
RP   IDENTIFICATION BY MASS SPECTROMETRY.
RC   STRAIN=B / BL21;
RX   PubMed=10493123;
RX   DOI=10.1002/(sici)1522-2683(19990801)20:11<2181::aid-elps2181>3.0.co;2-q;
RA   Fountoulakis M., Takacs M.-F., Berndt P., Langen H., Takacs B.;
RT   "Enrichment of low abundance proteins of Escherichia coli by hydroxyapatite
RT   chromatography.";
RL   Electrophoresis 20:2181-2195(1999).
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAA58204.1; Type=Miscellaneous discrepancy; Note=Wrong choice of frame.; Evidence={ECO:0000305};
CC       Sequence=AAA58205.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC       Sequence=AAA58205.1; Type=Frameshift; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; U18997; AAA58204.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; U18997; AAA58205.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; U00096; AAC76432.2; -; Genomic_DNA.
DR   EMBL; AP009048; BAE77884.1; -; Genomic_DNA.
DR   PIR; B65136; B65136.
DR   RefSeq; NP_417866.4; NC_000913.3.
DR   RefSeq; WP_000980727.1; NZ_LN832404.1.
DR   AlphaFoldDB; P46837; -.
DR   SMR; P46837; -.
DR   BioGRID; 4261725; 54.
DR   BioGRID; 852220; 1.
DR   DIP; DIP-12337N; -.
DR   IntAct; P46837; 10.
DR   STRING; 511145.b3407; -.
DR   jPOST; P46837; -.
DR   PaxDb; P46837; -.
DR   PRIDE; P46837; -.
DR   EnsemblBacteria; AAC76432; AAC76432; b3407.
DR   EnsemblBacteria; BAE77884; BAE77884; BAE77884.
DR   GeneID; 947911; -.
DR   KEGG; ecj:JW3370; -.
DR   KEGG; eco:b3407; -.
DR   PATRIC; fig|1411691.4.peg.3322; -.
DR   EchoBASE; EB2768; -.
DR   eggNOG; COG2183; Bacteria.
DR   HOGENOM; CLU_009833_0_2_6; -.
DR   InParanoid; P46837; -.
DR   OMA; RWAWRTR; -.
DR   PhylomeDB; P46837; -.
DR   BioCyc; EcoCyc:G7746-MON; -.
DR   PRO; PR:P46837; -.
DR   Proteomes; UP000000318; Chromosome.
DR   Proteomes; UP000000625; Chromosome.
DR   GO; GO:0005829; C:cytosol; IDA:EcoCyc.
DR   GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR   GO; GO:0003735; F:structural constituent of ribosome; IBA:GO_Central.
DR   GO; GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro.
DR   GO; GO:0010212; P:response to ionizing radiation; IMP:EcoCyc.
DR   GO; GO:0006412; P:translation; IBA:GO_Central.
DR   CDD; cd05685; S1_Tex; 1.
DR   Gene3D; 1.10.10.650; -; 1.
DR   Gene3D; 1.10.3500.10; -; 1.
DR   Gene3D; 2.40.50.140; -; 1.
DR   Gene3D; 3.30.420.140; -; 1.
DR   InterPro; IPR041692; HHH_9.
DR   InterPro; IPR012340; NA-bd_OB-fold.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR010994; RuvA_2-like.
DR   InterPro; IPR022967; S1_dom.
DR   InterPro; IPR003029; S1_domain.
DR   InterPro; IPR044146; S1_Tex.
DR   InterPro; IPR023323; Tex-like_dom_sf.
DR   InterPro; IPR023319; Tex-like_HTH_dom_sf.
DR   InterPro; IPR018974; Tex-like_N.
DR   InterPro; IPR032639; Tex_YqgF.
DR   InterPro; IPR006641; YqgF/RNaseH-like_dom.
DR   InterPro; IPR037027; YqgF/RNaseH-like_dom_sf.
DR   Pfam; PF17674; HHH_9; 1.
DR   Pfam; PF00575; S1; 1.
DR   Pfam; PF09371; Tex_N; 1.
DR   Pfam; PF16921; Tex_YqgF; 1.
DR   SMART; SM00316; S1; 1.
DR   SMART; SM00732; YqgFc; 1.
DR   SUPFAM; SSF47781; SSF47781; 2.
DR   SUPFAM; SSF50249; SSF50249; 1.
DR   SUPFAM; SSF53098; SSF53098; 1.
DR   PROSITE; PS50126; S1; 1.
PE   1: Evidence at protein level;
KW   Reference proteome; RNA-binding.
FT   CHAIN           1..773
FT                   /note="Protein YhgF"
FT                   /id="PRO_0000215104"
FT   DOMAIN          651..720
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00180"
FT   REGION          721..773
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CONFLICT        754..755
FT                   /note="QP -> HA (in Ref. 1; AAA58205)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   773 AA;  85120 MW;  EA54D9ED952A8229 CRC64;
     MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL
     SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
     AGLEPLADLL WSDPSHTPEV AAAQYVYADK GVADTKAALD GARYILMERF AEDAALLAKV
     RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN
     ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
     ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
     TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
     SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
     RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS
     RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
     KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
     VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
     LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024