位置:首页 > 蛋白库 > ENDOR_ECOLX
ENDOR_ECOLX
ID   ENDOR_ECOLX             Reviewed;         865 AA.
AC   P21312; P21313; P21319;
DT   01-MAY-1991, integrated into UniProtKB/Swiss-Prot.
DT   20-FEB-2007, sequence version 2.
DT   03-AUG-2022, entry version 65.
DE   RecName: Full=Probable replication endonuclease from retron Ec67;
DE            EC=3.1.-.-;
DE   AltName: Full=Protein ORF2 in retron Ec67 {ECO:0000303|PubMed:1701261};
OS   Escherichia coli.
OC   Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC   Enterobacteriaceae; Escherichia.
OX   NCBI_TaxID=562;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC   STRAIN=O1:NM / CL-1;
RX   PubMed=1701261; DOI=10.1073/pnas.87.23.9454;
RA   Hsu M.-Y., Inouye M., Inouye S.;
RT   "Retron for the 67-base multicopy single-stranded DNA from Escherichia
RT   coli: a potential transposable element encoding both reverse transcriptase
RT   and Dam methylase functions.";
RL   Proc. Natl. Acad. Sci. U.S.A. 87:9454-9458(1990).
CC   -!- FUNCTION: Possible endonuclease which induces a single-strand cut and
CC       initiates DNA replication. {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the phage GPA family. {ECO:0000305}.
CC   -!- CAUTION: Was originally (PubMed:1701261) proposed to code for three
CC       separate adjacent ORFs, ORF2, ORF3 and ORFE. This has been
CC       reconstructed to match proteins in other enterobacteria.
CC       {ECO:0000305|PubMed:1701261}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAA23398.1; Type=Frameshift; Evidence={ECO:0000305};
CC       Sequence=AAA23399.1; Type=Frameshift; Evidence={ECO:0000305};
CC       Sequence=AAA23400.1; Type=Frameshift; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; M55249; AAA23398.1; ALT_FRAME; Genomic_DNA.
DR   EMBL; M55249; AAA23399.1; ALT_FRAME; Genomic_DNA.
DR   EMBL; M55249; AAA23400.1; ALT_FRAME; Genomic_DNA.
DR   PIR; JQ0852; JQ0852.
DR   PIR; JQ0853; JQ0853.
DR   PIR; JQ0860; JQ0860.
DR   AlphaFoldDB; P21312; -.
DR   eggNOG; ENOG502Z7TX; Bacteria.
DR   GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR   GO; GO:0006260; P:DNA replication; IEA:UniProtKB-KW.
DR   InterPro; IPR008766; Replication_gene_A.
DR   Pfam; PF05840; Phage_GPA; 2.
PE   3: Inferred from homology;
KW   DNA replication; Endonuclease; Hydrolase; Nuclease; Transposable element.
FT   CHAIN           1..865
FT                   /note="Probable replication endonuclease from retron Ec67"
FT                   /id="PRO_0000066450"
FT   REGION          749..783
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        751..783
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   ACT_SITE        560
FT                   /note="O-(5'-phospho-DNA)-tyrosine intermediate"
FT                   /evidence="ECO:0000250"
FT   ACT_SITE        564
FT                   /note="O-(5'-phospho-DNA)-tyrosine intermediate"
FT                   /evidence="ECO:0000250"
SQ   SEQUENCE   865 AA;  98857 MW;  4F81E107A8382FFF CRC64;
     MSHADMSDSS GFNEAAAAFS WNGPKKAINP YLDPAEVAPF SALSNLITLY AADNEQEQLR
     REELSEQVWE RYFFNESRDP VQREMEQDKL ISRAKLAHEQ QRFNPDMVIL ADVSAQPTHI
     SKPLMQRIEY FSSLGRPKAY SRYLRETIKP CLERLDCVRD SQLSASFRFM ASHQGLEGLL
     ILPEMSQDQV KRLSTLVAAH MSMCLEAACG DLYATDDVKP EEIRKTWEKV AAETLRLDVI
     PPAFEKLRRK RNRRKPVPYE LIPGSLARML CADWWYRKLW KMRCEWREEQ LRAVCLVSKK
     ASPYVSYEAV THKREQRRKS LEFFRSHELV NEDGDTLDME DVVNASSSNP AHRRNEMMAC
     VKGLELIAEM RGDCAVFYTI TCPSRFHSTL NNGRPNPTWT NATVRQSSDY LVGMFAAFRK
     AMQPKPGCAG MACGWLSRTM TVLCTGISCV SCAKKTAAPL LHCCVSLPSV KTVRNWAITR
     GHALSLSGLR WYGVRVAEPH HDGTVHWHLM CFMRKKDRRA ITALLRKFAI REDREELGNN
     TGPRFKSELI NPRKGTPTSY IAKYISKNID GRGLAGEISK ETGKSLRDNA EYVNAWASLH
     RVQQFRFFGI PGRQAYRELR LLAGQAARQQ GDKKASTPIL DDPRLDAILA AADAGCFATY
     IMKQGVLVPR KYHLIRTAYE INEEPTAYGD HGIRIYGIWS PIVQGKICTH AMKWKMVRKA
     VDVQEAAADQ GACAPWTRGN NCPLAENLNQ QEKDKSADGD TRTDITRMDD KELHDHLHSM
     SKKERRELAA RLRLVKPIRR KDYKQRITDH QRQQLVYELK SRGFDGSEKE VDLLLRGGSI
     PSGAGLRIFY RNQRLQEDDK WRNLY
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024