ARPA_ECOLI
ID ARPA_ECOLI Reviewed; 728 AA.
AC P23325; P76781; Q2M6T7;
DT 01-NOV-1991, integrated into UniProtKB/Swiss-Prot.
DT 19-JUL-2003, sequence version 3.
DT 03-AUG-2022, entry version 144.
DE RecName: Full=Ankyrin repeat protein A;
DE AltName: Full=Ankyrin-like regulatory protein;
GN Name=arpA; Synonyms=arp, yjaC; OrderedLocusNames=b4017, JW3977;
OS Escherichia coli (strain K12).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=83333;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=8265357; DOI=10.1093/nar/21.23.5408;
RA Blattner F.R., Burland V.D., Plunkett G. III, Sofia H.J., Daniels D.L.;
RT "Analysis of the Escherichia coli genome. IV. DNA sequence of the region
RT from 89.2 to 92.8 minutes.";
RL Nucleic Acids Res. 21:5408-5417(1993).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND SEQUENCE REVISION TO
RP 282.
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA Shao Y.;
RT "The complete genome sequence of Escherichia coli K-12.";
RL Science 277:1453-1462(1997).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX PubMed=16738553; DOI=10.1038/msb4100049;
RA Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT and W3110.";
RL Mol. Syst. Biol. 2:E1-E5(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 108-728.
RX PubMed=1995429; DOI=10.1016/0378-1119(91)90024-6;
RA Galinier A., Bleicher F., Negre D., Perriere G., Duclos B., Cozzone A.J.,
RA Cortay J.-C.;
RT "Primary structure of the intergenic region between aceK and iclR in the
RT Escherichia coli chromosome.";
RL Gene 97:149-150(1991).
RN [5]
RP IDENTIFICATION OF ANKYRIN REPEATS.
RX PubMed=8014990; DOI=10.1006/jmbi.1994.1407;
RA Neuwald A.F., Green P.;
RT "Detecting patterns in protein sequences.";
RL J. Mol. Biol. 239:698-712(1994).
CC -!- SIMILARITY: Belongs to the Toxin_15 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U00006; AAC43111.1; -; Genomic_DNA.
DR EMBL; U00096; AAC76987.1; -; Genomic_DNA.
DR EMBL; AP009048; BAE78019.1; -; Genomic_DNA.
DR EMBL; M63497; AAA73004.1; -; Genomic_DNA.
DR PIR; H65208; H65208.
DR RefSeq; NP_418441.1; NC_000913.3.
DR RefSeq; WP_000632913.1; NZ_LN832404.1.
DR AlphaFoldDB; P23325; -.
DR SMR; P23325; -.
DR BioGRID; 4262004; 4.
DR DIP; DIP-9159N; -.
DR IntAct; P23325; 10.
DR STRING; 511145.b4017; -.
DR PaxDb; P23325; -.
DR PRIDE; P23325; -.
DR EnsemblBacteria; AAC76987; AAC76987; b4017.
DR EnsemblBacteria; BAE78019; BAE78019; BAE78019.
DR GeneID; 944933; -.
DR KEGG; ecj:JW3977; -.
DR KEGG; eco:b4017; -.
DR PATRIC; fig|1411691.4.peg.2696; -.
DR EchoBASE; EB1193; -.
DR eggNOG; ENOG5032YAJ; Bacteria.
DR HOGENOM; CLU_022927_0_0_6; -.
DR InParanoid; P23325; -.
DR OMA; KWSNDHI; -.
DR BioCyc; EcoCyc:EG11208-MON; -.
DR PRO; PR:P23325; -.
DR Proteomes; UP000000318; Chromosome.
DR Proteomes; UP000000625; Chromosome.
DR Gene3D; 1.25.40.20; -; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR012927; Toxin_15_N.
DR Pfam; PF07906; Toxin_15; 1.
DR SMART; SM00248; ANK; 6.
DR SUPFAM; SSF48403; SSF48403; 1.
PE 3: Inferred from homology;
KW ANK repeat; Reference proteome; Repeat.
FT CHAIN 1..728
FT /note="Ankyrin repeat protein A"
FT /id="PRO_0000067228"
FT REPEAT 381..410
FT /note="ANK 1"
FT /evidence="ECO:0000269|PubMed:8014990"
FT REPEAT 429..458
FT /note="ANK 2"
FT /evidence="ECO:0000269|PubMed:8014990"
FT REPEAT 477..506
FT /note="ANK 3"
FT /evidence="ECO:0000269|PubMed:8014990"
FT REPEAT 525..554
FT /note="ANK 4"
FT /evidence="ECO:0000269|PubMed:8014990"
FT REPEAT 573..602
FT /note="ANK 5"
FT /evidence="ECO:0000269|PubMed:8014990"
FT CONFLICT 124
FT /note="N -> D (in Ref. 4; AAA73004)"
FT /evidence="ECO:0000305"
FT CONFLICT 282
FT /note="S -> T (in Ref. 4; AAA73004)"
FT /evidence="ECO:0000305"
FT CONFLICT 701..728
FT /note="GFTDNPRYIAEKNYMEALLKKASPHTVR -> TQKSISPYRTLNLCLRRYA
FT (in Ref. 4; AAA73004)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 728 AA; 82598 MW; 19EE240C4561B3F0 CRC64;
MITRIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRQNGR
HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA PIFEKYLMEI LPQRSDTLDK
QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS
SAEKFLTHFN KKCKNQTLAL VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL
VDNHALTVRL RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKENIKHYS LMDFMNVDYS
LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI GQNHQLTQLM
IQLQKMPELH RTEMLTAYNS INLPGLYLAI NYGNADIVET IFNSLSETGY EGLLSKKNLM
HILEAKDKNG FSGLFLAISR KDKNVVTSIL NALPKLAATH HLDNEQVYKF LSAKNRTSSH
VLYHVMANGD ADMLKIVLNA LPLLIRTCHL TKEQVLDLLK AKDFYGCPGL YLAMQNGHSD
IVKVILEALP SLAQEINISA SDIVDLLTAK SLARDTGLFM AMQRGHMNVI NTIFNALPTL
FNTFKFDKKN MKPLLLANNS NEYPGLFSAI QHKQQNVVET VYLALSDHAR LFGFTAEDIM
DFWQHKAPQK YSAFELAFEF GHRVIAELIL NTLNKMAESF GFTDNPRYIA EKNYMEALLK
KASPHTVR