RHSA_ECOLI
ID RHSA_ECOLI Reviewed; 1377 AA.
AC P16916; Q2M7Q6;
DT 01-AUG-1990, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1990, sequence version 1.
DT 03-AUG-2022, entry version 154.
DE RecName: Full=Protein RhsA;
GN Name=rhsA; OrderedLocusNames=b3593, JW3566;
OS Escherichia coli (strain K12).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=83333;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=K12;
RX PubMed=2403547; DOI=10.1128/jb.172.1.446-456.1990;
RA Feulner G., Gray J.A., Kirschman J.A., Lehner A.F., Sadosky A.B.,
RA Vlazny D.A., Zhang J., Zhao S., Hill C.W.;
RT "Structure of the rhsA locus from Escherichia coli K-12 and comparison of
RT rhsA with other members of the rhs multigene family.";
RL J. Bacteriol. 172:446-456(1990).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=8041620; DOI=10.1093/nar/22.13.2576;
RA Sofia H.J., Burland V., Daniels D.L., Plunkett G. III, Blattner F.R.;
RT "Analysis of the Escherichia coli genome. V. DNA sequence of the region
RT from 76.0 to 81.5 minutes.";
RL Nucleic Acids Res. 22:2576-2586(1994).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA Shao Y.;
RT "The complete genome sequence of Escherichia coli K-12.";
RL Science 277:1453-1462(1997).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX PubMed=16738553; DOI=10.1038/msb4100049;
RA Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT and W3110.";
RL Mol. Syst. Biol. 2:E1-E5(2006).
RN [5]
RP REVIEW.
RX PubMed=7934896; DOI=10.1111/j.1365-2958.1994.tb01074.x;
RA Hill C.W., Sandt C.H., Vlazny D.A.;
RT "Rhs elements of Escherichia coli: a family of genetic composites each
RT encoding a large mosaic protein.";
RL Mol. Microbiol. 12:865-871(1994).
CC -!- FUNCTION: Rhs elements have a nonessential function. They may play an
CC important role in the natural ecology of the cell.
CC -!- DOMAIN: Each rhs appears to consist of a highly conserved 141 kDa amino
CC fragment followed by a highly divergent C-terminus.
CC -!- SIMILARITY: Belongs to the RHS family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L19044; AAC95065.1; -; Genomic_DNA.
DR EMBL; U00039; AAB18570.1; -; Genomic_DNA.
DR EMBL; U00096; AAC76617.1; -; Genomic_DNA.
DR EMBL; AP009048; BAE77700.1; -; Genomic_DNA.
DR PIR; C65159; C65159.
DR RefSeq; NP_418050.1; NC_000913.3.
DR RefSeq; WP_000015300.1; NZ_LN832404.1.
DR AlphaFoldDB; P16916; -.
DR SMR; P16916; -.
DR BioGRID; 4262562; 134.
DR DIP; DIP-10699N; -.
DR IntAct; P16916; 27.
DR STRING; 511145.b3593; -.
DR PaxDb; P16916; -.
DR PRIDE; P16916; -.
DR EnsemblBacteria; AAC76617; AAC76617; b3593.
DR EnsemblBacteria; BAE77700; BAE77700; BAE77700.
DR GeneID; 948120; -.
DR KEGG; ecj:JW3566; -.
DR KEGG; eco:b3593; -.
DR PATRIC; fig|511145.12.peg.3710; -.
DR EchoBASE; EB0839; -.
DR eggNOG; COG3209; Bacteria.
DR HOGENOM; CLU_001218_0_1_6; -.
DR InParanoid; P16916; -.
DR OMA; CEDIANS; -.
DR PhylomeDB; P16916; -.
DR BioCyc; EcoCyc:EG10846-MON; -.
DR PRO; PR:P16916; -.
DR Proteomes; UP000000318; Chromosome.
DR Proteomes; UP000000625; Chromosome.
DR InterPro; IPR045351; DUF6531.
DR InterPro; IPR028947; Ntox34.
DR InterPro; IPR001826; RHS.
DR InterPro; IPR022385; Rhs_assc_core.
DR InterPro; IPR031325; RHS_repeat.
DR InterPro; IPR006530; YD.
DR Pfam; PF20148; DUF6531; 1.
DR Pfam; PF15606; Ntox34; 1.
DR Pfam; PF03527; RHS; 1.
DR Pfam; PF05593; RHS_repeat; 7.
DR PRINTS; PR00394; RHSPROTEIN.
DR TIGRFAMs; TIGR03696; Rhs_assc_core; 1.
DR TIGRFAMs; TIGR01643; YD_repeat_2x; 5.
PE 3: Inferred from homology;
KW Reference proteome; Repeat.
FT CHAIN 1..1377
FT /note="Protein RhsA"
FT /id="PRO_0000022225"
FT REPEAT 330..352
FT /note="1"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 353..374
FT /note="2"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 375..417
FT /note="3"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 418..438
FT /note="4"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 439..460
FT /note="5"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 461..481
FT /note="6"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 482..502
FT /note="7"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 503..525
FT /note="8"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 526..546
FT /note="9"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 547..567
FT /note="10"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 568..588
FT /note="11"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 589..609
FT /note="12"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 610..629
FT /note="13"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 630..650
FT /note="14"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 651..671
FT /note="15"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 672..691
FT /note="16"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 692..711
FT /note="17"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 712..734
FT /note="18"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 735..758
FT /note="19"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 808..828
FT /note="20"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 829..850
FT /note="21"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 851..871
FT /note="22"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 872..894
FT /note="23"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 895..930
FT /note="24"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 931..959
FT /note="25"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 960..984
FT /note="26"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 985..1019
FT /note="27"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REPEAT 1162..1186
FT /note="28"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REGION 330..1186
FT /note="28 X approximate tandem repeats"
FT /evidence="ECO:0000305|PubMed:2403547"
FT REGION 1356..1377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1377 AA; 156321 MW; 21ACA989E74200FE CRC64;
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG
AWRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV
TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HNQLTSATGP
DGLELRREYD ELGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL
SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG
DLTAVIAPDG SRNGTQYDAW GKAVRTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV
LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV KGETAERWQY
DERGWLTDIS HISEGHRVAV HYRYDEKGRL TGERQTVHHP QTEALLWQHE TRHAYNAQGL
ANRCIPDSLP AVEWLTYGSG YLAGMKLGDT PLVEYTRDRL HRETLRSFGR YELTTAYTPA
GQLQSQHLNS LLSDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP
YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD
DERTHRYHYD SQHRLVHYTR TQYEEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR
KPQVTWYGWD GDRLTTIQND RTRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ
SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA
RKIHLYHCDH RGLPLALISK EGTTEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES
GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPVT NTDPLGLEVF PRPFPLPIPW
PKSPAQQQAD DNAAKALTKW WNDTASQRIF DSLILNNPGL ALDITMIASR GNVADTGITD
RVNDIINDRF WSDGKKPDRC DVLQELIDCG DISAKDAKST QKAWNCRHSR QSNDKKR