SHU1_ECOLX
ID SHU1_ECOLX Reviewed; 474 AA.
AC P09745; Q9R2H7;
DT 01-JUL-1989, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-1989, sequence version 1.
DT 25-MAY-2022, entry version 72.
DE RecName: Full=Shufflon protein A;
OS Escherichia coli.
OG Plasmid IncI1 R64.
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=562;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=3029698; DOI=10.1093/nar/15.3.1165;
RA Komano T., Kubo A., Nisioka T.;
RT "Shufflon: multi-inversion of four contiguous DNA segments of plasmid R64
RT creates seven different open reading frames.";
RL Nucleic Acids Res. 15:1165-1172(1987).
CC -!- MISCELLANEOUS: This protein is expressed by a shufflon (= clustered
CC inversion region that works as a biological switch). The orfs of this
CC region share a constant N-terminus, while the C-terminus is variable.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB027308; BAA77982.1; -; Genomic_DNA.
DR EMBL; AB027308; BAA77983.1; -; Genomic_DNA.
DR PIR; A26421; A26421.
DR RefSeq; WP_001389385.1; NZ_WVVK01000025.1.
DR RefSeq; YP_003108169.1; NC_013120.1.
DR RefSeq; YP_008998818.1; NC_023329.1.
DR RefSeq; YP_009061391.1; NC_024977.1.
DR RefSeq; YP_009069682.1; NC_025147.1.
DR AlphaFoldDB; P09745; -.
DR InterPro; IPR029017; Enolase-like_N.
DR InterPro; IPR007001; Shufflon_N.
DR Pfam; PF04917; Shufflon_N; 1.
DR SUPFAM; SSF54826; SSF54826; 1.
PE 4: Predicted;
KW Plasmid.
FT CHAIN 1..474
FT /note="Shufflon protein A"
FT /id="PRO_0000097741"
FT REGION 1..361
FT /note="Constant region"
FT REGION 362..474
FT /note="Variable region"
SQ SEQUENCE 474 AA; 49530 MW; 12FA84D73EA61104 CRC64;
MKKYDRGWAS LETGAALLIV MLLIAWGAGI WQDYIQTKGW QTEARLVSNW TSAARSYIGK
NYTTLQGSST TTTPAVITTT MLKNTGFLSS GFTETNSEGQ RLQAYVVRNA QNPELLQAMV
VSSGGTPYPV KALIQMAKDI TTGLGGYIQD GKTATGALRS WSVALSNYGA KSGNGHIAVL
LSTDELSGAA EDTDRLYRFQ VNGRPDLNKM HTAIDMGSNN LNNVGAVNAQ TGNFSGNVNG
VNGTFSGQVK GNSGNFDVNV TAGGDIRSNN GWLITRNSKG WLNETHGGGF YMSDGSWVRS
VNNKGIYTGG QVKGGTVRAD GRLYTGEYLQ LERTAVAGAS CSPNGLVGRD NTGAILSCQS
GTWKTSGSLN GSYTNLGSHR GSFSGRNSGG STLFIYASGG NGGSAGGACA NTSRLQGYVG
GTLISVNASN NPAYGKTAFI SFAVPAGTSY QITSYPTENT SCGAGVFSVF GYQT