SHU3_ECOLX
ID SHU3_ECOLX Reviewed; 442 AA.
AC P09747; Q9JMU0;
DT 01-JUL-1989, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-1989, sequence version 1.
DT 25-MAY-2022, entry version 46.
DE RecName: Full=Shufflon protein B;
OS Escherichia coli.
OG Plasmid IncI1 R64.
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=562;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=3029698; DOI=10.1093/nar/15.3.1165;
RA Komano T., Kubo A., Nisioka T.;
RT "Shufflon: multi-inversion of four contiguous DNA segments of plasmid R64
RT creates seven different open reading frames.";
RL Nucleic Acids Res. 15:1165-1172(1987).
CC -!- MISCELLANEOUS: This protein is expressed by a shufflon (= clustered
CC inversion region that works as a biological switch). The orfs of this
CC region share a constant N-terminus, while the C-terminus is variable.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB027308; BAA77988.1; -; Genomic_DNA.
DR PIR; C26421; C26421.
DR AlphaFoldDB; P09747; -.
DR InterPro; IPR029017; Enolase-like_N.
DR InterPro; IPR007001; Shufflon_N.
DR Pfam; PF04917; Shufflon_N; 1.
DR SUPFAM; SSF54826; SSF54826; 1.
PE 4: Predicted;
KW Plasmid.
FT CHAIN 1..442
FT /note="Shufflon protein B"
FT /id="PRO_0000097744"
FT REGION 1..361
FT /note="Constant region"
FT REGION 362..442
FT /note="Variable region"
SQ SEQUENCE 442 AA; 47681 MW; D86E718D5608C63C CRC64;
MKKYDRGWAS LETGAALLIV MLLIAWGAGI WQDYIQTKGW QTEARLVSNW TSAARSYIGK
NYTTLQGSST TTTPAVITTT MLKNTGFLSS GFTETNSEGQ RLQAYVVRNA QNPELLQAMV
VSSGGTPYPV KALIQMAKDI TTGLGGYIQD GKTATGALRS WSVALSNYGA KSGNGHIAVL
LSTDELSGAA EDTDRLYRFQ VNGRPDLNKM HTAIDMGSNN LNNVGAVNAQ TGNFSGNVNG
VNGTFSGQVK GNSGNFDVNV TAGGDIRSNN GWLITRNSKG WLNETHGGGF YMSDGSWVRS
VNNKGIYTGG QVKGGTVRAD GRLYTGEYLQ LERTAVAGAS CSPNGLVGRD NTGAILSCQS
GTWKSSSASI WTNIKTFTLY PKNTQVLGRF KLCINTYRID GREMAETEVV PIDMPDSNGE
MIWQAKNYTQ YSSYFMKITC LK