SETBP_MOUSE
ID SETBP_MOUSE Reviewed; 1582 AA.
AC Q9Z180; Q66JL8;
DT 13-APR-2004, integrated into UniProtKB/Swiss-Prot.
DT 20-APR-2010, sequence version 4.
DT 03-AUG-2022, entry version 133.
DE RecName: Full=SET-binding protein;
DE Short=SEB;
GN Name=Setbp1; Synonyms=Kiaa0437;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 27-1582.
RC STRAIN=C57BL/6J; TISSUE=Brain;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 155-1582.
RC TISSUE=Embryonic tail;
RX PubMed=14621295; DOI=10.1093/dnares/10.4.167;
RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S.,
RA Saga Y., Nagase T., Ohara O., Koga H.;
RT "Prediction of the coding sequences of mouse homologues of KIAA gene: III.
RT The complete nucleotide sequences of 500 mouse KIAA-homologous cDNAs
RT identified by screening of terminal sequences of cDNA clones randomly
RT sampled from size-fractionated libraries.";
RL DNA Res. 10:167-180(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 1281-1477.
RC TISSUE=Embryo;
RX PubMed=11231286; DOI=10.1046/j.1432-1327.2001.02000.x;
RA Minakuchi M., Kakazu N., Gorrin-Rivas M.J., Abe T., Copeland T.D., Ueda K.,
RA Adachi Y.;
RT "Identification and characterization of SEB, a novel protein that binds to
RT the acute undifferentiated leukemia-associated protein SET.";
RL Eur. J. Biochem. 268:1340-1351(2001).
RN [5]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-808, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Embryonic fibroblast;
RX PubMed=23806337; DOI=10.1016/j.molcel.2013.06.001;
RA Park J., Chen Y., Tishkoff D.X., Peng C., Tan M., Dai L., Xie Z., Zhang Y.,
RA Zwaans B.M., Skinner M.E., Lombard D.B., Zhao Y.;
RT "SIRT5-mediated lysine desuccinylation impacts diverse metabolic
RT pathways.";
RL Mol. Cell 50:919-930(2013).
CC -!- SUBUNIT: Interacts with SET. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH80865.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC114924; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC131736; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC140455; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC146613; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC080865; AAH80865.1; ALT_INIT; mRNA.
DR EMBL; AK129143; BAC97953.1; -; mRNA.
DR EMBL; AB015614; BAA36338.1; -; mRNA.
DR CCDS; CCDS29362.2; -.
DR RefSeq; NP_444329.2; NM_053099.2.
DR AlphaFoldDB; Q9Z180; -.
DR BioGRID; 232199; 2.
DR IntAct; Q9Z180; 1.
DR MINT; Q9Z180; -.
DR STRING; 10090.ENSMUSP00000124497; -.
DR iPTMnet; Q9Z180; -.
DR PhosphoSitePlus; Q9Z180; -.
DR MaxQB; Q9Z180; -.
DR PaxDb; Q9Z180; -.
DR PRIDE; Q9Z180; -.
DR ProteomicsDB; 256625; -.
DR Antibodypedia; 22406; 176 antibodies from 25 providers.
DR DNASU; 240427; -.
DR Ensembl; ENSMUST00000025430; ENSMUSP00000025430; ENSMUSG00000024548.
DR GeneID; 240427; -.
DR KEGG; mmu:240427; -.
DR UCSC; uc008fsi.2; mouse.
DR CTD; 26040; -.
DR MGI; MGI:1933199; Setbp1.
DR VEuPathDB; HostDB:ENSMUSG00000024548; -.
DR eggNOG; KOG1083; Eukaryota.
DR GeneTree; ENSGT00940000158784; -.
DR HOGENOM; CLU_005903_0_0_1; -.
DR InParanoid; Q9Z180; -.
DR OMA; CDNLPGR; -.
DR OrthoDB; 208374at2759; -.
DR PhylomeDB; Q9Z180; -.
DR TreeFam; TF106416; -.
DR BioGRID-ORCS; 240427; 3 hits in 76 CRISPR screens.
DR ChiTaRS; Setbp1; mouse.
DR PRO; PR:Q9Z180; -.
DR Proteomes; UP000000589; Chromosome 18.
DR RNAct; Q9Z180; protein.
DR Bgee; ENSMUSG00000024548; Expressed in dorsal root ganglion and 85 other tissues.
DR GO; GO:0005829; C:cytosol; ISO:MGI.
DR GO; GO:0016604; C:nuclear body; ISO:MGI.
DR GO; GO:0005654; C:nucleoplasm; ISO:MGI.
DR GO; GO:0005634; C:nucleus; IPI:MGI.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0042800; F:histone methyltransferase activity (H3-K4 specific); IBA:GO_Central.
DR GO; GO:0097676; P:histone H3-K36 dimethylation; IBA:GO_Central.
DR GO; GO:0051568; P:histone H3-K4 methylation; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IBA:GO_Central.
DR InterPro; IPR017956; AT_hook_DNA-bd_motif.
DR SMART; SM00384; AT_hook; 3.
PE 1: Evidence at protein level;
KW Acetylation; DNA-binding; Nucleus; Reference proteome; Repeat.
FT CHAIN 1..1582
FT /note="SET-binding protein"
FT /id="PRO_0000097699"
FT DNA_BIND 575..587
FT /note="A.T hook 1"
FT DNA_BIND 1007..1019
FT /note="A.T hook 2"
FT DNA_BIND 1440..1452
FT /note="A.T hook 3"
FT REGION 1..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 124..246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 278..416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 446..513
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 595..617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 709..787
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 845..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1128..1155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1182..1215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1236..1265
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1429..1461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1470..1489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1507..1582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 12..30
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 138..156
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..238
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 278..304
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 371..403
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 494..508
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 718..738
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..787
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 845..873
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1136..1151
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1182..1214
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1507..1535
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1562..1582
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 808
FT /note="N6-acetyllysine"
FT /evidence="ECO:0007744|PubMed:23806337"
FT CONFLICT 1281
FT /note="D -> L (in Ref. 4; BAA36338)"
FT /evidence="ECO:0000305"
FT CONFLICT 1433
FT /note="K -> N (in Ref. 2; AAH80865)"
FT /evidence="ECO:0000305"
FT CONFLICT 1467
FT /note="C -> W (in Ref. 4; BAA36338)"
FT /evidence="ECO:0000305"
FT CONFLICT 1476..1477
FT /note="QK -> PE (in Ref. 4; BAA36338)"
FT /evidence="ECO:0000305"
FT CONFLICT 1509..1529
FT /note="Missing (in Ref. 3; BAC97953)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1582 AA; 173077 MW; 783C1FEDFB5FE4F7 CRC64;
MEPREMLSSC RQRGSESEFL QGSSSRSPPA PGCSGEPLKG ISVGGERMEP EEEDELGSGR
DVDCNSNADS EKWVAGDGLE EQEFSIKEAN FTEGSLKLKI QTTKRAKKPP KNLENYICPP
EIKITIKQSG DQKVSRTGKN SKATKEDERN HSKKKLLTAG DPTASDLKAF QTQAYERPQK
HSTLQYDPGH SQGFTSDTLK PKHQQKSSSQ SHMEWSSNSD SGPATQNCFI SPEAGRDTAS
TSKVPALEPV ASFAKAQSKK GSTGGAWSQL SSSSKDLLLG SVVPSPSSHN SPATPSSSAE
CNGLQPLGDQ DGGSTKDLPE PPTLSSKKKS SKKDMISQTL PNSDLDWVKS AQKAFETTEG
KREAYSADSA QEASPARQSI SSVSNPENDS SHVRITIPIK TPSLDPSNHK RKKRQSIKAV
VEKIVPEKAL ASGISMSSEV VNRILSNSEG SKKDPRVPKL GKMIENETPS VGLETGGNAE
KIVPGGASKQ RKPPMVMTSP TRTEHAPSGK LSEIQHPKFA AKRRCSKAKP PAMLREAVLA
TAEKLMVEPP SAYPITPSSP LYTNTDSLTV ITPVKKKRGR PKKQPLLTVE TIHEGTSTSP
VSPISREFPG TKKRKRRRNL AKLAQLVPGE DKPMSEMKFH KKVGKLGVLD KKTIKTINKM
KTLKRKNILN QILSCSSSVA LKAKAPPETS PGAASIESKL GKQINVSKRG TIYIGKKRGR
KPRTELPPPS EEPKTAIKHP RPVSSQPDVP AVPSSFQSPV ASSPAAMHPL STQLGGSNGN
LSPASTETNF SELKTMPNLQ PISALPTKTQ KGIHGGTWKL SPPRLMANSP SHLCEIGSLK
EITLSPVSES HSEETIPSDS GIGTDNNSTS DQAEKSSESR RRYSFDFCSL DNPEAIPSDT
STKNRHGHRQ KHLIVDTFLA HESLKKPKHK RKRKSLQNRD DLQFLAELEE LITKFQVFRI
SHRGYTFYHE NPYPSIFRIN FDQYYPVPYI QYDPLLYLRR TSDLKSKKKR GRPAKTNDTM
TKVPFLQGFS YPIPSGSYYA PYGMPYTSMP MMNLGYYGQY PAPLYLSHTL GAASPFMRPT
VPPPQFHASS HVKISGATKH KAKHGVHLQG TVGMGLGDIQ PSLNPPKVGG ATLSSSRLHK
RKHKHKRKHK EDRILGTHDN LSGLFAGKAT GFSSHLLSER LSGSDKELPL VSEKSKHKER
QKHQHGEASH KVSKNNFEVD TLSTLSLSDA QHWTQAKDKG DLSSEPVESC AKRYSGSGGD
STRSEGLDVF SEMNPSSDKW DSDMGGSKRR SFEGFGTYRE KDIQAFKMNR KERGSYESSM
SPGMPSPHLK VDQTAAHSKS EGSISAMMAR KKPTAVDSVA IPSAPVLSLL AASAATSDAA
SSSLKKRFKR REIEAIQCEV RKMCHYTKLL STKKNLDHVN KILKAKRLQR QSKTGNNFVK
KRRGRPRKQP SQFDEDSRDQ MPVLEKCIDL PSKRGQKPSL SPLALEPASG QDAVMATIEA
VIHMAREAPP LPPPPPPPLP PPPPPPPPPP PLPKTARGGK RKHRPQPPAQ PAQPTPQPLP
QEEEVKAKRP RKSRASESDV LP