SET1_CHAGB
ID SET1_CHAGB Reviewed; 1076 AA.
AC Q2GWF3;
DT 09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT 21-MAR-2006, sequence version 1.
DT 03-AUG-2022, entry version 99.
DE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE EC=2.1.1.354 {ECO:0000250|UniProtKB:Q9Y7R4};
DE AltName: Full=COMPASS component SET1;
DE AltName: Full=SET domain-containing protein 1;
GN Name=SET1; ORFNames=CHGG_07701;
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970;
RX PubMed=25720678; DOI=10.1128/genomea.00021-15;
RA Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT "Draft genome sequence of the cellulolytic fungus Chaetomium globosum.";
RL Genome Announc. 3:E0002115-E0002115(2015).
CC -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC specifically mono-, di- and trimethylates histone H3 to form
CC H3K4me1/2/3, which subsequently plays a role in telomere length
CC maintenance and transcription elongation regulation.
CC {ECO:0000250|UniProtKB:Q9Y7R4}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(4)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+) +
CC N(6),N(6),N(6)-trimethyl-L-lysyl(4)-[histone H3] + 3 S-adenosyl-L-
CC homocysteine; Xref=Rhea:RHEA:60260, Rhea:RHEA-COMP:15537, Rhea:RHEA-
CC COMP:15547, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.354;
CC Evidence={ECO:0000250|UniProtKB:Q9Y7R4};
CC -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the class V-like SAM-binding methyltransferase
CC superfamily. {ECO:0000255|PROSITE-ProRule:PRU00190}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408033; EAQ86448.1; -; Genomic_DNA.
DR RefSeq; XP_001225357.1; XM_001225356.1.
DR AlphaFoldDB; Q2GWF3; -.
DR SMR; Q2GWF3; -.
DR STRING; 38033.XP_001225357.1; -.
DR EnsemblFungi; EAQ86448; EAQ86448; CHGG_07701.
DR GeneID; 4393302; -.
DR eggNOG; KOG1080; Eukaryota.
DR HOGENOM; CLU_004391_0_0_1; -.
DR InParanoid; Q2GWF3; -.
DR OMA; CHMTALF; -.
DR OrthoDB; 1017537at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0042800; F:histone methyltransferase activity (H3-K4 specific); IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006325; P:chromatin organization; IEA:UniProtKB-KW.
DR GO; GO:0051568; P:histone H3-K4 methylation; IEA:InterPro.
DR Gene3D; 2.170.270.10; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR InterPro; IPR024657; COMPASS_Set1_N-SET.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR017111; Set1_fungi.
DR InterPro; IPR024636; SET_assoc.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR45814; PTHR45814; 1.
DR Pfam; PF11764; N-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF11767; SET_assoc; 1.
DR PIRSF; PIRSF037104; Histone_H3-K4_mtfrase_Set1_fun; 1.
DR SMART; SM01291; N-SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54928; SSF54928; 1.
DR SUPFAM; SSF82199; SSF82199; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS51572; SAM_MT43_1; 1.
DR PROSITE; PS50280; SET; 1.
PE 3: Inferred from homology;
KW Chromatin regulator; Chromosome; Methyltransferase; Nucleus;
KW Reference proteome; S-adenosyl-L-methionine; Transferase.
FT CHAIN 1..1076
FT /note="Histone-lysine N-methyltransferase, H3 lysine-4
FT specific"
FT /id="PRO_0000269769"
FT DOMAIN 934..1051
FT /note="SET"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00190"
FT DOMAIN 1060..1076
FT /note="Post-SET"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00155"
FT REGION 174..210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 325..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 493..599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 623..744
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..206
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..387
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 522..574
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 679..708
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 716..742
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1076 AA; 121756 MW; B10CF6093311D63D CRC64;
MGTIWLISAA SDDQDDAPPS DPRLAKGGRL NYINVDFHLP KARLRHAPYN LKPYKYDPKT
SCGPGPPTQV VVTGFNPLIA FSKVTAVFAS FGDIAESSNK MHPDTGSYLG FATFRYRDSK
PSRSRPISIT GADAAKRAIR AMHGKRIEAN MVRVEYDAEG KKSSRMLVEV LQKGNETTPA
LGEPRIPTGP KPKEVAPGPP PTAPKGPAAH RGGLMNVQGV WVPKPRPDSI IEVEPVIGHL
KHDPYIFVGH EHVPVMPTTV AHMKRRLKTY MFEDIRADRT GYYIVFQDSG YGRAEAERCF
RSADRTAFFT YTMVMVLHLY GTDGKASHAH ASDTRRRTRT PERKHVDEAR PHREHDRSRR
DEERARRDEQ DRRRREDEAD LEEEKRQRAK NYDPVLEATD VVLRGMKEQL IKIIRTKIAA
PALFNFLDPV NHLAKRRRLN LEDPHSARLP PIVLDEFEDR SPVSTPNSRA DPIERRTARL
DVSALPRIRK VKNAGLNTRK HGFNDPFARN RPTARRTAFR SLHYRLRSDS EGESEDEAEN
RTSLGRDTEE PESRPRSRMS SDDEGDKDDY ASWGPGDDDS MTEASFALGD GPGLAKKRKL
DLQVETAIKR QKKTDEELFG VTIDRIGTEF PSREDSLEDV LPPGPGGGEE KDIGSSRLPT
PLLQEGKAKK KAPAKTKRKS KKQLFEEREA LKRQQQEIFE REALQSEDVD EVIPTPEPES
EPKKSKVEKE KEKEEKVEKP ALDENLYPSQ KVSVLELPHD FRLDVGSLEE LALGPNDQPD
LDRLRKRFGR GKIDDPELWV WRRDRIRELN STDGSAKTPV RIEGYYVPNP TGCARAEGVK
KILNSEKSKY LPHHIKVKKA REERQAQNGK NAKDSVLAAA EAARLAAESL VAKGNSRANR
ANNRRFVADL NDQRKTLGQD SDVLRFNQLK KRKKPVKFAR SAIHNWGLYA MENIPKDDMI
IEYVGEEVRQ QIAELRENRY LKSGIGSSYL FRIDDNTVID ATKKGGIARF INHSCMPNCT
AKIIKVEGSK RIVIYALRDI AQNEELTYDY KFERELGSTD RIPCLCGTAA CKGFLN