H31_PICGU
ID H31_PICGU Reviewed; 136 AA.
AC A5DFC5;
DT 21-AUG-2007, integrated into UniProtKB/Swiss-Prot.
DT 12-JUN-2007, sequence version 1.
DT 03-AUG-2022, entry version 68.
DE RecName: Full=Histone H3.1/H3.2;
GN Name=HHT1; ORFNames=PGUG_01976;
GN and
GN Name=HHT2; ORFNames=PGUG_04810;
OS Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539
OS / NBRC 10279 / NRRL Y-324) (Yeast) (Candida guilliermondii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Debaryomycetaceae; Meyerozyma.
OX NCBI_TaxID=294746;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324;
RX PubMed=19465905; DOI=10.1038/nature08064;
RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S.,
RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., Agrafioti I.,
RA Arnaud M.B., Bates S., Brown A.J.P., Brunke S., Costanzo M.C.,
RA Fitzpatrick D.A., de Groot P.W.J., Harris D., Hoyer L.L., Hube B.,
RA Klis F.M., Kodira C., Lennard N., Logue M.E., Martin R., Neiman A.M.,
RA Nikolaou E., Quail M.A., Quinn J., Santos M.C., Schmitzberger F.F.,
RA Sherlock G., Shah P., Silverstein K.A.T., Skrzypek M.S., Soll D.,
RA Staggs R., Stansfield I., Stumpf M.P.H., Sudbery P.E., Srikantha T.,
RA Zeng Q., Berman J., Berriman M., Heitman J., Gow N.A.R., Lorenz M.C.,
RA Birren B.W., Kellis M., Cuomo C.A.;
RT "Evolution of pathogenicity and sexual reproduction in eight Candida
RT genomes.";
RL Nature 459:657-662(2009).
CC -!- FUNCTION: Core component of nucleosome. Nucleosomes wrap and compact
CC DNA into chromatin, limiting DNA accessibility to the cellular
CC machineries which require DNA as a template. Histones thereby play a
CC central role in transcription regulation, DNA repair, DNA replication
CC and chromosomal stability. DNA accessibility is regulated via a complex
CC set of post-translational modifications of histones, also called
CC histone code, and nucleosome remodeling.
CC -!- SUBUNIT: The nucleosome is a histone octamer containing two molecules
CC each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and
CC two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of
CC DNA.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}. Chromosome {ECO:0000250}.
CC -!- PTM: Phosphorylated to form H3S10ph. H3S10ph promotes subsequent
CC H3K14ac formation and is required for transcriptional activation
CC through TBP recruitment to the promoters (By similarity).
CC {ECO:0000250}.
CC -!- PTM: Mono-, di- and trimethylated by the COMPASS complex to form
CC H3K4me1/2/3. H3K4me activates gene expression by regulating
CC transcription elongation and plays a role in telomere length
CC maintenance. H3K4me enrichment correlates with transcription levels,
CC and occurs in a 5' to 3' gradient with H3K4me3 enrichment at the 5'-end
CC of genes, shifting to H3K4me2 and then H3K4me1. Methylated by SET2 to
CC form H3K36me. H3K36me represses gene expression. Methylated by DOT1 to
CC form H3K79me. H3K79me is required for association of SIR proteins with
CC telomeric regions and for telomeric silencing. The COMPASS-mediated
CC formation of H3K4me2/3 and the DOT1-mediated formation of H3K79me
CC require H2BK123ub1 (By similarity). {ECO:0000250}.
CC -!- PTM: Acetylation of histone H3 leads to transcriptional activation.
CC H3K14ac formation by GCN5 is promoted by H3S10ph. H3K14ac can also be
CC formed by ESA1. H3K56ac formation occurs predominantly in newly
CC synthesized H3 molecules during G1, S and G2/M of the cell cycle and
CC may be involved in DNA repair (By similarity). {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the histone H3 family. {ECO:0000305}.
CC -!- CAUTION: To ensure consistency between histone entries, we follow the
CC 'Brno' nomenclature for histone modifications, with positions referring
CC to those used in the literature for the 'closest' model organism. Due
CC to slight variations in histone sequences between organisms and to the
CC presence of initiator methionine in UniProtKB/Swiss-Prot sequences, the
CC actual positions of modified amino acids in the sequence generally
CC differ. In this entry the following conventions are used: H3K4me1/2/3 =
CC mono-, di- and trimethylated Lys-5; H3K9ac = acetylated Lys-10; H3K9me1
CC = monomethylated Lys-10; H3S10ph = phosphorylated Ser-11; H3K14ac =
CC acetylated Lys-15; H3K14me2 = dimethylated Lys-15; H3K18ac = acetylated
CC Lys-19; H3K18me1 = monomethylated Lys-19; H3K23ac = acetylated Lys-24;
CC H3K23me1 = monomethylated Lys-24; H3K27ac = acetylated Lys-28;
CC H3K27me1/2/3 = mono-, di- and trimethylated Lys-28; H3K36ac =
CC acetylated Lys-37; H3K36me1/2/3 = mono-, di- and trimethylated Lys-37;
CC H3K56ac = acetylated Lys-57; H3K64ac = acetylated Lys-65; H3K79me1/2/3
CC = mono-, di- and trimethylated Lys-80. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408156; EDK37878.1; -; Genomic_DNA.
DR EMBL; CH408160; EDK40712.1; -; Genomic_DNA.
DR RefSeq; XP_001482855.1; XM_001482805.1.
DR RefSeq; XP_001486305.1; XM_001486255.1.
DR AlphaFoldDB; A5DFC5; -.
DR SMR; A5DFC5; -.
DR STRING; 4929.XP_001482855.1; -.
DR EnsemblFungi; EDK37878; EDK37878; PGUG_01976.
DR EnsemblFungi; EDK40712; EDK40712; PGUG_04810.
DR GeneID; 5124555; -.
DR GeneID; 5128133; -.
DR KEGG; pgu:PGUG_01976; -.
DR KEGG; pgu:PGUG_04810; -.
DR VEuPathDB; FungiDB:PGUG_01976; -.
DR VEuPathDB; FungiDB:PGUG_04810; -.
DR eggNOG; KOG1745; Eukaryota.
DR HOGENOM; CLU_078295_4_0_1; -.
DR InParanoid; A5DFC5; -.
DR OMA; HIVMART; -.
DR OrthoDB; 1564596at2759; -.
DR Proteomes; UP000001997; Unassembled WGS sequence.
DR GO; GO:0000786; C:nucleosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR GO; GO:0030527; F:structural constituent of chromatin; IEA:InterPro.
DR Gene3D; 1.10.20.10; -; 1.
DR InterPro; IPR009072; Histone-fold.
DR InterPro; IPR007125; Histone_H2A/H2B/H3.
DR InterPro; IPR000164; Histone_H3/CENP-A.
DR PANTHER; PTHR11426; PTHR11426; 1.
DR Pfam; PF00125; Histone; 1.
DR PRINTS; PR00622; HISTONEH3.
DR SMART; SM00428; H3; 1.
DR SUPFAM; SSF47113; SSF47113; 1.
DR PROSITE; PS00322; HISTONE_H3_1; 1.
DR PROSITE; PS00959; HISTONE_H3_2; 1.
PE 3: Inferred from homology;
KW Acetylation; Chromosome; DNA-binding; Methylation; Nucleosome core;
KW Nucleus; Phosphoprotein; Reference proteome.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0000250"
FT CHAIN 2..136
FT /note="Histone H3.1/H3.2"
FT /id="PRO_0000297751"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 5
FT /note="N6,N6,N6-trimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 5
FT /note="N6,N6-dimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 5
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 10
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 10
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 11
FT /note="Phosphoserine"
FT /evidence="ECO:0000250"
FT MOD_RES 15
FT /note="N6,N6-dimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 15
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 19
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 19
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 24
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 24
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 28
FT /note="N6,N6,N6-trimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 28
FT /note="N6,N6-dimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 28
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 28
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 37
FT /note="N6,N6,N6-trimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 37
FT /note="N6,N6-dimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 37
FT /note="N6-acetyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 37
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 57
FT /note="N6-acetyllysine"
FT /evidence="ECO:0000250"
FT MOD_RES 65
FT /note="N6-acetyllysine"
FT /evidence="ECO:0000250"
FT MOD_RES 80
FT /note="N6,N6,N6-trimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 80
FT /note="N6,N6-dimethyllysine; alternate"
FT /evidence="ECO:0000250"
FT MOD_RES 80
FT /note="N6-methyllysine; alternate"
FT /evidence="ECO:0000250"
SQ SEQUENCE 136 AA; 15388 MW; A613633B480AC67A CRC64;
MARTKQTARK STGGKAPRKQ LASKAARKSA PSTGGVKKPH RYKPGTVALR EIRRFQKSTE
LLIRKLPFQR LVREIAQDFK TDLRFQSSAI GALQESVEAY LVSLFEDTNL CAIHAKRVTI
QKKDIQLARR LRGERS