H2A_CAIMO
ID H2A_CAIMO Reviewed; 129 AA.
AC P13912;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 2.
DT 03-AUG-2022, entry version 110.
DE RecName: Full=Histone H2A;
OS Cairina moschata (Muscovy duck).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Cairina.
OX NCBI_TaxID=8855;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=2494350; DOI=10.1007/bf02102477;
RA Toenjes R., Munk K., Doenecke D.;
RT "Conserved organization of an avian histone gene cluster with inverted
RT duplications of H3 and H4 genes.";
RL J. Mol. Evol. 28:200-211(1989).
CC -!- FUNCTION: Core component of nucleosome. Nucleosomes wrap and compact
CC DNA into chromatin, limiting DNA accessibility to the cellular
CC machineries which require DNA as a template. Histones thereby play a
CC central role in transcription regulation, DNA repair, DNA replication
CC and chromosomal stability. DNA accessibility is regulated via a complex
CC set of post-translational modifications of histones, also called
CC histone code, and nucleosome remodeling.
CC -!- SUBUNIT: The nucleosome is a histone octamer containing two molecules
CC each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and
CC two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of
CC DNA.
CC -!- SUBCELLULAR LOCATION: Nucleus. Chromosome.
CC -!- PTM: Monoubiquitination of Lys-120 (H2AK119Ub) gives a specific tag for
CC epigenetic transcriptional repression. Following DNA double-strand
CC breaks (DSBs), it is ubiquitinated through 'Lys-63' linkage of
CC ubiquitin moieties, leading to the recruitment of repair proteins to
CC sites of DNA damage. H2AK119Ub and ionizing radiation-induced 'Lys-63'-
CC linked ubiquitination are distinct events (By similarity).
CC {ECO:0000250}.
CC -!- PTM: Phosphorylation on Ser-2 is enhanced during mitosis.
CC Phosphorylation on Ser-2 directly represses transcription (By
CC similarity). {ECO:0000250}.
CC -!- PTM: Glutamine methylation at Gln-105 (H2AQ104me) by FBL is
CC specifically dedicated to polymerase I. It is present at 35S ribosomal
CC DNA locus and impairs binding of the FACT complex (By similarity).
CC {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the histone H2A family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X14730; CAA32852.1; -; Genomic_DNA.
DR PIR; I50457; I50457.
DR AlphaFoldDB; P13912; -.
DR SMR; P13912; -.
DR GO; GO:0000786; C:nucleosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:InterPro.
DR GO; GO:0030527; F:structural constituent of chromatin; IEA:InterPro.
DR CDD; cd00074; H2A; 1.
DR Gene3D; 1.10.20.10; -; 1.
DR InterPro; IPR009072; Histone-fold.
DR InterPro; IPR002119; Histone_H2A.
DR InterPro; IPR007125; Histone_H2A/H2B/H3.
DR InterPro; IPR032454; Histone_H2A_C.
DR InterPro; IPR032458; Histone_H2A_CS.
DR PANTHER; PTHR23430; PTHR23430; 1.
DR Pfam; PF00125; Histone; 1.
DR Pfam; PF16211; Histone_H2A_C; 1.
DR PRINTS; PR00620; HISTONEH2A.
DR SMART; SM00414; H2A; 1.
DR SUPFAM; SSF47113; SSF47113; 1.
DR PROSITE; PS00046; HISTONE_H2A; 1.
PE 3: Inferred from homology;
KW Acetylation; Chromosome; DNA-binding; Hydroxylation; Isopeptide bond;
KW Methylation; Nucleosome core; Nucleus; Phosphoprotein; Ubl conjugation.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0000250"
FT CHAIN 2..129
FT /note="Histone H2A"
FT /id="PRO_0000055211"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 2
FT /note="N-acetylserine"
FT /evidence="ECO:0000250"
FT MOD_RES 2
FT /note="Phosphoserine"
FT /evidence="ECO:0000250"
FT MOD_RES 6
FT /note="N6-(2-hydroxyisobutyryl)lysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 6
FT /note="N6-acetyllysine"
FT /evidence="ECO:0000250"
FT MOD_RES 10
FT /note="N6-(2-hydroxyisobutyryl)lysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 10
FT /note="N6-lactoyllysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S5"
FT MOD_RES 10
FT /note="N6-succinyllysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 37
FT /note="N6-(2-hydroxyisobutyryl)lysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 75
FT /note="N6-(2-hydroxyisobutyryl)lysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 76
FT /note="N6-(2-hydroxyisobutyryl)lysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 96
FT /note="N6-(2-hydroxyisobutyryl)lysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 96
FT /note="N6-glutaryllysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 96
FT /note="N6-succinyllysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 100
FT /note="N6-glutaryllysine"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 105
FT /note="N5-methylglutamine"
FT /evidence="ECO:0000250"
FT MOD_RES 119
FT /note="N6-(2-hydroxyisobutyryl)lysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 119
FT /note="N6-glutaryllysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT MOD_RES 120
FT /note="N6-glutaryllysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P0C0S8"
FT CROSSLNK 14
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in ubiquitin)"
FT /evidence="ECO:0000250"
FT CROSSLNK 16
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in ubiquitin)"
FT /evidence="ECO:0000250"
FT CROSSLNK 120
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in ubiquitin)"
FT /evidence="ECO:0000250"
SQ SEQUENCE 129 AA; 13956 MW; C150497D1C525360 CRC64;
MSGRGKQGGK ARAKAKSRSS RAGLQFPVGR VHRLLRKGNY AERVGAGAPV YLAAVLEYLT
AEILELAGNA ARDNKKTRII PRHLQLAIRN DEELNKLLGK VTIAQGGVLP NIQAVLLPKK
TDSHKAKSK