HMCES_RAT

ID   HMCES_RAT               Reviewed;         353 AA.
AC   Q5XIJ1;
DT   24-JAN-2006, integrated into UniProtKB/Swiss-Prot.
DT   23-NOV-2004, sequence version 1.
DT   03-AUG-2022, entry version 92.
DE   RecName: Full=Abasic site processing protein HMCES {ECO:0000250|UniProtKB:Q96FZ2};
DE   AltName: Full=Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein;
DE            Short=ES cell-specific 5hmC-binding protein {ECO:0000250|UniProtKB:Q96FZ2};
DE   AltName: Full=Peptidase HMCES {ECO:0000250|UniProtKB:Q8R1M0};
DE            EC=3.4.-.- {ECO:0000250|UniProtKB:Q8R1M0};
DE   AltName: Full=SRAP domain-containing protein 1 {ECO:0000250|UniProtKB:Q96FZ2};
GN   Name=Hmces {ECO:0000312|RGD:1559800};
GN   Synonyms=Srapd1 {ECO:0000250|UniProtKB:Q96FZ2};
OS   Rattus norvegicus (Rat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Rattus.
OX   NCBI_TaxID=10116;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   TISSUE=Heart;
RX   PubMed=15489334; DOI=10.1101/gr.2596504;
RG   The MGC Project Team;
RT   "The status, quality, and expansion of the NIH full-length cDNA project:
RT   the Mammalian Gene Collection (MGC).";
RL   Genome Res. 14:2121-2127(2004).
CC   -!- FUNCTION: Sensor of abasic sites in single-stranded DNA (ssDNA)
CC       required to preserve genome integrity by promoting error-free repair of
CC       abasic sites. Acts as an enzyme that recognizes and binds abasic sites
CC       in ssDNA at replication forks and chemically modifies the lesion by
CC       forming a covalent cross-link with DNA: forms a stable thiazolidine
CC       linkage between a ring-opened abasic site and the alpha-amino and
CC       sulfhydryl substituents of its N-terminal catalytic cysteine residue.
CC       The HMCES DNA-protein cross-link is then degraded by the proteasome.
CC       Promotes error-free repair of abasic sites by acting as a 'suicide'
CC       enzyme that is degraded, thereby protecting abasic sites from
CC       translesion synthesis (TLS) polymerases and endonucleases that are
CC       error-prone and would generate mutations and double-strand breaks. Has
CC       preference for ssDNA, but can also accommodate double-stranded DNA with
CC       3' or 5' overhang (dsDNA), and dsDNA-ssDNA 3' junction (By similarity).
CC       Also involved in class switch recombination (CSR) in B-cells
CC       independently of the formation of a DNA-protein cross-link: acts by
CC       binding and protecting ssDNA overhangs to promote DNA double-strand
CC       break repair through the microhomology-mediated alternative-end-joining
CC       (Alt-EJ) pathway. Acts as a protease: mediates autocatalytic processing
CC       of its N-terminal methionine in order to expose the catalytic cysteine
CC       (By similarity). {ECO:0000250|UniProtKB:Q8R1M0,
CC       ECO:0000250|UniProtKB:Q96FZ2}.
CC   -!- SUBUNIT: Interacts (via PIP-box motif) with PCNA.
CC       {ECO:0000250|UniProtKB:Q96FZ2}.
CC   -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000250|UniProtKB:Q96FZ2}.
CC       Note=Recruited to chromatin following DNA damage. Localizes to
CC       replication forks. {ECO:0000250|UniProtKB:Q96FZ2}.
CC   -!- DOMAIN: Glu-127 is involved in sensing abasic sites in single-stranded
CC       DNA (ssDNA). His-209 stabilizes the abasic sites by forming a hydrogen
CC       bond with the O4' hydroxyl group. {ECO:0000250|UniProtKB:P76318}.
CC   -!- PTM: Ubiquitinated; the covalent HMCES DNA-protein cross-link is
CC       ubiquitinated, leading to its degradation by the proteasome.
CC       {ECO:0000250|UniProtKB:Q96FZ2}.
CC   -!- SIMILARITY: Belongs to the SOS response-associated peptidase family.
CC       {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BC083690; AAH83690.1; -; mRNA.
DR   RefSeq; NP_001020218.1; NM_001025047.1.
DR   RefSeq; XP_006236913.1; XM_006236851.3.
DR   AlphaFoldDB; Q5XIJ1; -.
DR   SMR; Q5XIJ1; -.
DR   STRING; 10116.ENSRNOP00000054105; -.
DR   PaxDb; Q5XIJ1; -.
DR   Ensembl; ENSRNOT00000057286; ENSRNOP00000054105; ENSRNOG00000010474.
DR   GeneID; 500251; -.
DR   KEGG; rno:500251; -.
DR   UCSC; RGD:1559800; rat.
DR   CTD; 56941; -.
DR   RGD; 1559800; Hmces.
DR   eggNOG; KOG2618; Eukaryota.
DR   GeneTree; ENSGT00390000016313; -.
DR   HOGENOM; CLU_035990_1_0_1; -.
DR   InParanoid; Q5XIJ1; -.
DR   OrthoDB; 1487237at2759; -.
DR   PhylomeDB; Q5XIJ1; -.
DR   PRO; PR:Q5XIJ1; -.
DR   Proteomes; UP000002494; Chromosome 4.
DR   Bgee; ENSRNOG00000010474; Expressed in cerebellum and 19 other tissues.
DR   ExpressionAtlas; Q5XIJ1; baseline and differential.
DR   Genevisible; Q5XIJ1; RN.
DR   GO; GO:0005657; C:replication fork; ISS:UniProtKB.
DR   GO; GO:0008233; F:peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0003697; F:single-stranded DNA binding; ISS:UniProtKB.
DR   GO; GO:0006974; P:cellular response to DNA damage stimulus; ISS:UniProtKB.
DR   GO; GO:0097681; P:double-strand break repair via alternative nonhomologous end joining; ISO:RGD.
DR   GO; GO:0045830; P:positive regulation of isotype switching; ISS:UniProtKB.
DR   GO; GO:0018142; P:protein-DNA covalent cross-linking; ISS:UniProtKB.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   Gene3D; 3.90.1680.10; -; 1.
DR   InterPro; IPR003738; SRAP.
DR   InterPro; IPR036590; SRAP-like.
DR   PANTHER; PTHR13604; PTHR13604; 1.
DR   Pfam; PF02586; SRAP; 1.
DR   SUPFAM; SSF143081; SSF143081; 1.
PE   2: Evidence at transcript level;
KW   Autocatalytic cleavage; Chromosome; Covalent protein-DNA linkage;
KW   DNA damage; DNA-binding; Hydrolase; Isopeptide bond; Phosphoprotein;
KW   Protease; Reference proteome; Ubl conjugation.
FT   INIT_MET        1
FT                   /note="Removed"
FT                   /evidence="ECO:0000250|UniProtKB:Q8R1M0"
FT   CHAIN           2..353
FT                   /note="Abasic site processing protein HMCES"
FT                   /id="PRO_0000164397"
FT   REGION          292..353
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOTIF           332..338
FT                   /note="PIP-box"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   COMPBIAS        308..328
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        336..353
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   ACT_SITE        2
FT                   /note="Nucleophile"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   SITE            127
FT                   /note="Required for sensing abasic sites"
FT                   /evidence="ECO:0000250|UniProtKB:P76318"
FT   SITE            209
FT                   /note="Required to stabilize abasic sites"
FT                   /evidence="ECO:0000250|UniProtKB:P76318"
FT   MOD_RES         2
FT                   /note="Thiazolidine linkage to a ring-opened DNA abasic
FT                   site"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   MOD_RES         160
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   MOD_RES         294
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   MOD_RES         321
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        148
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        151
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        274
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        275
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        305
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        339
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
FT   CROSSLNK        342
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q96FZ2"
SQ   SEQUENCE   353 AA;  40222 MW;  E5B6978DF8A526EA CRC64;
     MCGRTSCHLP RDALTRACAY LDRQGRRQLP QWRDPDKYCP SYNKSPQSSS PVLLSRLHFE
     KDADSSDRII FPMRWGLVPS WFKESDPSKL QFNTSNCRSD TIMEKQSFKA PLGKGRRCVV
     LADGFYEWQR CQGTNQRQPY FIYFPQSKTE KSGENSGSDS LNNKEEVWDN WRLLTMAGIF
     DCWEPPKGER LYSYSIITVD SCRGLSDIHS RMPAILDGEE AVSKWLDFGE VSTQEALKLI
     HPIDNITFHP VSPVVNNSRN NTPECLAPAD LLVKKEPKAS GSSQRMMQWL ATKSPKKEVP
     DSPKKDASGL PQWSSQFLQK SPLPTKRGAS SSLLDRWLKQ EKEDEPVAKR PNS