THAP1_MOUSE
ID THAP1_MOUSE Reviewed; 210 AA.
AC Q8CHW1;
DT 11-APR-2003, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 136.
DE RecName: Full=THAP domain-containing protein 1;
GN Name=Thap1;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Czech II; TISSUE=Lung;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [2]
RP TISSUE SPECIFICITY.
RX PubMed=20200153; DOI=10.1074/jbc.m109.072579;
RA Mazars R., Gonzalez-de-Peredo A., Cayrol C., Lavigne A.C., Vogel J.L.,
RA Ortega N., Lacroix C., Gautier V., Huet G., Ray A., Monsarrat B.,
RA Kristie T.M., Girard J.P.;
RT "The THAP-zinc finger protein THAP1 associates with coactivator HCF-1 and
RT O-GlcNAc transferase: a link between DYT6 and DYT3 dystonias.";
RL J. Biol. Chem. 285:13364-13371(2010).
CC -!- FUNCTION: DNA-binding transcription regulator that regulates
CC endothelial cell proliferation and G1/S cell-cycle progression.
CC Specifically binds the 5'-[AT]NTNN[GT]GGCA[AGT]-3' core DNA sequence
CC and acts by modulating expression of pRB-E2F cell-cycle target genes,
CC including RRM1. Component of a THAP1/THAP3-HCFC1-OGT complex that is
CC required for the regulation of the transcriptional activity of RRM1.
CC May also have pro-apoptotic activity by potentiating both serum-
CC withdrawal and TNF-induced apoptosis (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: Interacts with PAWR. Component of a THAP1/THAP3-HCFC1-OGT
CC complex that contains, either THAP1 or THAP3, HCFC1 and OGT. Interacts
CC with OGT. Interacts (via the HBM) with HCFC1 (via the Kelch-repeat
CC domain); the interaction recruits HCFC1 to the RRM1 promoter (By
CC similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm {ECO:0000250}. Nucleus, PML
CC body {ECO:0000250}.
CC -!- TISSUE SPECIFICITY: Highest levels in heart, liver and kidney. Lower
CC levels in brain and lung. {ECO:0000269|PubMed:20200153}.
CC -!- SIMILARITY: Belongs to the THAP1 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC038639; AAH38639.1; -; mRNA.
DR CCDS; CCDS22208.1; -.
DR RefSeq; NP_950243.1; NM_199042.2.
DR AlphaFoldDB; Q8CHW1; -.
DR SMR; Q8CHW1; -.
DR STRING; 10090.ENSMUSP00000042464; -.
DR iPTMnet; Q8CHW1; -.
DR PhosphoSitePlus; Q8CHW1; -.
DR MaxQB; Q8CHW1; -.
DR PaxDb; Q8CHW1; -.
DR PeptideAtlas; Q8CHW1; -.
DR PRIDE; Q8CHW1; -.
DR ProteomicsDB; 262767; -.
DR Antibodypedia; 24159; 134 antibodies from 27 providers.
DR DNASU; 73754; -.
DR Ensembl; ENSMUST00000036807; ENSMUSP00000042464; ENSMUSG00000037214.
DR GeneID; 73754; -.
DR KEGG; mmu:73754; -.
DR UCSC; uc009lhq.2; mouse.
DR CTD; 55145; -.
DR MGI; MGI:1921004; Thap1.
DR VEuPathDB; HostDB:ENSMUSG00000037214; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000159383; -.
DR HOGENOM; CLU_076186_2_1_1; -.
DR InParanoid; Q8CHW1; -.
DR OMA; LMPPLHT; -.
DR OrthoDB; 1382095at2759; -.
DR PhylomeDB; Q8CHW1; -.
DR TreeFam; TF330127; -.
DR BioGRID-ORCS; 73754; 8 hits in 69 CRISPR screens.
DR ChiTaRS; Thap1; mouse.
DR PRO; PR:Q8CHW1; -.
DR Proteomes; UP000000589; Chromosome 8.
DR RNAct; Q8CHW1; protein.
DR Bgee; ENSMUSG00000037214; Expressed in cleaving embryo and 227 other tissues.
DR ExpressionAtlas; Q8CHW1; baseline and differential.
DR Genevisible; Q8CHW1; MM.
DR GO; GO:0001650; C:fibrillar center; ISO:MGI.
DR GO; GO:0043231; C:intracellular membrane-bounded organelle; ISO:MGI.
DR GO; GO:0005654; C:nucleoplasm; ISO:MGI.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0016605; C:PML body; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IBA:GO_Central.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; ISO:MGI.
DR GO; GO:0042802; F:identical protein binding; ISO:MGI.
DR GO; GO:0042803; F:protein homodimerization activity; ISS:UniProtKB.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; ISO:MGI.
DR GO; GO:0043565; F:sequence-specific DNA binding; ISS:UniProtKB.
DR GO; GO:0008270; F:zinc ion binding; ISS:UniProtKB.
DR GO; GO:0007049; P:cell cycle; IEA:UniProtKB-KW.
DR GO; GO:0001935; P:endothelial cell proliferation; ISS:UniProtKB.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; ISO:MGI.
DR GO; GO:0007346; P:regulation of mitotic cell cycle; ISS:UniProtKB.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:UniProtKB.
DR GO; GO:0006351; P:transcription, DNA-templated; ISS:UniProtKB.
DR Gene3D; 6.20.210.20; -; 1.
DR InterPro; IPR026516; THAP1.
DR InterPro; IPR006612; THAP_Znf.
DR InterPro; IPR038441; THAP_Znf_sf.
DR PANTHER; PTHR46600; PTHR46600; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 2: Evidence at transcript level;
KW Cell cycle; Coiled coil; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..210
FT /note="THAP domain-containing protein 1"
FT /id="PRO_0000068639"
FT ZN_FING 5..57
FT /note="THAP-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00309"
FT COILED 137..187
FT /evidence="ECO:0000255"
FT MOTIF 131..134
FT /note="HCFC1-binding motif (HBM)"
FT /evidence="ECO:0000250"
SQ SEQUENCE 210 AA; 24611 MW; 2F8EE0E59FA01B3C CRC64;
MVQSCSAYGC KNRYDKDKPV SFHKFPLTRP SLCKQWEAAV KRKNFKPTKY SSICSEHFTP
DCFKRECNNK LLKENAVPTI FLYIEPHEKK EDLESQEQLP SPSPPASQVD AAIGLLMPPL
QTPDNLSVFC DHNYTVEDTM HQRKRILQLE QQVEKLRKKL KTAQQRCRRQ ERQLEKLKEV
VHFQREKDDA SERGYVILPN DYFEIVEVPA