THAP1_MACFA
ID THAP1_MACFA Reviewed; 212 AA.
AC Q4R3Q6;
DT 22-NOV-2005, integrated into UniProtKB/Swiss-Prot.
DT 19-JUL-2005, sequence version 1.
DT 03-AUG-2022, entry version 67.
DE RecName: Full=THAP domain-containing protein 1;
GN Name=THAP1; ORFNames=QtsA-15008;
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Testis;
RG International consortium for macaque cDNA sequencing and analysis;
RT "DNA sequences of macaque genes expressed in brain or testis and its
RT evolutionary implications.";
RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: DNA-binding transcription regulator that regulates
CC endothelial cell proliferation and G1/S cell-cycle progression.
CC Specifically binds the 5'-[AT]NTNN[GT]GGCA[AGT]-3' core DNA sequence
CC and acts by modulating expression of pRB-E2F cell-cycle target genes,
CC including RRM1. Component of a THAP1/THAP3-HCFC1-OGT complex that is
CC required for the regulation of the transcriptional activity of RRM1.
CC May also have pro-apoptotic activity by potentiating both serum-
CC withdrawal and TNF-induced apoptosis (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: Interacts with PAWR. Component of a THAP1/THAP3-HCFC1-OGT
CC complex that contains, either THAP1 or THAP3, HCFC1 and OGT. Interacts
CC with OGT. Interacts (via the HBM) with HCFC1 (via the Kelch-repeat
CC domain); the interaction recruits HCFC1 to the RRM1 promoter (By
CC similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm {ECO:0000250}. Nucleus, PML
CC body {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the THAP1 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB179209; BAE02260.1; -; mRNA.
DR RefSeq; NP_001271632.1; NM_001284703.1.
DR AlphaFoldDB; Q4R3Q6; -.
DR BMRB; Q4R3Q6; -.
DR SMR; Q4R3Q6; -.
DR STRING; 9541.XP_005563289.1; -.
DR GeneID; 101926823; -.
DR CTD; 55145; -.
DR eggNOG; KOG1721; Eukaryota.
DR OrthoDB; 1382095at2759; -.
DR Proteomes; UP000233100; Unplaced.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0016605; C:PML body; IEA:UniProtKB-SubCell.
DR GO; GO:0042803; F:protein homodimerization activity; ISS:UniProtKB.
DR GO; GO:0043565; F:sequence-specific DNA binding; ISS:UniProtKB.
DR GO; GO:0008270; F:zinc ion binding; ISS:UniProtKB.
DR GO; GO:0007049; P:cell cycle; IEA:UniProtKB-KW.
DR GO; GO:0001935; P:endothelial cell proliferation; ISS:UniProtKB.
DR GO; GO:0007346; P:regulation of mitotic cell cycle; ISS:UniProtKB.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:UniProtKB.
DR GO; GO:0006351; P:transcription, DNA-templated; ISS:UniProtKB.
DR Gene3D; 6.20.210.20; -; 1.
DR InterPro; IPR026516; THAP1.
DR InterPro; IPR006612; THAP_Znf.
DR InterPro; IPR038441; THAP_Znf_sf.
DR PANTHER; PTHR46600; PTHR46600; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 2: Evidence at transcript level;
KW Cell cycle; Coiled coil; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..212
FT /note="THAP domain-containing protein 1"
FT /id="PRO_0000068638"
FT ZN_FING 5..57
FT /note="THAP-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00309"
FT COILED 138..189
FT /evidence="ECO:0000250"
FT MOTIF 133..136
FT /note="HCFC1-binding motif (HBM)"
FT /evidence="ECO:0000250"
SQ SEQUENCE 212 AA; 24840 MW; CA8216B2068BBE70 CRC64;
MVQPCSAYGC KNRYDKDKPV SFHKFPLTRP SLCKEWEAAV RRKNFKPTKY SSICSEHFTP
DCFKRECNNK LLKENAVPTI FLCTEPHDKK EDLEPQEQLP PPPLPPPVSQ VDAAIGLLMP
PLQTPVNLSV FCDHNYTVED TMHQRKRIHQ LEQQVEKLRK KLKTAQQRCR RQERQLEKLK
EVVHFQKEKD NVSERGYVIL PNDYFEIVEV PA