THAP3_HUMAN
ID THAP3_HUMAN Reviewed; 239 AA.
AC Q8WTV1; Q569K1; Q5TH66; Q5TH67; Q8N8T6; Q9BSC7; Q9Y3H2; Q9Y3H3;
DT 11-APR-2003, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2002, sequence version 1.
DT 03-AUG-2022, entry version 159.
DE RecName: Full=THAP domain-containing protein 3;
GN Name=THAP3;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
RC TISSUE=Teratocarcinoma;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 93-239 (ISOFORMS 1/2).
RC TISSUE=Brain, and Uterus;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [4]
RP INTERACTION WITH HCFC1 AND OGT, TISSUE SPECIFICITY, IDENTIFICATION BY MASS
RP SPECTROMETRY, FUNCTION, AND MUTAGENESIS OF 177-ASP--TYR-180.
RX PubMed=20200153; DOI=10.1074/jbc.m109.072579;
RA Mazars R., Gonzalez-de-Peredo A., Cayrol C., Lavigne A.C., Vogel J.L.,
RA Ortega N., Lacroix C., Gautier V., Huet G., Ray A., Monsarrat B.,
RA Kristie T.M., Girard J.P.;
RT "The THAP-zinc finger protein THAP1 associates with coactivator HCF-1 and
RT O-GlcNAc transferase: a link between DYT6 and DYT3 dystonias.";
RL J. Biol. Chem. 285:13364-13371(2010).
RN [5]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-122, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma, and Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
CC -!- FUNCTION: Component of a THAP1/THAP3-HCFC1-OGT complex that is required
CC for the regulation of the transcriptional activity of RRM1.
CC {ECO:0000269|PubMed:20200153}.
CC -!- SUBUNIT: Component of a THAP1/THAP3-HCFC1-OGT complex that contains at
CC least, either THAP1 or THAP3, HCFC1 and OGT. Interacts directly with
CC OGT and HCFC1 (via its HBM). {ECO:0000269|PubMed:20200153}.
CC -!- INTERACTION:
CC Q8WTV1; P46379-2: BAG6; NbExp=3; IntAct=EBI-17438286, EBI-10988864;
CC Q8WTV1; P28329-3: CHAT; NbExp=3; IntAct=EBI-17438286, EBI-25837549;
CC Q8WTV1; O75190-2: DNAJB6; NbExp=3; IntAct=EBI-17438286, EBI-12593112;
CC Q8WTV1; O14645: DNALI1; NbExp=3; IntAct=EBI-17438286, EBI-395638;
CC Q8WTV1; P15311: EZR; NbExp=3; IntAct=EBI-17438286, EBI-1056902;
CC Q8WTV1; P22607: FGFR3; NbExp=3; IntAct=EBI-17438286, EBI-348399;
CC Q8WTV1; Q14957: GRIN2C; NbExp=3; IntAct=EBI-17438286, EBI-8285963;
CC Q8WTV1; O14901: KLF11; NbExp=3; IntAct=EBI-17438286, EBI-948266;
CC Q8WTV1; Q13449: LSAMP; NbExp=3; IntAct=EBI-17438286, EBI-4314821;
CC Q8WTV1; P28331-2: NDUFS1; NbExp=3; IntAct=EBI-17438286, EBI-6190702;
CC Q8WTV1; P61970: NUTF2; NbExp=3; IntAct=EBI-17438286, EBI-591778;
CC Q8WTV1; Q16512: PKN1; NbExp=3; IntAct=EBI-17438286, EBI-602382;
CC Q8WTV1; P24928: POLR2A; NbExp=3; IntAct=EBI-17438286, EBI-295301;
CC Q8WTV1; P63000: RAC1; NbExp=3; IntAct=EBI-17438286, EBI-413628;
CC Q8WTV1; P14678-2: SNRPB; NbExp=3; IntAct=EBI-17438286, EBI-372475;
CC Q8WTV1; Q13148: TARDBP; NbExp=6; IntAct=EBI-17438286, EBI-372899;
CC Q8WTV1; P14679-2: TYR; NbExp=3; IntAct=EBI-17438286, EBI-25894402;
CC Q8WTV1; Q9BZL1: UBL5; NbExp=3; IntAct=EBI-17438286, EBI-607755;
CC Q8WTV1; Q9UMX0: UBQLN1; NbExp=3; IntAct=EBI-17438286, EBI-741480;
CC Q8WTV1; P14927: UQCRB; NbExp=3; IntAct=EBI-17438286, EBI-743128;
CC Q8WTV1; P31930: UQCRC1; NbExp=3; IntAct=EBI-17438286, EBI-1052596;
CC Q8WTV1; P61758: VBP1; NbExp=3; IntAct=EBI-17438286, EBI-357430;
CC Q8WTV1; Q9Y649; NbExp=3; IntAct=EBI-17438286, EBI-25900580;
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q8WTV1-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8WTV1-3; Sequence=VSP_015136;
CC Name=3;
CC IsoId=Q8WTV1-4; Sequence=VSP_015137, VSP_015138, VSP_015139;
CC -!- TISSUE SPECIFICITY: Highly expressed in heart, skeletal muscle and
CC placenta. Weaker expression in brain, kidney and liver.
CC {ECO:0000269|PubMed:20200153}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK096217; BAC04727.1; -; mRNA.
DR EMBL; AL031447; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC005114; AAH05114.1; -; mRNA.
DR EMBL; BC092427; AAH92427.1; -; mRNA.
DR CCDS; CCDS55572.1; -. [Q8WTV1-1]
DR CCDS; CCDS55573.1; -. [Q8WTV1-3]
DR CCDS; CCDS86.1; -. [Q8WTV1-4]
DR RefSeq; NP_001182681.1; NM_001195752.1. [Q8WTV1-3]
DR RefSeq; NP_001182682.1; NM_001195753.1. [Q8WTV1-1]
DR RefSeq; NP_612359.2; NM_138350.3. [Q8WTV1-4]
DR RefSeq; XP_005263589.1; XM_005263532.3. [Q8WTV1-3]
DR AlphaFoldDB; Q8WTV1; -.
DR SMR; Q8WTV1; -.
DR BioGRID; 124694; 81.
DR ELM; Q8WTV1; -.
DR IntAct; Q8WTV1; 24.
DR STRING; 9606.ENSP00000054650; -.
DR iPTMnet; Q8WTV1; -.
DR PhosphoSitePlus; Q8WTV1; -.
DR BioMuta; THAP3; -.
DR DMDM; 29839586; -.
DR EPD; Q8WTV1; -.
DR jPOST; Q8WTV1; -.
DR MassIVE; Q8WTV1; -.
DR MaxQB; Q8WTV1; -.
DR PaxDb; Q8WTV1; -.
DR PeptideAtlas; Q8WTV1; -.
DR PRIDE; Q8WTV1; -.
DR ProteomicsDB; 74606; -. [Q8WTV1-1]
DR ProteomicsDB; 74607; -. [Q8WTV1-3]
DR ProteomicsDB; 74608; -. [Q8WTV1-4]
DR Antibodypedia; 27449; 60 antibodies from 18 providers.
DR DNASU; 90326; -.
DR Ensembl; ENST00000054650.9; ENSP00000054650.4; ENSG00000041988.16. [Q8WTV1-1]
DR Ensembl; ENST00000307896.10; ENSP00000311537.6; ENSG00000041988.16. [Q8WTV1-3]
DR Ensembl; ENST00000377627.7; ENSP00000366854.3; ENSG00000041988.16. [Q8WTV1-4]
DR GeneID; 90326; -.
DR KEGG; hsa:90326; -.
DR MANE-Select; ENST00000054650.9; ENSP00000054650.4; NM_001195753.2; NP_001182682.1.
DR UCSC; uc001aoc.4; human. [Q8WTV1-1]
DR CTD; 90326; -.
DR DisGeNET; 90326; -.
DR GeneCards; THAP3; -.
DR HGNC; HGNC:20855; THAP3.
DR HPA; ENSG00000041988; Low tissue specificity.
DR MIM; 612532; gene.
DR neXtProt; NX_Q8WTV1; -.
DR OpenTargets; ENSG00000041988; -.
DR PharmGKB; PA134987111; -.
DR VEuPathDB; HostDB:ENSG00000041988; -.
DR eggNOG; ENOG502S14P; Eukaryota.
DR GeneTree; ENSGT00940000162344; -.
DR HOGENOM; CLU_076186_1_0_1; -.
DR InParanoid; Q8WTV1; -.
DR OMA; EMPKSCA; -.
DR OrthoDB; 1382095at2759; -.
DR PhylomeDB; Q8WTV1; -.
DR TreeFam; TF330127; -.
DR PathwayCommons; Q8WTV1; -.
DR SignaLink; Q8WTV1; -.
DR BioGRID-ORCS; 90326; 22 hits in 1098 CRISPR screens.
DR ChiTaRS; THAP3; human.
DR GenomeRNAi; 90326; -.
DR Pharos; Q8WTV1; Tdark.
DR PRO; PR:Q8WTV1; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; Q8WTV1; protein.
DR Bgee; ENSG00000041988; Expressed in apex of heart and 101 other tissues.
DR ExpressionAtlas; Q8WTV1; baseline and differential.
DR Genevisible; Q8WTV1; HS.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IDA:ARUK-UCL.
DR InterPro; IPR026520; THAP3.
DR InterPro; IPR006612; THAP_Znf.
DR PANTHER; PTHR47120; PTHR47120; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Metal-binding; Phosphoprotein;
KW Reference proteome; Zinc; Zinc-finger.
FT CHAIN 1..239
FT /note="THAP domain-containing protein 3"
FT /id="PRO_0000068644"
FT ZN_FING 1..82
FT /note="THAP-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00309"
FT REGION 84..177
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 177..180
FT /note="HCFC1-binding motif (HBM)"
FT COMPBIAS 84..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 122
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT VAR_SEQ 89
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000305"
FT /id="VSP_015136"
FT VAR_SEQ 111
FT /note="K -> KTSPCRSQ (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_015137"
FT VAR_SEQ 147..168
FT /note="VSPRRPQATEAVGRPTGPAGLR -> AMLFNVENGTPASREALWLSEE (in
FT isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_015138"
FT VAR_SEQ 169..239
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_015139"
FT MUTAGEN 177..180
FT /note="DHSY->AAAA: Abolishes interaction with HCFC1."
FT /evidence="ECO:0000269|PubMed:20200153"
FT MUTAGEN 178
FT /note="H->A: Abolishes interaction with HCFC1."
FT MUTAGEN 180
FT /note="Y->A: Abolishes interaction with HCFC1."
FT CONFLICT 227
FT /note="Q -> R (in Ref. 3; AAH05114/AAH92427)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 239 AA; 27059 MW; 9904C925DF233397 CRC64;
MPKSCAARQC CNRYSSRRKQ LTFHRFPFSR PELLKEWVLN IGRGNFKPKQ HTVICSEHFR
PECFSAFGNR KNLKHNAVPT VFAFQDPTQQ VRENTDPASE RGNASSSQKE KVLPEAGAGE
DSPGRNMDTA LEELQLPPNA EGHVKQVSPR RPQATEAVGR PTGPAGLRRT PNKQPSDHSY
ALLDLDSLKK KLFLTLKENE KLRKRLQAQR LVMRRMSSRL RACKGHQGLQ ARLGPEQQS