THAP1_DANRE
ID THAP1_DANRE Reviewed; 225 AA.
AC Q1JPT7;
DT 24-MAR-2009, integrated into UniProtKB/Swiss-Prot.
DT 24-MAR-2009, sequence version 2.
DT 03-AUG-2022, entry version 81.
DE RecName: Full=THAP domain-containing protein 1;
GN Name=thap1; ORFNames=zgc:136597;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Intestine;
RA Mathavan S., Yao F., Wong E., Thoreau H., Nayudu M., Govindarajan K.R.,
RA Ruan Y., Wei C.;
RT "Genome institute of Singapore, zebrafish transcriptome characterization.";
RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Heart;
RG NIH - Zebrafish Gene Collection (ZGC) project;
RL Submitted (MAY-2006) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: DNA-binding transcription regulator that regulates
CC endothelial cell proliferation and G1/S cell-cycle progression.
CC Specifically binds the 5'-[AT]NTNN[GT]GGCA[AGT]-3' core DNA sequence
CC and acts by modulating expression of pRB-E2F cell-cycle target genes
CC (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleoplasm {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the THAP1 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAI16603.1; Type=Miscellaneous discrepancy; Note=Contaminating sequence. Sequence of unknown origin.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; EH532428; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BC116602; AAI16603.1; ALT_SEQ; mRNA.
DR RefSeq; NP_001038749.1; NM_001045284.1.
DR AlphaFoldDB; Q1JPT7; -.
DR SMR; Q1JPT7; -.
DR STRING; 7955.ENSDARP00000076434; -.
DR PaxDb; Q1JPT7; -.
DR Ensembl; ENSDART00000181692; ENSDARP00000153042; ENSDARG00000059020.
DR GeneID; 692315; -.
DR KEGG; dre:692315; -.
DR CTD; 55145; -.
DR ZFIN; ZDB-GENE-060519-9; thap1.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000164630; -.
DR HOGENOM; CLU_076186_2_1_1; -.
DR InParanoid; Q1JPT7; -.
DR OMA; LMPPLHT; -.
DR OrthoDB; 1382095at2759; -.
DR PhylomeDB; Q1JPT7; -.
DR TreeFam; TF330127; -.
DR PRO; PR:Q1JPT7; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Chromosome 5.
DR Bgee; ENSDARG00000059020; Expressed in muscle tissue and 21 other tissues.
DR ExpressionAtlas; Q1JPT7; baseline.
DR GO; GO:0005654; C:nucleoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0043565; F:sequence-specific DNA binding; ISS:UniProtKB.
DR GO; GO:0007049; P:cell cycle; IEA:UniProtKB-KW.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR Gene3D; 6.20.210.20; -; 1.
DR InterPro; IPR026516; THAP1.
DR InterPro; IPR006612; THAP_Znf.
DR InterPro; IPR038441; THAP_Znf_sf.
DR PANTHER; PTHR46600; PTHR46600; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 2: Evidence at transcript level;
KW Cell cycle; Coiled coil; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..225
FT /note="THAP domain-containing protein 1"
FT /id="PRO_0000367843"
FT ZN_FING 5..57
FT /note="THAP-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00309"
FT COILED 149..196
FT /evidence="ECO:0000255"
SQ SEQUENCE 225 AA; 25707 MW; 198E73C1265EA99A CRC64;
MVQSCSAYGC KNRYQKDRNI SFHKFPLARP EVCVQWVSAM SRRNFKPTKY SNICSQHFTS
DCFKQECNNR VLKDNAVPSL FTLQTQDPFS ADVCFPLNVC ATAEPLSECF PEQCGLPDGQ
EAGAVSCPEQ CVPPGGQEAG AVSCDHNYTL EDCVQQKRRV QRLQEQMEKL RRRMKTLQQK
CRRQERQLER LRANRGPAPL GDRYVILPRE LYEELQGVET IGAVH