ZN628_HUMAN
ID ZN628_HUMAN Reviewed; 1059 AA.
AC Q5EBL2; Q86X34;
DT 11-JUL-2006, integrated into UniProtKB/Swiss-Prot.
DT 03-APR-2013, sequence version 3.
DT 03-AUG-2022, entry version 143.
DE RecName: Full=Zinc finger protein 628;
GN Name=ZNF628;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15057824; DOI=10.1038/nature02399;
RA Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E.,
RA Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A.,
RA Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S.,
RA Carrano A.V., Caoile C., Chan Y.M., Christensen M., Cleland C.A.,
RA Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J.,
RA Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M.,
RA Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W.,
RA Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V.,
RA Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D.,
RA McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I.,
RA Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L.,
RA Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A.,
RA She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M.,
RA Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J.,
RA Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E.,
RA Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M.,
RA Rubin E.M., Lucas S.M.;
RT "The DNA sequence and biology of human chromosome 19.";
RL Nature 428:529-535(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Chondrosarcoma, and Liver;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-199 AND THR-589, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
CC -!- FUNCTION: Transcriptional activator. Binds DNA on GT-box consensus
CC sequence 5'-TTGGTT-3'. Plays a role in spermiogenesis.
CC {ECO:0000250|UniProtKB:Q8CJ78}.
CC -!- SUBUNIT: Interacts with TAF4B. {ECO:0000250|UniProtKB:Q8CJ78}.
CC -!- INTERACTION:
CC Q5EBL2; Q8IYI6: EXOC8; NbExp=3; IntAct=EBI-13086230, EBI-742102;
CC Q5EBL2; A6NEM1: GOLGA6L9; NbExp=3; IntAct=EBI-13086230, EBI-5916454;
CC Q5EBL2; Q96C03-3: MIEF2; NbExp=3; IntAct=EBI-13086230, EBI-11988931;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- CAUTION: It is uncertain whether Met-1 or Met-5 is the initiator.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH89449.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC008735; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC047332; AAH47332.1; -; mRNA.
DR EMBL; BC089449; AAH89449.1; ALT_INIT; mRNA.
DR CCDS; CCDS33116.3; -.
DR RefSeq; NP_149104.3; NM_033113.2.
DR AlphaFoldDB; Q5EBL2; -.
DR SMR; Q5EBL2; -.
DR BioGRID; 124638; 12.
DR IntAct; Q5EBL2; 4.
DR STRING; 9606.ENSP00000469591; -.
DR iPTMnet; Q5EBL2; -.
DR PhosphoSitePlus; Q5EBL2; -.
DR BioMuta; ZNF628; -.
DR DMDM; 476007835; -.
DR jPOST; Q5EBL2; -.
DR MassIVE; Q5EBL2; -.
DR MaxQB; Q5EBL2; -.
DR PaxDb; Q5EBL2; -.
DR PeptideAtlas; Q5EBL2; -.
DR PRIDE; Q5EBL2; -.
DR ProteomicsDB; 62760; -.
DR Antibodypedia; 50986; 20 antibodies from 10 providers.
DR DNASU; 89887; -.
DR Ensembl; ENST00000598519.2; ENSP00000469591.1; ENSG00000197483.10.
DR GeneID; 89887; -.
DR KEGG; hsa:89887; -.
DR MANE-Select; ENST00000598519.2; ENSP00000469591.1; NM_033113.3; NP_149104.3.
DR UCSC; uc002qld.3; human.
DR CTD; 89887; -.
DR DisGeNET; 89887; -.
DR GeneCards; ZNF628; -.
DR HGNC; HGNC:28054; ZNF628.
DR HPA; ENSG00000197483; Tissue enhanced (testis).
DR MIM; 610671; gene.
DR neXtProt; NX_Q5EBL2; -.
DR OpenTargets; ENSG00000197483; -.
DR PharmGKB; PA142670507; -.
DR VEuPathDB; HostDB:ENSG00000197483; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00910000144307; -.
DR InParanoid; Q5EBL2; -.
DR OMA; RPYLCLD; -.
DR OrthoDB; 1318335at2759; -.
DR PhylomeDB; Q5EBL2; -.
DR TreeFam; TF350841; -.
DR PathwayCommons; Q5EBL2; -.
DR SignaLink; Q5EBL2; -.
DR BioGRID-ORCS; 89887; 21 hits in 1095 CRISPR screens.
DR GenomeRNAi; 89887; -.
DR Pharos; Q5EBL2; Tdark.
DR PRO; PR:Q5EBL2; -.
DR Proteomes; UP000005640; Chromosome 19.
DR RNAct; Q5EBL2; protein.
DR Bgee; ENSG00000197483; Expressed in sperm and 129 other tissues.
DR ExpressionAtlas; Q5EBL2; baseline and differential.
DR Genevisible; Q5EBL2; HS.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IBA:GO_Central.
DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF00096; zf-C2H2; 12.
DR SMART; SM00355; ZnF_C2H2; 17.
DR SUPFAM; SSF57667; SSF57667; 9.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 16.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 17.
PE 1: Evidence at protein level;
KW DNA-binding; Metal-binding; Nucleus; Phosphoprotein; Reference proteome;
KW Repeat; Transcription; Transcription regulation; Zinc; Zinc-finger.
FT CHAIN 1..1059
FT /note="Zinc finger protein 628"
FT /id="PRO_0000246070"
FT REPEAT 818..831
FT /note="1"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT REPEAT 832..842
FT /note="2"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT REPEAT 843..853
FT /note="3"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT REPEAT 854..864
FT /note="4"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT ZN_FING 36..58
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 64..86
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 92..114
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 120..142
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 148..170
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 176..198
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 204..226
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 356..378
FT /note="C2H2-type 8"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 386..408
FT /note="C2H2-type 9"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 454..476
FT /note="C2H2-type 10"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 482..504
FT /note="C2H2-type 11"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 510..532
FT /note="C2H2-type 12"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 538..560
FT /note="C2H2-type 13"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 566..588
FT /note="C2H2-type 14"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 594..616
FT /note="C2H2-type 15"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 622..644
FT /note="C2H2-type 16"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 226..247
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 260..280
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 644..674
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 818..864
FT /note="4 X approximate tandem repeats"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT REGION 943..1059
FT /note="Interaction with TAF4B"
FT /evidence="ECO:0000250|UniProtKB:Q8CJ78"
FT COMPBIAS 262..280
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 318..351
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..669
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 199
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 589
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT CONFLICT 234
FT /note="T -> A (in Ref. 2; AAH89449)"
FT /evidence="ECO:0000305"
FT CONFLICT 439
FT /note="V -> A (in Ref. 2; AAH89449)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1059 AA; 110887 MW; F05F6265E9336D0C CRC64;
MSGVMVGSHA DMAPASTAEG AGEKPGPAAP APAAQYECGE CGKSFRWSSR LLHHQRTHTG
ERPYKCPDCP KAFKGSSALL YHQRGHTGER PYQCPDCPKA FKRSSLLQIH RSVHTGLRAF
ICGQCGLAFK WSSHYQYHLR QHTGERPYPC PDCPKAFKNS SSLRRHRHVH TGERPYTCGV
CGKSFTQSTN LRQHQRVHTG ERPFRCPLCP KTFTHSSNLL LHQRTHGAAP APGTASAAPP
PQSREPGKVF VCDAYLQRHL QPHSPPAPPA PPPPPPPVVP ELFLAAAETT VELVYRCDGC
EQGFSSEELL LEHQPCPGPD AAPQPQEAPA EAPKADQPPS PLPQPPPPAA APAPGFACLP
CGKSFRTVAG LSRHQHSHGA AGGQAFRCGS CDGSFPQLAS LLAHQQCHVE EAAAGRPPPQ
AEAAEVTCPQ EPLAPAAPVP PPPPSAPASA ERPYKCAECG KSFKGSSGLR YHLRDHTGER
PYQCGECGKA FKRSSLLAIH QRVHTGLRAF TCGQCGLTFK WSSHYQYHLR LHSGERPYAC
GECGKAFRNT SCLRRHRHVH TGERPHACGV CGKSFAQTSN LRQHQRVHTG ERPFRCPLCP
KTFTHSSNLL LHQRTHSAER PFTCPICGRG FVMAAYLQRH LRTHAPANTP PSTTAPAAGP
QPPAPLAAAR APPATQDVHV LPHLQATLSL EVAGGTAQAP SLGPAAPNSQ TFLLVQTAQG
LQLIPSSVQP PTPPPPPAPP KLILLPSSSA GAGGGRARQG PRAVGKAGQG AGVVWLPGPG
GLGVQGAASA GASGTGQSLI VLQNVGGGEA GPQEMSGVQL QPLRPAPEVT TVQLQPAQEV
TTVQLQPAQE VTTVQLQPAQ EVTTVQLQPV AGQLSNSSGG AVATEAPNLL VVQSGAAEEL
LTGPGPGEAG DGEASTGVVQ DVLFETLQTD EGLQSVLVLS GADGEQTRLC VQEVETLPPG
LTEPPATGPP GQKLLIIRSA PATELLDSSN TGGGTATLQL LAPPPSGPAS GPAGLPGAPA
SQMVQVVPAG AGPGVMTPQG LPSIQIVQTL PAVQLVHTF