ZN229_HUMAN
ID ZN229_HUMAN Reviewed; 825 AA.
AC Q9UJW7; B2RWN3; Q59FV2; Q86WL9;
DT 01-DEC-2000, integrated into UniProtKB/Swiss-Prot.
DT 23-FEB-2022, sequence version 4.
DT 03-AUG-2022, entry version 180.
DE RecName: Full=Zinc finger protein 229 {ECO:0000305};
GN Name=ZNF229 {ECO:0000312|HGNC:HGNC:13022};
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15057824; DOI=10.1038/nature02399;
RA Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E.,
RA Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A.,
RA Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S.,
RA Carrano A.V., Caoile C., Chan Y.M., Christensen M., Cleland C.A.,
RA Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J.,
RA Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M.,
RA Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W.,
RA Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V.,
RA Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D.,
RA McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I.,
RA Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L.,
RA Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A.,
RA She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M.,
RA Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J.,
RA Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E.,
RA Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M.,
RA Rubin E.M., Lucas S.M.;
RT "The DNA sequence and biology of human chromosome 19.";
RL Nature 428:529-535(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Brain;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 1-420 (ISOFORM 1), AND VARIANT SER-156.
RC TISSUE=Brain;
RX PubMed=12743021; DOI=10.1101/gr.963903;
RA Shannon M., Hamilton A.T., Gordon L., Branscomb E., Stubbs L.;
RT "Differential expansion of zinc-finger transcription factor loci in
RT homologous human and mouse gene clusters.";
RL Genome Res. 13:1097-1110(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 397-825, AND VARIANT ARG-662.
RC TISSUE=Brain;
RA Totoki Y., Toyoda A., Takeda T., Sakaki Y., Tanaka A., Yokoyama S.,
RA Ohara O., Nagase T., Kikuno R.F.;
RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP SUMOYLATION [LARGE SCALE ANALYSIS] AT LYS-543, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=28112733; DOI=10.1038/nsmb.3366;
RA Hendriks I.A., Lyon D., Young C., Jensen L.J., Vertegaal A.C.,
RA Nielsen M.L.;
RT "Site-specific mapping of the human SUMO proteome reveals co-modification
RT with phosphorylation.";
RL Nat. Struct. Mol. Biol. 24:325-336(2017).
CC -!- FUNCTION: May be involved in transcriptional regulation.
CC -!- INTERACTION:
CC Q9UJW7; Q9NP86: CABP5; NbExp=3; IntAct=EBI-12068564, EBI-10311131;
CC Q9UJW7; P07951-2: TPM2; NbExp=3; IntAct=EBI-12068564, EBI-10977815;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q9UJW7-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9UJW7-2; Sequence=VSP_054778;
CC -!- SIMILARITY: Belongs to the krueppel C2H2-type zinc-finger protein
CC family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAG23970.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC084239; AAG23970.1; ALT_SEQ; Genomic_DNA.
DR EMBL; BC150612; AAI50613.1; -; mRNA.
DR EMBL; AF192979; AAF07964.1; -; mRNA.
DR EMBL; AY166791; AAO45840.1; -; mRNA.
DR EMBL; AB209357; BAD92594.1; -; mRNA.
DR CCDS; CCDS42574.1; -. [Q9UJW7-1]
DR CCDS; CCDS62706.1; -. [Q9UJW7-2]
DR RefSeq; NP_055333.3; NM_014518.3.
DR AlphaFoldDB; Q9UJW7; -.
DR BioGRID; 113555; 3.
DR IntAct; Q9UJW7; 4.
DR STRING; 9606.ENSP00000479884; -.
DR iPTMnet; Q9UJW7; -.
DR PhosphoSitePlus; Q9UJW7; -.
DR BioMuta; ZNF229; -.
DR DMDM; 296453070; -.
DR REPRODUCTION-2DPAGE; Q9UJW7; -.
DR jPOST; Q9UJW7; -.
DR MassIVE; Q9UJW7; -.
DR MaxQB; Q9UJW7; -.
DR PaxDb; Q9UJW7; -.
DR PeptideAtlas; Q9UJW7; -.
DR PRIDE; Q9UJW7; -.
DR ProteomicsDB; 84674; -. [Q9UJW7-1]
DR Antibodypedia; 72498; 65 antibodies from 14 providers.
DR DNASU; 7772; -.
DR Ensembl; ENST00000613197.4; ENSP00000479807.1; ENSG00000278318.5. [Q9UJW7-2]
DR Ensembl; ENST00000614049.5; ENSP00000479884.1; ENSG00000278318.5. [Q9UJW7-1]
DR GeneID; 7772; -.
DR KEGG; hsa:7772; -.
DR MANE-Select; ENST00000614049.5; ENSP00000479884.1; NM_014518.4; NP_055333.3.
DR UCSC; uc032hzj.2; human. [Q9UJW7-1]
DR CTD; 7772; -.
DR GeneCards; ZNF229; -.
DR HGNC; HGNC:13022; ZNF229.
DR HPA; ENSG00000278318; Low tissue specificity.
DR neXtProt; NX_Q9UJW7; -.
DR OpenTargets; ENSG00000278318; -.
DR PharmGKB; PA37601; -.
DR VEuPathDB; HostDB:ENSG00000278318; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000163513; -.
DR HOGENOM; CLU_002678_17_1_1; -.
DR InParanoid; Q9UJW7; -.
DR OrthoDB; 1318335at2759; -.
DR PhylomeDB; Q9UJW7; -.
DR PathwayCommons; Q9UJW7; -.
DR BioGRID-ORCS; 7772; 12 hits in 1087 CRISPR screens.
DR GenomeRNAi; 7772; -.
DR Pharos; Q9UJW7; Tdark.
DR PRO; PR:Q9UJW7; -.
DR Proteomes; UP000005640; Chromosome 19.
DR RNAct; Q9UJW7; protein.
DR Bgee; ENSG00000278318; Expressed in adrenal tissue and 115 other tissues.
DR ExpressionAtlas; Q9UJW7; baseline and differential.
DR Genevisible; Q9UJW7; HS.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd07765; KRAB_A-box; 1.
DR InterPro; IPR001909; KRAB.
DR InterPro; IPR036051; KRAB_dom_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF01352; KRAB; 1.
DR Pfam; PF00096; zf-C2H2; 15.
DR SMART; SM00349; KRAB; 1.
DR SMART; SM00355; ZnF_C2H2; 17.
DR SUPFAM; SSF109640; SSF109640; 1.
DR SUPFAM; SSF57667; SSF57667; 10.
DR PROSITE; PS50805; KRAB; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 16.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 17.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Isopeptide bond; Metal-binding; Nucleus;
KW Reference proteome; Repeat; Transcription; Transcription regulation;
KW Ubl conjugation; Zinc; Zinc-finger.
FT CHAIN 1..825
FT /note="Zinc finger protein 229"
FT /id="PRO_0000047470"
FT DOMAIN 34..108
FT /note="KRAB"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00119"
FT ZN_FING 291..315
FT /note="C2H2-type 1; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 349..371
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 377..399
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 405..427
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 433..455
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 461..483
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 489..511
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 517..539
FT /note="C2H2-type 8"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 545..566
FT /note="C2H2-type 9"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 572..594
FT /note="C2H2-type 10"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 600..622
FT /note="C2H2-type 11"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 628..650
FT /note="C2H2-type 12"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 656..678
FT /note="C2H2-type 13"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 684..706
FT /note="C2H2-type 14"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 712..734
FT /note="C2H2-type 15"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 740..762
FT /note="C2H2-type 16"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 768..790
FT /note="C2H2-type 17"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 796..818
FT /note="C2H2-type 18"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CROSSLNK 543
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0007744|PubMed:28112733"
FT VAR_SEQ 75..80
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000305"
FT /id="VSP_054778"
FT VARIANT 156
FT /note="F -> S (in dbSNP:rs2571174)"
FT /evidence="ECO:0000269|PubMed:12743021"
FT /id="VAR_057408"
FT VARIANT 337
FT /note="R -> C (in dbSNP:rs12151338)"
FT /id="VAR_057409"
FT VARIANT 417
FT /note="S -> N (in dbSNP:rs57014690)"
FT /id="VAR_061942"
FT VARIANT 662
FT /note="G -> R (in dbSNP:rs1434579)"
FT /evidence="ECO:0000269|Ref.4"
FT /id="VAR_060426"
FT VARIANT 804
FT /note="G -> R (in dbSNP:rs10409807)"
FT /id="VAR_060427"
SQ SEQUENCE 825 AA; 93767 MW; 01058F57650097BC CRC64;
METLTSRHEK RALHSQASAI SQDREEKIMS QEPLSFKDVA VVFTEEELEL LDSTQRQLYQ
DVMQENFRNL LSVGERNPLG DKNGKDTEYI QDEELRFFSH KELSSCKIWE EVAGELPGSQ
DCRVNLQGKD FQFSEDAAPH QGWEGASTPC FPIENFLDSL QGDGLIGLEN QQFPAWRAIR
PIPIQGSWAK AFVNQLGDVQ ERCKNLDTED TVYKCNWDDD SFCWISCHVD HRFPEIDKPC
GCNKCRKDCI KNSVLHRINP GENGLKSNEY RNGFRDDADL PPHPRVPLKE KLCQYDEFSE
GLRHSAHLNR HQRVPTGEKS VKSLERGRGV RQNTHIRNHP RAPVGDMPYR CDVCGKGFRY
KSVLLIHQGV HTGRRPYKCE ECGKAFGRSS NLLVHQRVHT GEKPYKCSEC GKGFSYSSVL
QVHQRLHTGE KPYTCSECGK GFCAKSALHK HQHIHPGEKP YSCGECGKGF SCSSHLSSHQ
KTHTGERPYQ CDKCGKGFSH NSYLQAHQRV HMGQHLYKCN VCGKSFSYSS GLLMHQRLHT
GEKPYKCECG KSFGRSSDLH IHQRVHTGEK PYKCSECGKG FRRNSDLHSH QRVHTGERPY
VCDVCGKGFI YSSDLLIHQR VHTGEKPYKC AECGKGFSYS SGLLIHQRVH TGEKPYRCQE
CGKGFRCTSS LHKHQRVHTG KKPYTCDQCG KGFSYGSNLR THQRLHTGEK PYTCCECGKG
FRYGSGLLSH KRVHTGEKPY RCHVCGKGYS QSSHLQGHQR VHTGEKPYKC EECGKGFGRN
SCLHVHQRVH TGEKPYTCGV CGKGFSYTSG LRNHQRVHLG ENPYK