ZN809_MOUSE
ID ZN809_MOUSE Reviewed; 402 AA.
AC G3X9G7; Q4KL58; Q8BIJ2;
DT 07-JAN-2015, integrated into UniProtKB/Swiss-Prot.
DT 16-NOV-2011, sequence version 1.
DT 03-AUG-2022, entry version 74.
DE RecName: Full=Zinc finger protein 809 {ECO:0000305};
GN Name=Zfp809 {ECO:0000312|MGI:MGI:2143362};
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC STRAIN=C57BL/6J {ECO:0000312|EMBL:BAC33696.1};
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Placenta {ECO:0000312|EMBL:AAH99418.1};
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [5]
RP FUNCTION.
RX PubMed=19270682; DOI=10.1038/nature07844;
RA Wolf D., Goff S.P.;
RT "Embryonic stem cells use ZFP809 to silence retroviral DNAs.";
RL Nature 458:1201-1204(2009).
CC -!- FUNCTION: Transcription factor specifically required to repress
CC retrotransposons in embryonic stem cells. Recognizes and binds
CC retroviral DNA sequences from a large subset of mammalian retroviruses
CC and retroelements and repress their expression by recruiting a
CC repressive complex containing TRIM28/KAP1 (PubMed:19270682).
CC {ECO:0000269|PubMed:19270682}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=G3X9G7-1; Sequence=Displayed;
CC Name=2;
CC IsoId=G3X9G7-2; Sequence=VSP_057362, VSP_057363;
CC -!- SIMILARITY: Belongs to the krueppel C2H2-type zinc-finger protein
CC family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK049344; BAC33696.1; -; mRNA.
DR EMBL; AC159308; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH466522; EDL25266.1; -; Genomic_DNA.
DR EMBL; BC099418; AAH99418.1; -; mRNA.
DR CCDS; CCDS40560.1; -. [G3X9G7-1]
DR CCDS; CCDS90519.1; -. [G3X9G7-2]
DR RefSeq; NP_001158096.1; NM_001164624.1. [G3X9G7-2]
DR RefSeq; NP_766351.3; NM_172763.3. [G3X9G7-1]
DR AlphaFoldDB; G3X9G7; -.
DR SASBDB; G3X9G7; -.
DR SMR; G3X9G7; -.
DR DIP; DIP-59749N; -.
DR IntAct; G3X9G7; 2.
DR STRING; 10090.ENSMUSP00000072286; -.
DR iPTMnet; G3X9G7; -.
DR PhosphoSitePlus; G3X9G7; -.
DR EPD; G3X9G7; -.
DR MaxQB; G3X9G7; -.
DR PaxDb; G3X9G7; -.
DR PeptideAtlas; G3X9G7; -.
DR PRIDE; G3X9G7; -.
DR ProteomicsDB; 275094; -. [G3X9G7-1]
DR ProteomicsDB; 275095; -. [G3X9G7-2]
DR DNASU; 235047; -.
DR Ensembl; ENSMUST00000072465; ENSMUSP00000072286; ENSMUSG00000057982. [G3X9G7-1]
DR Ensembl; ENSMUST00000215618; ENSMUSP00000151180; ENSMUSG00000057982. [G3X9G7-2]
DR GeneID; 235047; -.
DR KEGG; mmu:235047; -.
DR UCSC; uc009oof.2; mouse. [G3X9G7-2]
DR UCSC; uc009oog.2; mouse. [G3X9G7-1]
DR CTD; 235047; -.
DR MGI; MGI:2143362; Zfp809.
DR VEuPathDB; HostDB:ENSMUSG00000057982; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000153505; -.
DR HOGENOM; CLU_002678_0_7_1; -.
DR InParanoid; G3X9G7; -.
DR OMA; QICLEPF; -.
DR OrthoDB; 1318335at2759; -.
DR PhylomeDB; G3X9G7; -.
DR TreeFam; TF339585; -.
DR Reactome; R-MMU-212436; Generic Transcription Pathway.
DR BioGRID-ORCS; 235047; 1 hit in 73 CRISPR screens.
DR PRO; PR:G3X9G7; -.
DR Proteomes; UP000000589; Chromosome 9.
DR RNAct; G3X9G7; protein.
DR Bgee; ENSMUSG00000057982; Expressed in otolith organ and 221 other tissues.
DR ExpressionAtlas; G3X9G7; baseline and differential.
DR Genevisible; G3X9G7; MM.
DR GO; GO:0005634; C:nucleus; IDA:MGI.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0043565; F:sequence-specific DNA binding; IDA:MGI.
DR GO; GO:0045087; P:innate immune response; IDA:MGI.
DR GO; GO:0045869; P:negative regulation of single stranded viral RNA replication via double stranded DNA intermediate; IDA:MGI.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd07765; KRAB_A-box; 1.
DR InterPro; IPR001909; KRAB.
DR InterPro; IPR036051; KRAB_dom_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF01352; KRAB; 1.
DR Pfam; PF00096; zf-C2H2; 6.
DR SMART; SM00349; KRAB; 1.
DR SMART; SM00355; ZnF_C2H2; 7.
DR SUPFAM; SSF109640; SSF109640; 1.
DR SUPFAM; SSF57667; SSF57667; 4.
DR PROSITE; PS50805; KRAB; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 7.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7.
PE 2: Evidence at transcript level;
KW Alternative splicing; Metal-binding; Nucleus; Reference proteome; Repeat;
KW Repressor; Transcription; Transcription regulation; Zinc; Zinc-finger.
FT CHAIN 1..402
FT /note="Zinc finger protein 809"
FT /id="PRO_0000431687"
FT DOMAIN 4..75
FT /note="KRAB"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00119"
FT ZN_FING 155..178
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 184..206
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 213..235
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 241..263
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 269..291
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 297..319
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 325..347
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 118..139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 350..354
FT /note="VTYFQ -> QYGDS (in isoform 2)"
FT /id="VSP_057362"
FT VAR_SEQ 355..402
FT /note="Missing (in isoform 2)"
FT /id="VSP_057363"
FT CONFLICT 141
FT /note="Q -> R (in Ref. 4; AAH99418)"
SQ SEQUENCE 402 AA; 46942 MW; D6CCCC6A3D2BD772 CRC64;
MGLVSFEDVA VDFTLEEWQD LDAAQRTLYR DVMLETYSSL VFLDPCIAKP KLIFNLERGF
GPWSLAEASS RSLPGVHNVS TLSDTSKKIP KTRLRQLRKT NQKTPSEDTI EAELKARQEV
SKGTTSRHRR APVKSLCRKS QRTKNQTSYN DGNLYECKDC EKVFCNNSTL IKHYRRTHNV
YKPYECDECS KMYYWKSDLT SHQKTHRQRK RIYECSECGK AFFRKSHLNA HERTHSGEKP
YECTECRKAF YYKSDLTRHK KTHLGEKPFK CEECKKAFSR KSKLAIHQKK HTGEKPYECT
ECKKAFSHQS QLTAHRIAHS SENPYECKEC NKSFHWKCQL TAHQKRHTGV TYFQEVVFQQ
ITVSDWTGNL SENGPHRPTW TWAYGIMDFV KAWSRCIIGG GL