ZSC20_HUMAN
ID ZSC20_HUMAN Reviewed; 1043 AA.
AC P17040; A8K2D0; B1ALI4; B1ALI5; B1ALI6; Q6ZN23; Q96FA9; Q96H84;
DT 01-AUG-1990, integrated into UniProtKB/Swiss-Prot.
DT 24-MAR-2009, sequence version 3.
DT 03-AUG-2022, entry version 203.
DE RecName: Full=Zinc finger and SCAN domain-containing protein 20;
DE AltName: Full=Zinc finger protein 31;
DE AltName: Full=Zinc finger protein 360;
DE AltName: Full=Zinc finger protein KOX29;
GN Name=ZSCAN20; Synonyms=KOX29, ZNF31, ZNF360;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 3), AND VARIANT
RP ASP-432.
RC TISSUE=Thalamus;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT ASP-432.
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 4), AND VARIANT
RP ASP-432.
RC TISSUE=Eye, and Muscle;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [5]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 875-930 (ISOFORMS 1/2/3).
RC TISSUE=Lymphoid tissue;
RX PubMed=2288909;
RA Thiesen H.-J.;
RT "Multiple genes encoding zinc finger domains are expressed in human T
RT cells.";
RL New Biol. 2:363-374(1990).
CC -!- FUNCTION: May be involved in transcriptional regulation.
CC -!- INTERACTION:
CC P17040-4; A0A0S2Z6X0: ZKSCAN4; NbExp=3; IntAct=EBI-16440054, EBI-16431094;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00187}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=1;
CC IsoId=P17040-1; Sequence=Displayed;
CC Name=2;
CC IsoId=P17040-2; Sequence=VSP_036735;
CC Name=3;
CC IsoId=P17040-3; Sequence=VSP_036738;
CC Name=4;
CC IsoId=P17040-4; Sequence=VSP_036736, VSP_036737, VSP_036739;
CC -!- SIMILARITY: Belongs to the krueppel C2H2-type zinc-finger protein
CC family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK131405; BAD18552.1; -; mRNA.
DR EMBL; AK290195; BAF82884.1; -; mRNA.
DR EMBL; AC115285; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL138837; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471059; EAX07454.1; -; Genomic_DNA.
DR EMBL; BC008827; AAH08827.1; -; mRNA.
DR EMBL; BC011404; AAH11404.1; -; mRNA.
DR EMBL; X52360; CAA36586.1; -; mRNA.
DR CCDS; CCDS41300.1; -. [P17040-1]
DR PIR; I37969; I37969.
DR RefSeq; NP_660281.2; NM_145238.3. [P17040-1]
DR RefSeq; XP_005271228.1; XM_005271171.3.
DR RefSeq; XP_006710937.1; XM_006710874.3.
DR RefSeq; XP_016857726.1; XM_017002237.1. [P17040-1]
DR RefSeq; XP_016857727.1; XM_017002238.1. [P17040-3]
DR AlphaFoldDB; P17040; -.
DR SMR; P17040; -.
DR BioGRID; 113408; 63.
DR IntAct; P17040; 55.
DR STRING; 9606.ENSP00000355053; -.
DR iPTMnet; P17040; -.
DR PhosphoSitePlus; P17040; -.
DR BioMuta; ZSCAN20; -.
DR DMDM; 229485383; -.
DR EPD; P17040; -.
DR jPOST; P17040; -.
DR MassIVE; P17040; -.
DR MaxQB; P17040; -.
DR PaxDb; P17040; -.
DR PeptideAtlas; P17040; -.
DR PRIDE; P17040; -.
DR ProteomicsDB; 53445; -. [P17040-1]
DR ProteomicsDB; 53446; -. [P17040-2]
DR ProteomicsDB; 53447; -. [P17040-3]
DR ProteomicsDB; 53448; -. [P17040-4]
DR ABCD; P17040; 4 sequenced antibodies.
DR Antibodypedia; 8548; 148 antibodies from 27 providers.
DR DNASU; 7579; -.
DR Ensembl; ENST00000361328.7; ENSP00000355053.3; ENSG00000121903.15. [P17040-1]
DR Ensembl; ENST00000373413.2; ENSP00000362512.1; ENSG00000121903.15. [P17040-4]
DR Ensembl; ENST00000684572.1; ENSP00000507139.1; ENSG00000121903.15. [P17040-1]
DR GeneID; 7579; -.
DR KEGG; hsa:7579; -.
DR MANE-Select; ENST00000684572.1; ENSP00000507139.1; NM_001377376.1; NP_001364305.1.
DR UCSC; uc001bxj.5; human. [P17040-1]
DR CTD; 7579; -.
DR DisGeNET; 7579; -.
DR GeneCards; ZSCAN20; -.
DR HGNC; HGNC:13093; ZSCAN20.
DR HPA; ENSG00000121903; Low tissue specificity.
DR MIM; 611315; gene.
DR neXtProt; NX_P17040; -.
DR OpenTargets; ENSG00000121903; -.
DR PharmGKB; PA37668; -.
DR VEuPathDB; HostDB:ENSG00000121903; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000161580; -.
DR HOGENOM; CLU_002678_88_0_1; -.
DR InParanoid; P17040; -.
DR OMA; EQEQWDV; -.
DR OrthoDB; 1318335at2759; -.
DR PhylomeDB; P17040; -.
DR TreeFam; TF337082; -.
DR PathwayCommons; P17040; -.
DR SignaLink; P17040; -.
DR BioGRID-ORCS; 7579; 228 hits in 1104 CRISPR screens.
DR GenomeRNAi; 7579; -.
DR Pharos; P17040; Tdark.
DR PRO; PR:P17040; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; P17040; protein.
DR Bgee; ENSG00000121903; Expressed in sperm and 114 other tissues.
DR Genevisible; P17040; HS.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd07936; SCAN; 1.
DR Gene3D; 1.10.4020.10; -; 1.
DR InterPro; IPR044822; Myb_DNA-bind_4.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR003309; SCAN_dom.
DR InterPro; IPR038269; SCAN_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF13837; Myb_DNA-bind_4; 2.
DR Pfam; PF02023; SCAN; 1.
DR Pfam; PF00096; zf-C2H2; 10.
DR SMART; SM00717; SANT; 2.
DR SMART; SM00431; SCAN; 1.
DR SMART; SM00355; ZnF_C2H2; 10.
DR SUPFAM; SSF57667; SSF57667; 6.
DR PROSITE; PS50804; SCAN_BOX; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 10.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 10.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Repeat; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..1043
FT /note="Zinc finger and SCAN domain-containing protein 20"
FT /id="PRO_0000047360"
FT DOMAIN 51..133
FT /note="SCAN box"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00187"
FT ZN_FING 710..732
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 738..760
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 766..788
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 794..816
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 875..897
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 903..925
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 931..953
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 959..981
FT /note="C2H2-type 8"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 987..1009
FT /note="C2H2-type 9"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1015..1037
FT /note="C2H2-type 10"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 30..49
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 661..692
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 835..873
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 30..46
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 25..90
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_036735"
FT VAR_SEQ 202..255
FT /note="Missing (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_036736"
FT VAR_SEQ 482..487
FT /note="AGVHWG -> GKNMGV (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_036737"
FT VAR_SEQ 482
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_036738"
FT VAR_SEQ 488..1043
FT /note="Missing (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_036739"
FT VARIANT 248
FT /note="D -> N (in dbSNP:rs34446695)"
FT /id="VAR_054799"
FT VARIANT 432
FT /note="Y -> D (in dbSNP:rs4403594)"
FT /evidence="ECO:0000269|PubMed:14702039,
FT ECO:0000269|PubMed:15489334, ECO:0000269|Ref.3"
FT /id="VAR_054800"
FT CONFLICT 100
FT /note="L -> P (in Ref. 1; BAF82884)"
FT /evidence="ECO:0000305"
FT CONFLICT 126
FT /note="V -> M (in Ref. 1; BAF82884)"
FT /evidence="ECO:0000305"
FT CONFLICT 740
FT /note="C -> R (in Ref. 1; BAD18552)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1043 AA; 117541 MW; 1598FB5F243773FE CRC64;
MAMALELQAQ ASPQPEPEEL LIVKLEEDSW GSESKLWEKD RGSVSGPEAS RQRFRQFQYR
DAAGPHEAFS QLWALCCRWL RPEIRLKEQI LELLVLEQFL TILPREVQTW VQARHPESGE
EAVALVEDWH RETRTAGQSG LELHTEETRP LKTGEEAQSF QLQPVDPWPE GQSQKKGVKN
TCPDLPNHLN AEVAPQPLKE SAVLTPRVPT LPKMGSVGDW EVTAESQEAL GPGKHAEKEL
CKDPPGDDCG NSVCLGVPVS KPSNTSEKEQ GPEFWGLSLI NSGKRSTADY SLDNEPAQAL
TWRDSRAWEE QYQWDVEDMK VSGVHWGYEE TKTFLAILSE SPFSEKLRTC HQNRQVYRAI
AEQLRARGFL RTLEQCRYRV KNLLRNYRKA KSSHPPGTCP FYEELEALVR ARTAIRATDG
PGEAVALPRL GYSDAEMDEQ EEGGWDPEEM AEDCNGAGLV NVESTQGPRI AGAPALFQSR
IAGVHWGYEE TKAFLAILSE SPFSEKLRTC HQNSQVYRAI AERLCALGFL RTLEQCRYRF
KNLLRSYRKA KSSHPPGTCP FYEELDSLMR ARAAVRAMGT VREAAGLPRC GQSSAETDAQ
EAWGEVANED AVKPSTLCPK APDMGFEMRH EDEDQISEQD IFEGLPGALS KCPTEAVCQP
LDWGEDSENE NEDEGQWGNP SQEQWQESSS EEDLEKLIDH QGLYLAEKPY KCDTCMKSFS
RSSHFIAHQR IHTGEKPYKC LECGKNFSDR SNLNTHQRIH TGEKPYKCLE CGKSFSDHSN
LITHQRIHTG EKPYKCGECW KSFNQSSNLL KHQRIHLGGN PDQCSEPGGN FAQSPSFSAH
WRNSTEETAP EQPQSISKDL NSPGPHSTNS GEKLYECSEC GRSFSKSSAL ISHQRIHTGE
KPYECAECGK SFSKSSTLAN HQRTHTGEKP YKCVDCGKCF SERSKLITHQ RVHTGEKPYK
CLECGKFFRD RSNLITHQRI HTGEKPYKCR ECGKCFNQSS SLIIHQRIHT GEKPYKCTEC
GKDFNNSSHF SAHRRTHAGG KAS