SOX21_HUMAN
ID SOX21_HUMAN Reviewed; 276 AA.
AC Q9Y651; P35715; Q15504; Q5TBS1;
DT 30-MAY-2000, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1999, sequence version 1.
DT 03-AUG-2022, entry version 166.
DE RecName: Full=Transcription factor SOX-21;
DE AltName: Full=SOX-A;
GN Name=SOX21; Synonyms=SOX25, SOXA;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=10441749; DOI=10.1007/s003359901118;
RA Malas S., Duthie S., Deloukas P., Episkopou V.;
RT "The isolation and high-resolution chromosomal mapping of human SOX14 and
RT SOX21; two members of the SOX gene family related to SOX1, SOX2, and
RT SOX3.";
RL Mamm. Genome 10:934-937(1999).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15057823; DOI=10.1038/nature02379;
RA Dunham A., Matthews L.H., Burton J., Ashurst J.L., Howe K.L.,
RA Ashcroft K.J., Beare D.M., Burford D.C., Hunt S.E., Griffiths-Jones S.,
RA Jones M.C., Keenan S.J., Oliver K., Scott C.E., Ainscough R., Almeida J.P.,
RA Ambrose K.D., Andrews D.T., Ashwell R.I.S., Babbage A.K., Bagguley C.L.,
RA Bailey J., Bannerjee R., Barlow K.F., Bates K., Beasley H., Bird C.P.,
RA Bray-Allen S., Brown A.J., Brown J.Y., Burrill W., Carder C., Carter N.P.,
RA Chapman J.C., Clamp M.E., Clark S.Y., Clarke G., Clee C.M., Clegg S.C.,
RA Cobley V., Collins J.E., Corby N., Coville G.J., Deloukas P., Dhami P.,
RA Dunham I., Dunn M., Earthrowl M.E., Ellington A.G., Faulkner L.,
RA Frankish A.G., Frankland J., French L., Garner P., Garnett J.,
RA Gilbert J.G.R., Gilson C.J., Ghori J., Grafham D.V., Gribble S.M.,
RA Griffiths C., Hall R.E., Hammond S., Harley J.L., Hart E.A., Heath P.D.,
RA Howden P.J., Huckle E.J., Hunt P.J., Hunt A.R., Johnson C., Johnson D.,
RA Kay M., Kimberley A.M., King A., Laird G.K., Langford C.J., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Lloyd C., Loveland J.E., Lovell J.,
RA Martin S., Mashreghi-Mohammadi M., McLaren S.J., McMurray A., Milne S.,
RA Moore M.J.F., Nickerson T., Palmer S.A., Pearce A.V., Peck A.I., Pelan S.,
RA Phillimore B., Porter K.M., Rice C.M., Searle S., Sehra H.K., Shownkeen R.,
RA Skuce C.D., Smith M., Steward C.A., Sycamore N., Tester J., Thomas D.W.,
RA Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P.,
RA Whitehead S.L., Willey D.L., Wilming L., Wray P.W., Wright M.W., Young L.,
RA Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Beck S., Bentley D.R.,
RA Rogers J., Ross M.T.;
RT "The DNA sequence and analysis of human chromosome 13.";
RL Nature 428:522-528(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 12-87.
RC TISSUE=Spinal cord;
RA Stevanovic M.;
RL Submitted (APR-1993) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 19-72.
RX PubMed=1614875; DOI=10.1093/nar/20.11.2887;
RA Denny P., Swift S., Brand N., Dabhade N., Barton P., Ashworth A.;
RT "A conserved family of genes related to the testis determining gene, SRY.";
RL Nucleic Acids Res. 20:2887-2887(1992).
CC -!- FUNCTION: May play a role as an activator of transcription of OPRM1.
CC {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00267}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF107044; AAC95381.1; -; Genomic_DNA.
DR EMBL; AL137061; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471085; EAX08946.1; -; Genomic_DNA.
DR EMBL; X71136; CAA50466.1; -; mRNA.
DR EMBL; X65666; CAA46617.1; -; mRNA.
DR CCDS; CCDS9473.1; -.
DR PIR; I38238; I38238.
DR PIR; S22937; S22937.
DR RefSeq; NP_009015.1; NM_007084.3.
DR AlphaFoldDB; Q9Y651; -.
DR SMR; Q9Y651; -.
DR BioGRID; 116337; 18.
DR IntAct; Q9Y651; 2.
DR STRING; 9606.ENSP00000366144; -.
DR iPTMnet; Q9Y651; -.
DR PhosphoSitePlus; Q9Y651; -.
DR BioMuta; SOX21; -.
DR DMDM; 6831690; -.
DR EPD; Q9Y651; -.
DR jPOST; Q9Y651; -.
DR MassIVE; Q9Y651; -.
DR MaxQB; Q9Y651; -.
DR PaxDb; Q9Y651; -.
DR PeptideAtlas; Q9Y651; -.
DR PRIDE; Q9Y651; -.
DR ProteomicsDB; 86600; -.
DR Antibodypedia; 24798; 198 antibodies from 30 providers.
DR DNASU; 11166; -.
DR Ensembl; ENST00000376945.4; ENSP00000366144.2; ENSG00000125285.6.
DR GeneID; 11166; -.
DR KEGG; hsa:11166; -.
DR MANE-Select; ENST00000376945.4; ENSP00000366144.2; NM_007084.4; NP_009015.1.
DR UCSC; uc001vma.4; human.
DR CTD; 11166; -.
DR DisGeNET; 11166; -.
DR GeneCards; SOX21; -.
DR HGNC; HGNC:11197; SOX21.
DR HPA; ENSG00000125285; Group enriched (brain, esophagus, skin, stomach).
DR MIM; 604974; gene.
DR neXtProt; NX_Q9Y651; -.
DR OpenTargets; ENSG00000125285; -.
DR PharmGKB; PA36034; -.
DR VEuPathDB; HostDB:ENSG00000125285; -.
DR eggNOG; KOG0527; Eukaryota.
DR GeneTree; ENSGT00940000162795; -.
DR HOGENOM; CLU_021123_3_1_1; -.
DR InParanoid; Q9Y651; -.
DR OMA; SDSIMGH; -.
DR OrthoDB; 1335452at2759; -.
DR PhylomeDB; Q9Y651; -.
DR TreeFam; TF351735; -.
DR PathwayCommons; Q9Y651; -.
DR SignaLink; Q9Y651; -.
DR BioGRID-ORCS; 11166; 7 hits in 1090 CRISPR screens.
DR ChiTaRS; SOX21; human.
DR GeneWiki; SOX21; -.
DR GenomeRNAi; 11166; -.
DR Pharos; Q9Y651; Tbio.
DR PRO; PR:Q9Y651; -.
DR Proteomes; UP000005640; Chromosome 13.
DR RNAct; Q9Y651; protein.
DR Bgee; ENSG00000125285; Expressed in ventricular zone and 54 other tissues.
DR Genevisible; Q9Y651; HS.
DR GO; GO:0000785; C:chromatin; ISA:NTNU_SB.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; ISS:UniProtKB.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IEA:Ensembl.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; ISS:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISA:NTNU_SB.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central.
DR GO; GO:0001942; P:hair follicle development; IEA:Ensembl.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; NAS:UniProtKB.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:UniProtKB.
DR GO; GO:0048863; P:stem cell differentiation; IDA:UniProtKB.
DR Gene3D; 1.10.30.10; -; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022097; SOX_fam.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12336; SOXp; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; SSF47095; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 2: Evidence at transcript level;
KW Activator; DNA-binding; Nucleus; Reference proteome; Transcription;
KW Transcription regulation.
FT CHAIN 1..276
FT /note="Transcription factor SOX-21"
FT /id="PRO_0000048770"
FT DNA_BIND 8..76
FT /note="HMG box"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00267"
FT VARIANT 230
FT /note="G -> R (in dbSNP:rs6492735)"
FT /id="VAR_049562"
FT CONFLICT 41
FT /note="R -> P (in Ref. 4; CAA50466)"
FT /evidence="ECO:0000305"
FT CONFLICT 83
FT /note="P -> T (in Ref. 4; CAA50466)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 276 AA; 28580 MW; 99DC899B7EC9A96B CRC64;
MSKPVDHVKR PMNAFMVWSR AQRRKMAQEN PKMHNSEISK RLGAEWKLLT ESEKRPFIDE
AKRLRAMHMK EHPDYKYRPR RKPKTLLKKD KFAFPVPYGL GGVADAEHPA LKAGAGLHAG
AGGGLVPESL LANPEKAAAA AAAAAARVFF PQSAAAAAAA AAAAAAGSPY SLLDLGSKMA
EISSSSSGLP YASSLGYPTA GAGAFHGAAA AAAAAAAAAG GHTHSHPSPG NPGYMIPCNC
SAWPSPGLQP PLAYILLPGM GKPQLDPYPA AYAAAL