SOX8_XENLA
ID SOX8_XENLA Reviewed; 459 AA.
AC Q6VVD7;
DT 16-JUN-2009, integrated into UniProtKB/Swiss-Prot.
DT 05-JUL-2004, sequence version 1.
DT 03-AUG-2022, entry version 78.
DE RecName: Full=Transcription factor Sox-8 {ECO:0000250|UniProtKB:P57073};
GN Name=sox8;
OS Xenopus laevis (African clawed frog).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Xenopus.
OX NCBI_TaxID=8355;
RN [1] {ECO:0000305, ECO:0000312|EMBL:AAQ67212.1}
RP NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, TISSUE SPECIFICITY, AND DISRUPTION
RP PHENOTYPE.
RC TISSUE=Neurula {ECO:0000269|PubMed:16943273};
RX PubMed=16943273; DOI=10.1242/dev.02558;
RA O'Donnell M., Hong C.-S., Huang X., Delnicki R.J., Saint-Jeannet J.-P.;
RT "Functional analysis of Sox8 during neural crest development in Xenopus.";
RL Development 133:3817-3826(2006).
RN [2]
RP ERRATUM OF PUBMED:16943273.
RA O'Donnell M., Hong C.-S., Huang X., Delnicki R.J., Saint-Jeannet J.-P.;
RL Development 133:3950-3950(2006).
RN [3] {ECO:0000312|EMBL:AAI69521.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RG NIH - Xenopus Gene Collection (XGC) project;
RL Submitted (NOV-2008) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Transcription factor. Acts early in neural crest formation,
CC functioning redundantly with the other group E Sox factors sox9 and
CC sox10 to induce neural crest progenitors. Regulates the onset of
CC expression of many neural crest marker genes including sox10, and
CC regulates the development of multiple neural crest derivatives. May be
CC required to regulate neural crest cell migration.
CC {ECO:0000269|PubMed:16943273}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00267}.
CC -!- TISSUE SPECIFICITY: From gastrula to neural stages, expressed in a
CC ventrolateral domain around the blastopore. A second domain of
CC expression appears at mid-gastrula stage (stage 11.5) lateral to the
CC neural plate, in the presumptive neural crest. At neurula stage (stage
CC 15), also expressed in the prospective cement gland. As development
CC proceeds, expression persists in migrating cranial crest cells as they
CC populate the pharyngeal arches, and in trunk neural crest cells. Not
CC expressed early in the otic placode, with otic expression only
CC beginning around stage 30. {ECO:0000269|PubMed:16943273}.
CC -!- DOMAIN: The transactivation domains TAM and TAC (for transactivation
CC domain in the middle and at the C-terminus, respectively) are required
CC to contact transcriptional coactivators and basal transcriptional
CC machinery components and thereby induce gene transactivation.
CC {ECO:0000250|UniProtKB:P48436}.
CC -!- DOMAIN: The 9aaTAD motif is a transactivation domain present in a large
CC number of yeast and animal transcription factors.
CC {ECO:0000250|UniProtKB:P57073}.
CC -!- DISRUPTION PHENOTYPE: Impaired neural crest cell migration and delay in
CC neural crest formation leading to severe defects in multiple lineages
CC of the neural crest. {ECO:0000269|PubMed:16943273}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY324658; AAQ67212.1; -; mRNA.
DR EMBL; BC169521; AAI69521.1; -; mRNA.
DR EMBL; BC169525; AAI69525.1; -; mRNA.
DR RefSeq; NP_001083964.1; NM_001090495.1.
DR AlphaFoldDB; Q6VVD7; -.
DR SMR; Q6VVD7; -.
DR GeneID; 399214; -.
DR KEGG; xla:399214; -.
DR CTD; 399214; -.
DR Xenbase; XB-GENE-480869; sox8.L.
DR OrthoDB; 782373at2759; -.
DR Proteomes; UP000186698; Chromosome 9_10L.
DR Bgee; 399214; Expressed in internal ear and 15 other tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0014029; P:neural crest formation; IMP:UniProtKB.
DR Gene3D; 1.10.30.10; -; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR031265; SOX-8.
DR InterPro; IPR022151; Sox_N.
DR PANTHER; PTHR45803:SF2; PTHR45803:SF2; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12444; Sox_N; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; SSF47095; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..459
FT /note="Transcription factor Sox-8"
FT /id="PRO_0000377414"
FT DNA_BIND 98..166
FT /note="HMG box"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00267"
FT REGION 1..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 56..96
FT /note="Dimerization (DIM)"
FT /evidence="ECO:0000250|UniProtKB:P57073"
FT REGION 152..247
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..297
FT /note="Transactivation domain (TAM)"
FT /evidence="ECO:0000250|UniProtKB:P57073"
FT REGION 293..367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 342..459
FT /note="Transactivation domain (TAC)"
FT /evidence="ECO:0000250|UniProtKB:P57073"
FT REGION 439..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 413..421
FT /note="9aaTAD"
FT /evidence="ECO:0000250|UniProtKB:P57073"
FT COMPBIAS 1..36
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..178
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..247
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..367
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 459 AA; 50465 MW; 65978F2C198BFE47 CRC64;
MLNMSSDQEP PCSPTGTASS MSHVSDSDSD SPLSPAGSEG RGSHRPPGIS KRDGEEPMDE
RFPACIRDAV SQVLKGYDWS LVPMPVRGSG GLKAKPHVKR PMNAFMVWAQ AARRKLADQY
PHLHNAELSK TLGKLWRLLS ENEKRPFVEE AERLRVQHKK DHPDYKYQPR RRKSVKAGQS
DSDSGAELGH HPGSQMYKSD SGMGSMGENH LHSEHAGQNH GPPTPPTTPK TDLHHGGKQE
LKHEGRRMMD NGRQNIDFSN VDINELSSEV ISNIEAFDVH EFDQYLPLNG HGAIPADHGQ
NTTAAPYGPS YPHAAGATPA PVWSHKSSST SSSSSIESGQ QRPHIKTEQL SPSHYNDQSQ
GSPTHSDYNT YSAQACATTV SSATVPTAFP SSQCDYTDLP SSNYYNPYSG YPSSLYQYPY
FHSSRRPYAT PILNSLSIPP SHSPTSNWDQ PVYTTLTRP