SOX8_TETNG
ID SOX8_TETNG Reviewed; 462 AA.
AC Q6IZ48; Q4SGP2;
DT 22-NOV-2005, integrated into UniProtKB/Swiss-Prot.
DT 22-NOV-2005, sequence version 2.
DT 25-MAY-2022, entry version 84.
DE RecName: Full=Transcription factor Sox-8;
GN Name=sox8; ORFNames=GSTENG00018540001;
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RA MacKenzie M.G., Fernandes J.M.O., Johnston I.A., Kinghorn J.R.;
RT "Cloning and characterization of the transcription factor Sox 8 in the
RT fresh water pufferfish Tetraodon nigroviridis.";
RL Submitted (APR-2004) to the EMBL/GenBank/DDBJ databases.
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
CC -!- FUNCTION: May play a role in central nervous system, limb and facial
CC development. May be involved in male sex determination. Binds the
CC consensus motif 5'-[AT][AT]CAA[AT]G-3' (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00267}.
CC -!- DOMAIN: The 9aaTAD motif is a transactivation domain present in a large
CC number of yeast and animal transcription factors.
CC {ECO:0000250|UniProtKB:P57073}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAG00190.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY612092; AAT42231.1; -; mRNA.
DR EMBL; CAAE01014593; CAG00190.1; ALT_SEQ; Genomic_DNA.
DR AlphaFoldDB; Q6IZ48; -.
DR SMR; Q6IZ48; -.
DR KEGG; tng:GSTEN00018540G001; -.
DR HOGENOM; CLU_031800_0_0_1; -.
DR InParanoid; Q6IZ48; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR Gene3D; 1.10.30.10; -; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR031265; SOX-8.
DR InterPro; IPR022151; Sox_N.
DR PANTHER; PTHR45803:SF2; PTHR45803:SF2; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12444; Sox_N; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; SSF47095; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 2: Evidence at transcript level;
KW DNA-binding; Nucleus; Reference proteome; Transcription;
KW Transcription regulation.
FT CHAIN 1..462
FT /note="Transcription factor Sox-8"
FT /id="PRO_0000048736"
FT DNA_BIND 100..168
FT /note="HMG box"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00267"
FT REGION 1..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 155..243
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 291..375
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 442..462
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 415..423
FT /note="9aaTAD"
FT /evidence="ECO:0000250|UniProtKB:P57073"
FT COMPBIAS 15..52
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 155..180
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 291..316
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 133
FT /note="T -> A (in Ref. 1; AAT42231)"
FT /evidence="ECO:0000305"
FT CONFLICT 274
FT /note="N -> S (in Ref. 1; AAT42231)"
FT /evidence="ECO:0000305"
FT CONFLICT 338
FT /note="S -> P (in Ref. 1; AAT42231)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 462 AA; 50712 MW; A74DA5142A5A006B CRC64;
MLKMTEEHDK CVSDQPCSPS GTNSSMSQDE SDSDAPSSPT GSDGQGSLLT SLGRKVDSED
DERFPACIRD AVSQVLKGYD WSLVPMPVRG NGSLKNKPHV KRPMNAFMVW AQAARRKLAD
QYPHLHNAEL SKTLGKLWRL LSESEKRPFV DEAERLRIQH KKDHPDYKYQ PRRRKNVKPG
QSDSDSGAEL AHHMYKAEPG MGGMGGITDA HHHAEHAGQP HGPPTPPTTP KTDLHHGAKQ
DLKHEGRRLI DSSRQNIDFS NVDISELSTD VISNMETFDV HEFDQYLPLN GHTSSSSSLP
SDQPPAPVSS YASSYGHAGV NGPAWSRKGA MPSSSPSSGE VGQHRLHIKT EQLSPSHYSE
HSHRSPPHSD YGSYSSPACV TSATSAASVP FSGSQCDYSD IQSSNYYNPY SSYSSSLYQY
PYFHSSRRPY GSPILNSLSM APAHSPTGSG WDQPVYTTLS RP