SEBOX_HUMAN
ID SEBOX_HUMAN Reviewed; 190 AA.
AC Q9HB31; F6T8T6;
DT 13-NOV-2007, integrated into UniProtKB/Swiss-Prot.
DT 18-SEP-2019, sequence version 3.
DT 03-AUG-2022, entry version 151.
DE RecName: Full=Homeobox protein SEBOX;
DE AltName: Full=Homeobox OG-9;
DE AltName: Full=Skin-, embryo-, brain- and oocyte-specific homeobox;
GN Name=SEBOX; Synonyms=OG9X;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND VARIANT SER-181.
RX PubMed=10922053; DOI=10.1073/pnas.97.16.8904;
RA Cinquanta M., Rovescalli A.C., Kozak C.A., Nirenberg M.;
RT "Mouse Sebox homeobox gene expression in skin, brain, oocytes, and two-cell
RT embryos.";
RL Proc. Natl. Acad. Sci. U.S.A. 97:8904-8909(2000).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16625196; DOI=10.1038/nature04689;
RA Zody M.C., Garber M., Adams D.J., Sharpe T., Harrow J., Lupski J.R.,
RA Nicholson C., Searle S.M., Wilming L., Young S.K., Abouelleil A.,
RA Allen N.R., Bi W., Bloom T., Borowsky M.L., Bugalter B.E., Butler J.,
RA Chang J.L., Chen C.-K., Cook A., Corum B., Cuomo C.A., de Jong P.J.,
RA DeCaprio D., Dewar K., FitzGerald M., Gilbert J., Gibson R., Gnerre S.,
RA Goldstein S., Grafham D.V., Grocock R., Hafez N., Hagopian D.S., Hart E.,
RA Norman C.H., Humphray S., Jaffe D.B., Jones M., Kamal M., Khodiyar V.K.,
RA LaButti K., Laird G., Lehoczky J., Liu X., Lokyitsang T., Loveland J.,
RA Lui A., Macdonald P., Major J.E., Matthews L., Mauceli E., McCarroll S.A.,
RA Mihalev A.H., Mudge J., Nguyen C., Nicol R., O'Leary S.B., Osoegawa K.,
RA Schwartz D.C., Shaw-Smith C., Stankiewicz P., Steward C., Swarbreck D.,
RA Venkataraman V., Whittaker C.A., Yang X., Zimmer A.R., Bradley A.,
RA Hubbard T., Birren B.W., Rogers J., Lander E.S., Nusbaum C.;
RT "DNA sequence of human chromosome 17 and analysis of rearrangement in the
RT human lineage.";
RL Nature 440:1045-1049(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT SER-181.
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Probable transcription factor involved in the control of
CC specification of mesoderm and endoderm. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAG14458.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC Sequence=EAW51081.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF284337; AAG14458.1; ALT_INIT; Genomic_DNA.
DR EMBL; AC002094; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KF573650; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471159; EAW51081.1; ALT_INIT; Genomic_DNA.
DR CCDS; CCDS45634.2; -.
DR RefSeq; NP_001074306.3; NM_001080837.3.
DR AlphaFoldDB; Q9HB31; -.
DR SMR; Q9HB31; -.
DR STRING; 9606.ENSP00000444503; -.
DR BioMuta; SEBOX; -.
DR DMDM; 322510071; -.
DR MassIVE; Q9HB31; -.
DR PaxDb; Q9HB31; -.
DR PRIDE; Q9HB31; -.
DR Antibodypedia; 76862; 11 antibodies from 6 providers.
DR DNASU; 645832; -.
DR Ensembl; ENST00000536498.6; ENSP00000444503.3; ENSG00000274529.6.
DR GeneID; 645832; -.
DR KEGG; hsa:645832; -.
DR MANE-Select; ENST00000536498.6; ENSP00000444503.3; NM_001080837.4; NP_001074306.3.
DR UCSC; uc010wai.1; human.
DR CTD; 645832; -.
DR GeneCards; SEBOX; -.
DR HGNC; HGNC:32942; SEBOX.
DR HPA; ENSG00000274529; Not detected.
DR MIM; 610975; gene.
DR neXtProt; NX_Q9HB31; -.
DR VEuPathDB; HostDB:ENSG00000274529; -.
DR eggNOG; KOG0490; Eukaryota.
DR GeneTree; ENSGT00920000149180; -.
DR HOGENOM; CLU_080455_0_0_1; -.
DR InParanoid; Q9HB31; -.
DR OMA; AFAAWPY; -.
DR OrthoDB; 1270742at2759; -.
DR PhylomeDB; Q9HB31; -.
DR TreeFam; TF315976; -.
DR BioGRID-ORCS; 645832; 13 hits in 1002 CRISPR screens.
DR GenomeRNAi; 645832; -.
DR Pharos; Q9HB31; Tbio.
DR PRO; PR:Q9HB31; -.
DR Proteomes; UP000005640; Chromosome 17.
DR RNAct; Q9HB31; protein.
DR Bgee; ENSG00000274529; Expressed in right lobe of liver and 23 other tissues.
DR Genevisible; Q9HB31; HS.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0009792; P:embryo development ending in birth or egg hatching; IEA:Ensembl.
DR GO; GO:0048477; P:oogenesis; IEA:Ensembl.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR042223; SEBOX.
DR PANTHER; PTHR47777; PTHR47777; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein; Differentiation; DNA-binding; Homeobox; Nucleus;
KW Reference proteome; Transcription; Transcription regulation.
FT CHAIN 1..190
FT /note="Homeobox protein SEBOX"
FT /id="PRO_0000311336"
FT DNA_BIND 19..78
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 82..161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VARIANT 181
FT /note="L -> S (in dbSNP:rs9910163)"
FT /evidence="ECO:0000269|PubMed:10922053, ECO:0000269|Ref.3"
FT /id="VAR_037228"
SQ SEQUENCE 190 AA; 20398 MW; 6C2D5F785A7F7BDF CRC64;
MPSPVDASSA DGGSGLGSHR RKRTTFSKGQ LLELERAFAA WPYPNISTHE HLAWVTCLPE
AKVQVWFQKR WAKIIKNRKS GILSPGSECP QSSCSLPDTL QQPWDPQMPG QPPPSSGTPQ
RTSVCRHSSC PAPGLSPRQG WEGAKAVAPW GSAGASEVHP SLERATPQTS LGSLSDLIYA
LAIVVNVDHS