MSX1_BOVIN
ID MSX1_BOVIN Reviewed; 303 AA.
AC O02786;
DT 15-JUL-1998, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2013, sequence version 2.
DT 25-MAY-2022, entry version 116.
DE RecName: Full=Homeobox protein MSX-1;
DE AltName: Full=Msh homeobox 1-like protein;
GN Name=MSX1;
OS Bos taurus (Bovine).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=9913;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC TISSUE=Tooth;
RX PubMed=7626784; DOI=10.3109/10425179509030972;
RA Iimura T., Oida S., Takeda K., Maruoka Y., Shimokawa H., Ibaraki K.,
RA Sasaki S.;
RT "Molecular cloning and sequence of bovine Msx-1 homeobox-containing gene
RT cDNA from a bovine odontoblast library.";
RL DNA Seq. 5:233-237(1995).
CC -!- FUNCTION: Acts as a transcriptional repressor. May play a role in limb-
CC pattern formation. Acts in cranofacial development and specifically in
CC odontogenesis (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- PTM: Sumoylated by PIAS1, desumoylated by SENP1. {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the Msh homeobox family. {ECO:0000305}.
CC -!- CAUTION: It is uncertain whether Met-1 or Met-7 is the initiator.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAA20367.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D30750; BAA20367.1; ALT_INIT; mRNA.
DR RefSeq; NP_777223.1; NM_174798.2.
DR AlphaFoldDB; O02786; -.
DR SMR; O02786; -.
DR STRING; 9913.ENSBTAP00000014447; -.
DR PaxDb; O02786; -.
DR GeneID; 286872; -.
DR KEGG; bta:286872; -.
DR CTD; 4487; -.
DR eggNOG; KOG0492; Eukaryota.
DR InParanoid; O02786; -.
DR OrthoDB; 1226077at2759; -.
DR Proteomes; UP000009136; Unplaced.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048598; P:embryonic morphogenesis; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Homeobox; Isopeptide bond; Nucleus;
KW Reference proteome; Repressor; Transcription; Transcription regulation;
KW Ubl conjugation.
FT CHAIN 1..303
FT /note="Homeobox protein MSX-1"
FT /id="PRO_0000049082"
FT DNA_BIND 172..231
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 74..113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 135..161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CROSSLNK 15
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000250"
FT CROSSLNK 133
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000250"
SQ SEQUENCE 303 AA; 31668 MW; 847FA220365E498F CRC64;
MAPAADMTSL PLGVKVEDPP FGKPAGGGGG QTPSTTAATA AAMGADAEGA KPKVSPSLLP
FSVEALMADH RKPGAKKSVL AASEGAQAAG GSAKPLGARP GSLAAPDAPS SPRPLGHFSV
GGLLKLPEDA LVKAESPEKP ERTPWMQNPR FSPPPSRRLS PPACTLRKHK TNRKPRTPFT
TAQLLALERK FRQKQYLSIA ERAEFSSSLS LTETQVKIWF QNRRAKAKRL QEAELEKLKM
AAKPMLPPAT FGLSFPLGGP AAVAAPAGAS LYGASGPFQR AALPVAPVGL YTAHVGYSMY
HLT