HOMEZ_MOUSE
ID HOMEZ_MOUSE Reviewed; 542 AA.
AC Q80W88; Q504Z8;
DT 16-JAN-2004, integrated into UniProtKB/Swiss-Prot.
DT 11-JAN-2011, sequence version 2.
DT 25-MAY-2022, entry version 141.
DE RecName: Full=Homeobox and leucine zipper protein Homez;
DE AltName: Full=Homeodomain leucine zipper-containing factor;
GN Name=Homez; Synonyms=Kiaa1443;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2), DEVELOPMENTAL STAGE, AND
RP TISSUE SPECIFICITY.
RC STRAIN=Swiss Webster / NIH;
RX PubMed=12925734; DOI=10.1073/pnas.1834010100;
RA Bayarsaihan D., Enkhmandakh B., Makeyev A., Greally J.M., Leckman J.F.,
RA Ruddle F.H.;
RT "Homez, a homeobox leucine zipper gene specific to the vertebrate
RT lineage.";
RL Proc. Natl. Acad. Sci. U.S.A. 100:10358-10363(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Embryonic tail;
RX PubMed=14621295; DOI=10.1093/dnares/10.4.167;
RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S.,
RA Saga Y., Nagase T., Ohara O., Koga H.;
RT "Prediction of the coding sequences of mouse homologues of KIAA gene: III.
RT The complete nucleotide sequences of 500 mouse KIAA-homologous cDNAs
RT identified by screening of terminal sequences of cDNA clones randomly
RT sampled from size-fractionated libraries.";
RL DNA Res. 10:167-180(2003).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=C57BL/6J; TISSUE=Brain;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- FUNCTION: May function as a transcriptional regulator.
CC -!- SUBUNIT: Homodimer or heterodimer (Potential). Interacts with HOXC8 (By
CC similarity). {ECO:0000250, ECO:0000305}.
CC -!- INTERACTION:
CC Q80W88; Q9UHL9: GTF2IRD1; Xeno; NbExp=2; IntAct=EBI-12516872, EBI-372530;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q80W88-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q80W88-2; Sequence=VSP_009133;
CC -!- TISSUE SPECIFICITY: Ubiquitous. Strongly expressed in testis.
CC {ECO:0000269|PubMed:12925734}.
CC -!- DEVELOPMENTAL STAGE: First expressed at 7 dpc. At 8.5-9 dpc expressed
CC in all developing organs. Later on during embryogenesis shows a more
CC restricted expression pattern. At 9.5-12.5 dpc it is strongly expressed
CC in the developing brain, optic vesicle and the otic placode.
CC {ECO:0000269|PubMed:12925734}.
CC -!- MISCELLANEOUS: [Isoform 2]: May result from the retention of an intron.
CC {ECO:0000305}.
CC -!- CAUTION: It is uncertain whether Met-1 or Met-25 is the initiator.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH94669.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAC98171.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY258064; AAP32905.1; -; mRNA.
DR EMBL; AK129361; BAC98171.1; ALT_INIT; mRNA.
DR EMBL; BC094669; AAH94669.1; ALT_INIT; mRNA.
DR RefSeq; NP_001171176.1; NM_001177705.1.
DR RefSeq; NP_898997.2; NM_183174.3.
DR AlphaFoldDB; Q80W88; -.
DR BMRB; Q80W88; -.
DR SMR; Q80W88; -.
DR BioGRID; 232046; 1.
DR IntAct; Q80W88; 1.
DR STRING; 10090.ENSMUSP00000079929; -.
DR iPTMnet; Q80W88; -.
DR PhosphoSitePlus; Q80W88; -.
DR SwissPalm; Q80W88; -.
DR jPOST; Q80W88; -.
DR PaxDb; Q80W88; -.
DR PeptideAtlas; Q80W88; -.
DR PRIDE; Q80W88; -.
DR ProteomicsDB; 273375; -. [Q80W88-1]
DR ProteomicsDB; 273376; -. [Q80W88-2]
DR DNASU; 239099; -.
DR GeneID; 239099; -.
DR KEGG; mmu:239099; -.
DR UCSC; uc007txc.1; mouse. [Q80W88-1]
DR CTD; 57594; -.
DR MGI; MGI:2678023; Homez.
DR eggNOG; KOG3986; Eukaryota.
DR InParanoid; Q80W88; -.
DR OrthoDB; 518562at2759; -.
DR PhylomeDB; Q80W88; -.
DR TreeFam; TF333363; -.
DR BioGRID-ORCS; 239099; 6 hits in 70 CRISPR screens.
DR ChiTaRS; Homez; mouse.
DR PRO; PR:Q80W88; -.
DR Proteomes; UP000000589; Unplaced.
DR RNAct; Q80W88; protein.
DR GO; GO:0005829; C:cytosol; ISO:MGI.
DR GO; GO:0005730; C:nucleolus; ISO:MGI.
DR GO; GO:0005654; C:nucleoplasm; ISO:MGI.
DR GO; GO:0005634; C:nucleus; IDA:MGI.
DR GO; GO:0003677; F:DNA binding; IDA:MGI.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR024578; Homez_homeobox_dom.
DR Pfam; PF11569; Homez; 1.
DR SMART; SM00389; HOX; 2.
DR SUPFAM; SSF46689; SSF46689; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Homeobox; Isopeptide bond; Nucleus;
KW Phosphoprotein; Reference proteome; Repeat; Transcription;
KW Transcription regulation; Ubl conjugation.
FT CHAIN 1..542
FT /note="Homeobox and leucine zipper protein Homez"
FT /id="PRO_0000049136"
FT DNA_BIND 55..114
FT /note="Homeobox 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT DNA_BIND 349..409
FT /note="Homeobox 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT DNA_BIND 443..502
FT /note="Homeobox 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 165..193
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 250..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 424..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 501..542
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 352..357
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255"
FT COMPBIAS 263..307
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..542
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 345
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8IX15"
FT MOD_RES 443
FT /note="Phosphothreonine"
FT /evidence="ECO:0000250|UniProtKB:Q8IX15"
FT CROSSLNK 181
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q8IX15"
FT CROSSLNK 201
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q8IX15"
FT VAR_SEQ 522..542
FT /note="EEEEEEEEDDDDGDDDVIIWD -> SVWTEGPWSSNSMMF (in isoform
FT 2)"
FT /evidence="ECO:0000303|PubMed:12925734"
FT /id="VSP_009133"
FT CONFLICT 297
FT /note="P -> L (in Ref. 2; BAC98171)"
FT /evidence="ECO:0000305"
FT CONFLICT 314
FT /note="K -> E (in Ref. 2; BAC98171)"
FT /evidence="ECO:0000305"
FT CONFLICT 389
FT /note="L -> S (in Ref. 1; AAP32905)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 542 AA; 60914 MW; 244C833F148A9B62 CRC64;
MLGHRLLPSL DFPAVSEGYK PEHDMSPNKD ASSLNSSAAG LVCLPPVSEE LQLVWTQAIQ
TSELDGNEHL LQAFSYFPYP SLADIALLCL RHGLQMEKVK TWFMAQRLRC GISWSSEEIE
ETRARVVYHR DQLLFKSLLS FTQQSVRPPQ ERPPVLRPEQ VALGLSPLAP SEQPTHMKGL
KVEPEEPSQV SQLPLNHQNA KEPLMMGSRT FSHQSDCQDL QISGLSKEQA GRGPDQSCGK
TASWNHFTAV HQPDKPASVS LLDNSCKEES EPSGIPPSSS TSSPSFQALA NGTTATPKPL
QPLGCISQSL SPSKKALSPQ VEPLWPQRLW NNSEPNSAGP TEYLSPDMQH QRKTKRKTKE
QLAILKSFFL QCQWARREDY HKLEQITGLP RPEIIQWFGD TRYALKHGQL KWFRDNAVLG
TPSFQDPAIP TPSTRSLKEW AKTPPLPAPP PPPDIRPLEK YWAAHQQLQE ADILKLSQAS
RLSTQQVLDW FDSRLPKPAE VVVCLDEEDE EDEEDELPED GEEEEEEEED DDDGDDDVII
WD