位置:首页 > 蛋白库 > HDX_MOUSE
HDX_MOUSE
ID   HDX_MOUSE               Reviewed;         692 AA.
AC   Q14B70; Q14B69;
DT   11-SEP-2007, integrated into UniProtKB/Swiss-Prot.
DT   22-AUG-2006, sequence version 1.
DT   03-AUG-2022, entry version 114.
DE   RecName: Full=Highly divergent homeobox;
GN   Name=Hdx;
OS   Mus musculus (Mouse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
OX   NCBI_TaxID=10090;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=C57BL/6J;
RX   PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA   Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA   Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA   Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA   Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA   Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA   Eichler E.E., Ponting C.P.;
RT   "Lineage-specific biology revealed by a finished genome assembly of the
RT   mouse.";
RL   PLoS Biol. 7:E1000112-E1000112(2009).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RX   PubMed=15489334; DOI=10.1101/gr.2596504;
RG   The MGC Project Team;
RT   "The status, quality, and expansion of the NIH full-length cDNA project:
RT   the Mammalian Gene Collection (MGC).";
RL   Genome Res. 14:2121-2127(2004).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=2;
CC       Name=1;
CC         IsoId=Q14B70-1; Sequence=Displayed;
CC       Name=2;
CC         IsoId=Q14B70-2; Sequence=VSP_027709;
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BX539333; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; BC116300; AAI16301.1; -; mRNA.
DR   EMBL; BC116301; AAI16302.1; -; mRNA.
DR   CCDS; CCDS41105.1; -. [Q14B70-1]
DR   CCDS; CCDS72422.1; -. [Q14B70-2]
DR   RefSeq; NP_001074018.1; NM_001080549.2. [Q14B70-1]
DR   RefSeq; NP_001277388.1; NM_001290459.1. [Q14B70-2]
DR   AlphaFoldDB; Q14B70; -.
DR   SMR; Q14B70; -.
DR   BioGRID; 232805; 2.
DR   STRING; 10090.ENSMUSP00000109049; -.
DR   iPTMnet; Q14B70; -.
DR   PhosphoSitePlus; Q14B70; -.
DR   jPOST; Q14B70; -.
DR   PaxDb; Q14B70; -.
DR   PeptideAtlas; Q14B70; -.
DR   PRIDE; Q14B70; -.
DR   Antibodypedia; 28380; 57 antibodies from 14 providers.
DR   Ensembl; ENSMUST00000038472; ENSMUSP00000043482; ENSMUSG00000034551. [Q14B70-2]
DR   Ensembl; ENSMUST00000113422; ENSMUSP00000109049; ENSMUSG00000034551. [Q14B70-1]
DR   GeneID; 245596; -.
DR   KEGG; mmu:245596; -.
DR   UCSC; uc009udb.1; mouse. [Q14B70-1]
DR   CTD; 139324; -.
DR   MGI; MGI:2685226; Hdx.
DR   VEuPathDB; HostDB:ENSMUSG00000034551; -.
DR   eggNOG; ENOG502QPZG; Eukaryota.
DR   GeneTree; ENSGT00390000008591; -.
DR   HOGENOM; CLU_025064_0_0_1; -.
DR   InParanoid; Q14B70; -.
DR   OMA; ASMAEIH; -.
DR   OrthoDB; 465472at2759; -.
DR   TreeFam; TF330998; -.
DR   BioGRID-ORCS; 245596; 1 hit in 71 CRISPR screens.
DR   PRO; PR:Q14B70; -.
DR   Proteomes; UP000000589; Chromosome X.
DR   RNAct; Q14B70; protein.
DR   Bgee; ENSMUSG00000034551; Expressed in epiblast (generic) and 37 other tissues.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR   CDD; cd00086; homeodomain; 2.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   SMART; SM00389; HOX; 2.
DR   SUPFAM; SSF46689; SSF46689; 2.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   2: Evidence at transcript level;
KW   Alternative splicing; DNA-binding; Homeobox; Isopeptide bond; Nucleus;
KW   Reference proteome; Repeat; Ubl conjugation.
FT   CHAIN           1..692
FT                   /note="Highly divergent homeobox"
FT                   /id="PRO_0000299488"
FT   DNA_BIND        3..63
FT                   /note="Homeobox 1"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT   DNA_BIND        437..500
FT                   /note="Homeobox 2"
FT                   /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT   REGION          117..136
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          505..541
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          647..692
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        672..692
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CROSSLNK        137
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        142
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        146
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        165
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        174
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        196
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        214
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        223
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   CROSSLNK        234
FT                   /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT                   G-Cter in SUMO2)"
FT                   /evidence="ECO:0000250|UniProtKB:Q7Z353"
FT   VAR_SEQ         1..58
FT                   /note="Missing (in isoform 2)"
FT                   /evidence="ECO:0000303|PubMed:15489334"
FT                   /id="VSP_027709"
SQ   SEQUENCE   692 AA;  76861 MW;  66D15D3E4B0E69AA CRC64;
     MNLRSVFTVE QQRILQRYYE NGMTNQSKNC FQLILQCAQE TKLDFSVVRT WVGNKRRKMS
     SKSCESGAAG TVSGTSLAAP DITVRNVVNI ARPSSQQSSW TSANNDVIVT GIYSPVSSSS
     KQGTTKHTNT QITEAHKIPI QKAANKNDTE LQLHIPVQRQ VAHCKNASVL LGEKTIILSR
     QTSVLNAGNS VYNHTKKSYG SSPVQASEMT VPQKPSVCQR PCKIEPVGIQ RSYKPEHAGL
     ASHNLCGQKP TIRDPCCRTQ NLEIREVFSL AVSDYPQRIL GGNSTQKPAS AEGTCLSIAM
     ETGDAEDEYA REEELASMGA QITSYSRFYE SGNSLRAENQ STNLPGPGRN LPNSQMVNIR
     DLSDNVLYQT RDYHLTPRTS LHTASSTMYS NTNPSRSNFS PHFVSSNQLR LSQNQNNYQI
     SGNLSVPWIT GCSRKRALQD RTQFSDRDLA TLKKYWDNGM TSLGSVCREK IEAVAIELNV
     DCEIVRTWIG NRRRKYRLMG IEVPPPRGGP ADFSEQPESG SLSALTPGEE AGPEVGEDND
     RNDEVSICLS EASSQEESNE LIPNETRAHK DEEHQAVSAD NVKIEIIDDE ESDMISNSEV
     EQENSLLDYK NEEVRFIENE LEIQKQKYFK LQSFVRNLIL AMKADDKDQQ QALLSDLPPE
     LEEMDCSHAS PDPDDTSLSV SSLSEKNASD SL
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024