HDX_CHICK
ID HDX_CHICK Reviewed; 695 AA.
AC Q5ZKW8;
DT 11-SEP-2007, integrated into UniProtKB/Swiss-Prot.
DT 23-NOV-2004, sequence version 1.
DT 03-AUG-2022, entry version 115.
DE RecName: Full=Highly divergent homeobox;
GN Name=HDX; ORFNames=RCJMB04_8n18;
OS Gallus gallus (Chicken).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Phasianinae; Gallus.
OX NCBI_TaxID=9031;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=CB; TISSUE=Bursa of Fabricius;
RX PubMed=15642098; DOI=10.1186/gb-2004-6-1-r6;
RA Caldwell R.B., Kierzek A.M., Arakawa H., Bezzubov Y., Zaim J., Fiedler P.,
RA Kutter S., Blagodatski A., Kostovska D., Koter M., Plachy J., Carninci P.,
RA Hayashizaki Y., Buerstedde J.-M.;
RT "Full-length cDNAs from chicken bursal lymphocytes to facilitate gene
RT function analysis.";
RL Genome Biol. 6:R6.1-R6.9(2005).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ719966; CAG31625.1; -; mRNA.
DR RefSeq; NP_001026291.1; NM_001031120.3.
DR RefSeq; XP_015133765.1; XM_015278279.1.
DR AlphaFoldDB; Q5ZKW8; -.
DR SMR; Q5ZKW8; -.
DR STRING; 9031.ENSGALP00000011393; -.
DR PaxDb; Q5ZKW8; -.
DR Ensembl; ENSGALT00000011407; ENSGALP00000011393; ENSGALG00000007041.
DR GeneID; 422272; -.
DR KEGG; gga:422272; -.
DR CTD; 139324; -.
DR VEuPathDB; HostDB:geneid_422272; -.
DR eggNOG; ENOG502QPZG; Eukaryota.
DR GeneTree; ENSGT00390000008591; -.
DR HOGENOM; CLU_025064_0_0_1; -.
DR InParanoid; Q5ZKW8; -.
DR OMA; ASMAEIH; -.
DR OrthoDB; 465472at2759; -.
DR PhylomeDB; Q5ZKW8; -.
DR TreeFam; TF330998; -.
DR PRO; PR:Q5ZKW8; -.
DR Proteomes; UP000000539; Chromosome 4.
DR Bgee; ENSGALG00000007041; Expressed in spermatid and 13 other tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 2.
DR SUPFAM; SSF46689; SSF46689; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW DNA-binding; Homeobox; Nucleus; Reference proteome; Repeat.
FT CHAIN 1..695
FT /note="Highly divergent homeobox"
FT /id="PRO_0000299489"
FT DNA_BIND 3..63
FT /note="Homeobox 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT DNA_BIND 440..503
FT /note="Homeobox 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 56..81
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 653..695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 674..695
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 695 AA; 77251 MW; 5E5EC479BDE6057B CRC64;
MNLRSVFTVE QQRILQRYYE NGMTNQSKNC FQLILQCAQE TKLDFSVVRT WVGNKRRKMS
SKSALESGGA PPGTAHTAPS VPPEAMVRNV VNIARSQSQQ SSWTSSNNDV IVTGIYSPAS
SSNRQGSTKQ TNASMAEIHK TSIPRLPGKS DADFQQQHIP IGRQIPHCKN ASLLVGEKTI
ILSRQTSVLN SANSIYSHTK KSYGGSSVQT AELVLPQKPM ICHRPCKAEL MGCQRLQKPE
HAALASHGPP GQRANARDPC STQNLEIREV FSLAVTDQPQ RIVGGSTAQK HCSVEGSSLS
IAMETGDVDD EYAREEELAS MGAQIQSYSR YYESSSSIRV ENQSAALSGP GRNVSCSSQM
VNARDVPDSM LYHSRDYHLP ARTSLHTSST LYNSANTSRN TFSPHFTSSN QLRLSQNQNN
YQISGNLTVP WITGCSRKRA LQDRTQFSDR DLATLKKYWD NGMTSLGSVC REKIEAVAAE
LNVDCEIVRT WIGNRRRKYR LMGIEVPPPR GGPADFSDQS EFVSKSALNP GEETATEVGD
DNDRNDEVSI CLSEGSSQEE TNEVLQNEEI HHKDDDRNPV SADNVKIEII DDEESDMISN
SEVDQMSSLL DYKNEEVRFI ENELENHKQK YFELQTFTRS LILAIKSDDK EQQQALLSDL
PPELEEMDFN HTSPEPDDTS FSLSSLSEKN ASDSL