HXA1A_DANRE
ID HXA1A_DANRE Reviewed; 329 AA.
AC Q98SI1; Q1L968; Q4PRB2; Q8AWZ1; Q98SI0; Q9YGT8;
DT 01-FEB-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 129.
DE RecName: Full=Homeobox protein Hox-A1a;
DE Short=Hox-A1;
GN Name=hoxa1a; Synonyms=hoxa1;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2).
RX PubMed=11493564; DOI=10.1242/dev.128.13.2471;
RA McClintock J.M., Carlson R., Mann D.M., Prince V.E.;
RT "Consequences of Hox gene duplication in the vertebrates: an investigation
RT of the zebrafish Hox paralogue group 1 genes.";
RL Development 128:2471-2484(2001).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=9831563; DOI=10.1126/science.282.5394.1711;
RA Amores A., Force A., Yan Y.-L., Joly L., Amemiya C., Fritz A., Ho R.K.,
RA Langeland J., Prince V.E., Wang Y.-L., Westerfield M., Ekker M.,
RA Postlethwait J.H.;
RT "Zebrafish hox clusters and vertebrate genome evolution.";
RL Science 282:1711-1714(1998).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tuebingen;
RX PubMed=23594743; DOI=10.1038/nature12111;
RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M.,
RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I.,
RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J.,
RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y.,
RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B.,
RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S.,
RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M.,
RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J.,
RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Threadgold G.,
RA Harden G., Ware D., Begum S., Mortimore B., Kerry G., Heath P.,
RA Phillimore B., Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S.,
RA Pelan S., Griffiths G., Smith M., Glithero R., Howden P., Barker N.,
RA Lloyd C., Stevens C., Harley J., Holt K., Panagiotidis G., Lovell J.,
RA Beasley H., Henderson C., Gordon D., Auger K., Wright D., Collins J.,
RA Raisen C., Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D.,
RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S.,
RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E.,
RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., Babbage A.,
RA Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., Wray P.,
RA Ellington A., Matthews N., Ellwood M., Woodmansey R., Clark G., Cooper J.,
RA Tromans A., Grafham D., Skuce C., Pandian R., Andrews R., Harrison E.,
RA Kimberley A., Garnett J., Fosker N., Hall R., Garner P., Kelly D., Bird C.,
RA Palmer S., Gehring I., Berger A., Dooley C.M., Ersan-Urun Z., Eser C.,
RA Geiger H., Geisler M., Karotki L., Kirn A., Konantz J., Konantz M.,
RA Oberlander M., Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G.,
RA Osoegawa K., Zhu B., Rapp A., Widaa S., Langford C., Yang F.,
RA Schuster S.C., Carter N.P., Harrow J., Ning Z., Herrero J., Searle S.M.,
RA Enright A., Geisler R., Plasterk R.H., Lee C., Westerfield M.,
RA de Jong P.J., Zon L.I., Postlethwait J.H., Nusslein-Volhard C.,
RA Hubbard T.J., Roest Crollius H., Rogers J., Stemple D.L.;
RT "The zebrafish reference genome sequence and its relationship to the human
RT genome.";
RL Nature 496:498-503(2013).
RN [4]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 136-224.
RC STRAIN=Tuebingen;
RX PubMed=16174031; DOI=10.1111/j.1525-142x.2005.05042.x;
RA Corredor-Adamez M., Welten M.C.M., Spaink H.P., Jeffery J.E., Schoon R.T.,
RA de Bakker M.A.G., Bagowski C.P., Meijer A.H., Verbeek F.J.,
RA Richardson M.K.;
RT "Genomic annotation and transcriptome analysis of the zebrafish (Danio
RT rerio) hox complex with description of a novel member, hoxb13a.";
RL Evol. Dev. 7:362-375(2005).
CC -!- FUNCTION: Sequence-specific transcription factor. Part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis (By similarity).
CC {ECO:0000250|UniProtKB:Q90423}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q98SI1-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q98SI1-2; Sequence=VSP_012678;
CC -!- SIMILARITY: Belongs to the Antp homeobox family. Labial subfamily.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ306430; CAC34565.1; -; mRNA.
DR EMBL; AJ306431; CAC34566.1; -; mRNA.
DR EMBL; AF071243; AAD15937.1; -; Genomic_DNA.
DR EMBL; AL645756; CAD52136.1; -; Genomic_DNA.
DR EMBL; AL645756; CAD52137.1; -; Genomic_DNA.
DR EMBL; CR382300; CAK10852.1; -; Genomic_DNA.
DR EMBL; DQ060531; AAY67909.1; -; mRNA.
DR RefSeq; NP_571611.1; NM_131536.2. [Q98SI1-1]
DR RefSeq; XP_017207752.1; XM_017352263.1. [Q98SI1-2]
DR AlphaFoldDB; Q98SI1; -.
DR SMR; Q98SI1; -.
DR STRING; 7955.ENSDARP00000074910; -.
DR PaxDb; Q98SI1; -.
DR Ensembl; ENSDART00000163546; ENSDARP00000133812; ENSDARG00000104307. [Q98SI1-2]
DR Ensembl; ENSDART00000167757; ENSDARP00000133272; ENSDARG00000104307. [Q98SI1-1]
DR GeneID; 58051; -.
DR KEGG; dre:58051; -.
DR CTD; 58051; -.
DR ZFIN; ZDB-GENE-000823-5; hoxa1a.
DR eggNOG; KOG0489; Eukaryota.
DR GeneTree; ENSGT00940000157315; -.
DR InParanoid; Q98SI1; -.
DR OMA; QCGPLMY; -.
DR PhylomeDB; Q98SI1; -.
DR TreeFam; TF317730; -.
DR PRO; PR:Q98SI1; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Chromosome 19.
DR Bgee; ENSDARG00000104307; Expressed in midbrain tegmentum and 18 other tissues.
DR ExpressionAtlas; Q98SI1; baseline.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR InterPro; IPR046327; HXA1/B1/D1.
DR PANTHER; PTHR45946; PTHR45946; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW Nucleus; Reference proteome; Transcription; Transcription regulation.
FT CHAIN 1..329
FT /note="Homeobox protein Hox-A1a"
FT /id="PRO_0000200033"
FT DNA_BIND 225..284
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 208..227
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 277..329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 200..205
FT /note="Antp-type hexapeptide"
FT COMPBIAS 279..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 309..329
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 158..192
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:11493564"
FT /id="VSP_012678"
FT CONFLICT 1..28
FT /note="MSTFLDFSSISGGGDGGSGGSCSVRAFH -> MEVAGARAQSGRSQ (in
FT Ref. 2)"
FT /evidence="ECO:0000305"
FT CONFLICT 296
FT /note="Q -> E (in Ref. 1; CAC34566)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 329 AA; 35737 MW; CBF2C722F50A85D5 CRC64;
MSTFLDFSSI SGGGDGGSGG SCSVRAFHGD HGLSTFQSSC AVRLNSCSGD ERFMSNISSQ
DVINSQPQQA GSYQSPGTLS ITYSAHPSYG TQSFCTGYNH YALNQDVESS VSFPQCGPLV
YSGNISSTVV QHRHHRHGYS SGNVHLHGQF QYGSATYGNS SDQANLTFVA GCSNPLSPLH
VPHHDACCSP LSDGVPTGQT FDWMKVKRNP PKTGKAGEYG FGGQPNTVRT NFSTKQLTEL
EKEFHFNKYL TRARRVEIAA SLQLNETQVK IWFQNRRMKQ KKREKEGLLP KSLSEQKDGL
EKTEDASEKS PSAPSTPSPS PTVEAYSSN