HDG5_ARATH
ID HDG5_ARATH Reviewed; 826 AA.
AC Q9FJS2;
DT 29-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 08-FEB-2011, sequence version 3.
DT 03-AUG-2022, entry version 142.
DE RecName: Full=Homeobox-leucine zipper protein HDG5 {ECO:0000303|PubMed:16778018};
DE AltName: Full=HD-ZIP protein HDG5 {ECO:0000303|PubMed:16778018};
DE AltName: Full=Homeodomain GLABRA 2-like protein 5 {ECO:0000303|PubMed:10809443};
DE AltName: Full=Homeodomain transcription factor HDG5 {ECO:0000303|PubMed:16778018};
DE AltName: Full=Protein HOMEODOMAIN GLABROUS 5 {ECO:0000303|PubMed:16778018};
GN Name=HDG5 {ECO:0000303|PubMed:16778018};
GN Synonyms=HDGL2-5 {ECO:0000303|PubMed:10809443};
GN OrderedLocusNames=At5g46880 {ECO:0000312|Araport:AT5G46880};
GN ORFNames=MQD22.1 {ECO:0000312|EMBL:BAB10227.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=9734815; DOI=10.1093/dnares/5.3.203;
RA Kotani H., Nakamura Y., Sato S., Asamizu E., Kaneko T., Miyajima N.,
RA Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 5. VI. Sequence
RT features of the regions of 1,367,185 bp covered by 19 physically assigned
RT P1 and TAC clones.";
RL DNA Res. 5:203-216(1998).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-302.
RC STRAIN=cv. Columbia;
RX PubMed=14993207; DOI=10.1101/gr.1515604;
RA Castelli V., Aury J.-M., Jaillon O., Wincker P., Clepet C., Menard M.,
RA Cruaud C., Quetier F., Scarpelli C., Schaechter V., Temple G., Caboche M.,
RA Weissenbach J., Salanoubat M.;
RT "Whole genome sequence comparisons and 'full-length' cDNA sequences: a
RT combined approach to evaluate and improve Arabidopsis genome annotation.";
RL Genome Res. 14:406-413(2004).
RN [4]
RP GENE FAMILY.
RX PubMed=10809443; DOI=10.1023/a:1006368316413;
RA Tavares R., Aubourg S., Lecharny A., Kreis M.;
RT "Organization and structural evolution of four multigene families in
RT Arabidopsis thaliana: AtLCAD, AtLGT, AtMYST and AtHD-GL2.";
RL Plant Mol. Biol. 42:703-717(2000).
RN [5]
RP TISSUE SPECIFICITY, GENE FAMILY, AND NOMENCLATURE.
RX PubMed=16778018; DOI=10.1104/pp.106.077388;
RA Nakamura M., Katsumata H., Abe M., Yabe N., Komeda Y., Yamamoto K.T.,
RA Takahashi T.;
RT "Characterization of the class IV homeodomain-leucine zipper gene family in
RT Arabidopsis.";
RL Plant Physiol. 141:1363-1375(2006).
RN [6]
RP FUNCTION, AND DISRUPTION PHENOTYPE.
RC STRAIN=cv. Columbia;
RX PubMed=23590515; DOI=10.1111/tpj.12211;
RA Kamata N., Okada H., Komeda Y., Takahashi T.;
RT "Mutations in epidermis-specific HD-ZIP IV genes affect floral organ
RT identity in Arabidopsis thaliana.";
RL Plant J. 75:430-440(2013).
CC -!- FUNCTION: Probable transcription factor (By similarity). Involved,
CC together with PDF2, in the regulation of flower organs development by
CC promoting the expression of APETALA 3 (AP3) in the epidermis and
CC internal cell layers of developing flowers (PubMed:23590515).
CC {ECO:0000250|UniProtKB:Q0WV12, ECO:0000269|PubMed:23590515}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in shoot apical meristem (SAM) with
CC higher levels in L1 cells and the epidermal layer of young leaves.
CC Expressed in the L1 of apical inflorescence meristems, early flower
CC primordia, carpel and stamen filament epidermis, ovule primordia,
CC nucellus and chalaze. {ECO:0000269|PubMed:16778018}.
CC -!- DISRUPTION PHENOTYPE: The double mutant pdf2-1 hdg5-1 exhibits abnormal
CC flowers with sepaloid petals and carpelloid stamens in association with
CC a reduced expression of APETALA 3 (AP3) in the epidermis and internal
CC cell layers of developing flowers. {ECO:0000269|PubMed:23590515}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class IV subfamily.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB10227.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=BX841652; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB013394; BAB10227.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002688; AED95443.1; -; Genomic_DNA.
DR EMBL; CP002688; ANM68239.1; -; Genomic_DNA.
DR EMBL; CP002688; ANM68240.1; -; Genomic_DNA.
DR EMBL; BX841652; -; NOT_ANNOTATED_CDS; mRNA.
DR RefSeq; NP_001318750.1; NM_001344703.1.
DR RefSeq; NP_001330010.1; NM_001344704.1.
DR RefSeq; NP_199499.3; NM_124059.4.
DR AlphaFoldDB; Q9FJS2; -.
DR SMR; Q9FJS2; -.
DR STRING; 3702.AT5G46880.1; -.
DR PaxDb; Q9FJS2; -.
DR PRIDE; Q9FJS2; -.
DR ProteomicsDB; 247354; -.
DR EnsemblPlants; AT5G46880.1; AT5G46880.1; AT5G46880.
DR EnsemblPlants; AT5G46880.2; AT5G46880.2; AT5G46880.
DR EnsemblPlants; AT5G46880.3; AT5G46880.3; AT5G46880.
DR GeneID; 834733; -.
DR Gramene; AT5G46880.1; AT5G46880.1; AT5G46880.
DR Gramene; AT5G46880.2; AT5G46880.2; AT5G46880.
DR Gramene; AT5G46880.3; AT5G46880.3; AT5G46880.
DR KEGG; ath:AT5G46880; -.
DR Araport; AT5G46880; -.
DR TAIR; locus:2170957; AT5G46880.
DR eggNOG; ENOG502QU3P; Eukaryota.
DR HOGENOM; CLU_015002_2_1_1; -.
DR InParanoid; Q9FJS2; -.
DR OMA; WVEHMEM; -.
DR OrthoDB; 223056at2759; -.
DR PhylomeDB; Q9FJS2; -.
DR PRO; PR:Q9FJS2; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; Q9FJS2; baseline and differential.
DR Genevisible; Q9FJS2; AT.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; ISS:TAIR.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR GO; GO:0048497; P:maintenance of floral organ identity; IGI:TAIR.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR042160; GLABRA2/ANL2/PDF2/ATML1-like.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR002913; START_lipid-bd_dom.
DR PANTHER; PTHR45654; PTHR45654; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF01852; START; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00234; START; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50848; START; 1.
PE 2: Evidence at transcript level;
KW Coiled coil; DNA-binding; Homeobox; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..826
FT /note="Homeobox-leucine zipper protein HDG5"
FT /id="PRO_0000331667"
FT DOMAIN 314..558
FT /note="START"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00197"
FT DNA_BIND 111..170
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 69..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 165..189
FT /evidence="ECO:0000255"
FT COMPBIAS 90..111
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 826 AA; 92120 MW; F890788BB91735A2 CRC64;
MLTMGEGNVM TSNNRFASPP QQPSSSSPGT IQNPNFNFIP FNSYSSIIPK EEHGMMSMMM
MMGDGTVEEM MENGSAGGSF GSGSEQAEDP KFGNESDVNE LHDDEQPPPA KKKRYHRHTN
RQIQEMEALF KENPHPDDKQ RKRLSAELGL KPRQVKFWFQ NRRTQMKAQQ DRNENVMLRA
ENDNLKSENC HLQAELRCLS CPSCGGPTVL GDIPFNEIHI ENCRLREELD RLCCIASRYT
GRPMQSMPPS QPLINPSPML PHHQPSLELD MSVYAGNFPE QSCTDMMMLP PQDTACFFPD
QTANNNNNNN MLLADEEKVI AMEFAVSCVQ ELTKMCDTEE PLWIKKKSDK IGGEILCLNE
EEYMRLFPWP MENQNNKGDF LREASKANAV VIMNSITLVD AFLNADKWSE MFCSIVARAK
TVQIISSGVS GASGSLLLMF AELQVLSPLV PTREAYFLRY VEQNAETGNW AIVDFPIDSF
HDQMQPMNTI THEYKRKPSG CIIQDMPNGY SQVKWVEHVE VDEKHVHETF AEYVKSGMAF
GANRWLDVLQ RQCERIASLM ARNITDLGVI SSAEARRNIM RLSQRLVKTF CVNISTAYGQ
SWTALSETTK DTVRITTRKM CEPGQPTGVV LCAVSTTWLP FSHHQVFDLI RDQHHQSLLE
VLFNGNSPHE VAHIANGSHP GNCISLLRIN VASNSWHNVE LMLQESCIDN SGSLIVYSTV
DVDSIQQAMN GEDSSNIPIL PLGFSIVPVN PPEGISVNSH SPPSCLLTVG IQVLASNVPT
AKPNLSTVTT INNHLCATVN QITSALSNTI TPVIASSADV SNQEVS