HOX33_ORYSJ
ID HOX33_ORYSJ Reviewed; 855 AA.
AC Q2QM96; B9GE92;
DT 29-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 24-JAN-2006, sequence version 1.
DT 03-AUG-2022, entry version 108.
DE RecName: Full=Homeobox-leucine zipper protein HOX33;
DE AltName: Full=HD-ZIP protein HOX33;
DE AltName: Full=Homeodomain transcription factor HOX33;
DE AltName: Full=OsHox33;
GN Name=HOX33; OrderedLocusNames=Os12g0612700, LOC_Os12g41860;
GN ORFNames=OsJ_36852 {ECO:0000312|EMBL:EEE53599.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=16188032; DOI=10.1186/1741-7007-3-20;
RG The rice chromosomes 11 and 12 sequencing consortia;
RT "The sequence of rice chromosomes 11 and 12, rich in disease resistance
RT genes and recent gene duplications.";
RL BMC Biol. 3:20-20(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare;
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare;
RX PubMed=24280374; DOI=10.1186/1939-8433-6-4;
RA Kawahara Y., de la Bastide M., Hamilton J.P., Kanamori H., McCombie W.R.,
RA Ouyang S., Schwartz D.C., Tanaka T., Wu J., Zhou S., Childs K.L.,
RA Davidson R.M., Lin H., Quesada-Ocampo L., Vaillancourt B., Sakai H.,
RA Lee S.S., Kim J., Numa H., Itoh T., Buell C.R., Matsumoto T.;
RT "Improvement of the Oryza sativa Nipponbare reference genome using next
RT generation sequence and optical map data.";
RL Rice 6:4-4(2013).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038;
RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S.,
RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., Cong L.,
RA Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., Wang J.,
RA Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., Wang J., Wang X.,
RA Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., Zhang Z., Bao J., Han Y.,
RA Dong L., Ji J., Chen P., Wu S., Liu J., Xiao Y., Bu D., Tan J., Yang L.,
RA Ye C., Zhang J., Xu J., Zhou Y., Yu Y., Zhang B., Zhuang S., Wei H.,
RA Liu B., Lei M., Yu H., Li Y., Xu H., Wei S., He X., Fang L., Zhang Z.,
RA Zhang Y., Huang X., Su Z., Tong W., Li J., Tong Z., Li S., Ye J., Wang L.,
RA Fang L., Lei T., Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F.,
RA Xu H., Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q.,
RA Li W., Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J.,
RA Gao L., Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M.,
RA McDermott J., Samudrala R., Wang J., Wong G.K.-S., Yang H.;
RT "The genomes of Oryza sativa: a history of duplications.";
RL PLoS Biol. 3:266-281(2005).
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=12869764; DOI=10.1126/science.1081288;
RG The rice full-length cDNA consortium;
RT "Collection, mapping, and annotation of over 28,000 cDNA clones from
RT japonica rice.";
RL Science 301:376-379(2003).
RN [7]
RP TISSUE SPECIFICITY, GENE FAMILY, AND NOMENCLATURE.
RX PubMed=17999151; DOI=10.1007/s11103-007-9255-7;
RA Agalou A., Purwantomo S., Oevernaes E., Johannesson H., Zhu X., Estiati A.,
RA de Kam R.J., Engstroem P., Slamet-Loedin I.H., Zhu Z., Wang M., Xiong L.,
RA Meijer A.H., Ouwerkerk P.B.F.;
RT "A genome-wide survey of HD-Zip genes in rice and analysis of drought-
RT responsive family members.";
RL Plant Mol. Biol. 66:87-103(2008).
CC -!- FUNCTION: Probable transcription factor. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in seedlings, roots, stems, leaf sheaths
CC and blades and panicles. {ECO:0000269|PubMed:17999151}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class III subfamily.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000011; ABA99386.1; -; Genomic_DNA.
DR EMBL; AP008218; BAF30279.1; -; Genomic_DNA.
DR EMBL; AP014968; BAT18052.1; -; Genomic_DNA.
DR EMBL; CM000149; EEE53599.1; -; Genomic_DNA.
DR EMBL; AK102183; -; NOT_ANNOTATED_CDS; mRNA.
DR RefSeq; XP_015620816.1; XM_015765330.1.
DR AlphaFoldDB; Q2QM96; -.
DR SMR; Q2QM96; -.
DR STRING; 4530.OS12T0612700-01; -.
DR PaxDb; Q2QM96; -.
DR PRIDE; Q2QM96; -.
DR EnsemblPlants; Os12t0612700-01; Os12t0612700-01; Os12g0612700.
DR GeneID; 4352777; -.
DR Gramene; Os12t0612700-01; Os12t0612700-01; Os12g0612700.
DR KEGG; osa:4352777; -.
DR eggNOG; ENOG502QPKR; Eukaryota.
DR HOGENOM; CLU_012517_0_0_1; -.
DR InParanoid; Q2QM96; -.
DR OMA; FRDCRCV; -.
DR OrthoDB; 146876at2759; -.
DR Proteomes; UP000000763; Chromosome 12.
DR Proteomes; UP000007752; Chromosome 12.
DR Proteomes; UP000059680; Chromosome 12.
DR Genevisible; Q2QM96; OS.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 3.30.530.20; -; 1.
DR InterPro; IPR044830; HD-Zip_III.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR013978; MEKHLA.
DR InterPro; IPR023393; START-like_dom_sf.
DR InterPro; IPR002913; START_lipid-bd_dom.
DR PANTHER; PTHR45950; PTHR45950; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF08670; MEKHLA; 1.
DR Pfam; PF01852; START; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00234; START; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50848; START; 1.
PE 2: Evidence at transcript level;
KW Coiled coil; DNA-binding; Homeobox; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..855
FT /note="Homeobox-leucine zipper protein HOX33"
FT /id="PRO_0000331734"
FT DOMAIN 168..390
FT /note="START"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00197"
FT DNA_BIND 26..89
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 84..126
FT /evidence="ECO:0000255"
FT CONFLICT 214
FT /note="A -> T (in Ref. 6; AK102183)"
FT /evidence="ECO:0000305"
FT CONFLICT 276
FT /note="Y -> H (in Ref. 6; AK102183)"
FT /evidence="ECO:0000305"
FT CONFLICT 321
FT /note="P -> L (in Ref. 6; AK102183)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 855 AA; 93110 MW; F39F46AEA5A085E8 CRC64;
MAAAAVGGRG ERLSSSSPTA AAPQVDAGKY VRYTPEQVEA LERVYTECPK PSSLRRQQLI
RECPILSNIE PKQIKVWFQN RRCREKQRKE ASRLQTVNRK LNAMNKLLME ENDRLQKQVS
RLVYENGYMR TQLHNPSAAT TDTSCESVVT SGQHHQQQNP AVLHPQRDAN NPAGLLAIAE
ETLAEFMSKA TGTAVEWVQM VGMKPGPDSI GIIAVSHNCS GVAARACGLV SLEPTKVAEI
LKDRPSWYRD CRCVDIIHVI PTGNGGTIEL IYMQTYAPTT LAAPRDFWTL RYTSGLEDGS
LVICERSLTQ STGGPSGPNT PNFIRAEVLP SGYLIRPCEG GGSMIYIVDH VDLDAWSVPE
VLRPLYESPK ILAQKMTIAA LRHIRQIAHE SSGEIPYGAG RQPAVFRTFS QRLSRGFNDA
VSGFPDDGWS LLSSDGSEDI TISVNSSPNK LVGSHVSPNP LFSTVGGGIL CAKASMLLQN
VPPALLVRFL REHRSEWADP GVDAYSAASL RASPYAVPGL RTSGFMGSQV ILPLAHTLEH
EEFLEVIRLE GHGFSHDEVL LSRDMYLLQL CSGVDENATS ASAQLVFAPI DESFADDAPL
LPSGFRVIPL DTKMDGPSAT RTLDLASALE VGPGGASRAS VEASGTCNRS VLTIAFQFSY
ENHLRESVAA MARSYVRAVM ASVQRVAVAI APSRLGPQIG MKHPPASPEA LTLASWIGRS
YRAHTGADIR WSDTEDADSP LALLWKHSDA ILCCSLKPAP MFTFANNAGL DILETTLVNL
QDISLEMILD DEGRKALCSE FPKIMQQGFT YLPGGVCKSS MGRQASYEQA VAWKVLSDDD
APHCLAFMLV NWTFM