SHH2_ARATH
ID SHH2_ARATH Reviewed; 348 AA.
AC Q8RWJ7; F4J7N6; F4J7N8; Q9LS52;
DT 18-SEP-2013, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2002, sequence version 1.
DT 03-AUG-2022, entry version 133.
DE RecName: Full=Protein SAWADEE HOMEODOMAIN HOMOLOG 2;
DE AltName: Full=Probable DNA-binding transcription factor 2;
GN Name=SHH2; OrderedLocusNames=At3g18380; ORFNames=MYF24.10;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10819329; DOI=10.1093/dnares/7.2.131;
RA Sato S., Nakamura Y., Kaneko T., Katoh T., Asamizu E., Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 3. I. Sequence
RT features of the regions of 4,504,864 bp covered by sixty P1 and TAC
RT clones.";
RL DNA Res. 7:131-135(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP IDENTIFICATION.
RX PubMed=21811420; DOI=10.1371/journal.pgen.1002195;
RA Law J.A., Vashisht A.A., Wohlschlegel J.A., Jacobsen S.E.;
RT "SHH1, a homeodomain protein required for DNA methylation, as well as RDR2,
RT RDM4, and chromatin remodeling factors, associate with RNA polymerase IV.";
RL PLoS Genet. 7:E1002195-E1002195(2011).
RN [5]
RP DOMAIN.
RX PubMed=23637343; DOI=10.1073/pnas.1300585110;
RA Zhang H., Ma Z.Y., Zeng L., Tanaka K., Zhang C.J., Ma J., Bai G., Wang P.,
RA Zhang S.W., Liu Z.W., Cai T., Tang K., Liu R., Shi X., He X.J., Zhu J.K.;
RT "DTF1 is a core component of RNA-directed DNA methylation and may assist in
RT the recruitment of Pol IV.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:8290-8295(2013).
CC -!- FUNCTION: May play a role in the recruitment of Pol IV to genomic
CC regions associated with K9 methylated histone H3 that are targets for
CC RdDM.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q8RWJ7-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8RWJ7-2; Sequence=VSP_047677;
CC Name=3;
CC IsoId=Q8RWJ7-3; Sequence=VSP_047676, VSP_047677;
CC -!- DOMAIN: The SAWADEE domain (154-257) binds to mono-, di-, or
CC trimethylated H3K9 histone peptides, but this interaction is blocked if
CC H3K4 methylation is present. {ECO:0000269|PubMed:23637343}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB01104.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB026658; BAB01104.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002686; AEE76088.1; -; Genomic_DNA.
DR EMBL; CP002686; AEE76089.1; -; Genomic_DNA.
DR EMBL; CP002686; AEE76090.1; -; Genomic_DNA.
DR EMBL; AY093042; AAM13041.1; -; mRNA.
DR EMBL; BT003428; AAO30091.1; -; mRNA.
DR RefSeq; NP_001189923.1; NM_001202994.1. [Q8RWJ7-3]
DR RefSeq; NP_188467.2; NM_112723.3. [Q8RWJ7-1]
DR RefSeq; NP_974333.1; NM_202604.3. [Q8RWJ7-2]
DR AlphaFoldDB; Q8RWJ7; -.
DR SMR; Q8RWJ7; -.
DR BioGRID; 6700; 12.
DR IntAct; Q8RWJ7; 12.
DR STRING; 3702.AT3G18380.2; -.
DR PaxDb; Q8RWJ7; -.
DR PRIDE; Q8RWJ7; -.
DR ProteomicsDB; 234504; -. [Q8RWJ7-1]
DR EnsemblPlants; AT3G18380.1; AT3G18380.1; AT3G18380. [Q8RWJ7-1]
DR EnsemblPlants; AT3G18380.2; AT3G18380.2; AT3G18380. [Q8RWJ7-2]
DR EnsemblPlants; AT3G18380.3; AT3G18380.3; AT3G18380. [Q8RWJ7-3]
DR GeneID; 821367; -.
DR Gramene; AT3G18380.1; AT3G18380.1; AT3G18380. [Q8RWJ7-1]
DR Gramene; AT3G18380.2; AT3G18380.2; AT3G18380. [Q8RWJ7-2]
DR Gramene; AT3G18380.3; AT3G18380.3; AT3G18380. [Q8RWJ7-3]
DR KEGG; ath:AT3G18380; -.
DR Araport; AT3G18380; -.
DR TAIR; locus:2095077; AT3G18380.
DR eggNOG; ENOG502QSY0; Eukaryota.
DR OMA; NAMPARD; -.
DR OrthoDB; 938556at2759; -.
DR PhylomeDB; Q8RWJ7; -.
DR PRO; PR:Q8RWJ7; -.
DR Proteomes; UP000006548; Chromosome 3.
DR ExpressionAtlas; Q8RWJ7; baseline and differential.
DR Genevisible; Q8RWJ7; AT.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR032001; SAWADEE_dom.
DR InterPro; IPR039276; SHH1/2.
DR PANTHER; PTHR33827; PTHR33827; 2.
DR Pfam; PF16719; SAWADEE; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Nucleus; Reference proteome.
FT CHAIN 1..348
FT /note="Protein SAWADEE HOMEODOMAIN HOMOLOG 2"
FT /id="PRO_0000423318"
FT REGION 154..260
FT /note="SAWADEE domain"
FT /evidence="ECO:0000250"
FT REGION 318..348
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..348
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 127..129
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000305"
FT /id="VSP_047676"
FT VAR_SEQ 260
FT /note="E -> EQ (in isoform 2 and isoform 3)"
FT /evidence="ECO:0000305"
FT /id="VSP_047677"
SQ SEQUENCE 348 AA; 38521 MW; 54E203645B18E7BC CRC64;
MGRPPSNGGP AFRFILPEVT EMEAILLQHN TAMPGRHILE ALADKFSESP ERKGKVVVQF
KQIWNWFQNR RYALRARGNK APGKLNVSSM PRMDLPNQMR SVIQPLSVPK TTHMTGNLPG
MTPAPSGSLV PGVMRSGSDN SYLEFEAKSA RDGAWYDVQA FLAHRNLEIG DPEVQVRFAG
FEVEEDEWIN VKKHVRQRSL PCEASECVAV LAGDLVLCFQ EGKDQALYFD AIVLDAQRRR
HDVRGCRCRF LVRYSHDQSE EIVPLRKICR RPETDYRLQQ LHNAVNDLAN SNQHQIPALD
AAAKTPLSLP GATVPIVAPE SKDPSLSATP ATLVQPSSNA ATVPAGSA