LEA47_ARATH
ID LEA47_ARATH Reviewed; 192 AA.
AC Q8GWT7;
DT 13-APR-2016, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 102.
DE RecName: Full=Late embryogenesis abundant protein 47 {ECO:0000305};
DE Short=LEA 47 {ECO:0000305};
GN OrderedLocusNames=At5g27980 {ECO:0000312|Araport:AT5G27980};
GN ORFNames=F15F15.50 {ECO:0000305};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000312|EMBL:BAC43241.1};
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130714; DOI=10.1038/35048507;
RA Tabata S., Kaneko T., Nakamura Y., Kotani H., Kato T., Asamizu E.,
RA Miyajima N., Sasamoto S., Kimura T., Hosouchi T., Kawashima K., Kohara M.,
RA Matsumoto M., Matsuno A., Muraki A., Nakayama S., Nakazaki N., Naruo K.,
RA Okumura S., Shinpo S., Takeuchi C., Wada T., Watanabe A., Yamada M.,
RA Yasuda M., Sato S., de la Bastide M., Huang E., Spiegel L., Gnoj L.,
RA O'Shaughnessy A., Preston R., Habermann K., Murray J., Johnson D.,
RA Rohlfing T., Nelson J., Stoneking T., Pepin K., Spieth J., Sekhon M.,
RA Armstrong J., Becker M., Belter E., Cordum H., Cordes M., Courtney L.,
RA Courtney W., Dante M., Du H., Edwards J., Fryman J., Haakensen B.,
RA Lamar E., Latreille P., Leonard S., Meyer R., Mulvaney E., Ozersky P.,
RA Riley A., Strowmatt C., Wagner-McPherson C., Wollam A., Yoakum M., Bell M.,
RA Dedhia N., Parnell L., Shah R., Rodriguez M., Hoon See L., Vil D.,
RA Baker J., Kirchoff K., Toth K., King L., Bahret A., Miller B., Marra M.A.,
RA Martienssen R., McCombie W.R., Wilson R.K., Murphy G., Bancroft I.,
RA Volckaert G., Wambutt R., Duesterhoeft A., Stiekema W., Pohl T.,
RA Entian K.-D., Terryn N., Hartley N., Bent E., Johnson S., Langham S.-A.,
RA McCullagh B., Robben J., Grymonprez B., Zimmermann W., Ramsperger U.,
RA Wedler H., Balke K., Wedler E., Peters S., van Staveren M., Dirkse W.,
RA Mooijman P., Klein Lankhorst R., Weitzenegger T., Bothe G., Rose M.,
RA Hauf J., Berneiser S., Hempel S., Feldpausch M., Lamberth S.,
RA Villarroel R., Gielen J., Ardiles W., Bents O., Lemcke K., Kolesov G.,
RA Mayer K.F.X., Rudd S., Schoof H., Schueller C., Zaccaria P., Mewes H.-W.,
RA Bevan M., Fransz P.F.;
RT "Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana.";
RL Nature 408:823-826(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=11910074; DOI=10.1126/science.1071006;
RA Seki M., Narusaka M., Kamiya A., Ishida J., Satou M., Sakurai T.,
RA Nakajima M., Enju A., Akiyama K., Oono Y., Muramatsu M., Hayashizaki Y.,
RA Kawai J., Carninci P., Itoh M., Ishii Y., Arakawa T., Shibata K.,
RA Shinagawa A., Shinozaki K.;
RT "Functional annotation of a full-length Arabidopsis cDNA collection.";
RL Science 296:141-145(2002).
RN [4]
RP GENE FAMILY, AND NOMENCLATURE.
RX PubMed=18318901; DOI=10.1186/1471-2164-9-118;
RA Hundertmark M., Hincha D.K.;
RT "LEA (late embryogenesis abundant) proteins and their encoding genes in
RT Arabidopsis thaliana.";
RL BMC Genomics 9:118-118(2008).
RN [5]
RP SUBCELLULAR LOCATION, GENE FAMILY, AND NOMENCLATURE.
RX PubMed=25005920; DOI=10.1105/tpc.114.127316;
RA Candat A., Paszkiewicz G., Neveu M., Gautier R., Logan D.C.,
RA Avelange-Macherel M.-H., Macherel D.;
RT "The ubiquitous distribution of late embryogenesis abundant proteins across
RT cell compartments in Arabidopsis offers tailored protection against abiotic
RT stress.";
RL Plant Cell 26:3148-3166(2014).
CC -!- FUNCTION: LEA proteins are late embryonic proteins abundant in higher
CC plant seed embryos. The function of those proteins is not known.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000269|PubMed:25005920}. Nucleus
CC {ECO:0000269|PubMed:25005920}.
CC -!- SIMILARITY: Belongs to the LEA type SMP family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC007627; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CP002688; AED93755.1; -; Genomic_DNA.
DR EMBL; AK118645; BAC43241.1; -; mRNA.
DR RefSeq; NP_198150.1; NM_122681.4.
DR AlphaFoldDB; Q8GWT7; -.
DR STRING; 3702.AT5G27980.1; -.
DR iPTMnet; Q8GWT7; -.
DR PaxDb; Q8GWT7; -.
DR PRIDE; Q8GWT7; -.
DR ProteomicsDB; 230199; -.
DR EnsemblPlants; AT5G27980.1; AT5G27980.1; AT5G27980.
DR GeneID; 832868; -.
DR Gramene; AT5G27980.1; AT5G27980.1; AT5G27980.
DR KEGG; ath:AT5G27980; -.
DR Araport; AT5G27980; -.
DR TAIR; locus:2143789; AT5G27980.
DR eggNOG; ENOG502R41N; Eukaryota.
DR HOGENOM; CLU_075678_1_0_1; -.
DR InParanoid; Q8GWT7; -.
DR OMA; LQKPIDC; -.
DR OrthoDB; 1200778at2759; -.
DR PhylomeDB; Q8GWT7; -.
DR PRO; PR:Q8GWT7; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; Q8GWT7; baseline and differential.
DR GO; GO:0005829; C:cytosol; HDA:TAIR.
DR GO; GO:0005634; C:nucleus; IDA:UniProtKB.
DR InterPro; IPR042971; LEA_SMP.
DR InterPro; IPR007011; LEA_SMP_dom.
DR PANTHER; PTHR31174; PTHR31174; 1.
DR Pfam; PF04927; SMP; 2.
PE 2: Evidence at transcript level;
KW Cytoplasm; Nucleus; Reference proteome; Repeat.
FT CHAIN 1..192
FT /note="Late embryogenesis abundant protein 47"
FT /id="PRO_0000436061"
FT DOMAIN 68..125
FT /note="SMP 1"
FT /evidence="ECO:0000255"
FT DOMAIN 133..190
FT /note="SMP 2"
FT /evidence="ECO:0000255"
FT REGION 146..174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 5..9
FT /note="Nuclear localization signal (NLS)"
FT /evidence="ECO:0000250|UniProtKB:Q9LJ97"
FT COMPBIAS 149..163
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 192 AA; 19516 MW; D08ADA9BBE27CB7E CRC64;
MSEEQLQKPI DCADVKGEAE KISTTEGGIK AAEDKEKGVV AEASGEQAEG EVNQKKVVAN
PLKSEGTITI GEALEAAVLT AGNKPVEWSD AAAIQAAEVR ATGRTNIMPG GVAASAQSAA
TLNARIGSDD TKTTLADVLT GASSKLPSDK AATRKDAEGV TGAEMRNDPH LTTYPTGVAA
SVAAAARINQ SK