EXTN1_ARATH
ID EXTN1_ARATH Reviewed; 373 AA.
AC Q38913; F4I433; F4I434; Q9FS15; Q9SAW2;
DT 27-MAY-2002, integrated into UniProtKB/Swiss-Prot.
DT 27-MAY-2002, sequence version 2.
DT 03-AUG-2022, entry version 123.
DE RecName: Full=Extensin-1 {ECO:0000303|PubMed:11475326};
DE Short=AtExt1 {ECO:0000303|PubMed:11475326};
DE Short=AtExt4 {ECO:0000303|PubMed:11475326};
DE AltName: Full=Extensin-1/4 {ECO:0000303|PubMed:20395450};
DE Flags: Precursor;
GN Name=EXT1 {ECO:0000303|PubMed:11475326};
GN Synonyms=EXT1/4 {ECO:0000303|PubMed:20395450},
GN EXT4 {ECO:0000303|PubMed:11475326}; OrderedLocusNames=At1g76930;
GN ORFNames=F22K20.3;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=cv. Landsberg erecta;
RX PubMed=10333585; DOI=10.1007/s004250050552;
RA Merkouropoulos G., Barnett D.C., Shirsat A.H.;
RT "The Arabidopsis extensin gene is developmentally regulated, is induced by
RT wounding, methyl jasmonate, abscisic and salicylic acid, and codes for a
RT protein with unusual motifs.";
RL Planta 208:212-219(1999).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2).
RC STRAIN=cv. Columbia;
RX PubMed=11475326; DOI=10.1093/dnares/8.3.115;
RA Yoshiba Y., Aoki C., Iuchi S., Nanjo T., Seki M., Sekiguchi F.,
RA Yamaguchi-Shinozaki K., Shinozaki K.;
RT "Characterization of four extensin genes in Arabidopsis thaliana by
RT differential gene expression under stress and non-stress conditions.";
RL DNA Res. 8:115-122(2001).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [5]
RP GENE FAMILY, AND NOMENCLATURE.
RX PubMed=20395450; DOI=10.1104/pp.110.156554;
RA Showalter A.M., Keppler B., Lichtenberg J., Gu D., Welch L.R.;
RT "A bioinformatics approach to the identification, classification, and
RT analysis of hydroxyproline-rich glycoproteins.";
RL Plant Physiol. 153:485-513(2010).
CC -!- FUNCTION: Structural component which strengthens the primary cell wall.
CC -!- SUBCELLULAR LOCATION: Secreted, primary cell wall.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Comment=Experimental confirmation may be lacking for some isoforms.;
CC Name=1;
CC IsoId=Q38913-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q38913-2; Sequence=VSP_008897;
CC -!- TISSUE SPECIFICITY: Predominantly expressed in the roots. Not detected
CC in the leaves, nor in flowers or flower buds. Wounding reverses this
CC pattern, turning on the gene in the leaves and repressing it in the
CC roots.
CC -!- DEVELOPMENTAL STAGE: Early expressed in the whole plant. Detected in
CC the leaves of 2 and 4 weeks old rosettes, but not in 6-weeks-old
CC rosettes. Detected specifically in roots from the mature plant (6-weeks
CC old).
CC -!- INDUCTION: By wounding, water and cold stresses; in response to plant
CC hormones 2,4-D, BAP, GA3, SA, MeJA and ABA treatment; in response to L-
CC Ser, Hyp and L-Pro treatment.
CC -!- PTM: Extensins contain a characteristic repeat of the pentapeptide Ser-
CC Pro(4). For this particular extensin, a typical repeat of Ser-Pro(3) is
CC found. In both cases, the proline residues are hydroxylated and then O-
CC glycosylated (arabinosylation).
CC -!- PTM: Synthetised as soluble proteins which become insolubilised in the
CC cell wall through the intermolecular cross-linking of Tyr on adjacent
CC monomers. Isodityrosine (IDT) stabilizes and makes rigid the part of
CC the polypeptide where IDT functional sites are present.
CC -!- SIMILARITY: Belongs to the extensin family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA85899.1; Type=Miscellaneous discrepancy; Note=In cv. Landsberg erecta, absence of several repeats.; Evidence={ECO:0000305};
CC Sequence=AEE35904.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=AEE35905.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U43627; AAA85899.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AB031820; BAB20085.1; -; mRNA.
DR EMBL; AC002291; AAC00630.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE35904.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE35905.1; ALT_SEQ; Genomic_DNA.
DR PIR; B96798; B96798.
DR RefSeq; NP_565143.1; NM_106344.3.
DR RefSeq; NP_849895.1; NM_179564.1.
DR AlphaFoldDB; Q38913; -.
DR STRING; 3702.AT1G76930.1; -.
DR PaxDb; Q38913; -.
DR PRIDE; Q38913; -.
DR ProteomicsDB; 222311; -. [Q38913-1]
DR GeneID; 844028; -.
DR KEGG; ath:AT1G76930; -.
DR Araport; AT1G76930; -.
DR TAIR; locus:2025262; AT1G76930.
DR InParanoid; Q38913; -.
DR PRO; PR:Q38913; -.
DR Proteomes; UP000006548; Chromosome 1.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0009530; C:primary cell wall; IEA:UniProtKB-SubCell.
DR GO; GO:0005199; F:structural constituent of cell wall; TAS:TAIR.
DR GO; GO:0009664; P:plant-type cell wall organization; IEA:InterPro.
DR GO; GO:0009737; P:response to abscisic acid; IEP:TAIR.
DR GO; GO:0009753; P:response to jasmonic acid; IEP:TAIR.
DR GO; GO:0009751; P:response to salicylic acid; IEP:TAIR.
DR GO; GO:0009611; P:response to wounding; IEP:TAIR.
DR InterPro; IPR006706; Extensin_dom.
DR Pfam; PF04554; Extensin_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Cell wall; Cell wall biogenesis/degradation;
KW Glycoprotein; Hydroxylation; Reference proteome; Repeat; Secreted; Signal.
FT SIGNAL 1..19
FT /evidence="ECO:0000255"
FT CHAIN 20..373
FT /note="Extensin-1"
FT /id="PRO_0000008725"
FT REPEAT 25..33
FT /note="1-1"
FT REPEAT 34..40
FT /note="2-1"
FT REPEAT 41..49
FT /note="1-2"
FT REPEAT 50..56
FT /note="2-2"
FT REPEAT 57..65
FT /note="1-3"
FT REPEAT 66..72
FT /note="2-3"
FT REPEAT 73..81
FT /note="1-4"
FT REPEAT 82..88
FT /note="2-4"
FT REPEAT 97..105
FT /note="1-5"
FT REPEAT 106..112
FT /note="2-5"
FT REPEAT 113..121
FT /note="1-6"
FT REPEAT 122..128
FT /note="2-6"
FT REPEAT 129..137
FT /note="1-7"
FT REPEAT 138..144
FT /note="2-7"
FT REPEAT 145..153
FT /note="1-8"
FT REPEAT 154..160
FT /note="2-8"
FT REPEAT 161..169
FT /note="1-9"
FT REPEAT 170..176
FT /note="2-9"
FT REPEAT 177..185
FT /note="1-10"
FT REPEAT 186..192
FT /note="2-10"
FT REPEAT 193..201
FT /note="1-11"
FT REPEAT 202..208
FT /note="2-11"
FT REPEAT 209..217
FT /note="1-12"
FT REPEAT 218..224
FT /note="2-12"
FT REPEAT 225..233
FT /note="1-13"
FT REPEAT 234..240
FT /note="2-13"
FT REPEAT 241..248
FT /note="3-1"
FT REPEAT 249..256
FT /note="4-1"
FT REPEAT 257..264
FT /note="3-2"
FT REPEAT 265..272
FT /note="4-2"
FT REPEAT 273..280
FT /note="3-3"
FT REPEAT 281..288
FT /note="4-3"
FT REPEAT 289..296
FT /note="3-4"
FT REPEAT 297..304
FT /note="4-4"
FT REPEAT 305..312
FT /note="3-5"
FT REPEAT 313..320
FT /note="4-5"
FT REGION 25..233
FT /note="13 X 9 AA repeats of S-P-P-P-P-V-K-[HY]-Y"
FT REGION 34..240
FT /note="13 X 7 AA repeats of S-P-P-P-V-Y-K"
FT REGION 241..312
FT /note="5 X 8 AA repeats of S-P-P-P-P-V-H-Y"
FT REGION 249..320
FT /note="5 X 8 AA repeats of S-P-P-P-V-V-Y-H"
FT REGION 329..332
FT /note="Isodityrosine cross-linking"
FT /evidence="ECO:0000255"
FT REGION 349..373
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 363..366
FT /note="Isodityrosine cross-linking"
FT /evidence="ECO:0000255"
FT VAR_SEQ 152..278
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:11475326"
FT /id="VSP_008897"
FT CONFLICT 95..182
FT /note="Missing (in Ref. 1; AAA85899)"
FT /evidence="ECO:0000305"
FT CONFLICT 334
FT /note="Missing (in Ref. 1; AAA85899)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 373 AA; 41555 MW; 0446F20942734A9A CRC64;
MASFLVLAFS LAFVSQTTAN YFYSSPPPPV KHYSPPPVYK SPPPPVKHYS PPPVYKSPPP
PVKHYSPPPV YKSPPPPVKY YSPPPVYKSP PPPVYKSPPP PVKHYSPPPV YKSPPPPVKH
YSPPPVYKSP PPPVKHYSPP PVYKSPPPPV KHYSPPPVYK SPPPPVKYYS PPPVYKSPPP
PVKHYSPPPV YKSPPPPVKY YSPPPVYKSP PPPVKHYSPP PVYKSPPPPV KYYSPPPVYK
SPPPPVHYSP PPVVYHSPPP PVHYSPPPVV YHSPPPPVHY SPPPVVYHSP PPPVHYSPPP
VVYHSPPPPV HYSPPPVVYH SPPPPKKHYE YKSPPPPVHY SPPTVYHSPP PPVHHYSPPH
QPYLYKSPPP PHY