PEN1_ARATH
ID PEN1_ARATH Reviewed; 766 AA.
AC Q9FR95; O23388; Q08J21;
DT 03-MAR-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 121.
DE RecName: Full=Arabidiol synthase;
DE EC=4.2.1.124;
DE AltName: Full=Pentacyclic triterpene synthase 1;
DE Short=AtPEN1;
GN Name=PEN1; Synonyms=04C11; OrderedLocusNames=At4g15340;
GN ORFNames=dl3715c, FCAALL.158;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], AND NOMENCLATURE.
RC STRAIN=cv. Columbia; TISSUE=Hypocotyl;
RX PubMed=11247608; DOI=10.1023/a:1006476123930;
RA Husselstein-Muller T., Schaller H., Benveniste P.;
RT "Molecular cloning and expression in yeast of 2,3-oxidosqualene-
RT triterpenoid cyclases from Arabidopsis thaliana.";
RL Plant Mol. Biol. 45:75-92(2001).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, AND CATALYTIC ACTIVITY.
RX PubMed=16774269; DOI=10.1021/ol060973p;
RA Xiang T., Shibuya M., Katsube Y., Tsutsumi T., Otsuka M., Zhang H.,
RA Masuda K., Ebizuka Y.;
RT "A new triterpene synthase from Arabidopsis thaliana produces a tricyclic
RT triterpene with two hydroxyl groups.";
RL Org. Lett. 8:2835-2838(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=9461215; DOI=10.1038/35140;
RA Bevan M., Bancroft I., Bent E., Love K., Goodman H.M., Dean C.,
RA Bergkamp R., Dirkse W., van Staveren M., Stiekema W., Drost L., Ridley P.,
RA Hudson S.-A., Patel K., Murphy G., Piffanelli P., Wedler H., Wedler E.,
RA Wambutt R., Weitzenegger T., Pohl T., Terryn N., Gielen J., Villarroel R.,
RA De Clercq R., van Montagu M., Lecharny A., Aubourg S., Gy I., Kreis M.,
RA Lao N., Kavanagh T., Hempel S., Kotter P., Entian K.-D., Rieger M.,
RA Schaefer M., Funk B., Mueller-Auer S., Silvey M., James R., Monfort A.,
RA Pons A., Puigdomenech P., Douka A., Voukelatou E., Milioni D.,
RA Hatzopoulos P., Piravandi E., Obermaier B., Hilbert H., Duesterhoeft A.,
RA Moores T., Jones J.D.G., Eneva T., Palme K., Benes V., Rechmann S.,
RA Ansorge W., Cooke R., Berger C., Delseny M., Voet M., Volckaert G.,
RA Mewes H.-W., Klosterman S., Schueller C., Chalwatzis N.;
RT "Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis
RT thaliana.";
RL Nature 391:485-488(1998).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [5]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [6]
RP FUNCTION.
RX PubMed=17474751; DOI=10.1021/ol070709b;
RA Kolesnikova M.D., Obermeyer A.C., Wilson W.K., Lynch D.A., Xiong Q.,
RA Matsuda S.P.T.;
RT "Stereochemistry of water addition in triterpene synthesis: the structure
RT of arabidiol.";
RL Org. Lett. 9:2183-2186(2007).
CC -!- FUNCTION: Converts oxidosqualene to arabidiol. Minor production of
CC arabidiol 20,21-epoxide. {ECO:0000269|PubMed:16774269,
CC ECO:0000269|PubMed:17474751}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=arabidiol = (S)-2,3-epoxysqualene + H2O; Xref=Rhea:RHEA:31035,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15441, ChEBI:CHEBI:62417;
CC EC=4.2.1.124; Evidence={ECO:0000269|PubMed:16774269};
CC -!- SIMILARITY: Belongs to the terpene cyclase/mutase family.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAB10313.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB78576.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF062513; AAF21768.1; -; mRNA.
DR EMBL; AB257562; BAF33292.1; -; mRNA.
DR EMBL; Z97338; CAB10313.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161541; CAB78576.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE83586.1; -; Genomic_DNA.
DR PIR; G71417; G71417.
DR RefSeq; NP_567462.1; NM_117622.3.
DR AlphaFoldDB; Q9FR95; -.
DR SMR; Q9FR95; -.
DR STRING; 3702.AT4G15340.1; -.
DR PaxDb; Q9FR95; -.
DR PRIDE; Q9FR95; -.
DR ProteomicsDB; 236683; -.
DR EnsemblPlants; AT4G15340.1; AT4G15340.1; AT4G15340.
DR GeneID; 827200; -.
DR Gramene; AT4G15340.1; AT4G15340.1; AT4G15340.
DR KEGG; ath:AT4G15340; -.
DR Araport; AT4G15340; -.
DR TAIR; locus:2129995; AT4G15340.
DR eggNOG; KOG0497; Eukaryota.
DR HOGENOM; CLU_009074_2_0_1; -.
DR InParanoid; Q9FR95; -.
DR OrthoDB; 365003at2759; -.
DR PhylomeDB; Q9FR95; -.
DR BioCyc; ARA:AT4G15340-MON; -.
DR BioCyc; MetaCyc:AT4G15340-MON; -.
DR PRO; PR:Q9FR95; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q9FR95; baseline and differential.
DR Genevisible; Q9FR95; AT.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0034075; F:arabidiol synthase activity; IDA:TAIR.
DR GO; GO:0042300; F:beta-amyrin synthase activity; IBA:GO_Central.
DR GO; GO:0016829; F:lyase activity; IEA:UniProtKB-KW.
DR GO; GO:0010263; P:tricyclic triterpenoid biosynthetic process; IDA:TAIR.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IDA:CACAO.
DR CDD; cd02892; SQCY_1; 1.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR002365; Terpene_synthase_CS.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11764; PTHR11764; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; SSF48239; 2.
DR TIGRFAMs; TIGR01787; squalene_cyclas; 1.
DR PROSITE; PS01074; TERPENE_SYNTHASES; 1.
PE 1: Evidence at protein level;
KW Lyase; Reference proteome; Repeat.
FT CHAIN 1..766
FT /note="Arabidiol synthase"
FT /id="PRO_0000366136"
FT REPEAT 149..190
FT /note="PFTB 1"
FT REPEAT 520..561
FT /note="PFTB 2"
FT REPEAT 597..637
FT /note="PFTB 3"
FT REPEAT 646..687
FT /note="PFTB 4"
FT ACT_SITE 491
FT /note="Proton donor"
FT /evidence="ECO:0000250|UniProtKB:P48449"
FT CONFLICT 169
FT /note="H -> Y (in Ref. 2; BAF33292)"
FT /evidence="ECO:0000305"
FT CONFLICT 496
FT /note="S -> G (in Ref. 2; BAF33292)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 766 AA; 88074 MW; 2B5E07D1C2C7E91E CRC64;
MWRLRIGAKA GNDTHLFTTN NYVGRQIWEF DANAGSPQEL AEVEEARRNF SNNRSHYKAS
ADLLWRMQFL REKGFEQKIP RVRVEDAAKI RYEDAKTALK RGLHYFTALQ ADDGHWPADN
SGPNFFIAPL VICLYITGHL EKIFTVEHRI ELIRYMYNHQ NEDGGWGLHV ESPSIMFCTV
INYICLRIVG VEAGHDDDQG STCTKARKWI LDHGGATYTP LIGKACLSVL GVYDWSGCKP
MPPEFWFLPS SFPINGGTLW IYLRDIFMGL SYLYGKKFVA TPTPLILQLQ EELYPEPYTK
INWRLTRNRC AKEDLCYPSS FLQDLFWKGV HIFSESILNR WPFNKLIRQA ALRTTMKLLH
YQDEANRYIT GGSVPKAFHM LACWVEDPEG EYFKKHLARV SDFIWIGEDG LKIQSFGSQL
WDTVMSLHFL LDGVEDDVDD EIRSTLVKGY DYLKKSQVTE NPPSDHIKMF RHISKGGWTF
SDKDQGWPVS DCTAESLKCC LLFERMPSEF VGQKMDVEKL FDAVDFLLYL QSDNGGITAW
EPADGKTWLE WFSPVEFVQD TVIEHEYVEC TGSAIVALTQ FSKQFPEFRK KEVERFITNG
VKYIEDLQMK DGSWCGNWGV CFIYGTLFAV RGLVAAGKTF HNCEPIRRAV RFLLDTQNQE
GGWGESYLSC LRKKYTPLAG NKTNIVSTGQ ALMVLIMGGQ MERDPLPVHR AAKVVINLQL
DNGDFPQQEV MGVFNMNVLL HYPTYRNIYS LWALTLYTQA LRRLQP