GUN18_ARATH
ID GUN18_ARATH Reviewed; 478 AA.
AC Q9SZ90; F4JKS2;
DT 05-SEP-2006, integrated into UniProtKB/Swiss-Prot.
DT 25-JAN-2012, sequence version 2.
DT 03-AUG-2022, entry version 120.
DE RecName: Full=Endoglucanase 18;
DE EC=3.2.1.4;
DE AltName: Full=Endo-1,4-beta glucanase 18;
DE Flags: Precursor;
GN OrderedLocusNames=At4g09740; ORFNames=F17A8.90;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP GENE FAMILY.
RX PubMed=15170254; DOI=10.1007/s00239-003-2571-x;
RA Libertini E., Li Y., McQueen-Mason S.J.;
RT "Phylogenetic analysis of the plant endo-beta-1,4-glucanase gene family.";
RL J. Mol. Evol. 58:506-515(2004).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000255|PROSITE-ProRule:PRU10140, ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAB39641.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB78097.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL049482; CAB39641.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161515; CAB78097.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE82789.1; -; Genomic_DNA.
DR PIR; T04021; T04021.
DR RefSeq; NP_849349.1; NM_179018.1.
DR AlphaFoldDB; Q9SZ90; -.
DR SMR; Q9SZ90; -.
DR STRING; 3702.AT4G09740.1; -.
DR CAZy; GH9; Glycoside Hydrolase Family 9.
DR PaxDb; Q9SZ90; -.
DR PRIDE; Q9SZ90; -.
DR EnsemblPlants; AT4G09740.1; AT4G09740.1; AT4G09740.
DR GeneID; 826560; -.
DR Gramene; AT4G09740.1; AT4G09740.1; AT4G09740.
DR KEGG; ath:AT4G09740; -.
DR Araport; AT4G09740; -.
DR TAIR; locus:2118519; AT4G09740.
DR eggNOG; ENOG502QRF6; Eukaryota.
DR HOGENOM; CLU_008926_1_2_1; -.
DR InParanoid; Q9SZ90; -.
DR OMA; KRTDYSH; -.
DR OrthoDB; 1195424at2759; -.
DR BioCyc; ARA:AT4G09740-MON; -.
DR PRO; PR:Q9SZ90; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q9SZ90; baseline.
DR Genevisible; Q9SZ90; AT.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0071555; P:cell wall organization; IEA:UniProtKB-KW.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; SSF48208; 1.
DR PROSITE; PS60032; GH9_1; 1.
DR PROSITE; PS00592; GH9_2; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism; Cell wall biogenesis/degradation;
KW Cellulose degradation; Glycoprotein; Glycosidase; Hydrolase;
KW Polysaccharide degradation; Reference proteome; Secreted; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT CHAIN 22..478
FT /note="Endoglucanase 18"
FT /id="PRO_0000249270"
FT REGION 433..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..448
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 76
FT /note="Nucleophile"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10140"
FT ACT_SITE 398
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10059"
FT ACT_SITE 449
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10060"
FT ACT_SITE 458
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10060"
FT CARBOHYD 29
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 442
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 478 AA; 52514 MW; 594B26EAD9E82FD3 CRC64;
MGKLLVVMLI GMFLAFESLE ALDYGDALNK SILFFEGQRS GKLPTNQRVK WRADSGLSDG
ASANVNLIGG YYDAGDNVKF VWPMSFTTTL LSWAALEYQN EITFVNQLGY LRSTIKWGTN
FILRAHTSTN MLYTQVGDGN SDHSCWERPE DMDTPRTLYS ISSSSPGSEA AGEAAAALAA
ASLVFKLVDS TYSSKLLNNA KSLFEFADKY RGSYQASCPF YCSHSGYQDE LLWAAAWLYK
ATGEKSYLNY VISNKDWSKA INEFSWDNKF AGVQALLASE FYNGANDLEK FKTDVESFVC
ALMPGSSSQQ IKPTPGGILF IRDSSNLQYV TTATTILFYY SKTLTKAGVG SIQCGSTQFT
VSQIRNFAKS QVDYILGNNP LKMSYMVGFG TKYPTQPHHR GSSLPSIQSK PEKIDCNGGF
SYYNFDTPNP NVHTGAIVGG PNSSDQYSDK RTDYSHAEPT TYINAAFIGS VAALISSS