GUN4_ARATH
ID GUN4_ARATH Reviewed; 489 AA.
AC O49296;
DT 19-SEP-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-1998, sequence version 1.
DT 03-AUG-2022, entry version 140.
DE RecName: Full=Endoglucanase 4;
DE EC=3.2.1.4;
DE AltName: Full=Endo-1,4-beta glucanase 4;
DE Flags: Precursor;
GN OrderedLocusNames=At1g23210; ORFNames=F26F24.6, T26J12.2;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Underwood B.A., Xiao Y.-L., Moskal W.A. Jr., Monaghan E.L., Wang W.,
RA Redman J.C., Wu H.C., Utterback T., Town C.D.;
RL Submitted (MAY-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Quinitio C., Chen H., Kim C.J., Shinn P., Ecker J.R.;
RT "Arabidopsis ORF clones.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP GENE FAMILY.
RX PubMed=15170254; DOI=10.1007/s00239-003-2571-x;
RA Libertini E., Li Y., McQueen-Mason S.J.;
RT "Phylogenetic analysis of the plant endo-beta-1,4-glucanase gene family.";
RL J. Mol. Evol. 58:506-515(2004).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000255|PROSITE-ProRule:PRU10140, ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC002311; AAC00616.1; -; Genomic_DNA.
DR EMBL; AC005292; AAF86995.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30357.1; -; Genomic_DNA.
DR EMBL; DQ056459; AAY78616.1; -; mRNA.
DR EMBL; BT026379; ABH04486.1; -; mRNA.
DR PIR; E86366; E86366.
DR RefSeq; NP_173735.1; NM_102170.2.
DR AlphaFoldDB; O49296; -.
DR SMR; O49296; -.
DR STRING; 3702.AT1G23210.1; -.
DR CAZy; GH9; Glycoside Hydrolase Family 9.
DR PaxDb; O49296; -.
DR PRIDE; O49296; -.
DR ProteomicsDB; 247244; -.
DR EnsemblPlants; AT1G23210.1; AT1G23210.1; AT1G23210.
DR GeneID; 838930; -.
DR Gramene; AT1G23210.1; AT1G23210.1; AT1G23210.
DR KEGG; ath:AT1G23210; -.
DR Araport; AT1G23210; -.
DR TAIR; locus:2028015; AT1G23210.
DR eggNOG; ENOG502QR9R; Eukaryota.
DR HOGENOM; CLU_008926_1_2_1; -.
DR InParanoid; O49296; -.
DR OMA; QKYRGAY; -.
DR OrthoDB; 1195424at2759; -.
DR PhylomeDB; O49296; -.
DR BioCyc; ARA:AT1G23210-MON; -.
DR PRO; PR:O49296; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; O49296; baseline and differential.
DR Genevisible; O49296; AT.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0071555; P:cell wall organization; IEA:UniProtKB-KW.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; SSF48208; 1.
DR PROSITE; PS60032; GH9_1; 1.
DR PROSITE; PS00592; GH9_2; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 2: Evidence at transcript level;
KW Carbohydrate metabolism; Cell wall biogenesis/degradation;
KW Cellulose degradation; Glycoprotein; Glycosidase; Hydrolase;
KW Polysaccharide degradation; Reference proteome; Secreted; Signal.
FT SIGNAL 1..25
FT /evidence="ECO:0000255"
FT CHAIN 26..489
FT /note="Endoglucanase 4"
FT /id="PRO_0000249257"
FT ACT_SITE 81
FT /note="Nucleophile"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10140"
FT ACT_SITE 409
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10059"
FT ACT_SITE 460
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10060"
FT ACT_SITE 469
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU10060"
FT CARBOHYD 453
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
SQ SEQUENCE 489 AA; 54630 MW; 9B46F1C3111C4D0B CRC64;
MAGKSFMTPA IMLAMLLLIS PETYAGHDYR DALRKSILFF EGQRSGKLPP DQRLKWRRDS
ALRDGSSAGV DLTGGYYDAG DNVKFGFPMA FTTTMMSWSV IDFGKTMGPE LENAVKAIKW
GTDYLMKATQ IPDVVFVQVG DAYSDHNCWE RPEDMDTLRT VYKIDKDHSG SEVAGETAAA
LAAASIVFEK RDPVYSKMLL DRATRVFAFA QKYRGAYSDS LYQAVCPFYC DFNGYEDELL
WGAAWLHKAS KKRVYREFIV KNQVILRAGD TIHEFGWDNK HAGINVLVSK MVLMGKAEYF
QSFKQNADEF ICSLLPGISH PQVQYSQGGL LVKSGGSNMQ HVTSLSFLLL TYSNYLSHAN
KVVPCGEFTA SPALLRQVAK RQVDYILGDN PMKMSYMVGY GSRFPQKIHH RGSSVPSVVD
HPDRIGCKDG SRYFFSNNPN PNLLIGAVVG GPNITDDFPD SRPYFQLTEP TTYINAPLLG
LLGYFSAHY