位置:首页 > 蛋白库 > CGEP_ARATH
CGEP_ARATH
ID   CGEP_ARATH              Reviewed;         960 AA.
AC   Q8VZF3; O22913; Q8L635;
DT   05-OCT-2010, integrated into UniProtKB/Swiss-Prot.
DT   03-MAY-2011, sequence version 2.
DT   03-AUG-2022, entry version 105.
DE   RecName: Full=Probable glutamyl endopeptidase, chloroplastic;
DE            EC=3.4.21.-;
DE   Flags: Precursor;
GN   Name=GEP; OrderedLocusNames=At2g47390; ORFNames=T8I13.23;
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia;
RX   PubMed=10617197; DOI=10.1038/45471;
RA   Lin X., Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town C.D.,
RA   Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V.,
RA   Buell C.R., Ketchum K.A., Lee J.J., Ronning C.M., Koo H.L., Moffat K.S.,
RA   Cronin L.A., Shen M., Pai G., Van Aken S., Umayam L., Tallon L.J.,
RA   Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., Goodman H.M.,
RA   Somerville C.R., Copenhaver G.P., Preuss D., Nierman W.C., White O.,
RA   Eisen J.A., Salzberg S.L., Fraser C.M., Venter J.C.;
RT   "Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.";
RL   Nature 402:761-768(1999).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia;
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
RN   [3]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC   STRAIN=cv. Columbia;
RX   PubMed=14593172; DOI=10.1126/science.1088305;
RA   Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA   Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA   Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA   Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA   Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA   Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA   Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA   Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA   Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA   Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA   Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA   Ecker J.R.;
RT   "Empirical analysis of transcriptional activity in the Arabidopsis
RT   genome.";
RL   Science 302:842-846(2003).
RN   [4]
RP   FUNCTION.
RX   DOI=10.1111/j.1399-3054.2005.00441.x;
RA   Forsberg J., Stroem J., Kieselbach T., Larsson H., Alexciev K.,
RA   Engstroem A., Aekerlund H.-E.;
RT   "Protease activities in the chloroplast capable of cleaving an LHCII N-
RT   terminal peptide.";
RL   Physiol. Plantarum 123:21-29(2005).
CC   -!- FUNCTION: Serine-type protease active in vitro against the LHCII N-
CC       terminal. Cleaves its substrate on the carboxy-side of Glu residues (By
CC       similarity). {ECO:0000250, ECO:0000269|Ref.4}.
CC   -!- SUBCELLULAR LOCATION: Plastid, chloroplast stroma {ECO:0000250}.
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=2;
CC       Name=1;
CC         IsoId=Q8VZF3-1; Sequence=Displayed;
CC       Name=2;
CC         IsoId=Q8VZF3-2; Sequence=VSP_039719;
CC   -!- SIMILARITY: Belongs to the peptidase S9D family. {ECO:0000305}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=AAB63841.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AC002337; AAB63841.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; CP002685; AEC10835.1; -; Genomic_DNA.
DR   EMBL; AY064997; AAL57645.1; -; mRNA.
DR   EMBL; AY099560; AAM20412.1; -; mRNA.
DR   EMBL; BT002650; AAO11566.1; -; mRNA.
DR   PIR; F84914; F84914.
DR   RefSeq; NP_850473.1; NM_180142.2. [Q8VZF3-2]
DR   AlphaFoldDB; Q8VZF3; -.
DR   BioGRID; 4687; 4.
DR   STRING; 3702.AT2G47390.1; -.
DR   ESTHER; arath-CGEP; Glutamyl_Peptidase_S9.
DR   MEROPS; S09.021; -.
DR   iPTMnet; Q8VZF3; -.
DR   PaxDb; Q8VZF3; -.
DR   PeptideAtlas; Q8VZF3; -.
DR   PRIDE; Q8VZF3; -.
DR   ProteomicsDB; 224477; -. [Q8VZF3-1]
DR   EnsemblPlants; AT2G47390.1; AT2G47390.1; AT2G47390. [Q8VZF3-2]
DR   GeneID; 819352; -.
DR   Gramene; AT2G47390.1; AT2G47390.1; AT2G47390. [Q8VZF3-2]
DR   KEGG; ath:AT2G47390; -.
DR   Araport; AT2G47390; -.
DR   TAIR; locus:2065200; AT2G47390.
DR   eggNOG; KOG2100; Eukaryota.
DR   HOGENOM; CLU_017120_0_0_1; -.
DR   InParanoid; Q8VZF3; -.
DR   OMA; FPIQSER; -.
DR   OrthoDB; 265965at2759; -.
DR   PRO; PR:Q8VZF3; -.
DR   Proteomes; UP000006548; Chromosome 2.
DR   ExpressionAtlas; Q8VZF3; baseline and differential.
DR   Genevisible; Q8VZF3; AT.
DR   GO; GO:0009507; C:chloroplast; HDA:TAIR.
DR   GO; GO:0009570; C:chloroplast stroma; IDA:TAIR.
DR   GO; GO:0005829; C:cytosol; HDA:TAIR.
DR   GO; GO:0043621; F:protein self-association; IPI:TAIR.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IDA:TAIR.
DR   GO; GO:0006508; P:proteolysis; IDA:TAIR.
DR   Gene3D; 2.120.10.30; -; 1.
DR   Gene3D; 3.40.50.1820; -; 1.
DR   InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR   InterPro; IPR029058; AB_hydrolase.
DR   InterPro; IPR001375; Peptidase_S9.
DR   Pfam; PF00326; Peptidase_S9; 1.
DR   SUPFAM; SSF53474; SSF53474; 1.
PE   2: Evidence at transcript level;
KW   Alternative splicing; Chloroplast; Hydrolase; Plastid; Protease;
KW   Reference proteome; Serine protease; Transit peptide.
FT   TRANSIT         1..62
FT                   /note="Chloroplast"
FT                   /evidence="ECO:0000255"
FT   CHAIN           63..960
FT                   /note="Probable glutamyl endopeptidase, chloroplastic"
FT                   /id="PRO_0000397884"
FT   REGION          78..98
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          915..960
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        82..98
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   ACT_SITE        780
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000250"
FT   ACT_SITE        854
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000250"
FT   ACT_SITE        888
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000250"
FT   VAR_SEQ         428
FT                   /note="L -> LY (in isoform 2)"
FT                   /evidence="ECO:0000303|PubMed:14593172"
FT                   /id="VSP_039719"
FT   CONFLICT        685
FT                   /note="C -> Y (in Ref. 3; AAL57645/AAM20412/AAO11566)"
FT                   /evidence="ECO:0000305"
SQ   SEQUENCE   960 AA;  106103 MW;  C35EBC0B3B1DDF77 CRC64;
     MMRFHKACHR FSLSPLCHLS PPSPSPASSL LLLPKLSGFS TLSTRRCVRV RRFSENPLTT
     VMASRSASRL RSLASACSGG AEDGGGTSNG SLSASATATE DDELAIGTGY RLPPPEIRDI
     VDAPPVPALS FSPHRDKILF LKRRALPPLA DLARPEEKLA GVRIDGYCNT RSRMSFYTGL
     GIHQLLPDGT LSPEKEITGI PDGGKINFVT WSNDGKHLAF SIRVDENGNS SKPVVWVADV
     ETGVARPLFN SQDIFLNAIF ESFVWIDNST LLVSTIPSSR GEPPKKPLVP SGPKTLSNET
     KTVVQVRTFQ DLLKDEYDAD LFDYYASSQL VLASLDGTVK EVGVPAVYTS LDPSTDHKYL
     LVSSLHRPYS FIVPCGRFPK KVEVWTTDGR FVRQLCDLPL AEDIPIASNS VRKGMRSINW
     RADKPSTLWA ETQDGGDAKM EVSPRDIVYM QSAEPLAGEE PEVLHKLDLR YGGISWCDDT
     LALVYESWYK TRRTRTWVIS PGSNDVSPRI LFDRSSEDVY SDPGSTMLRR TDAGTYVIAK
     IKKENDEGTY VLLNGSGATP QGNVPFLDLF DINTGNKERI WESDKEKYFE TVVALMSDQK
     EGDLKMEELK ILTSKESKTE NTQYSLQLWP DRKVQQITNF PHPYPQLASL QKEMIRYQRK
     DGVQLTATLY LPPGYDPSKD GPLPCLFWSY PGEFKSKDAA GQVRGSPNEF AGIGSTSALL
     WLARRFAILS GPTIPIIGEG DEEANDRYVE QLVASAEAAV EEVVRRGVAD RSKIAVGGHS
     YGAFMTANLL AHAPHLFACG IARSGAYNRT LTPFGFQNED RTLWEATNVY VEMSPFMSAN
     KIKKPILLIH GEEDNNPGTL TMQSDRFFNA LKGHGALCRL VVLPHESHGY SARESIMHVL
     WETDRWLQKY CVPNTSDADT SPDQSKEGSD SADKVSTGTG GGNPEFGEHE VHSKLRRSLL
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024