位置:首页 > 蛋白库 > VCL_THECC
VCL_THECC
ID   VCL_THECC               Reviewed;         525 AA.
AC   Q43358; Q9S7V9; Q9SQ35;
DT   21-FEB-2001, integrated into UniProtKB/Swiss-Prot.
DT   01-NOV-1996, sequence version 1.
DT   25-MAY-2022, entry version 75.
DE   RecName: Full=Vicilin;
DE   Flags: Precursor;
GN   Name=CSV; Synonyms=VIC;
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RC   TISSUE=Leaf;
RX   PubMed=1600151; DOI=10.1007/bf00047720;
RA   McHenry L., Fritz P.J.;
RT   "Comparison of the structure and nucleotide sequences of vicilin genes of
RT   cocoa and cotton raise questions about vicilin evolution.";
RL   Plant Mol. Biol. 18:1173-1176(1992).
RN   [2]
RP   NUCLEOTIDE SEQUENCE OF 159-397.
RX   AGRICOLA=IND22012241; DOI=10.2307/2419544;
RA   Whitlock B.A., Baum D.A.;
RT   "Phylogenetic relationships of Theobroma and Herrania (Sterculiaceae) based
RT   on sequences of the nuclear gene vicilin.";
RL   Syst. Bot. 24:128-138(1999).
CC   -!- SIMILARITY: Belongs to the 7S seed storage protein family.
CC       {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; X62625; CAA44493.1; -; Genomic_DNA.
DR   EMBL; X62626; CAA44494.1; -; mRNA.
DR   EMBL; AF113046; AAF13477.1; -; Genomic_DNA.
DR   EMBL; AF113047; AAF13478.1; -; Genomic_DNA.
DR   EMBL; AF113048; AAF13479.1; -; Genomic_DNA.
DR   AlphaFoldDB; Q43358; -.
DR   SMR; Q43358; -.
DR   STRING; 3641.EOY05738; -.
DR   eggNOG; ENOG502QQEP; Eukaryota.
DR   GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.120.10; -; 2.
DR   InterPro; IPR006045; Cupin_1.
DR   InterPro; IPR014710; RmlC-like_jellyroll.
DR   InterPro; IPR011051; RmlC_Cupin_sf.
DR   InterPro; IPR006792; Vicilin_N.
DR   Pfam; PF00190; Cupin_1; 2.
DR   Pfam; PF04702; Vicilin_N; 2.
DR   SMART; SM00835; Cupin_1; 2.
DR   SUPFAM; SSF51182; SSF51182; 2.
PE   2: Evidence at transcript level;
KW   Glycoprotein; Seed storage protein; Signal; Storage protein.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000255"
FT   CHAIN           25..525
FT                   /note="Vicilin"
FT                   /id="PRO_0000032180"
FT   DOMAIN          154..303
FT                   /note="Cupin type-1 1"
FT                   /evidence="ECO:0000255"
FT   DOMAIN          350..513
FT                   /note="Cupin type-1 2"
FT                   /evidence="ECO:0000255"
FT   REGION          119..145
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          308..345
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          424..452
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        119..138
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   CARBOHYD        130
FT                   /note="N-linked (GlcNAc...) asparagine"
FT                   /evidence="ECO:0000255"
FT   VARIANT         313
FT                   /note="K -> M"
SQ   SEQUENCE   525 AA;  60798 MW;  19114CD5C248905D CRC64;
     MVISKSPFIV LIFSLLLSFA LLCSGVSAYG RKQYERDPRQ QYEQCQRRCE SEATEEREQE
     QCEQRCEREY KEQQRQQEEE LQRQYQQCQG RCQEQQQGQR EQQQCQRKCW EQYKEQERGE
     HENYHNHKKN RSEEEEGQQR NNPYYFPKRR SFQTRFRDEE GNFKILQRFA ENSPPLKGIN
     DYRLAMFEAN PNTFILPHHC DAEAIYFVTN GKGTITFVTH ENKESYNVQR GTVVSVPAGS
     TVYVVSQDNQ EKLTIAVLAL PVNSPGKYEL FFPAGNNKPE SYYGAFSYEV LETVFNTQRE
     KLEEILEEQR GQKRQQGQQG MFRKAKPEQI RAISQQATSP RHRGGERLAI NLLSQSPVYS
     NQNGRFFEAC PEDFSQFQNM DVAVSAFKLN QGAIFVPHYN SKATFVVFVT DGYGYAQMAC
     PHLSRQSQGS QSGRQDRREQ EEESEEETFG EFQQVKAPLS PGDVFVAPAG HAVTFFASKD
     QPLNAVAFGL NAQNNQRIFL AGRPFFLNHK QNTNVIKFTV KASAY
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024