LEGA_GOSHI
ID LEGA_GOSHI Reviewed; 509 AA.
AC P09802; Q39790;
DT 01-JUL-1989, integrated into UniProtKB/Swiss-Prot.
DT 15-DEC-1998, sequence version 2.
DT 03-AUG-2022, entry version 98.
DE RecName: Full=Legumin A;
DE AltName: Full=Beta-globulin;
DE AltName: Full=LEGA-C94;
DE Contains:
DE RecName: Full=Legumin A acidic chain;
DE Contains:
DE RecName: Full=Legumin A basic chain;
DE Flags: Precursor;
GN Name=LEGA;
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
RX PubMed=16668521; DOI=10.1104/pp.97.3.1268;
RA Galau G.A., Wang H.Y., Hughes D.W.;
RT "Sequence of the Gossypium hirsutum D-genome alloallele of Legumin A and
RT its mRNA.";
RL Plant Physiol. 97:1268-1270(1991).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 3-509.
RX AGRICOLA=IND87019922; DOI=10.1007/BF00020331;
RA Chlan C.A., Pyle J.B., Legocki A.B., Dure L. III;
RT "Developmental biochemistry of cottonseed embryogenesis and germination.
RT XVIII. cDNA and amino acid sequences of the members of the storage protein
RT families.";
RL Plant Mol. Biol. 7:475-489(1986).
CC -!- FUNCTION: This is a seed storage protein.
CC -!- SUBUNIT: Hexamer; each subunit is composed of an acidic and a basic
CC chain derived from a single precursor and linked by a disulfide bond.
CC -!- SIMILARITY: Belongs to the 11S seed storage protein (globulins) family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M69188; AAA33053.1; -; Genomic_DNA.
DR EMBL; M73072; AAA33065.1; -; mRNA.
DR EMBL; M16905; AAA33072.1; -; mRNA.
DR PIR; B30838; FWCNBA.
DR RefSeq; XP_016701249.1; XM_016845760.1.
DR AlphaFoldDB; P09802; -.
DR SMR; P09802; -.
DR STRING; 3635.P09802; -.
DR PRIDE; P09802; -.
DR GeneID; 107916455; -.
DR KEGG; ghi:107916455; -.
DR OMA; KFGRGQE; -.
DR Proteomes; UP000189702; Chromosome 3.
DR GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR GO; GO:0010431; P:seed maturation; IEA:UniProt.
DR Gene3D; 2.60.120.10; -; 3.
DR InterPro; IPR022379; 11S_seedstore_CS.
DR InterPro; IPR006044; 11S_seedstore_pln.
DR InterPro; IPR006045; Cupin_1.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR Pfam; PF00190; Cupin_1; 2.
DR PRINTS; PR00439; 11SGLOBULIN.
DR SMART; SM00835; Cupin_1; 2.
DR SUPFAM; SSF51182; SSF51182; 1.
DR PROSITE; PS00305; 11S_SEED_STORAGE; 1.
PE 2: Evidence at transcript level;
KW Disulfide bond; Reference proteome; Seed storage protein; Signal;
KW Storage protein.
FT SIGNAL 1..21
FT CHAIN 22..509
FT /note="Legumin A"
FT /id="PRO_0000032060"
FT CHAIN 22..324
FT /note="Legumin A acidic chain"
FT /id="PRO_0000032061"
FT CHAIN 325..509
FT /note="Legumin A basic chain"
FT /id="PRO_0000032062"
FT DOMAIN 39..273
FT /note="Cupin type-1 1"
FT /evidence="ECO:0000255"
FT DOMAIN 337..486
FT /note="Cupin type-1 2"
FT /evidence="ECO:0000255"
FT REGION 187..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 298..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..227
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..245
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 298..322
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 34..67
FT /evidence="ECO:0000250"
FT DISULFID 110..331
FT /note="Interchain (between acidic and basic chains)"
FT /evidence="ECO:0000255"
FT CONFLICT 3
FT /note="I -> V (in Ref. 2; AAA33072)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 509 AA; 58425 MW; E9235CCC9F946F37 CRC64;
MAINPSLLFL SLLFLFNGCL ARQTFSSQQS QNECQINRLR ASAPQTRIRS EAGTTEWWNP
NCQQLRCAGV SVMRQTIEPN GLVLPSFTNA PQLLYIVQGR GIQGIVMPGC AETFQDSQQW
QHQSRGRFQD QHQKVRRFRQ GDIIALPQGV VHWSYNDGNE RVVTINLLDT GNSANQLDNI
PRRFHLAGNP EEEQRQLRRL AQQMQGRSER GEESEEEEGE GEEEEEEDNP SRRSRHQEEE
EQGRESSSCN NLLCAFDRNF LAQAFNVDHD IIRKIQRVRG NRGTIIRVRD RLQVVTPPRM
EEEEREERQQ EQRYRHTRGG SQDNGLEETF CSMRIKENLA DPERADIFNP QAGRISTLNR
FNLPILQRLE LSAERGVLYN RAGLIPQWNV NAHKILYMLR GCARVQVVNH NGDAVFDDNV
EQGQLLTVPQ NFAFMKQAGN EGAEWISFFT NSEATNTPMA GSVSFMRALP EEVVAASYQV
SREDARRIKF NNKNTFFFTP SQSERRADA