SPD1_TRICX
ID SPD1_TRICX Reviewed; 748 AA.
AC P19837;
DT 01-FEB-1991, integrated into UniProtKB/Swiss-Prot.
DT 05-OCT-2010, sequence version 3.
DT 03-AUG-2022, entry version 79.
DE RecName: Full=Spidroin-1;
DE AltName: Full=Dragline silk fibroin 1;
DE Flags: Fragment;
OS Trichonephila clavipes (Golden silk orbweaver) (Nephila clavipes).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Araneae;
OC Araneomorphae; Entelegynae; Araneoidea; Nephilidae; Trichonephila.
OX NCBI_TaxID=2585209;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], AND PARTIAL PROTEIN SEQUENCE.
RX PubMed=2402494; DOI=10.1073/pnas.87.18.7120;
RA Xu M., Lewis R.V.;
RT "Structure of a protein superfiber: spider dragline silk.";
RL Proc. Natl. Acad. Sci. U.S.A. 87:7120-7124(1990).
RN [2]
RP SEQUENCE REVISION TO C-TERMINUS.
RA Xu M., Lewis R.V.;
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE OF 653-747.
RX PubMed=8120021; DOI=10.1016/s0021-9258(17)37425-2;
RA Beckwitt R., Arcidiacono S.;
RT "Sequence conservation in the C-terminal region of spider silk proteins
RT (Spidroin) from Nephila clavipes (Tetragnathidae) and Araneus bicentenarius
RT (Araneidae).";
RL J. Biol. Chem. 269:6661-6663(1994).
CC -!- FUNCTION: Spiders' major ampullate silk possesses unique
CC characteristics of strength and elasticity. Fibroin consists of
CC pseudocrystalline regions of antiparallel beta-sheet interspersed with
CC elastic amorphous segments.
CC -!- SUBUNIT: Major subunit, with spidroin 2, of the dragline silk.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space.
CC -!- DOMAIN: Highly repetitive protein characterized by regions of
CC polyalanine and glycine-rich repeating units.
CC -!- SIMILARITY: Belongs to the silk fibroin family. {ECO:0000305}.
CC -!- WEB RESOURCE: Name=Protein Spotlight; Note=The tiptoe of an airbus
CC - Issue 24 of July 2002;
CC URL="https://web.expasy.org/spotlight/back_issues/024";
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M37137; AAA29380.2; -; mRNA.
DR EMBL; U03848; AAB60212.1; -; Unassigned_DNA.
DR PIR; A36068; A36068.
DR AlphaFoldDB; P19837; -.
DR PCDDB; P19837; -.
DR SMR; P19837; -.
DR PRIDE; P19837; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR Gene3D; 1.10.10.1350; -; 1.
DR InterPro; IPR021001; Spidroin_C.
DR InterPro; IPR038542; Spidroin_C_sf.
DR Pfam; PF11260; Spidroin_MaSp; 1.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Repeat; Secreted; Silk protein.
FT CHAIN <1..748
FT /note="Spidroin-1"
FT /id="PRO_0000221446"
FT REPEAT 1..25
FT /note="1"
FT REPEAT 26..38
FT /note="2"
FT REPEAT 39..66
FT /note="3"
FT REPEAT 67..96
FT /note="4"
FT REPEAT 97..130
FT /note="5"
FT REPEAT 131..158
FT /note="6"
FT REPEAT 159..191
FT /note="7"
FT REPEAT 192..204
FT /note="8"
FT REPEAT 205..235
FT /note="9"
FT REPEAT 236..262
FT /note="10"
FT REPEAT 263..292
FT /note="11"
FT REPEAT 293..305
FT /note="12"
FT REPEAT 306..333
FT /note="13"
FT REPEAT 334..360
FT /note="14"
FT REPEAT 361..394
FT /note="15"
FT REPEAT 395..424
FT /note="16"
FT REPEAT 425..458
FT /note="17"
FT REPEAT 459..485
FT /note="18"
FT REPEAT 486..512
FT /note="19"
FT REPEAT 513..525
FT /note="20"
FT REPEAT 526..555
FT /note="21"
FT REPEAT 556..582
FT /note="22"
FT REPEAT 583..612
FT /note="23"
FT REPEAT 613..642
FT /note="24"
FT REPEAT 643..655
FT /note="25"
FT REGION 1..655
FT /note="25 X approximate tandem repeats"
FT CONFLICT 662
FT /note="V -> L (in Ref. 1; AAA29380)"
FT /evidence="ECO:0000305"
FT CONFLICT 672
FT /note="S -> T (in Ref. 1; AAA29380)"
FT /evidence="ECO:0000305"
FT NON_TER 1
SQ SEQUENCE 748 AA; 60585 MW; 70F50E44B0D649E0 CRC64;
QGAGAAAAAA GGAGQGGYGG LGGQGAGQGG YGGLGGQGAG QGAGAAAAAA AGGAGQGGYG
GLGSQGAGRG GQGAGAAAAA AGGAGQGGYG GLGSQGAGRG GLGGQGAGAA AAAAAGGAGQ
GGYGGLGNQG AGRGGQGAAA AAAGGAGQGG YGGLGSQGAG RGGLGGQGAG AAAAAAGGAG
QGGYGGLGGQ GAGQGGYGGL GSQGAGRGGL GGQGAGAAAA AAAGGAGQGG LGGQGAGQGA
GASAAAAGGA GQGGYGGLGS QGAGRGGEGA GAAAAAAGGA GQGGYGGLGG QGAGQGGYGG
LGSQGAGRGG LGGQGAGAAA AGGAGQGGLG GQGAGQGAGA AAAAAGGAGQ GGYGGLGSQG
AGRGGLGGQG AGAVAAAAAG GAGQGGYGGL GSQGAGRGGQ GAGAAAAAAG GAGQRGYGGL
GNQGAGRGGL GGQGAGAAAA AAAGGAGQGG YGGLGNQGAG RGGQGAAAAA GGAGQGGYGG
LGSQGAGRGG QGAGAAAAAA VGAGQEGIRG QGAGQGGYGG LGSQGSGRGG LGGQGAGAAA
AAAGGAGQGG LGGQGAGQGA GAAAAAAGGV RQGGYGGLGS QGAGRGGQGA GAAAAAAGGA
GQGGYGGLGG QGVGRGGLGG QGAGAAAAGG AGQGGYGGVG SGASAASAAA SRLSSPQASS
RVSSAVSNLV ASGPTNSAAL SSTISNVVSQ IGASNPGLSG CDVLIQALLE VVSALIQILG
SSSIGQVNYG SAGQATQIVG QSVYQALG