位置:首页 > 蛋白库 > CO1A1_TAPTE
CO1A1_TAPTE
ID   CO1A1_TAPTE             Reviewed;         963 AA.
AC   C0HJN7;
DT   22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT   22-JUL-2015, sequence version 1.
DT   03-AUG-2022, entry version 14.
DE   RecName: Full=Collagen alpha-1(I) chain {ECO:0000303|PubMed:25799987};
DE   AltName: Full=Alpha-1 type I collagen {ECO:0000250|UniProtKB:P02452};
DE   Flags: Fragments;
GN   Name=COL1A1 {ECO:0000250|UniProtKB:P02452};
OS   Tapirus terrestris (Lowland tapir) (Brazilian tapir).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Perissodactyla; Tapiridae; Tapirus.
OX   NCBI_TaxID=9801 {ECO:0000303|PubMed:25799987};
RN   [1] {ECO:0000305}
RP   PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC   TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX   PubMed=25799987; DOI=10.1038/nature14249;
RA   Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA   Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA   Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA   Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA   Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA   Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT   "Ancient proteins resolve the evolutionary history of Darwin's South
RT   American ungulates.";
RL   Nature 522:81-84(2015).
CC   -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC       forming collagen). {ECO:0000305}.
CC   -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC       {ECO:0000305}.
CC   -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC       Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC   -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC       In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC       {ECO:0000305}.
CC   -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC       (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; C0HJN7; -.
DR   PRIDE; C0HJN7; -.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR   InterPro; IPR008160; Collagen.
DR   Pfam; PF01391; Collagen; 12.
PE   1: Evidence at protein level;
KW   Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW   Hydroxylation; Phosphoprotein; Repeat; Secreted.
FT   CHAIN           1..963
FT                   /note="Collagen alpha-1(I) chain"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT                   /id="PRO_0000433498"
FT   REGION          1..963
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..37
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        230..244
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        360..383
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        613..627
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        659..673
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        947..963
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOD_RES         82
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:P02454"
FT   MOD_RES         558
FT                   /note="Phosphoserine"
FT                   /evidence="ECO:0000250|UniProtKB:P02454"
FT   UNSURE          11
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          66
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          72
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          84
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          117
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          216
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          267
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          291
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          345
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          351
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          453
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          497
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          509
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          536
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          540
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          624
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          725
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          734
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          746
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          776
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          849
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          878
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          887
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          926
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          929
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   UNSURE          933
FT                   /note="I or L"
FT                   /evidence="ECO:0000269|PubMed:25799987"
FT   NON_CONS        23..24
FT                   /evidence="ECO:0000303|PubMed:25799987"
FT   NON_CONS        452..453
FT                   /evidence="ECO:0000303|PubMed:25799987"
FT   NON_CONS        496..497
FT                   /evidence="ECO:0000303|PubMed:25799987"
SQ   SEQUENCE   963 AA;  85634 MW;  CF3777DD55E203E0 CRC64;
     GPMGPSGPRG IPGPPGAPGP QGFASGPMGP RGPPGPPGKN GDDGEAGKPG RPGERGPPGP
     QGARGIPGTA GIPGMKGHRG FSGIDGAKGD AGPAGPKGEP GSPGENGAPG QMGPRGIPGE
     RGRPGAPGPA GARGNDGATG AAGPPGPTGP AGPPGFPGAV GAKGEAGPQG ARGSEGPQGV
     RGEPGPPGPA GAAGPAGNPG ADGQPGAKGA NGAPGIAGAP GFPGARGPSG PQGPSGPPGP
     KGNSGEPGAP GSKGDTGAKG EPGPTGIQGP PGPAGEEGKR GARGEPGPTG IPGPPGERGG
     PGARGFPGSD GVAGPKGPAG ERGAPGPAGP KGSPGEAGRP GEAGIPGAKG ITGSPGSPGP
     DGKTGPPGPA GQDGRPGPPG PPGARGQAGV MGFPGPKGAA GEPGKAGERG VPGPPGAVGP
     AGKDGEAGAQ GPPGPAGPAG ERGEQGPAGS PGIGAPGPSG ARGERGFPGE RGVQGPPGPA
     GPRGANGAPG NDGAKGIQGM PGERGAAGIP GPKGDRGDAG PKGADGSPGK DGVRGITGPI
     GPPGPAGAPG DKGESGPSGP AGPTGARGAP GDRGEPGPPG PAGFAGPPGA DGQPGAKGEP
     GDAGAKGDAG PPGPAGPTGP PGPIGNVGAP GPKGARGSAG PPGATGFPGA AGRVGPPGPS
     GNAGPPGPPG PVGKEGGKGP RGETGPAGRP GEAGPPGPPG PAGEKGSPGA DGPAGAPGTP
     GPQGIAGQRG VVGIPGQRGE RGFPGIPGPS GEPGKQGPSG ASGERGPPGP VGPPGIAGPP
     GESGREGAPG AEGSPGRDGS PGPKGDRGET GPAGPPGAPG APGAPGPVGP AGKSGDRGET
     GPAGPAGPIG PVGARGPAGP QGPRGDKGET GEQGDRGIKG HRGFSGIQGP PGPPGSPGEQ
     GPSGASGPAG PRGPPGSAGA PGKDGINGIP GPIGPPGPRG RTGDAGPVGP PGPPGPPGPP
     GPP
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024