CO1A1_TAPTE
ID CO1A1_TAPTE Reviewed; 963 AA.
AC C0HJN7;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-1(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-1 type I collagen {ECO:0000250|UniProtKB:P02452};
DE Flags: Fragments;
GN Name=COL1A1 {ECO:0000250|UniProtKB:P02452};
OS Tapirus terrestris (Lowland tapir) (Brazilian tapir).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Tapiridae; Tapirus.
OX NCBI_TaxID=9801 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN7; -.
DR PRIDE; C0HJN7; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 12.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Phosphoprotein; Repeat; Secreted.
FT CHAIN 1..963
FT /note="Collagen alpha-1(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433498"
FT REGION 1..963
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..37
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..244
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 360..383
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 613..627
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 659..673
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 947..963
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 82
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT MOD_RES 558
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT UNSURE 11
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 66
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 72
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 84
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 117
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 216
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 267
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 291
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 345
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 351
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 453
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 497
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 509
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 536
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 540
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 624
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 725
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 734
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 746
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 776
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 849
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 878
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 887
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 926
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 929
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 933
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 23..24
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 452..453
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 496..497
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 963 AA; 85634 MW; CF3777DD55E203E0 CRC64;
GPMGPSGPRG IPGPPGAPGP QGFASGPMGP RGPPGPPGKN GDDGEAGKPG RPGERGPPGP
QGARGIPGTA GIPGMKGHRG FSGIDGAKGD AGPAGPKGEP GSPGENGAPG QMGPRGIPGE
RGRPGAPGPA GARGNDGATG AAGPPGPTGP AGPPGFPGAV GAKGEAGPQG ARGSEGPQGV
RGEPGPPGPA GAAGPAGNPG ADGQPGAKGA NGAPGIAGAP GFPGARGPSG PQGPSGPPGP
KGNSGEPGAP GSKGDTGAKG EPGPTGIQGP PGPAGEEGKR GARGEPGPTG IPGPPGERGG
PGARGFPGSD GVAGPKGPAG ERGAPGPAGP KGSPGEAGRP GEAGIPGAKG ITGSPGSPGP
DGKTGPPGPA GQDGRPGPPG PPGARGQAGV MGFPGPKGAA GEPGKAGERG VPGPPGAVGP
AGKDGEAGAQ GPPGPAGPAG ERGEQGPAGS PGIGAPGPSG ARGERGFPGE RGVQGPPGPA
GPRGANGAPG NDGAKGIQGM PGERGAAGIP GPKGDRGDAG PKGADGSPGK DGVRGITGPI
GPPGPAGAPG DKGESGPSGP AGPTGARGAP GDRGEPGPPG PAGFAGPPGA DGQPGAKGEP
GDAGAKGDAG PPGPAGPTGP PGPIGNVGAP GPKGARGSAG PPGATGFPGA AGRVGPPGPS
GNAGPPGPPG PVGKEGGKGP RGETGPAGRP GEAGPPGPPG PAGEKGSPGA DGPAGAPGTP
GPQGIAGQRG VVGIPGQRGE RGFPGIPGPS GEPGKQGPSG ASGERGPPGP VGPPGIAGPP
GESGREGAPG AEGSPGRDGS PGPKGDRGET GPAGPPGAPG APGAPGPVGP AGKSGDRGET
GPAGPAGPIG PVGARGPAGP QGPRGDKGET GEQGDRGIKG HRGFSGIQGP PGPPGSPGEQ
GPSGASGPAG PRGPPGSAGA PGKDGINGIP GPIGPPGPRG RTGDAGPVGP PGPPGPPGPP
GPP