CO1A1_EQUSP
ID CO1A1_EQUSP Reviewed; 895 AA.
AC C0HJN9;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-1(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-1 type I collagen {ECO:0000250|UniProtKB:P02452};
DE Flags: Fragments;
GN Name=COL1A1 {ECO:0000250|UniProtKB:P02452};
OS Equus sp.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=46122 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from fossils. The
CC tryptic peptides required multiple purification steps in order to
CC eliminate contaminants and to increase the concentration of peptidic
CC material. {ECO:0000305|PubMed:25799987}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN9; -.
DR PRIDE; C0HJN9; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 12.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Phosphoprotein; Repeat; Secreted.
FT CHAIN 1..895
FT /note="Collagen alpha-1(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433493"
FT REGION 1..895
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..48
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..252
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..382
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..611
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 640..656
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 90
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT MOD_RES 556
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT UNSURE 11
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 77
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 83
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 92
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 125
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 224
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 275
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 294
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 296
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 344
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 350
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 451
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 473
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 507
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 519
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 534
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 538
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 608
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 702
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 711
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 720
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 750
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 823
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 843
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 882
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 885
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 889
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 87..88
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 288..289
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 303..304
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 404..405
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 482..483
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 500..501
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 532..533
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 565..566
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 590..591
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 617..618
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 655..656
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 715..716
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 838..839
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 895 AA; 79214 MW; FBA66215183F694F CRC64;
GPMGPSGPRG IPGPPGAPGP QGFQGPPGEP GEPGASGPMG PRGPPGPPGK NGDDGEAGKP
GRPGERGPPG PQGARGIPGT AGIPGMKGFS GIDGAKGDAG PAGPKGEPGS PGENGAPGQM
GPRGIPGERG RPGAPGPAGA RGNDGATGAA GPPGPTGPAG PPGFPGAVGA KGEAGPQGAR
GSEGPQGVRG EPGPPGPAGA AGPAGNPGAD GQPGAKGANG APGIAGAPGF PGARGPSGPQ
GPSGPPGPKG NSGEPGAPGN KGDTGAKGEP GPTGIQGPPG PAGEEGKRGE PGPIGIPGPP
GERGFPGADG VAGPKGPAGE RGAPGPAGPK GSPGEAGRPG EAGIPGAKGI TGSPGSPGPD
GKTGPPGPAG QDGRPGPPGP PGARGQAGVM GFPGPKGAAG EPGKGVPGPP GAVGPAGKDG
EAGAQGPPGP AGPAGERGEQ GPAGSPGFQG IPGPAGPPGE SGKPGEQGVP GDIGAPGPSG
ARGFPGERGV QGPPGPAGPR SQGAPGIQGM PGERGAAGIP GPKGDRGDAG PKGITGPIGP
PGPAGAPGDK GETGPSGPAG PTGARRGEPG PPGPAGFAGP PGADGQPGAK GDAGPPGPAG
PAGPPGPIGS VGAPGPKGSA GPPGATGFPG AAGRVGPPGP SGNAGPPGPP GPVGKGPRGE
TGPAGRPGEA GPPGPPGPAG EKGSPGADGP AGAPGTPGPQ GIAGQRGVVG IPGQRGFPGI
PGPSGEPGKQ GPSGASGERG PPGPVGPPGI AGPPGESGRE GSPGAEGSAG RDGSPGPKGD
RGETGPAGPP GAPGAPGAPG PVGPAGKSGD RGEAGPAGPA GPIGPVGARG PAGPQGPRGF
SGIQGPPGPP GSPGEQGPSG ASGPAGPRGP PGSAGAPGKD GINGIPGPIG PPGPR