CO1A2_MACSX
ID CO1A2_MACSX Reviewed; 907 AA.
AC C0HJP6;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Macrauchenia sp.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Litopterna; Macraucheniidae; Macrauchenia;
OC unclassified Macrauchenia.
OX NCBI_TaxID=1563127 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, HYDROXYLATION AT PRO-272, DEAMIDATION AT ASN-260, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from fossils. The
CC tryptic peptides required multiple purification steps in order to
CC eliminate contaminants and to increase the concentration of peptidic
CC material. {ECO:0000305|PubMed:25799987}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJP6; -.
DR PRIDE; C0HJP6; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extinct organism protein;
KW Extracellular matrix; Hydroxylation; Repeat; Secreted.
FT CHAIN 1..907
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433503"
FT REGION 1..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 199..907
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..138
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 891..907
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 260
FT /note="Deamidated asparagine"
FT /evidence="ECO:0000269|PubMed:25799987"
FT MOD_RES 272
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 5
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 49
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 55
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 61
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 64
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 94
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 125
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 143
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 163
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 181
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 190
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 199
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 205
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 220
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 274
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 280
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 319
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 343
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 346
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 365
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 387
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 405
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 411
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 436
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 471
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 480
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 492
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 619
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 634
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 673
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 679
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 680
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 685
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 686
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 688
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 697
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 710
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 712
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 807
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 822
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 825
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 831
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 834
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 837
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 871
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 882
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 9..10
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 30..31
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 276..277
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 379..380
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 493..494
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 559..560
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 635..636
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 751..752
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 907 AA; 80939 MW; 003B53B58D014453 CRC64;
GPMGIMGPRR GPPGPPGKAG EDGHPGKPGR ERGVVGPQGA RGFPGTPGIP GFKGIRGHNG
IDGIKGQPGA PGVKGEPGAP GENGTPGQAG ARGIPGERGR VGAPGPAGAR GSDGSVGPVG
PAGPIGSAGP PGFPGAPGPK GEIGPVGNPG PAGPAGPRGE VGIPGVSGPV GPPGNPGANG
IPGAKGAAGI PGVAGAPGIP GPRGIPGPVG AAGATGARGI VGEPGPAGTK GESGNKGEPG
SAGPQGPPGP SGEEGKRGPN GEAGSAGPTG PPGIRGSRGI PGADGRAGVM GPPGSRGASG
PAGVRGPNGD SGRPGEPGIM GPRGFPGSPG NVGPAGKEGP AGIPGIDGRP GPAGPAGARG
EPGNIGFPGP KGPTGDPGKG PPGFQGIPGP AGTAGEAGKP GERGIPGEFG IPGPAGARGE
RGPPGESGAA GPAGPIGSRG PSGPPGPDGA KGEPGVVGAP GTAGPSGPSG IPGERGAAGI
PGPKGEKGET GIRGAPGAVG APGPAGANGD RGEAGPAGPA GPAGPRGSPG ERGEVGPAGP
NGFAGPAGAA GQPGAKGERK GPKGENGPVG PTGPVGSAGP AGPNGPPGPA GSRGDGGPPG
ATGFPGAAGR TGPPGPAGIT GPPGPPGAAG KEGIRGDQGP VGRAGETGAS GPPGFAGEKG
PNGEAGTAGA PGIPGPQGII GAPGIIGIPG SRGERGIPGV AGSVGEPGPI GIAGPPGARG
PPGAVGSPGV NGAPGEAGRD GNPGNDGPPG RGYPGNSGPV GAAGAPGSPG PVGPTGKHGN
RGEPGPVGSV GPAGAVGPRG PSGPQGIRGD KGEPGDKGPR GIPGIKGHNG IQGIPGIAGQ
HGDQGAPGSV GPAGPRGPAG PTGPAGKDGR IGHPGSVGPA GIRGSQGSQG PAGPPGPPGP
PGPPGPS