CO1A2_TOXSP
ID CO1A2_TOXSP Reviewed; 908 AA.
AC C0HJP8;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Toxodon sp.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Notoungulata; Toxodontidae; Toxodon; unclassified Toxodon.
OX NCBI_TaxID=1563122 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from fossils. The
CC tryptic peptides required multiple purification steps in order to
CC eliminate contaminants and to increase the concentration of peptidic
CC material. {ECO:0000305|PubMed:25799987}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJP8; -.
DR PRIDE; C0HJP8; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 10.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extinct organism protein;
KW Extracellular matrix; Hydroxylation; Repeat; Secreted.
FT CHAIN 1..908
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433507"
FT REGION 1..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 227..908
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..166
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 892..908
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 5
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 77
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 83
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 89
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 92
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 122
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 153
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 171
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 191
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 209
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 218
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 227
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 233
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 248
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 302
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 308
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 336
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 349
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 360
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 363
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 382
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 405
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 420
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 438
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 444
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 469
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 490
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 493
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 504
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 513
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 525
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 533
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 655
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 670
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 691
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 709
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 718
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 719
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 724
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 725
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 727
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 736
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 743
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 749
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 751
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 808
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 823
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 826
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 832
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 835
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 838
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 872
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 883
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 20..21
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 304..305
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 313..314
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 326..327
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 409..410
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 526..527
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 778..779
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 908 AA; 81149 MW; AE9D24955BF57A74 CRC64;
GPMGIMGPRG PPGASGAPGP AGEPGEPGQT GPAGARGPPG PPGKAGEDGH PGKPGRPGER
GVVGPQGARG FPGTPGIPGF KGIRGHNGID GIKGQPGAPG VKGEPGAPGE NGTPGQAGAR
GIPGERGRVG APGPAGARGS DGSVGPVGPA GPIGSAGPPG FPGAPGPKGE IGPVGNPGPA
GPAGPRGEVG IPGVSGPVGP PGNPGANGIT GAKGAAGIPG VAGAPGIPGP RGIPGPVGAA
GATGARGIVG EPGPAGSKGE SGNKGEPGSA GPQGPPGPAG EEGKRGPNGE AGSTGPTGPP
GIRGSRGIPG ADGGSRGATG PAGVRGDSGR PGEPGIMGPR GFPGSPGNIG PAGKEGPVGI
PGIDGRPGPT GPAGARGEPG NIGFPGPKGP TGDPGKNGDK GHAGIAGARG PAGPPGFQGI
PGPAGTAGEV GKPGERGIPG EFGIPGPAGA RGERGPPGES GAVGPAGPIG SRGPSGPPGP
DGNKGEPGNI GAIGTAGPSG PSGIPGERGA AGIPGGKGEK GETGIRRGAP GAIGAPGPAG
ANGDRGEAGP AGPAGPAGPR GSPGERGEVG PAGPNGFAGP AGAAGQPGAK GERGTKGPKG
ENGPVGPTGP VGAAGPAGPN GPPGPAGSRG DGGPPGATGF PGAAGRTGPP GPAGITGPPG
PPGAAGKEGI RGPRGDQGPV GRSGETGASG IPGFAGEKGP AGEPGTAGIP GTPGPQGIIG
APGIIGIPGS RGERGIPGVA GSIGEPGPIG IAGPPGARGP PGAVGNPGVN GAPGEAGRHG
NRGEPGPAGS VGPAGAVGPR GPSGPQGIRG DKGEPGDKGP RGIPGIKGHN GIQGIPGIAG
QHGDQGAPGA VGPAGPRGPA GPSGPAGKDG RIGHPGTVGP AGIRGSQGSQ GPAGPPGPPG
PPGPPGPS