CO1A2_MYLDA
ID CO1A2_MYLDA Reviewed; 1033 AA.
AC C0HJP4; C0HLH8;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 2.
DT 03-AUG-2022, entry version 15.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Mylodon darwinii (Giant ground sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; Mylodon.
OX NCBI_TaxID=48784 {ECO:0000303|PubMed:25799987};
RN [1]
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
RN [2] {ECO:0000305}
RP PROTEIN SEQUENCE OF 37-70; 79-99; 103-111; 121-137; 145-186; 205-264;
RP 282-320; 335-354; 373-420; 433-490; 495-539; 594-626; 633-656; 696-732;
RP 748-763; 784-796; 800-853 AND 938-978, TISSUE SPECIFICITY, AND
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000250|UniProtKB:P08123}.
CC Secreted, extracellular space {ECO:0000250|UniProtKB:P08123}. Secreted,
CC extracellular space, extracellular matrix
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- TISSUE SPECIFICITY: Expressed in bone. {ECO:0000269|PubMed:25799987,
CC ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from ancient bone
CC material (PubMed:25799987, PubMed:31171860). The displayed protein
CC fragments were extracted from an ancient caudal vertebra bone collected
CC in Ultima Esperanza, Chile and around 13045 years old
CC (PubMed:31171860). The tryptic peptides required multiple purification
CC steps in order to eliminate contaminants and to increase the
CC concentration of peptidic material (PubMed:25799987).
CC {ECO:0000269|PubMed:25799987, ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJP4; -.
DR PRIDE; C0HJP4; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extinct organism protein;
KW Extracellular matrix; Hydroxylation; Repeat; Secreted.
FT CHAIN 1..1033
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433504"
FT REGION 1..1033
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 169..183
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1002..1016
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 25
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 32
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 94
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 100
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 106
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 109
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 139
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 170
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 188
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 208
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 226
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 235
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 244
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 250
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 265
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 319
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 328
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 338
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 367
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 373
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 391
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 394
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 401
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 413
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 436
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 457
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 478
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 496
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 502
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 523
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 558
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 567
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 579
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 669
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 720
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 735
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 783
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 784
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 789
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 790
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 792
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 801
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 814
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 816
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 856
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 907
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 918
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 933
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 936
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:31171860"
FT UNSURE 942
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 945
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 948
FT /note="L or I"
FT /evidence="ECO:0000269|PubMed:25799987,
FT ECO:0000269|PubMed:31171860"
FT UNSURE 993
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:31171860"
FT CONFLICT 41..42
FT /note="AS -> ST (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 44
FT /note="A -> V (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 47
FT /note="P -> A (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 146
FT /note="V -> I (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 152..154
FT /note="AGS -> SGA (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 433
FT /note="H -> N (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 511..513
FT /note="ERG -> APGE (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 519..523
FT /note="PSGAI -> TVGAP (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 613
FT /note="A -> S (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 615..616
FT /note="AA -> PS (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 621
FT /note="P -> T (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 748
FT /note="T -> A (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 816
FT /note="I -> IH (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 818..819
FT /note="PP -> TT (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 829
FT /note="G -> GA (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 957
FT /note="S -> A (in Ref. 2; AA sequence)"
FT /evidence="ECO:0000305"
FT NON_CONS 24..25
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 77..78
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 513..514
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 816..817
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 829..830
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 868..869
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1033
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1033 AA; 92769 MW; 2F139BF5618484D8 CRC64;
SGGFDFSFLP QPPQEKAHDG GRYYLGPGPM GLMGPRGPPG ASGAPGPQGF QGPAGEPGEP
GQTGPAGARG PAGPPGKGVV GPQGARGFPG TPGLPGFKGI RGHNGLDGLK GQPGAPGVKG
EPGAPGENGT PGQTGARGLP GERGRVGAPG PAGSRGSDGS VGPVGPAGPI GSAGPPGFPG
APGPKGELGP VGNTGPSGPA GPRGEQGLPG VSGPVGPPGN PGANGLTGAK GAAGLPGVAG
APGLPGPRGI PGPVGASGAT GARGLVGEPG PAGSKGESGG KGEPGSAGPQ GPPGSSGEEG
KRGPSGESGS TGPTGPPGLR GGPGSRGLPG ADGRAGVIGP AGARGASGPA GVRGPSGDTG
RPGEPGLMGA RGLPGSPGNV GPAGKEGPAG LPGIDGRPGP IGPAGARGEA GNIGFPGPKG
PAGDPGKAGE KGHAGLAGNR GAPGPDGNNG AQGPPGLQGV QGGKGEQGPA GPPGFQGLPG
PAGTTGEAGK PGERGIPGEF GLPGPAGPRG ERGSGAVGPS GAIGSRGPSG PPGPDGNKGE
PGVVGAPGTA GPAGSGGLPG ERGAAGIPGG KGEKGETGLR GEVGTTGRDG ARGAPGAVGA
PGPAGATGDR GEAGAAGPAG PAGPRGSPGE RGEVGPAGPN GFAGPAGAAG QPGAKGERGT
KGPKGENGIV GPTGPVGSAG PAGPNGPAGP AGSRGDGGPP GVTGFPGAAG RTGPPGPSGI
TGPPGPPGAA GKEGLRGPRG DQGPVGRTGE TGAGGPPGFT GEKGPSGEPG TAGPPGTAGP
QGLLGAPGIL GLPGSRGERG LPGVAGAVGE PGPLGIGPPG ARGPSGGVGP GVNGAPGEAG
RDGNPGSDGP PGRDGLPGHK GERGYAGNGP VGAAGAPGPH GAVGPAGKHG NRGEPGPVGS
AGPVGALGPR GPSGPQGIRG DKGEAGDKGP RGLPGLKGHN GLQGLPGLAG QHGDQGSPGP
VGPAGPRGPA GPSGPPGKDG RTGHPGAVGP AGIRGSQGSQ GPSGPPGPPG PPGPPGASGG
GYDFGYEGDF YRA