CO1A2_DOESX
ID CO1A2_DOESX Reviewed; 1015 AA.
AC C0HLI2;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Doedicurus sp. (South American giant glyptodont).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Cingulata; Chlamyphoridae; Doedicurus;
OC unclassified Doedicurus.
OX NCBI_TaxID=1849957 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC scute bone collected at Camet Norte in Argentina.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLI2; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 9.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..1015
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448463"
FT REGION 1..1015
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..177
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 38
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 106
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 346
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 349
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 106
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 24
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 31
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 102
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 108
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 114
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 117
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 133
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 164
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 182
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 202
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 219
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 228
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 234
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 249
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 303
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 312
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 351
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 356
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 374
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 377
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 396
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 418
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 439
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 460
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 478
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 484
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 509
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 543
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 552
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 564
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 654
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 687
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 705
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 720
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 768
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 769
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 774
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 775
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 777
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 786
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 799
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 801
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 837
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 915
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 918
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 924
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 927
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 930
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 17..18
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 41..42
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 127..128
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 213..214
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 353..354
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 410..411
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 538..539
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 815..816
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 827..828
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1015
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1015 AA; 91340 MW; 0376F42BEF5398C1 CRC64;
SGGFDFSFLP QPPQEKGDGK GVGLGPGPMG LMGPRGPPGA SFQGPAGEPG EPGQTGPAGA
RGPAGPPGKA GEDGHPGKPG RPGERGVVGP QGARGFPGTP GLPGFKGIRG HNGLDGLKGQ
AGAPGVKTGA RGLPGERGRV GAPGPAGARG SDGSVGPVGP AGPIGSAGPP GFPGAPGPKG
ELGPVGNPGP AGPAGPRGEQ GLPGVSGPVG PPGKGAAGLP GVAGAPGLPG PRGIPGPVGA
VGATGARGLV GEPGPAGSKG ESGNKGEPGS AGPQGPPGPS GEEGKRGPSG ESGSTGPTGP
PGLRGGPGSR GLPGADGRAG VMGPAGSRGA SGPAGVRGPS GDTGRPGEPG LMGRGLPGSP
GNTGPAGKEG PVGLPGIDGR PGPVGPAGPR GEAGNIGFPG PKGPTGDPGK GEKGHAGLAG
NRGAPGPDGN NGAQGPPGLQ GVQGGKGEQG PAGPPGFQGL PGPSGTTGEA GKPGERGIHG
EFGLPGPAGP RGERGPPGES GAAGPVGPIG SRGPSGPPGP DGNKGEPGVV GAPGTAGPGS
GGLPGERGAA GIPGGKGEKG ETGLRGEVGT TGRDGARGAP GAVGAPGPAG ATGDRGEAGA
AGPAGPAGPR GSPGERGEVG PAGPNGFAGP AGAAGQPGAK GERGTKGPKG ENGIVGPTGP
VGAAGPSGPN GAPGPAGGRG DGGPPGLTGF PGAAGRTGPP GPSGITGPPG PPGAAGKEGL
RGPRGDQGPV GRTGETGAGG PPGFAGEKGP SGEPGTAGPP GTAGPQGLLG APGILGLPGS
RGERGLPGVA GAVGEPGPLG ISGPPGARGP PGAVGPGVNG APGEAGRSDG PPGRDGLPGH
KGERGYAGNA GPVGAAGAPG PHGTVGPAGK HGNRGEPGPA GSVGPVGAVG PRGPSGPQGV
RGDKGEAGDK GPRGLPGLKG HNGLQGLPGL AGQHGDQGSP GPVGPAGPRG PAGPSGPAGK
DGRTGHPGAV GPAGVRGSQG SQGPSGPAGP PGPPGPPGAS GGGYDFGYEG DFYRA