CO1A2_PARSU
ID CO1A2_PARSU Reviewed; 913 AA.
AC C0HLJ2;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Parocnus serus (Greater Haitian ground sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Megalonychidae; Parocnus.
OX NCBI_TaxID=2546659 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC humerus epiphysis bone collected in Haiti.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLJ2; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 8.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Hydroxylation; Secreted.
FT CHAIN 1..913
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448455"
FT REGION 1..913
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..162
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 882..896
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 35
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 41
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 338
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 341
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 21
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 28
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 102
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 108
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 118
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 149
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 167
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 186
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 204
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 213
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 222
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 228
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 243
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 295
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 304
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 314
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 343
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 349
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 367
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 370
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 377
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 389
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 420
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 446
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 471
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 514
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 526
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 616
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 667
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 682
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 730
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 731
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 736
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 737
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 739
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 748
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 761
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 763
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 795
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 806
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 821
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 824
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 826
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 829
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 873
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 17..18
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 75..76
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 110..111
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 175..176
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 284..285
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 403..404
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 427..428
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 440..441
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 505..506
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 763..764
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 774..775
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 824..825
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 837..838
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 913
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 913 AA; 81633 MW; EB54DBB1240061F1 CRC64;
SGGFDFSFLP QPPQEKAGVG LGPGPMGLMG PRGPPGASGA PGPQGFQGPA GEPGEPGQTG
PAGARGPAGP PGKAGPGKPG RPGERGVVGP QGARGFPGTP GLPGFKGIRG GQTGARGLPG
ERGRVGAPGP AGARGSDGSV GPVGPAGPIG SAGPPGFPGA PGPKGELGPV GNTGPGPAGP
RGEQGLPGVS GPVGPPGNPG ANGLTGAKGA AGLPGVAGAP GLPGPRGIPG PVGASGATGA
RGLVGEPGPA GSKGESGGKG EPGSAGPQGP PGSSGEEGKR GPNGGSTGPT GPPGLRGGPG
SRGLPGADGR AGVIGPAGAR GASGPAGVRG PSGDTGRPGE PGLMGARGLP GSPGNVGPAG
KEGPAGLPGI DGRPGPIGPA GARGEAGNIG FPGPKGPAGD PGKGAPGPDG NNGAQGPPGL
QGVQGGKGTT GEAGKPGERG PGEFGLPGPA GPRGERGPPG ESGAVGPSGA IGSRGPSGPP
GPDGNKGEPG VVGAPGTAGP AGSGGPGERG AAGIPGGKGE KGETGLRGEV GTTGRDGARG
APGAVGAPGP AGATGDRGEA GAAGPAGPAG PRGSPGERGE VGPAGPNGFA GPAGAAGQPG
AKGERGTKGP KGENGIVGPT GPVGSAGPAG PNGPAGPAGS RGDGGPPGVT GFPGAAGRTG
PPGPSGITGP PGPPGAAGKE GLRGPRGDQG PVGRTGETGA GGPPGFTGEK GPSGEPGTAG
PPGTAGPQGL LGAPGILGLP GSRGERGLPG VAGAVGEPGP LGIGPPGARG PSGAGKHGNR
GEPGPVGSVG PVGALGPRGP SGPQGIRGDK GEPGEKGPRG LPGLGLPGLA GQHGDQGPGP
VGPAGPRGPA GPSGPPGKDG RTGHPGAVGP AGIRGSQGSQ GPSGPPGPPG PPGPPGASGG
GYDFGYEGDF YRA