CO1A2_PARHA
ID CO1A2_PARHA Reviewed; 1041 AA.
AC C0HLK0;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Paramylodon harlani (Harlan's ground sloth) (Glossotherium harlani).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; Paramylodon.
OX NCBI_TaxID=2546661 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC femur bone collected in Roseburgh, Oregon, USA.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLK0; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 4.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..1041
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448467"
FT REGION 1..1041
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 183..197
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 39
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 45
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 113
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 376
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 379
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 113
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 25
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 32
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 109
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 115
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 121
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 124
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 154
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 184
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 202
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 222
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 240
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 249
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 258
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 264
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 279
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 333
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 342
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 352
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 381
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 387
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 405
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 408
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 415
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 427
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 450
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 464
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 485
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 503
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 509
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 534
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 569
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 578
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 590
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 680
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 731
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 746
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 793
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 794
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 799
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 800
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 802
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 811
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 824
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 826
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 865
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 916
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 927
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 942
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 945
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 951
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 954
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 957
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 1001
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 24..25
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 50..51
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 168..169
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 456..457
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 758..759
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 826..827
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 838..839
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 891..892
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 965..966
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1041
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1041 AA; 93673 MW; F37118C5425CBA27 CRC64;
SGGFDFSFLP QPPQEKAHDG GRYYLGPGPM GLMGPRGPPG ASGAPGPQGF GPAGEPGEPG
QTGPAGARGP AGPPGKAGED GHPGKPGRPG ERGVVGPQGA RGFPGTPGLP GFKGIRGHNG
LDGLKGQPGA PGVKGEPGAP GENGTPGQTG ARGLPGERGR VGAPGPAGRG SDGSVGPVGP
AGPIGSAGPP GFPGAPGPKG ELGPVGNTGP SGPAGPRGEQ GLPGVSGPVG PPGNPGANGL
TGAKGAAGLP GVAGAPGLPG PRGIPGPVGA SGATGARGLV GEPGPAGSKG ESGGKGEPGS
AGPQGPPGSS GEEGKRGPSG ESGSTGPTGP PGLRGGPGSR GLPGADGRAG VIGPAGARGA
SGPAGVRGPS GDTGRPGEPG LMGARGLPGS PGNVGPAGKE GPAGLPGIDG RPGPIGPAGA
RGEAGNIGFP GPKGPAGDPG KAGEKGHAGL AGNRGAGAQG PPGLQGVQGG KGEQGPAGPP
GFQGLPGPAG TTGEVGKPGE RGIPGEFGLP GPAGPRGERG PSGESGAVGP SGAIGSRGPS
GPPGPDGNKG EPGVVGAPGT AGPAGSGGLP GERGAAGIPG GKGEKGETGL RGEVGTTGRD
GARGAPGAVG APGPAGATGD RGEAGAAGPA GPAGPRGSPG ERGEVGPAGP NGFAGPAGAA
GQPGAKGERG TKGPKGENGI VGPTGPVGSA GPAGPNGPAG PAGSRGDGGP PGVTGFPGAA
GRTGPPGPSG ITGPPGPPGA AGKEGLRGPR GDQGPVGRGE TGAGGPPGFT GEKGPSGEPG
TAGPPGTAGP QGLLGAPGIL GLPGSRGERG LPGVAGAVGE PGPLGIGPPG ARGPSGGVPG
VNGAPGEAGR DGNPGSDGPP GRDGLPGHKG ERGYAGNPGP VGAAGAPGPQ GVGPVGKHGN
RGEPGPVGSA GPVGALGPRG PSGPQGIRGD KGEAGDKGPR GLPGLKGHNG LQGLPGLAGQ
HGDQGPGPVG PAGPRGPAGP SGPPGKDGRT GHPGAVGPAG IRGSQGSQGP SGPAGPPGPP
GPPGASGGGY DFGYEGDFYR A