CO1A2_MEGJE
ID CO1A2_MEGJE Reviewed; 874 AA.
AC C0HLJ8;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Megalonyx jeffersonii (Jefferson's ground sloth) (Megatherium jeffersonii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Megalonychidae; Megalonyx.
OX NCBI_TaxID=2576014 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC pelvis bone collected in Newburgh, New York, USA and estimated to be
CC around 11255 years old. {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLJ8; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 5.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Pyrrolidone carboxylic acid; Secreted.
FT CHAIN 1..874
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448475"
FT REGION 1..874
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..151
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 16
FT /note="Allysine"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 34
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 40
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 93
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 317
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 320
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 93
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 20
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 27
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 89
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 138
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 156
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 176
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 193
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 202
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 211
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 217
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 226
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 280
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 284
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 322
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 328
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 346
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 349
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 356
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 368
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 379
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 397
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 415
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 421
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 446
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 481
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 490
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 580
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 613
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 631
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 643
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 687
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 688
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 693
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 694
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 696
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 705
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 718
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 720
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 776
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 787
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 790
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 793
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 796
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 841
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 16..17
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 72..73
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 93..94
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 111..112
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 178..179
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 223..224
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 282..283
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 298..299
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 374..375
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 383..384
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 485..486
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 637..638
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 644..645
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 652..653
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 728..729
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 740..741
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 788..789
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 858..859
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 874
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 874 AA; 78113 MW; 00A001CDFEFDB497 CRC64;
SGGFDFSFLP QPPQEKGVGL GPGPMGLMGP RGPPGASGAP GPQGFQGPAG EPGEPGQTGP
AGARGPAGPP GKGVVGPQGA RGFPGTPGLP GFKGEPGAPG ENGTPGQTGA RGRVGAPGPA
GARGSDGSVG PVGPAGPIGS AGPPGFPGAP GPKGELGPVG NTGPSGPAGP RGEQGLPGSG
PVGPPGNPGA NGLTGAKGAA GLPGVAGAPG LPGPRGIPGP VGARGLVGEP GPAGSKGESG
GKGEPGSAGP QGPPGSSGEE GKRGPSGESG STGPTGPPGL RGGLPGADGR AGVMGPAGRG
ASGPAGVRGP SGDTGRPGEP GLMGARGLPG SPGNVGPAGK EGPAGLPGID GRPGPIGPAG
ARGEAGNIGF PGPKGHAGLA GNRGEQGPAG PPGFQGLPGP AGTTGEAGKP GERGIPGEFG
LPGPAGPRGE RGPPGESGAV GPSGAIGSRG PSGPPGPDGN KGEPGVVGAP GTAGPAGSGG
LPGERGETGL RGEVGTTGRD GARGAPGAVG APGPAGATGD RGEAGAAGPA GPAGPRGSPG
ERGEVGPAGP NGFAGPAGAA GQPGAKGERG TKGPKGENGI VGPTGPVGSA GPAGPNGPAG
PAGSRGDGGP PGLTGFPGAA GRTGPPGPSG ITGPPGPAGK EGLRGDQGPV GRGETGAGGP
PGFTGEKGPS GEPGTAGPPG TAGPQGLLGA PGILGLPGSR GERGLPGVAG AVGEPGPLGI
AGPPGARGDG NPGSDGPPGR GAAGAPGPHG TVGPAGKHGN RGEPGPVGSV GPVGALGPRG
PSGPQGIRGL QGLPGLAGQH GDQGSPGPVG PAGPRGPAGP SGPAGKDGRT GHPGAVGPAG
IRGSQGSQGP SGPAGPPGSG GGYDFGYEGD FYRA