CO1A2_MEGAE
ID CO1A2_MEGAE Reviewed; 1050 AA.
AC C0HLJ6;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Megatherium americanum (Giant ground sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Megatheriidae; Megatherium.
OX NCBI_TaxID=2546660 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC rib bone collected at Bariloche in Argentina and estimated to be around
CC 19050 years old. {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLJ6; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 9.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..1050
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448469"
FT REGION 1..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..198
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1019..1033
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 40
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 46
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 114
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 377
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 380
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 114
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 26
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 33
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 110
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 116
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 122
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 125
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 154
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 185
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 203
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 223
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 241
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 250
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 259
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 265
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 280
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 334
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 343
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 353
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 382
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 388
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 406
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 409
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 416
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 428
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 451
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 493
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 511
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 517
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 542
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 573
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 582
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 594
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 684
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 735
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 750
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 798
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 799
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 804
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 805
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 807
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 816
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 829
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 831
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 872
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 924
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 935
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 950
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 953
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 959
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 962
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 965
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 1010
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 16..17
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 22..23
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 51..52
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 131..132
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 563..564
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 845..846
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1050
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1050 AA; 94631 MW; DB631B50506460E4 CRC64;
SGGFDFSFLP QPPQEKDGGR YYGVGLGPGP MGLMGPRGPP GASGAPGPQG FGPAGEPGEP
GQTGPAGARG PPGAPGKAGE DGHPGKPGRP GERGVVGPQG ARGFPGTPGL PGFKGIRGHN
GLDGLKGQPG AGVKGEPGAP GENGTPGQTG ARGLPGERGR VGAPGPAGAR GSDGSVGPVG
PAGPIGSAGP PGFPGAPGPK GELGPVGNTG PSGPAGPRGE QGLPGVSGPV GPPGNPGANG
LTGAKGAAGL PGVAGAPGLP GPRGIPGPVG ASGATGARGL VGEPGPAGSK GESGNKGEPG
SAGPQGPPGS SGEEGKRGPN GESGSTGPTG PPGLRGNPGS RGLPGADGRA GVIGPAGARG
ASGPAGVRGP SGDTGRPGEP GLMGARGLPG SPGNVGPAGK EGPAGLPGID GRPGPIGPAG
ARGEAGNIGF PGPKGPAGDP GKAGEKGHAG LAGNRGAPGP DGNNGAQGPP GPQGVQGGKG
EQGPAGPPGF QGLPGPAGTT GEAGKPGERG IPGEFGLPGP AGPRGERGPP GESGAVGPSG
AIGSRGPSGP PGPDGNKGEP GVVTAGPAGS GGLPGERGAA GIPGGKGEKG ETGLRGEVGT
TGRDGARGAP GAVGAPGPAG ATGDRGEAGA AGPAGPAGPR GSPGERGEVG PAGPNGFAGP
AGAAGQPGAK GERGTKGPKG ENGIVGPTGP VGSAGPAGPN GPAGPAGSRG DGGPPGMTGF
PGAAGRTGPP GPSGITGPPG PPGAAGKEGL RGPRGDQGPV GRTGETGAGG PPGFTGEKGP
SGEPGTAGPP GTAGPQGLLG APGILGLPGS RGERGLPGVA GAVGEPGPLG IAGPPGARGP
SGAVGPGVNG APGETGRDGN PGSDGPPGRD GLPGHKGERG YAGNAGPVGA AGAPGPHGTV
GPAGKHGNRG EPGPVGSVGP VGALGPRGPS GPQGIRGDKG EPGDKGPRGL PGLKGHNGLQ
GLPGLAGQHG DQGSPGPVGP AGPRGPAGPS GPAGKDGRTG HPGAVGPAGI RGSQGSQGPS
GPPGPPGPPG PPGASGGGYD FGYEGDFYRA