CO1A2_GLORB
ID CO1A2_GLORB Reviewed; 1002 AA.
AC C0HLI4;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Glossotherium robustum (Ground sloth) (Mylodon robustus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; Glossotherium.
OX NCBI_TaxID=2591764 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC skull bone collected in Buenos Aires in Argentina.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLI4; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 3.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Pyrrolidone carboxylic acid; Secreted.
FT CHAIN 1..1002
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448473"
FT REGION 1..1002
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 160..174
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 986..1002
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 28
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 34
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 89
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 352
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 355
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 89
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 21
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 85
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 91
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 97
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 100
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 130
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 161
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 179
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 198
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 216
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 225
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 234
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 240
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 255
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 309
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 318
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 328
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 357
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 363
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 381
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 384
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 391
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 403
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 426
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 447
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 468
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 486
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 492
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 517
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 552
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 561
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 573
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 663
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 714
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 729
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 777
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 778
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 783
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 784
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 786
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 795
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 808
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 810
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 844
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 896
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 907
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 922
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 926
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 929
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 932
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 977
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 16..17
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 68..69
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 187..188
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 810..811
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 823..824
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 835..836
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 924..925
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1002
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1002 AA; 89326 MW; D81CA55AF8194FA3 CRC64;
SGGFDFSFLP QPPQEKGPMG LMGPRGPPGA SGAPGPQGFQ GPAGEPGEPG QTGPAGARGP
AGPPGKAGGV VGPQGARGFP GTPGLPGFKG IRGHNGLDGL KGQPGAPGVK GEPGAPGENG
TPGQTGARGL PGERGRVGAP GPAGSRGSDG SVGPVGPAGP IGSAGPPGFP GAPGPKGELG
PVGNTGPGPA GPRGEQGLPG VSGPVGPPGN PGANGLTGAK GAAGLPGVAG APGLPGPRGI
PGPVGASGAT GARGLVGEPG PAGSKGESGG KGEPGSAGPQ GPPGSSGEEG KRGPSGESGS
TGPTGPPGLR GGPGSRGLPG ADGRAGVIGP AGARGASGPA GVRGPSGDTG RPGEPGLMGA
RGLPGSPGNV GPAGKEGPAG LPGIDGRPGP IGPAGARGEA GNIGFPGPKG PAGDPGKAGE
KGHAGLAGNR GAPGPDGNNG AQGPPGLQGV QGGKGEQGPA GPPGFQGLPG PAGTTGEAGK
PGERGIPGEF GLPGPAGPRG ERGPPGESGA VGPSGAIGSR GPSGPPGPDG NKGEPGVVGA
PGTAGPAGSG GLPGERGAAG IPGGKGEKGE TGLRGEVGTT GRDGARGAPG AVGAPGPAGA
TGDRGEAGAA GPAGPAGPRG SPGERGEVGP AGPNGFAGPA GAAGQPGAKG ERGTKGPKGE
NGIVGPTGPV GSAGPAGPNG PAGPAGSRGD GGPPGVTGFP GAAGRTGPPG PSGITGPPGP
PGAAGKEGLR GPRGDQGPVG RTGETGAGGP PGFTGEKGPS GEPGTAGPPG TAGPQGLLGA
PGILGLPGSR GERGLPGVAG AVGEPGPLGI GPPGARGPSG GVGPGVNGAP GEAGRDGPPG
RDGLPGHKGE RGYAGNAGPV GAAGAPGPHG AVGPAGKHGN RGEPGPVGSA GPVGALGPRG
PSGPQGIRGD KGEAGDKGPR GLPGGLQGLP GLAGQHGDQG APGPVGPAGP RGPAGPSGPP
GKDGRTGHPG AVGPAGIRGS QGSQGPSGPP GPPGPPGPPG AS