CO1A2_GLYSX
ID CO1A2_GLYSX Reviewed; 933 AA.
AC C0HLI0;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 6.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Glyptodon sp. (strain SLP-2019) (Giant armadillo).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Cingulata; Chlamyphoridae; Glyptodon;
OC unclassified Glyptodon.
OX NCBI_TaxID=2546663 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC femur bone collected in Buenos Aires in Argentina.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLI0; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..933
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448465"
FT REGION 1..191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 206..933
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 131..145
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 24
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 30
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 94
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 317
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 320
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 94
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 10
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 17
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 90
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 101
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 150
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 170
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 188
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 197
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 206
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 227
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 281
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 290
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 322
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 328
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 346
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 390
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 411
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 432
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 455
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 515
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 536
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 657
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 690
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 738
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 739
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 745
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 747
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 756
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 769
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 800
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 877
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 881
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 884
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 887
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 6..7
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 54..55
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 96..97
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 304..305
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 314..315
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 360..361
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 435..436
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 615..616
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 785..786
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 797..798
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 869..870
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 879..880
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 933
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 933 AA; 83328 MW; 512387D9E758D095 CRC64;
SGGFDFGVGL GPGPMGLMGP RGPPGASGAP GPQGFQGPAG EPGEPGQTGP AGARPGKAGE
DGHPGKPGRP GERGVVGPQG ARGFPGTPGL PGFKGIGARG LPGERGRVGA PGPAGARGSD
GSVGPVGPAG PIGSAGPPGF PGAPGPKGEL GPVGNPGPAG PAGPRGEQGL PGVSGPVGPP
GNPGANGLTG AKGAAGLPGV AGAPGLPGPR GIPGPVGAVG ATGARGLVGE PGPAGSKGES
GNKGEPGSAG PQGPPGPSGE EGKRGPNGES GSTGPTGPPG LRGGPGSRGL PGADGRAGVM
GPAGRGASGP AGVRGRPGEP GLMGPRGLPG SPGNTGPAGK EGPVGLPGID GRPGPVGPAG
RGEAGNIGFP GPKGPTGDPG KAGEKGHAGL AGNRGAPGPD GNNGAQGPPG LQGVQGGKGE
QGPAGPPGFQ GLPGPGTTGE AGKPGERGIH GEFGLPGPAG PRGERGPPGE SGAAGPVGPI
GSRGPSGPPG PDGNKGEPGV VGAPGTAGPA GSGGLPGERG AAGIPGGKGE KGETGLRGEV
GTTGRDGARG APGAVGAPGP AGATGDRGEA GAAGPAGPAG PRGSPGERGE VGPAGPNGFA
GPAGAAGQPG AKGERKGPKG ENGIVGPTGP VGAAGPSGPN GAPGPAGGRG DGGPPGLTGF
PGAAGRTGPP GPSGITGPPG PPGAAGKEGL RGPRGDQGPV GRTGETGAGG PPGFAGEKGP
SGEPGTAGPP GTAGPQGLLG APGILGLPGS RGERGLPGVA GAVGEPGPLG ISGPPGARGP
SGAVGPGVNG APGEAGRDGL PGHKGERGYA GNAGPVGAAG APGPHGSVGP AGKHGNRGEP
GPAGSVGPVG AVGPRGPSGP QGVRGDKGEG DKGPRGLPGG LQGLPGLAGQ HGDQGSPGPV
GPAGPRGPAG PSGPAGKDGR TGHPGAVGPA GVR