CO1A2_CHOHO
ID CO1A2_CHOHO Reviewed; 1006 AA.
AC C0HLG8;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 6.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Choloepus hoffmanni (Hoffmann's two-fingered sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Megalonychidae; Choloepus.
OX NCBI_TaxID=9358 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLG8; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extracellular matrix; Glycoprotein;
KW Hydroxylation; Secreted.
FT CHAIN 1..1006
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448479"
FT REGION 1..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 99..1006
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 157..171
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 975..989
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 35
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 41
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 86
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 350
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 353
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 86
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 21
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 28
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 82
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 94
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 97
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 127
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 176
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 196
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 214
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 223
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 232
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 253
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 307
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 316
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 355
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 361
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 379
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 424
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 445
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 466
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 490
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 550
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 571
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 723
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 771
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 772
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 778
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 780
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 789
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 802
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 828
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 880
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 906
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 909
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 915
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 918
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 921
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 17..18
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 65..66
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 580..581
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 804..805
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 811..812
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1006
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1006 AA; 90206 MW; EC7A9488782B2721 CRC64;
SGGFDFSFLP QPPQEKAGVG LGPGPMGLMG PRGPPGASGA PGPQGFQGPA GEPGEPGQTG
PAGARGVVGP QGARGFPGTP GLPGFKGIRG YNGLDGLKGQ PGAAGVKGEP GAPGENGTPG
QTGARGLPGE RGRVGAPGPA GSRGSDGSVG PVGPAGPIGS AGPPGFPGAP GPKGELGPVG
NTGPSGPAGP RGEQGLPGVS GPVGPPGNPG ANGLTGAKGA AGLPGVAGAP GLPGPRGIPG
PVGASGATGA RGLVGEPGPA GSKGESGGKG EPGSAGPQGP PGSSGEEGKR GPSGESGSTG
PTGPPGLRGG PGSRGLPGAD GRAGVIGPAG ARGASGPAGV RGPSGDTGRP GEPGLMGARG
LPGSPGNVGP AGKEGPAGLP GIDGRPGPIG PAGARGEAGN IGFPGPKGPA GDPGKGGEKG
HAGLAGNRGA PGPDGNNGAQ GPPGLQGVQG GKGEQGPAGP PGFQGLPGPA GTTGEAGKPG
ERGIPGEFGL PGPAGPRGER GPPGESGAVG PSGAIGSRGP SGPPGPDGNK GEPGVVGAPG
TAGPAGSGGL PGERGAAGIP GGKGEKGETG LRGEVGTTGR GAPGAVGAPG PAGATGDRGE
AGAAGPAGPA GPRGSPGERG EVGPAGPNGF AGPAGAAGQP GAKGERGTKG PKGENGIVGP
TGPVGSAGPA GPNGPAGPAG SRGDGGPPGV TGFPGAAGRT GPPGPSGITG PPGPPGAAGK
EGLRGPRGDQ GPVGRTGETG AGGPPGFTGE KGPSGEPGTA GPPGTAGPQG LLGAPGILGL
PGSRGERGLP GVAGAVGEPG PLGIGPPGAR GGRDGNPGSD GPPGRDGLPG HKGERGYAGN
PGPVGAAGAP GPHGAVGPAG KHGNRGEPGP VGSAGPVGAL GPRGPSGPQG IRGDKGEAGD
KGPRGLPGLK GHNGLQGLPG LAGQHGDQGA PGAVGPAGPR GPSGPSGPPG KDGRTGHPGA
VGPAGIRGSQ GSQGPSGPPG PPGPPGPPGA SGGGYDFGYE GDFYRA