CO1A2_BRAVA
ID CO1A2_BRAVA Reviewed; 979 AA.
AC C0HLH0;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Bradypus variegatus (Brown-throated three-fingered sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Bradypodidae; Bradypus.
OX NCBI_TaxID=9355 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLH0; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 11.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extracellular matrix; Glycoprotein;
KW Hydroxylation; Secreted.
FT CHAIN 1..979
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448477"
FT REGION 1..979
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..184
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 948..962
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 38
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 44
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 99
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 334
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 337
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 99
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 24
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 31
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 95
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 107
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 110
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 140
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 189
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 209
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 227
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 236
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 245
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 253
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 305
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 339
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 345
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 363
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 425
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 446
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 470
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 525
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 546
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 702
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 749
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 750
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 756
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 758
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 767
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 780
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 806
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 853
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 879
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 882
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 888
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 891
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 894
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 16..17
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 23..24
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 78..79
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 249..250
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 267..268
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 312..313
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 331..332
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 404..405
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 470..471
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 520..521
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 714..715
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 782..783
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 789..790
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 810..811
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 815..816
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 828..829
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 979
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 979 AA; 88144 MW; 2353663F0B87A01E CRC64;
SGGFDFSFLP QPPQEKHDGG RYYLGPGPMG LMGPRGPPGA SGAPGPQGFQ GPAGEPGEPG
QTGPAGARGP AGPPGKAGGV VGPQGARGFP GTPGLPGFKG IRGHNGLDGL KGQPGAPGVK
GEPGAPGENG TPGQTGARGL PGERGRVGAP GPAGARGSDG SVGPVGPAGP IGSAGPPGFP
GAPGPKGELG PVGSTGPSGP AGPRGEQGLP GVSGPVGPPG NPGANGLTGA KGAAGLPGVA
GAPGLPGPRA RGLVGEPGPA GSKGESGGEP GSAGPQGPPG SSGEEGKRGP SGESGSTGPT
GPPGLRGGPG SRAGVIGPAG ARGASGPAGV RGRPGEPGLM GARGLPGSPG NVGPAGKEGP
VGLPGIDGRP GPIGPAGARG EAGNIGFPGP KGPAGDPGKA GEKGAGNRGA PGPDGNNGAQ
GPPGLQGVQG GKGEQGPAGP PGFQGLPGPA GTTGEAGKPG ERGIPGEFGL PGPAGPRGER
GPPGESGAVG PSGAIGSRGP SGPPGPDGNK GEPGVVGAPG GSGGLPGERG AAGIPGGKGE
KGETGLRGEV GTTGRDGARG APGAVGAPGP AGATGDRGEA GAAGPAGPAG PRGSPGERGE
VGPAGPNGFA GPAGAAGQPG AKGERGTKGP KGENGIVGPT GPVGSAGPAG PNGPAGPAGS
RGDGGPPGAT GFPGAAGRTG PPGPSGITGP PGPPGAAGKE GLRGPRGDQG PVGRGETGAG
GPPGFTGEKG PSGEPGTAGP PGTAGPQGLL GAPGILGLPG SRGERGLPGV AGAVGEPGPL
GIGPPGARGG RDGNPGSDGP PGRDGLPGHK GYAGNGPVGA AGAPGPHGVG PAGKHGNRGE
PGPVGSVGPV GALGPRGPSG PQGIRGDKGE PGDKGPRGLP GLKGHNGLQG LPGLAGHHGD
QGAPGPVGPA GPRGPAGPSG PAGKDGRTGH PGAVGPAGIR GSQGSQGPSG PPGPPGPPGP
PGASGGGYDF GYEGDFYRA