CO1A2_NEOCO
ID CO1A2_NEOCO Reviewed; 1011 AA.
AC C0HLJ0;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 7.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Neocnus comes (Miller's Hispaniolan ground sloth) (Synocnus comes).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Megalonychidae; Neocnus.
OX NCBI_TaxID=2546658 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC bone collected in Haiti. {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLJ0; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..1011
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448457"
FT REGION 1..1011
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 980..994
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 35
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 41
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 106
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 367
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 370
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 106
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 21
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 28
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 102
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 108
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 114
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 117
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 147
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 177
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 195
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 215
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 233
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 241
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 250
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 256
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 271
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 324
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 333
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 343
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 372
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 378
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 396
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 399
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 406
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 418
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 441
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 462
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 483
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 501
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 507
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 532
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 575
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 587
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 677
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 728
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 743
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 791
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 792
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 797
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 798
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 800
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 809
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 822
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 824
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 834
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 885
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 896
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 911
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 914
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 920
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 923
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 926
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 971
FT /note="I or L"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 17..18
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 75..76
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 161..162
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 235..236
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 309..310
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 566..567
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 824..825
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 831..832
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 860..861
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1011
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 1011 AA; 90758 MW; 967442B5F4D2566A CRC64;
SGGFDFSFLP QPPQEKAGVG LGPGPMGLMG PRGPPGASGA PGPQGFQGPA GEPGEPGQTG
PAGARGPAGP PGKAGPGKPG RPGERGVVGP QGARGFPGTP GLPGFKGIRG HNGLDGLKGQ
PGAPGVKGEP GAPGENGTPG QTGARGLPGE RGRVGAPGPA GRGSDGSVGP VGPAGPIGSA
GPPGFPGAPG PKGELGPVGN TGPAGPAGPR GEQGLPGVSG PVGPPGNPGA NGLTGKGAAG
LPGVAGAPGL PGPRGIPGPV GASGATGARG LVGEPGPAGS KGESGGKGEP GSAGPQGPPG
SSGEEGKRGN GEAGSTGPTG PPGLRGGPGS RGLPGADGRA GVIGPAGARG ASGPAGVRGP
SGDTGRPGEP GLMGARGLPG SPGNVGPAGK EGPVGLPGID GRPGPIGPAG ARGEAGNIGF
PGPKGPAGDP GKGGEKGHAG LAGNRGAPGP DGNNGAQGPP GLQGVQGGKG EQGPAGPPGF
QGLPGPAGTT GEAGKPGERG IPGEFGLPGP AGPRGERGPP GESGAVGPSG AIGSRGPSGP
PGPDGNKGEP GVVGAPGTAG PAGSGGPGER GAAGIPGGKG EKGETGLRGE VGTTGRDGAR
GAPGAVGAPG PAGATGDRGE AGAAGPAGPA GPRGSPGERG EVGPAGPNGF AGPAGAAGQP
GAKGERGTKG PKGENGIVGP TGPVGSAGPA GPNGPAGPAG SRGDGGPPGV TGFPGAAGRT
GPPGPSGITG PPGPPGAAGK EGLRGPRGDQ GPVGRTGETG AGGPPGFTGE KGPSGEPGTA
GPPGTAGPQG LLGAPGILGL PGSRGERGLP GVAGAVGEPG PLGIGPPGAR GDGLPGHKGE
RGYAGNAGPV GAAGAPGPHG VGPAGKHGNR GEPGPVGSVG PVGALGPRGP SGPQGIRGDK
GEPGEKGPRG LPGLKGHNGL QGLPGLAGQH GDQGSPGPVG PAGPRGPAGP SGPPGKDGRT
GHPGAVGPAG IRGSQGSQGP SGPPGPPGPP GPPGASGGGY DFGYEGDFYR A