CO1A1_CYCDI
ID CO1A1_CYCDI Reviewed; 847 AA.
AC C0HJP1;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-1(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-1 type I collagen {ECO:0000250|UniProtKB:P02452};
DE Flags: Fragments;
GN Name=COL1A1 {ECO:0000250|UniProtKB:P02452};
OS Cyclopes didactylus (Silky anteater) (Myrmecophaga didactyla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Vermilingua; Cyclopedidae; Cyclopes.
OX NCBI_TaxID=84074 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJP1; -.
DR PRIDE; C0HJP1; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 11.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Phosphoprotein; Repeat; Secreted.
FT CHAIN 1..847
FT /note="Collagen alpha-1(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433492"
FT REGION 1..847
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..48
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 85
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT MOD_RES 501
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT UNSURE 11
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 87
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 120
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 248
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 271
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 301
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 407
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 450
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 462
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 479
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 483
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 558
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 561
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 646
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 655
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 664
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 694
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 795
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 834
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 837
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 841
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 75..76
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 213..214
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 240..241
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 263..264
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 278..279
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 290..291
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 299..300
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 325..326
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 394..395
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 405..406
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 449..450
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 477..478
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 510..511
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 623..624
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 659..660
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 751..752
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 790..791
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 847 AA; 75274 MW; 9D921C0BF39CECCF CRC64;
GPMGPSGPRG IPGPPGSPGP QGFQGPPGEP GEPGSSGPMG PRGPPGPPGK NGDDGEAGKP
GRPGERGPSG PQGPRPGMKG HRGFSGIDGA KGDAGPAGPK GEPGSPGENG APGQMGPRGI
PGERGRPGAS GPAGARGNDG ATGAAGPPGP TGPAGPPGFP GAVGAKGEAG PQGARGSEGP
QGVRGEPGPP GPAGAAGPAG NPGADGQPGA KGAGPSGPQG PSGAPGPKGN SGEPGAPGNK
GEPGPTGIQG PPGPAGEEGK RGAGEPGPTG IPGPPGERGF PGADGVAGPK GSPGPAGPKG
ITGSPGSPGP DGKTGPPGPA GQDGRGQAGV MGFPGPKGAA GEPGKAGERG VPGPPGAAGP
AGKDGEAGAQ GPPGPAGPAG ERGEQGPAGS PGFQGPAGPP GEAGKDIGAP GPSGARGERG
FPGERGVQGP PGPAGPRGSN GAPGNDGAKI QGMPGERGAA GIPGPKGDRG DSGPKGAGIT
GPIGPPGPAG ATGDKGETGP SGPAGPTGAR GPPGPAGFAG PPGADGQPGA KGEPGDAGAK
GDAGPPGPAG PTGAPGPIGN IGAPGPKGAR GSAGPPGATG FPGAAGRVGP PGPSGNAGAP
GPPGPAGKEG GKGPRGETGP AGRGEKGSPG ADGPAGAPGT PGPQGISGQR GVVGIPGQRG
FPGIPGPSGE PGKQGPSGSS GERGPPGPMG PPGIAGPPGE SGREGSPGAE GSPGRDGSPG
PKGDRGETGP SGPPGAPGAP GAPGPVGPAG KGETGPAGPA GPAGPAGARG PSGPQGPRGD
KGETGEQGDR GFSGIQGPPG APGSPGEQGP SGASGPAGPR GPPGSAGSPG KDGINGIPGP
IGPPGPR