CO1A1_HIPAM
ID CO1A1_HIPAM Reviewed; 957 AA.
AC C0HJN5;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 15.
DE RecName: Full=Collagen alpha-1(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-1 type I collagen {ECO:0000250|UniProtKB:P02452};
DE Flags: Fragments;
GN Name=COL1A1 {ECO:0000250|UniProtKB:P02452};
OS Hippopotamus amphibius (Hippopotamus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Ancodonta;
OC Hippopotamidae; Hippopotamus.
OX NCBI_TaxID=9833 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN5; -.
DR PRIDE; C0HJN5; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 9.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Phosphoprotein; Repeat; Secreted.
FT CHAIN 1..957
FT /note="Collagen alpha-1(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433494"
FT REGION 1..957
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..22
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..364
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 670..684
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 936..957
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 87
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT MOD_RES 579
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P02454"
FT UNSURE 11
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 71
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 77
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 89
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 113
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 197
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 248
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 272
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 326
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 332
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 437
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 459
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 518
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 530
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 557
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 561
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 635
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 730
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 739
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 751
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 781
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 854
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 883
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 892
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 931
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 934
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 938
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 23..24
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 102..103
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 189..190
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 628..629
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 685..686
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 944..945
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 957 AA; 85142 MW; 553D91E113D46144 CRC64;
GPMGPSGPRG IPGPPGAPGP QGFPGEPGAS GPMGPRGPPG PPGKNGDDGE AGKPGRPGER
GPSGPQGARG IPGTAGIPGM KGHRGFSGID GAKGDAGPAG PKGAPGQMGP RGIPGERGRP
GAPGPAGARG NDGATGAAGP PGPTGPAGPP GFPGAVGAKG EAGPQGARGS EGPQGVRGEP
GPPGPAGAAG ANGAPGIAGA PGFPGARGPS GPQGPSGPSG PKGNSGEPGA PGSKGDTGAK
GEPGPTGIQG PPGPAGEEGK RGARGEPGPA GIPGPPGERG GPGSRGFPGA DGVAGPKGPA
GERGSPGPAG PKGSPGEAGR PGEAGIPGAK GITGSPGSPG PDGKTGPPGP AGQDGRPGPP
GPPGARGQAG VMGFPGPKGA AGEPGKAGER GVPGPPGAVG PAGKDGEAGA QGPPGPAGPA
GERGEQGPAG SPGFQGIPGP AGPPGEAGKP GEQGVPGDIG APGPSGARGE RGFPGERGVQ
GPPGPAGPRG ANGAPGNDGA KGDAGAPGAP GSQGAPGIQG MPGERGAAGI PGPKGDRGDA
GPKGADGSPG KDGVRGITGP IGPPGPAGAP GDKGEAGPSG PAGPTGARGA PGDRGEPGPP
GPAGFAGPPG ADGQPGAKGE PGDAGAKGTG PPGPIGNVGA PGPKGARGSA GPPGATGFPG
AAGRVGPPGP SGNAGPPGPP GPVGKRGETG PAGRPGEVGP PGPPGPAGEK GAPGADGPAG
APGTPGPQGI AGQRGVVGIP GQRGERGFPG IPGPSGEPGK QGPSGTSGER GPPGPMGPPG
IAGPPGESGR EGAPGAEGSP GRDGSPGAKG DRGETGPAGP PGAPGAPGAP GPVGPAGKSG
DRGETGPAGP AGPIGPVGAR GAAGPQGPRG DKGETGEQGD RGIKGHRGFS GIQGPPGPPG
SPGEQGPSGA SGPAGPRGPP GSAGSPGKDG INGIPGPIGP PGPRPGPPGP PGPPGPP