CO1A2_HIPAM
ID CO1A2_HIPAM Reviewed; 887 AA.
AC C0HJN6;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 13.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Hippopotamus amphibius (Hippopotamus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Ancodonta;
OC Hippopotamidae; Hippopotamus.
OX NCBI_TaxID=9833 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN6; -.
DR PRIDE; C0HJN6; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 14.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Repeat; Secreted.
FT CHAIN 1..887
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433502"
FT REGION 1..887
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 149..163
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 610..624
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 871..887
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 5
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 83
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 89
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 95
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 98
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 119
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 150
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 168
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 188
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 205
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 214
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 220
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 235
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 251
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 267
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 278
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 287
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 299
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 321
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 332
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 335
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 350
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 370
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 407
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 425
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 431
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 456
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 491
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 500
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 512
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 613
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 628
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 663
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 664
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 670
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 672
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 681
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 694
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 696
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 790
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 805
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 808
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 814
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 817
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 820
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 865
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 114..115
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 200..201
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 245..246
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 312..313
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 348..349
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 361..362
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 389..390
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 513..514
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 586..587
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 656..657
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 735..736
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 745..746
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 866..867
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 887 AA; 79760 MW; 647120E002F4FDC7 CRC64;
GPMGIMGPRG PPGASGAPGP QGFQGPPGEP GEPGQTGPAG ARGPPGPPGK AGEDGHPGKP
GRSGERGVVG PQGARGFPGT PGIPGFKGIR GHNGIDGIKG QPGAPGVKGE PGAPGARGIP
GERGRVGAPG PAGARGSDGS VGPVGPAGPI GSAGPPGFPG APGPKGEIGP VGSPGASGPA
GPRGEVGIPG VSGPVGPPGN GAAGIPGVAG APGIPGPRGI PGPVGAAGAT GARGIVGEPG
PAGSKAGPQG IPGPSGEEGK RGSTGEIGPA GPPGPPGIRG SPGSRGIPGA DGRAGVMGIP
GSRGATGPAG VRGFPGSPGN IGPAGKEGPV GIPGIDGRPG PTGPAGARNI GFPGPKGPTG
DNGDKGHAGI AGARGAPGPD GNNGAQGPPQ GGKGEQGPAG PPGFQGIPGP AGTAGEAGKP
GERGIPGEFG IPGPAGPRGE RGPPGESGAA GPTGPIGNRG PSGPAGPDGN KGEPGVVGAP
GTAGPSGPSG IPGERGAAGI PGPKGEKGEP GIRRDGARGA PGAVGAPGPA GANGDRGEAG
PAGPAGPAGP RGSPGERGEV GPAGPNGFAG PAGAAGQPGA KGERGTRGDG GPPGATGFPG
AAGRTGPPGP SGISGPPGPP GPAGKEGIRG PRGDQGPVGR SGETGASGPP GFAGEKTPGP
QGIIGAPGFI GIPGSRGERG IPGVAGSVGE PGPIGIAGPP GARGPPGAVG NPGVNGAPGE
AGRDGNPGSD GPPGRGHKGE RGYPGAGAPG PQGPVGPTGK HGNRGEPGPA GVVGPTGAVG
PRGPSGPQGI RGDKGEPGDK GPRGIPGIKG HNGIQGIPGI AGHHGDQGAP GSVGPAGPRG
PAGPSGPVGK DGRTGHPGAV GPAGIRGSQG PAGPPGPPGP PGPPGPS