CO1A2_TAPTE
ID CO1A2_TAPTE Reviewed; 910 AA.
AC C0HJN8;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 13.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Tapirus terrestris (Lowland tapir) (Brazilian tapir).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Tapiridae; Tapirus.
OX NCBI_TaxID=9801 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN8; -.
DR PRIDE; C0HJN8; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 8.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Repeat; Secreted.
FT CHAIN 1..910
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433506"
FT REGION 1..184
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 200..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 138..152
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..910
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 3
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 72
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 78
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 84
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 87
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 108
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 139
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 157
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 177
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 180
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 192
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 201
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 207
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 222
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 268
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 277
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 308
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 328
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 331
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 350
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 364
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 390
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 408
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 414
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 439
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 460
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 474
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 483
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 495
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 499
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 628
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 677
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 678
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 683
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 684
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 686
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 695
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 702
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 708
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 710
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 810
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 825
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 828
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 834
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 837
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 840
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 23..24
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 97..98
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 186..187
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 259..260
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 302..303
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 326..327
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 356..357
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 376..377
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 610..611
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 656..657
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 669..670
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 749..750
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 910 AA; 81708 MW; 74046D54D7841B54 CRC64;
MGIMGPRGPP GASGAPGPQG FQGQTGPAGA RGPPGPPGKA GEDGHPGKPG RPGERGVVGP
QGARGFPGTP GIPGFKGIRG HNGIDGIKGQ PGAPGVKGTP GQAGARGIPG ERGRVGAPGP
AGARGSDGSV GPVGPAGPIG SAGPPGFPGA PGPKGEIGPV GNPGPAGPAG PRGEVGIPGI
SGPVGPKGAA GIPGVAGAPG IPGPRGIPGP AGAAGATGAR GIVGEPGPAG SKGESGNKGE
PGSVGAQGPP GPSGEEGKRT GPAGPPGIRG SPGSRGIPGA DGRAGVMGPA GSRGASGPAG
VRPGEPGIMG PRGFPGSPGN VGPAGKGIPG IDGRPGPVGP AGARGEPGNI GFPGPKGDKG
HAGIAGARGA PGPDGNGEQG PAGPPGFQGI PGPAGTAGEA GKPGERGIPG EFGIPGPAGP
RGERGPPGES GAAGPAGPIG SRGPSGPPGP DGNKGEPGVI GAPGTAGPSG PSGIPGERGA
AGIPGGKGEK GETGIRGEIG NPGRDGARGA PGAVGAPGPA GANGDRGEAG PAGPAGPAGP
RGSPGERGEV GPAGPNGFAG PAGAAGQPGA KGERGAKGPK GENGPVGPTG PVGSAGPSGP
NGPPGPAGGR TGFPGAAGRT GPPGPSGITG PPGPPGAAGK EGVRGPRGDQ GPVGRAGFAG
EKGPSGEPGG TPGPQGIIGA PGIIGIPGSR GERGIPGVAG SIGEPGPIGI AGPPGARGPP
GAVGAPGVNG APGETGRDGN PGNDGPPGRH KGERGYPGNA GPVGAVGAPG PHGPVGPTGK
HGNRGEPGPV GSVGPVGAVG PRGPSGPQGI RGDKGEPGDK GPRGIPGIKG HNGIQGIPGI
AGQHGDQGAP GSVGPAGPRG PAGPTGPVGK DGRSGQPGTV GPAGVRGSQG SQGPAGPPGP
PGPPGPPGPS