CO1A2_ORYAF
ID CO1A2_ORYAF Reviewed; 916 AA.
AC C0HJN4;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 14.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Orycteropus afer (Aardvark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Afrotheria; Tubulidentata; Orycteropodidae; Orycteropus.
OX NCBI_TaxID=9818 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJN4; -.
DR PRIDE; C0HJN4; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 8.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Repeat; Secreted.
FT CHAIN 1..916
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433505"
FT REGION 1..167
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..711
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 732..916
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 108..122
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 5
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 42
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 51
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 54
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 109
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 127
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 147
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 154
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 165
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 174
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 183
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 189
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 204
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 258
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 292
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 298
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 316
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 319
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 338
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 353
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 361
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 385
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 403
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 421
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 427
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 439
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 452
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 487
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 496
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 508
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 527
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 547
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 596
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 640
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 655
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 703
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 704
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 709
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 710
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 712
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 718
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 731
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 733
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 775
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 836
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 853
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 856
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 862
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 865
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 868
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 9..10
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 46..47
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 82..83
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 259..260
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 554..555
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 581..582
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 716..717
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 914..915
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 916 AA; 82128 MW; 4F541FD5FB880C67 CRC64;
GPMGIMGPRA GEDGHPGKPG RPGERGVVGP QGARGFPGTP GIPGFKGHNG IDGIKGQPGA
PGVKGEPGAP GENGTPGQAG ARGRVGGTGP AGARGSDGSV GPVGPAGPIG SAGPPGFPGA
PGPKGEIGPV GNPGPSGPAG PRGEMGIPGV TGPIGPPGNP GANGISGAKG AAGIPGVAGA
PGIPGPRGIP GPVGAAGATG ARGIVGEPGP AGSKGESGNK GEPGSAGPQG PPGPSGEEGK
RGPNGEPGSA GPVGPPGIRA GVMGPPGSRG SSGPAGVRGP SGDSGRPGEP GIMGPRGIPG
SPGNVGPAGK EGPVGIPGID GRPGPVGPAG ARGEPGNIGF PGPKGPTGDP GKIGEKGHVG
IAGPRGAPGP DGNNGAQGPP GPQGIQGGKG EQGPAGPPGF QGIPGPSGTA GEVGKPGERG
IHGDFGIPGP AGPRGERGIP GQSGAAGPTG PIGSRGPSGP PGPDGNKGEP GVVGAPGTAG
PSGPSGIPGE RGAAGIPGGK GEKGETGIRG DTGNTGRDGA RGAPGAIGAP GPAGATGDRG
EAGPAGIAGP AGPRGEVGPA GPNGFAGPAG AAGQPGAKGE RGPKGENGPV GPTGPIGSAG
PAGPNGPPGP AGSRGDGGPP GVTGFPGAAG RTGPPGPSGI TGPPGPPGAA GKEGIRGPRG
DQGPVGRTGE TGASGPPGFT GEKGPSGEPG TAGPPGSPGP QGIIGAPGII GIPGSRGIPG
VAGAVGEPGP IGIAGPSGAR GPPGGVGSPG VNGAPGEAGR DGNPGSDGPP GRDGIPGHKG
ERGYPGNAGP VGTAGAPGPQ GSVGPAGKHG NRGEPGPAGS VGPVGPVGPR GPNGPIGPRG
DKGEPGDKGP RGIPGIKGHN GIQGIPGIAG QHGDQGSPGT VGPAGPRGPA GPSGPAGKDG
RSGHPGAVGP AGVRVS