COLA1_XENLA
ID COLA1_XENLA Reviewed; 957 AA.
AC Q641F3;
DT 05-FEB-2008, integrated into UniProtKB/Swiss-Prot.
DT 25-OCT-2004, sequence version 1.
DT 03-AUG-2022, entry version 90.
DE RecName: Full=Collagen alpha-1(XXI) chain;
DE Flags: Precursor;
GN Name=col21a1;
OS Xenopus laevis (African clawed frog).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Xenopus.
OX NCBI_TaxID=8355;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Kidney;
RG NIH - Xenopus Gene Collection (XGC) project;
RL Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix. Cytoplasm {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the fibril-associated collagens with interrupted
CC helices (FACIT) family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC082384; AAH82384.1; -; mRNA.
DR RefSeq; NP_001087858.1; NM_001094389.1.
DR AlphaFoldDB; Q641F3; -.
DR SMR; Q641F3; -.
DR DNASU; 447719; -.
DR GeneID; 447719; -.
DR KEGG; xla:447719; -.
DR CTD; 447719; -.
DR Xenbase; XB-GENE-952330; col21a1.L.
DR OMA; CMNGPSD; -.
DR OrthoDB; 1295141at2759; -.
DR Proteomes; UP000186698; Chromosome 5L.
DR Bgee; 447719; Expressed in pancreas and 5 other tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 3.40.50.410; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF00092; VWA; 1.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; SSF49899; 1.
DR SUPFAM; SSF53300; SSF53300; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 2: Evidence at transcript level;
KW Collagen; Cytoplasm; Extracellular matrix; Reference proteome; Repeat;
KW Secreted; Signal.
FT SIGNAL 1..16
FT /evidence="ECO:0000255"
FT CHAIN 17..957
FT /note="Collagen alpha-1(XXI) chain"
FT /id="PRO_0000317614"
FT DOMAIN 37..211
FT /note="VWFA"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00219"
FT DOMAIN 230..412
FT /note="Laminin G-like"
FT DOMAIN 448..501
FT /note="Collagen-like 1"
FT DOMAIN 502..543
FT /note="Collagen-like 2"
FT DOMAIN 544..591
FT /note="Collagen-like 3"
FT DOMAIN 592..642
FT /note="Collagen-like 4"
FT DOMAIN 643..684
FT /note="Collagen-like 5"
FT DOMAIN 685..741
FT /note="Collagen-like 6"
FT DOMAIN 742..786
FT /note="Collagen-like 7"
FT DOMAIN 825..882
FT /note="Collagen-like 8"
FT REGION 441..788
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 820..935
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 536..554
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 729..748
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 957 AA; 99759 MW; 765F4A9AA5BBDC0D CRC64;
MPGIIYILCS ILLIESQYYV ASENAEIRSS CRTAPNDLVF ILDGSWSVGP ENFEILKKWV
VNITSNFNIG PKFTQVGVVQ YSDYPILEIP LGSYESIDDL SRRTQSIQYL GGNTQTGNAI
QFAIDNLFAR SLRPLTKIAI VLTDGKSQDD VKHIAEEARK NKITLFAIGV GSEIEESELR
AIANKPSSTY VFYVEDYIAI SRIREIMKQK LCEESVCPTR IPVAARDEKG FDILLGLGIN
KKKAKRVEGS RPRNKAYEIT SQIDLTEFTG NVFPEGLPPS YVFISTLRFK IKKKWDLWRI
LALDGTIQTA VSLNGEEKTL SFTTTNEENG TQAITFTTPG VKKLFDEEWH QIRLLVTEED
ITLYVDDQEI ETRKLLPVIG IYISGQTQIG KYPAREESVQ FTLQKLRIYC DPEQNKRETA
CEIPGTNGEC LNGPSDVGGT PAPCICPPGE KGDPGPKGDS GQPGNPGSPG QPGPDGKHGF
QGTSGSPGIP GSPGVQGPRG FAGLKGNTGQ DGEKGDRGMP GFPGLHGQPG IKGEMGPKGD
KGDIGIDGKK GTKGDKGGNG ATGKPGRHGE PGSYGKDGIP GYPGQKGEEG KPGPPGMEGL
RGLPGIPGIP GNDGANGLKG ETGLSGEPGA RGPTGTPGIS GPEGISGPQG PVGPKGNKGE
TGPPGKASPA GMKGEKGEMG IPGQQGYTGI PGLMGPKGDK GNLGERGMQG HKGEHGSSGM
PGLKGEHGVT GSKGEKGEIG EHGHRGITGP RGEPGNMGLI GAPGPRGMSG ERGTQGLPGP
KGQQGKEQSE AFIRQICLDV LKAQLPSLIQ NDIRQNCNQC KTQEGSPGLP GPPGPVGPEG
TRGHPGLPGR NGFSGLVGQP GLPGIPGTKG LPGKPGAKGN KGDIEPGPQG SPGVPGPAGL
PGVGKDGRTG PLGPPGREGD RGPPGTQGPP GDAGICDPSL CYGAVMRRDP FRKGPNY