CAS4_EPHMU
ID CAS4_EPHMU Reviewed; 366 AA.
AC P18503;
DT 01-NOV-1990, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1990, sequence version 1.
DT 25-MAY-2022, entry version 68.
DE RecName: Full=Short-chain collagen C4;
DE Flags: Fragment;
OS Ephydatia muelleri (Mueller's freshwater sponge) (Spongilla muelleri).
OC Eukaryota; Metazoa; Porifera; Demospongiae; Heteroscleromorpha;
OC Spongillida; Spongillidae; Ephydatia.
OX NCBI_TaxID=6052;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RX PubMed=2163843; DOI=10.1111/j.1432-1033.1990.tb15589.x;
RA Exposito J.-Y., Ouazana R., Garrone R.;
RT "Cloning and sequencing of a Porifera partial cDNA coding for a short-chain
RT collagen.";
RL Eur. J. Biochem. 190:401-406(1990).
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X52598; CAA36831.1; -; mRNA.
DR PIR; S11449; S11449.
DR AlphaFoldDB; P18503; -.
DR PRIDE; P18503; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 2.
PE 2: Evidence at transcript level;
KW Collagen; Extracellular matrix; Hydroxylation; Repeat; Secreted.
FT CHAIN <1..366
FT /note="Short-chain collagen C4"
FT /id="PRO_0000059393"
FT REGION 1..207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1..23
FT /note="Triple-helical region"
FT REGION 40..210
FT /note="Triple-helical region"
FT COMPBIAS 28..64
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 172..186
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
SQ SEQUENCE 366 AA; 35783 MW; E29A96D3C49576A7 CRC64;
DTGPQGPQGV AGPPGIDGAK GDKGECFYPP PPTCPTCPAG PPGAPGPQGA PGAPGAPGLP
GPAGPQGPKG DKGLPGNDGQ PGAPGAPGYD GAKGDKGDTG APGPQGPKGD QGPKGDQGYK
GDAGLPGQPG QTGAPGKDGQ DGAKGDKGDQ GPAGTPGAPG KDGAQGPAGP AGPAGPAGPV
GPTGPQGPQG PKGDVGPQGP QGAPGSNGAV VYIRWGNNVC PAGETNVYSG HIVESSNAND
ANGDYLCLPD THNAYPPQTQ NPLLNLKDVT DSYGKTVPCV ACLASGRSTV FTFPDNTVCP
YGWTTEYVGY EAANPKWPGQ NLCVDTYFGD KLSQTPCNNL AVIAKGPLNA YSYQPQDVVS
CVVCSI