COLL1_DROME
ID COLL1_DROME Reviewed; 822 AA.
AC B7Z0K8; B7Z0K6; B7Z0K7; B7Z0K9; Q86NZ7;
DT 03-NOV-2009, integrated into UniProtKB/Swiss-Prot.
DT 03-MAR-2009, sequence version 1.
DT 03-AUG-2022, entry version 87.
DE RecName: Full=Collagen alpha chain CG42342;
GN ORFNames=CG42342;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1] {ECO:0000312|EMBL:ACL83519.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000305, ECO:0000312|EMBL:ACL83519.1}
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [3] {ECO:0000305, ECO:0000312|EMBL:AAO39561.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 467-822 (ISOFORM F).
RC STRAIN=Berkeley {ECO:0000312|EMBL:AAO39561.1}; TISSUE=Larva, and Pupae;
RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J.W., Champe M.,
RA Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., George R.A.,
RA Gonzalez M., Guarin H., Kronmiller B., Li P.W., Liao G., Miranda A.,
RA Mungall C.J., Nunoo J., Pacleb J.M., Paragas V., Park S., Patel S.,
RA Phouanenavong S., Wan K.H., Yu C., Lewis S.E., Rubin G.M., Celniker S.E.;
RL Submitted (FEB-2003) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000255}; Single-pass type II
CC membrane protein {ECO:0000255}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=D {ECO:0000269|PubMed:10731132};
CC IsoId=B7Z0K8-1; Sequence=Displayed;
CC Name=C {ECO:0000269|PubMed:10731132};
CC IsoId=B7Z0K8-2; Sequence=VSP_053159, VSP_053160, VSP_053163;
CC Name=E {ECO:0000269|PubMed:10731132};
CC IsoId=B7Z0K8-3; Sequence=VSP_053161, VSP_053162;
CC Name=F {ECO:0000269|PubMed:10731132};
CC IsoId=B7Z0K8-4; Sequence=VSP_053163;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; ACL83519.1; -; Genomic_DNA.
DR EMBL; AE014297; ACL83520.1; -; Genomic_DNA.
DR EMBL; AE014297; ACL83521.1; -; Genomic_DNA.
DR EMBL; AE014297; ACL83522.1; -; Genomic_DNA.
DR EMBL; BT003557; AAO39561.1; -; mRNA.
DR RefSeq; NP_001138061.1; NM_001144589.3. [B7Z0K8-1]
DR RefSeq; NP_001138062.2; NM_001144590.3.
DR RefSeq; NP_001138063.2; NM_001144591.3.
DR RefSeq; NP_001138064.2; NM_001144592.3.
DR AlphaFoldDB; B7Z0K8; -.
DR SMR; B7Z0K8; -.
DR BioGRID; 928290; 1.
DR STRING; 7227.FBpp0292667; -.
DR PaxDb; B7Z0K8; -.
DR PRIDE; B7Z0K8; -.
DR EnsemblMetazoa; FBtr0299894; FBpp0289172; FBgn0259244. [B7Z0K8-1]
DR GeneID; 7354466; -.
DR KEGG; dme:Dmel_CG42342; -.
DR UCSC; CG42342-RC; d. melanogaster.
DR FlyBase; FBgn0259244; CG42342.
DR VEuPathDB; VectorBase:FBgn0259244; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; B7Z0K8; -.
DR BioGRID-ORCS; 7354466; 0 hits in 3 CRISPR screens.
DR ChiTaRS; CG42342; fly.
DR GenomeRNAi; 7354466; -.
DR PRO; PR:B7Z0K8; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0259244; Expressed in embryonic epidermis (Drosophila) and 51 other tissues.
DR ExpressionAtlas; B7Z0K8; baseline and differential.
DR Genevisible; B7Z0K8; DM.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IBA:GO_Central.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 7.
PE 2: Evidence at transcript level;
KW Alternative splicing; Cell membrane; Coiled coil; Collagen; Membrane;
KW Reference proteome; Repeat; Signal-anchor; Transmembrane;
KW Transmembrane helix.
FT CHAIN 1..822
FT /note="Collagen alpha chain CG42342"
FT /id="PRO_0000388711"
FT TOPO_DOM 1..104
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT TRANSMEM 105..125
FT /note="Helical; Signal-anchor for type II membrane protein"
FT /evidence="ECO:0000255"
FT TOPO_DOM 126..822
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT DOMAIN 241..299
FT /note="Collagen-like 1"
FT /evidence="ECO:0000255"
FT DOMAIN 350..409
FT /note="Collagen-like 2"
FT /evidence="ECO:0000255"
FT DOMAIN 430..469
FT /note="Collagen-like 3"
FT /evidence="ECO:0000255"
FT DOMAIN 493..526
FT /note="Collagen-like 4"
FT /evidence="ECO:0000255"
FT DOMAIN 527..586
FT /note="Collagen-like 5"
FT /evidence="ECO:0000255"
FT DOMAIN 621..680
FT /note="Collagen-like 6"
FT /evidence="ECO:0000255"
FT DOMAIN 681..740
FT /note="Collagen-like 7"
FT /evidence="ECO:0000255"
FT REGION 1..45
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 66..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 169..188
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 205..297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 345..822
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 131..162
FT /evidence="ECO:0000255"
FT COILED 194..222
FT /evidence="ECO:0000255"
FT COILED 790..822
FT /evidence="ECO:0000255"
FT COMPBIAS 12..43
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 430..459
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..488
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 584..605
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 615..629
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 637..653
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 724..740
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..812
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 472..480
FT /note="IFGPGGTKI -> GPPGLDGMK (in isoform C)"
FT /evidence="ECO:0000303|PubMed:10731132"
FT /id="VSP_053159"
FT VAR_SEQ 481..683
FT /note="Missing (in isoform C)"
FT /evidence="ECO:0000303|PubMed:10731132"
FT /id="VSP_053160"
FT VAR_SEQ 684..688
FT /note="GAQGE -> VWGDY (in isoform E)"
FT /evidence="ECO:0000303|PubMed:10731132"
FT /id="VSP_053161"
FT VAR_SEQ 689..822
FT /note="Missing (in isoform E)"
FT /evidence="ECO:0000303|PubMed:10731132"
FT /id="VSP_053162"
FT VAR_SEQ 769..822
FT /note="PIISTPVHKDYLPDVTQPESNTSDYEQEEEEDDEQAEDNENEYDEYQDNLHN
FT NE -> NWQAWLKDELNALQHPRSKN (in isoform C and isoform F)"
FT /evidence="ECO:0000303|PubMed:10731132, ECO:0000303|Ref.3"
FT /id="VSP_053163"
SQ SEQUENCE 822 AA; 84074 MW; 61B2302533CD4E63 CRC64;
MRKHKAPPSG SPRTMAQDNS QSEPSGGNGE SPAATTAAAA SVEAPQQSLL LGHNAADASA
AAVASRLAPP PCQHPINNSN NNSNISNNSS NSSSSKERPR PTVRFISLLH VASYVLCLCA
FSFALYGNVR QTRLEQRMQR LQQLDARIVE LELRLEQQQL LHWPAEQTQV LASHPSDRDS
SNSNNGSQHL ELHVRRELHR LRRDVSHLQL TRRQQRRQAA EAAAAAASGE GGSGGGQCQC
QPGPPGPPGP PGKRGKRGKK GDSGEKGDPG LNGISGEKGA AGKPGDKGQK GDVGHPGMDV
FQTVKGLKRS VTTLHGGTLG YAEIVAVKDL QEAGVNVSAS TVIKLKGEPG EPGPPGPPGE
AGQPGAPGER GPPGEIGAQG PQGEAGQPGV AGPPGVAGAP GTKGDKGDRG DRGLTTTIKG
DEFPTGIIEG PPGPAGPPGP PGEPGARGEP GPIGPAGPPG EKGPRGKRGK RIFGPGGTKI
DEDYDDPPVT LLRGPPGPPG IAGKDGRDGR DGSKGEPGEP GEPGSLGPRG LDGLPGEPGI
EGPPGLPGYQ GPPGEKGDRG DIGPPGLMGP PGLPGPPGYP GVKGDKGDRG DSYRKMRRRQ
DDGMSDAPHM PTIEYLYGPP GPPGPMGPPG HTGSQGERGL DGRKGDPGEK GHKGDQGPMG
LPGPMGMRGE SGPSGPSGKA GIPGAQGETG HKGERGDPGL PGTDGIPGQE GPRGEQGSRG
DAGPPGKRGR KGDRGDKGEQ GVPGLDAPCP LGADGLPLPG CGWRPPKEPI ISTPVHKDYL
PDVTQPESNT SDYEQEEEED DEQAEDNENE YDEYQDNLHN NE