COLL_DROME
ID COLL_DROME Reviewed; 575 AA.
AC P56721; Q8MS49; Q9V758;
DT 30-MAY-2000, integrated into UniProtKB/Swiss-Prot.
DT 11-OCT-2004, sequence version 2.
DT 03-AUG-2022, entry version 167.
DE RecName: Full=Transcription factor collier;
DE AltName: Full=Transcription factor knot;
GN Name=kn; Synonyms=col; ORFNames=CG10197;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE (ISOFORMS COL1 AND COL2), FUNCTION, TISSUE SPECIFICITY,
RP AND DEVELOPMENTAL STAGE.
RC TISSUE=Embryo;
RX PubMed=8793297; DOI=10.1016/s0960-9822(09)00452-7;
RA Crozatier M., Valle D., Dubois L., Ibnsouda S., Vincent A.;
RT "Collier, a novel regulator of Drosophila head development, is expressed in
RT a single mitotic domain.";
RL Curr. Biol. 6:707-718(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM COL2).
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP FUNCTION.
RX PubMed=10375526; DOI=10.1016/s0960-9822(99)80285-1;
RA Vervoort M., Crozatier M., Valle D., Vincent A.;
RT "The COE transcription factor Collier is a mediator of short-range
RT Hedgehog-induced patterning of the Drosophila wing.";
RL Curr. Biol. 9:632-639(1999).
RN [6]
RP FUNCTION.
RX PubMed=10068642; DOI=10.1242/dev.126.7.1495;
RA Crozatier M., Vincent A.;
RT "Requirement for the Drosophila COE transcription factor Collier in
RT formation of an embryonic muscle: transcriptional response to notch
RT signalling.";
RL Development 126:1495-1504(1999).
RN [7]
RP FUNCTION.
RX PubMed=10477305; DOI=10.1242/dev.126.19.4385;
RA Crozatier M., Valle D., Dubois L., Ibnsouda S., Vincent A.;
RT "Head versus trunk patterning in the Drosophila embryo; collier requirement
RT for formation of the intercalary segment.";
RL Development 126:4385-4394(1999).
CC -!- FUNCTION: May act as a 'second-level regulator' of head patterning.
CC Required for establishment of the PS(-1)/PS0 parasegmental border and
CC formation of the intercalary segment. Required for expression of the
CC segment polarity genes hedgehog, engrailed and wingless, and the
CC segment-identity genes CAP and collar in the intercalary segment.
CC Required at the onset of the gastrulation for the correct formation of
CC the mandibular segment. {ECO:0000269|PubMed:10068642,
CC ECO:0000269|PubMed:10375526, ECO:0000269|PubMed:10477305,
CC ECO:0000269|PubMed:8793297}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=COL1;
CC IsoId=P56721-1; Sequence=Displayed;
CC Name=COL2;
CC IsoId=P56721-2; Sequence=VSP_001111;
CC -!- TISSUE SPECIFICITY: Its expression at the blastoderm stage is
CC restricted to a single stripe of cells corresponding to part of the
CC intercalary and mandibular segment primordia, possibly parasegment O.
CC {ECO:0000269|PubMed:8793297}.
CC -!- DEVELOPMENTAL STAGE: Isoform COL1 is expressed from 3 hours of
CC embryogenesis, with a peak of accumulation between 8 and 16 hours post-
CC fertilization. Expression persists at very low level in first instar
CC larvae and accumulates again in third instar larvae and pupae. Isoform
CC COL2 is expressed after 8 hours of embryogenesis, peaks in first instar
CC larvae and is present at low levels in third instar larvae and pupae.
CC {ECO:0000269|PubMed:8793297}.
CC -!- SIMILARITY: Belongs to the COE family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X97803; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; AE013599; AAF58204.2; -; Genomic_DNA.
DR EMBL; AY119102; AAM50962.1; -; mRNA.
DR RefSeq; NP_524813.2; NM_080074.4. [P56721-2]
DR RefSeq; NP_725419.2; NM_166070.3. [P56721-1]
DR AlphaFoldDB; P56721; -.
DR SMR; P56721; -.
DR BioGRID; 69588; 77.
DR IntAct; P56721; 22.
DR STRING; 7227.FBpp0111722; -.
DR PaxDb; P56721; -.
DR EnsemblMetazoa; FBtr0087465; FBpp0086595; FBgn0001319. [P56721-2]
DR EnsemblMetazoa; FBtr0112809; FBpp0111721; FBgn0001319. [P56721-1]
DR GeneID; 45318; -.
DR KEGG; dme:Dmel_CG10197; -.
DR UCSC; CG10197-RA; d. melanogaster. [P56721-1]
DR CTD; 45318; -.
DR FlyBase; FBgn0001319; kn.
DR VEuPathDB; VectorBase:FBgn0001319; -.
DR eggNOG; KOG3836; Eukaryota.
DR GeneTree; ENSGT00950000182859; -.
DR HOGENOM; CLU_016320_3_1_1; -.
DR InParanoid; P56721; -.
DR PhylomeDB; P56721; -.
DR SignaLink; P56721; -.
DR BioGRID-ORCS; 45318; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 45318; -.
DR PRO; PR:P56721; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0001319; Expressed in mandibular segment (Drosophila) and 60 other tissues.
DR ExpressionAtlas; P56721; baseline and differential.
DR Genevisible; P56721; DM.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0035288; P:anterior head segmentation; TAS:FlyBase.
DR GO; GO:0007350; P:blastoderm segmentation; TAS:FlyBase.
DR GO; GO:0048813; P:dendrite morphogenesis; IMP:FlyBase.
DR GO; GO:0016204; P:determination of muscle attachment site; IMP:FlyBase.
DR GO; GO:0001700; P:embryonic development via the syncytial blastoderm; IMP:FlyBase.
DR GO; GO:0035287; P:head segmentation; IMP:FlyBase.
DR GO; GO:0007476; P:imaginal disc-derived wing morphogenesis; TAS:FlyBase.
DR GO; GO:0007474; P:imaginal disc-derived wing vein specification; IMP:FlyBase.
DR GO; GO:0045087; P:innate immune response; IMP:FlyBase.
DR GO; GO:0007526; P:larval somatic muscle development; IMP:FlyBase.
DR GO; GO:0042694; P:muscle cell fate specification; IMP:FlyBase.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IMP:FlyBase.
DR GO; GO:0035289; P:posterior head segmentation; TAS:FlyBase.
DR GO; GO:0048814; P:regulation of dendrite morphogenesis; IMP:FlyBase.
DR GO; GO:0010468; P:regulation of gene expression; IDA:FlyBase.
DR GO; GO:0035203; P:regulation of lamellocyte differentiation; IMP:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IMP:FlyBase.
DR GO; GO:0009608; P:response to symbiont; IMP:FlyBase.
DR GO; GO:0007367; P:segment polarity determination; IDA:FlyBase.
DR GO; GO:0035291; P:specification of segmental identity, intercalary segment; IMP:FlyBase.
DR GO; GO:0007419; P:ventral cord development; HMP:FlyBase.
DR CDD; cd11606; COE_DBD; 1.
DR CDD; cd01175; IPT_COE; 1.
DR Gene3D; 2.60.40.10; -; 1.
DR Gene3D; 2.60.40.3180; -; 1.
DR InterPro; IPR032200; COE_DBD.
DR InterPro; IPR038173; COE_DBD_sf.
DR InterPro; IPR032201; COE_HLH.
DR InterPro; IPR038006; COE_IPT.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR002909; IPT_dom.
DR InterPro; IPR003523; Transcription_factor_COE.
DR InterPro; IPR018350; Transcription_factor_COE_CS.
DR PANTHER; PTHR10747; PTHR10747; 1.
DR Pfam; PF16422; COE1_DBD; 1.
DR Pfam; PF16423; COE1_HLH; 1.
DR Pfam; PF01833; TIG; 1.
DR SMART; SM00429; IPT; 1.
DR SUPFAM; SSF81296; SSF81296; 1.
DR PROSITE; PS01345; COE; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Metal-binding;
KW Nucleus; Reference proteome; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..575
FT /note="Transcription factor collier"
FT /id="PRO_0000107823"
FT DOMAIN 299..382
FT /note="IPT/TIG"
FT ZN_FING 167..186
FT /note="C5-type"
FT /evidence="ECO:0000255"
FT REGION 79..82
FT /note="Interaction with DNA"
FT /evidence="ECO:0000250"
FT REGION 213..220
FT /note="Interaction with DNA"
FT /evidence="ECO:0000250"
FT REGION 252..255
FT /note="Interaction with DNA"
FT /evidence="ECO:0000250"
FT REGION 255..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 456..492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 546..575
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT SITE 179
FT /note="Interaction with DNA"
FT /evidence="ECO:0000250"
FT SITE 188
FT /note="Interaction with DNA"
FT /evidence="ECO:0000250"
FT VAR_SEQ 529..575
FT /note="MSAVSSTWHQAFVQHHHAATAHPHHHYPHPHQPWHNPAVSAATAAAV -> R
FT VSSLSFNPFALPTCNTQGYSTQLVTSTK (in isoform COL2)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_001111"
FT CONFLICT 354..355
FT /note="RH -> SD (in Ref. 1; X97803)"
FT /evidence="ECO:0000305"
FT CONFLICT 357
FT /note="P -> R (in Ref. 4; AAM50962)"
FT /evidence="ECO:0000305"
FT CONFLICT 384
FT /note="Missing (in Ref. 4; AAM50962)"
FT /evidence="ECO:0000305"
FT CONFLICT 435
FT /note="G -> D (in Ref. 1; X97803)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 575 AA; 62494 MW; D15144D95BDCDFBC CRC64;
MEWGRKLYPS AVSGPRSAGG LMFGLPPTAA VDMNQPRGPM TSLKEEPLGS RWAMQPVVDQ
SNLGIGRAHF EKQPPSNLRK SNFFHFVIAL YDRAGQPIEI ERTAFIGFIE KDSESDATKT
NNGIQYRLQL LYANGARQEQ DIFVRLIDSV TKQAIIYEGQ DKNPEMCRVL LTHEVMCSRC
CDKKSCGNRN ETPSDPVIID RFFLKFFLKC NQNCLKNAGN PRDMRRFQVV ISTQVAVDGP
LLAISDNMFV HNNSKHGRRA KRLDTTEGTG NTSLSISGHP LAPDSTYDGL YPPLPVATPC
IKAISPSEGW TTGGATVIIV GDNFFDGLQV VFGTMLVWSE LITSHAIRVQ TPPRHIPGVV
EVTLSYKSKQ FCKGSPGRFV YVSALNEPTI DYGFQRLQKL IPRHPGDPEK LQKEIILKRA
ADLVEALYSM PRSPGGSTGF NSYAGQLAVS VQDGSGQWTE DDYQRAQSSS VSPRGGYCSS
ASTPHSSGGS YGATAASAAV AATANGYAPA PNMGTLSSSP GSVFNSTSMS AVSSTWHQAF
VQHHHAATAH PHHHYPHPHQ PWHNPAVSAA TAAAV