WC1_NEUCR
ID WC1_NEUCR Reviewed; 1167 AA.
AC Q01371; Q7RVA7; V5IKL6;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 11-JUL-2001, sequence version 2.
DT 03-AUG-2022, entry version 144.
DE RecName: Full=White collar 1 protein;
DE Short=WC1;
GN Name=wc-1; ORFNames=NCU02356;
OS Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 /
OS FGSC 987).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Sordariaceae; Neurospora.
OX NCBI_TaxID=367110;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987;
RX PubMed=8612589; DOI=10.1002/j.1460-2075.1996.tb00510.x;
RA Ballario P., Vittorioso P., Magrelli A., Talora C., Cabibbo A., Macino G.;
RT "White collar-1, a central regulator of blue light responses in Neurospora,
RT is a zinc finger protein.";
RL EMBO J. 15:1650-1657(1996).
RN [2]
RP SEQUENCE REVISION TO C-TERMINUS.
RA Ballario P.;
RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987;
RX PubMed=12712197; DOI=10.1038/nature01554;
RA Galagan J.E., Calvo S.E., Borkovich K.A., Selker E.U., Read N.D.,
RA Jaffe D.B., FitzHugh W., Ma L.-J., Smirnov S., Purcell S., Rehman B.,
RA Elkins T., Engels R., Wang S., Nielsen C.B., Butler J., Endrizzi M.,
RA Qui D., Ianakiev P., Bell-Pedersen D., Nelson M.A., Werner-Washburne M.,
RA Selitrennikoff C.P., Kinsey J.A., Braun E.L., Zelter A., Schulte U.,
RA Kothe G.O., Jedd G., Mewes H.-W., Staben C., Marcotte E., Greenberg D.,
RA Roy A., Foley K., Naylor J., Stange-Thomann N., Barrett R., Gnerre S.,
RA Kamal M., Kamvysselis M., Mauceli E.W., Bielke C., Rudd S., Frishman D.,
RA Krystofova S., Rasmussen C., Metzenberg R.L., Perkins D.D., Kroken S.,
RA Cogoni C., Macino G., Catcheside D.E.A., Li W., Pratt R.J., Osmani S.A.,
RA DeSouza C.P.C., Glass N.L., Orbach M.J., Berglund J.A., Voelker R.,
RA Yarden O., Plamann M., Seiler S., Dunlap J.C., Radford A., Aramayo R.,
RA Natvig D.O., Alex L.A., Mannhaupt G., Ebbole D.J., Freitag M., Paulsen I.,
RA Sachs M.S., Lander E.S., Nusbaum C., Birren B.W.;
RT "The genome sequence of the filamentous fungus Neurospora crassa.";
RL Nature 422:859-868(2003).
CC -!- FUNCTION: May function as a transcription factor involved in light
CC regulation. Binds and affects blue light regulation of the al-3 gene.
CC Wc-1 and wc-2 proteins interact via homologous PAS domains, bind to
CC promoters of light regulated genes such as frq, and activate
CC transcription.
CC -!- SUBUNIT: Heterodimer of wc-1 and wc-2. {ECO:0000305}.
CC -!- INTERACTION:
CC Q01371; P78714: wc-2; NbExp=5; IntAct=EBI-2922603, EBI-2924174;
CC Q01371; Q9C3Y6: vvd; Xeno; NbExp=5; IntAct=EBI-2922603, EBI-2922644;
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- INDUCTION: By blue light.
CC -!- DOMAIN: The glutamine-rich domain might function in activating gene
CC expression.
CC -!- PTM: FMN binds covalently to cysteine after exposure to blue light and
CC is reversed in the dark. {ECO:0000250}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X94300; CAA63964.2; -; Genomic_DNA.
DR EMBL; CM002242; ESA41977.1; -; Genomic_DNA.
DR EMBL; CM002242; ESA41978.1; -; Genomic_DNA.
DR EMBL; CM002242; ESA41979.1; -; Genomic_DNA.
DR EMBL; CM002242; ESA41980.1; -; Genomic_DNA.
DR RefSeq; XP_011395150.1; XM_011396848.1.
DR RefSeq; XP_011395151.1; XM_011396849.1.
DR RefSeq; XP_011395152.1; XM_011396850.1.
DR RefSeq; XP_011395153.1; XM_011396851.1.
DR AlphaFoldDB; Q01371; -.
DR SMR; Q01371; -.
DR DIP; DIP-1155N; -.
DR IntAct; Q01371; 2.
DR MINT; Q01371; -.
DR STRING; 367110.Q01371; -.
DR EnsemblFungi; ESA41977; ESA41977; NCU02356.
DR EnsemblFungi; ESA41978; ESA41978; NCU02356.
DR EnsemblFungi; ESA41979; ESA41979; NCU02356.
DR EnsemblFungi; ESA41980; ESA41980; NCU02356.
DR GeneID; 3875924; -.
DR KEGG; ncr:NCU02356; -.
DR VEuPathDB; FungiDB:NCU02356; -.
DR HOGENOM; CLU_007918_2_0_1; -.
DR InParanoid; Q01371; -.
DR Proteomes; UP000001805; Chromosome 7, Linkage Group VII.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0009881; F:photoreceptor activity; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IEA:InterPro.
DR GO; GO:0050896; P:response to stimulus; IEA:UniProtKB-KW.
DR CDD; cd00130; PAS; 3.
DR CDD; cd00202; ZnF_GATA; 1.
DR Gene3D; 3.30.50.10; -; 1.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR000679; Znf_GATA.
DR InterPro; IPR013088; Znf_NHR/GATA.
DR Pfam; PF00320; GATA; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF13426; PAS_9; 1.
DR SMART; SM00086; PAC; 2.
DR SMART; SM00091; PAS; 3.
DR SMART; SM00401; ZnF_GATA; 1.
DR SUPFAM; SSF55785; SSF55785; 3.
DR TIGRFAMs; TIGR00229; sensory_box; 2.
DR PROSITE; PS00344; GATA_ZN_FINGER_1; 1.
DR PROSITE; PS50114; GATA_ZN_FINGER_2; 1.
DR PROSITE; PS50112; PAS; 3.
PE 1: Evidence at protein level;
KW Activator; Chromophore; DNA-binding; Flavoprotein; FMN; Metal-binding;
KW Nucleus; Photoreceptor protein; Receptor; Reference proteome; Repeat;
KW Sensory transduction; Transcription; Transcription regulation; Zinc;
KW Zinc-finger.
FT CHAIN 1..1167
FT /note="White collar 1 protein"
FT /id="PRO_0000083489"
FT DOMAIN 381..452
FT /note="PAS 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 469..508
FT /note="PAC 1"
FT DOMAIN 574..644
FT /note="PAS 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT DOMAIN 650..691
FT /note="PAC 2"
FT DOMAIN 693..763
FT /note="PAS 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00140"
FT ZN_FING 934..959
FT /note="GATA-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00094"
FT REGION 1..91
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 307..355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 849..872
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 918..952
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 966..1047
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1060..1167
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 307..325
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..355
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 849..865
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 968..1047
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1107..1137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 428
FT /note="S-4a-FMN cysteine"
FT /evidence="ECO:0000250"
SQ SEQUENCE 1167 AA; 127454 MW; 6489D04DAB50EE38 CRC64;
MNNNYYGSPL SPEELQHQMH QHQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ HQHQQQQKTN
QHRNAGMMNT PPTTNQGNST IHASDVTMSG GSDSLDEIIQ QNLDEMHRRR SVPQPYGGQT
RRLSMFDYAN PNDGFSDYQL DNMSGNYGDM TGGMGMSGHS SPYAGQNIMA MSDHSGGYSH
MSPNVMGNMM TYPNLNMYHS PPIENPYSSA GLDTIRTDFS MDMNMDSGSV SAASVHPTPG
LNKQDDEMMT MEQGFGGGDD ANASHQAQQN MGGLTPAMTP AMTPAMTPGV SNFAQGMATP
VSQDAASTPA TTFQSPSLSA TTQTIRIGPP PPPSVTNAPT PAPFTSTPSG GGASQTKSIY
SKSGFDMLRA LWYVASRKDP KLKLGAVDMS CAFVVCDVTL NDCPIIYVSD NFQNLTGYSR
HEIVGRNCRF LQAPDGNVEA GTKREFVENN AVYTLKKTIA EGQEIQQSLI NYRKGGKPFL
NLLTMIPIPW DTEEIRYFIG FQIDLVECPD AIIGQEGNGP MQVNYTHSDI GQYIWTPPTQ
KQLEPADGQT LGVDDVSTLL QQCNSKGVAS DWHKQSWDKM LLENADDVVH VLSLKGLFLY
LSPACKKVLE YDASDLVGTS LSSICHPSDI VPVTRELKEA QQHTPVNIVF RIRRKNSGYT
WFESHGTLFN EQGKGRKCII LVGRKRPVFA LHRKDLELNG GIGDSEIWTK VSTSGMFLFV
SSNVRSLLDL LPENLQGTSM QDLMRKESRA EFGRTIEKAR KGKIASCKHE VQNKRGQVLQ
AYTTFYPGDG GEGQRPTFLL AQTKLLKASS RTLAPATVTV KNMSPGGVPL SPMKGIQTDS
DSNTLMGGMS KSGSSDSTGA MVSARSSAGP GQDAALDADN IFDELKTTRC TSWQYELRQM
EKVNRMLAEE LAQLLSNKKK RKRRKGGGNM VRDCANCHTR NTPEWRRGPS GNRDLCNSCG
LRWAKQTGRV SPRTSSRGGN GDSMSKKSNS PSHSSPLHRE VGNDSPSTTT ATKNSPSLRG
SSTTAPGTIT TDSGPAVASS ASGTGSTTIA TSANSAASTV NALGPPATGP SGGSPAQHLP
PHLQGTHLNA QAMQRVHQHK QHQQHQQQHQ QQHQQQHQQQ HQQLQQHQFN PPQSQPLLEG
GSGFRGSGME MTSIREEMGE HQQGLSV