CLF1_CRYNH
ID CLF1_CRYNH Reviewed; 724 AA.
AC Q9HF03; J9VFF4;
DT 13-SEP-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 25-MAY-2022, entry version 76.
DE RecName: Full=Pre-mRNA-splicing factor CLF1;
DE AltName: Full=crooked-neck-like protein 1;
GN Name=CLF1; Synonyms=CCN1; ORFNames=CNAG_00694;
OS Cryptococcus neoformans var. grubii serotype A (strain H99 / ATCC 208821 /
OS CBS 10515 / FGSC 9487) (Filobasidiella neoformans var. grubii).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Tremellales; Cryptococcaceae; Cryptococcus;
OC Cryptococcus neoformans species complex.
OX NCBI_TaxID=235443;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND VARIANT LYS-217 DEL.
RC STRAIN=B-4551, and H99 / ATCC 208821 / CBS 10515 / FGSC 9487;
RX PubMed=12654817; DOI=10.1128/iai.71.4.1988-1994.2003;
RA Chung S., Mondon P., Chang Y.C., Kwon-Chung K.J.;
RT "Cryptococcus neoformans with a mutation in the tetratricopeptide repeat-
RT containing gene, CCN1, causes subcutaneous lesions but fails to cause
RT systemic infection.";
RL Infect. Immun. 71:1988-1994(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=H99 / ATCC 208821 / CBS 10515 / FGSC 9487;
RX PubMed=24743168; DOI=10.1371/journal.pgen.1004261;
RA Janbon G., Ormerod K.L., Paulet D., Byrnes E.J. III, Yadav V.,
RA Chatterjee G., Mullapudi N., Hon C.-C., Billmyre R.B., Brunel F.,
RA Bahn Y.-S., Chen W., Chen Y., Chow E.W.L., Coppee J.-Y., Floyd-Averette A.,
RA Gaillardin C., Gerik K.J., Goldberg J., Gonzalez-Hilarion S., Gujja S.,
RA Hamlin J.L., Hsueh Y.-P., Ianiri G., Jones S., Kodira C.D., Kozubowski L.,
RA Lam W., Marra M., Mesner L.D., Mieczkowski P.A., Moyrand F., Nielsen K.,
RA Proux C., Rossignol T., Schein J.E., Sun S., Wollschlaeger C., Wood I.A.,
RA Zeng Q., Neuveglise C., Newlon C.S., Perfect J.R., Lodge J.K., Idnurm A.,
RA Stajich J.E., Kronstad J.W., Sanyal K., Heitman J., Fraser J.A.,
RA Cuomo C.A., Dietrich F.S.;
RT "Analysis of the genome and transcriptome of Cryptococcus neoformans var.
RT grubii reveals complex RNA expression and microevolution leading to
RT virulence attenuation.";
RL PLoS Genet. 10:E1004261-E1004261(2014).
CC -!- FUNCTION: Involved in pre-mRNA splicing and cell cycle progression.
CC Required for the spliceosome assembly and initiation of the DNA
CC replication (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: Associated with the spliceosome. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the crooked-neck family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF265234; AAG36938.1; -; Genomic_DNA.
DR EMBL; CP003820; AFR92823.1; -; Genomic_DNA.
DR RefSeq; XP_012046896.1; XM_012191506.1.
DR AlphaFoldDB; Q9HF03; -.
DR SMR; Q9HF03; -.
DR EnsemblFungi; AFR92823; AFR92823; CNAG_00694.
DR GeneID; 23884476; -.
DR VEuPathDB; FungiDB:CNAG_00694; -.
DR HOGENOM; CLU_011554_1_0_1; -.
DR PHI-base; PHI:280; -.
DR Proteomes; UP000010091; Chromosome 1.
DR GO; GO:0000785; C:chromatin; IEA:EnsemblFungi.
DR GO; GO:0000974; C:Prp19 complex; IEA:EnsemblFungi.
DR GO; GO:0071006; C:U2-type catalytic step 1 spliceosome; IEA:EnsemblFungi.
DR GO; GO:0071007; C:U2-type catalytic step 2 spliceosome; IEA:EnsemblFungi.
DR GO; GO:0071008; C:U2-type post-mRNA release spliceosomal complex; IEA:EnsemblFungi.
DR GO; GO:0071004; C:U2-type prespliceosome; IEA:EnsemblFungi.
DR GO; GO:0003682; F:chromatin binding; IEA:EnsemblFungi.
DR GO; GO:0003688; F:DNA replication origin binding; IEA:EnsemblFungi.
DR GO; GO:0000354; P:cis assembly of pre-catalytic spliceosome; IEA:EnsemblFungi.
DR GO; GO:0006270; P:DNA replication initiation; IEA:EnsemblFungi.
DR Gene3D; 1.25.40.10; -; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR045075; Syf1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR11246; PTHR11246; 1.
DR Pfam; PF02184; HAT; 1.
DR SMART; SM00386; HAT; 14.
DR SUPFAM; SSF48452; SSF48452; 1.
PE 3: Inferred from homology;
KW mRNA processing; mRNA splicing; Nucleus; Repeat; Spliceosome.
FT CHAIN 1..724
FT /note="Pre-mRNA-splicing factor CLF1"
FT /id="PRO_0000205743"
FT REPEAT 55..87
FT /note="HAT 1"
FT REPEAT 89..121
FT /note="HAT 2"
FT REPEAT 123..155
FT /note="HAT 3"
FT REPEAT 157..188
FT /note="HAT 4"
FT REPEAT 190..221
FT /note="HAT 5"
FT REPEAT 223..262
FT /note="HAT 6"
FT REPEAT 264..298
FT /note="HAT 7"
FT REPEAT 308..340
FT /note="HAT 8"
FT REPEAT 352..386
FT /note="HAT 9"
FT REPEAT 396..432
FT /note="HAT 10"
FT REPEAT 434..465
FT /note="HAT 11"
FT REPEAT 467..499
FT /note="HAT 12"
FT REPEAT 501..534
FT /note="HAT 13"
FT REPEAT 536..567
FT /note="HAT 14"
FT REPEAT 585..626
FT /note="HAT 15"
FT REPEAT 635..667
FT /note="HAT 16"
FT REGION 681..724
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 703..724
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VARIANT 217
FT /note="Missing (in strain: B-4551; no growth at 37 degrees
FT Celsius and fails to cause systemic infection in mice)"
FT /evidence="ECO:0000269|PubMed:12654817"
SQ SEQUENCE 724 AA; 85483 MW; 853CF3B5E5373628 CRC64;
MAGRDPRDRA PRVRNRAPAA VQITAEQLLR EAQERQEPTI QAPKQRVQDL EELSEFQARK
RTEFESRIRY SRDSILAWTK YAQWEASQNE YERSRSVFER ALDVDPRSVD LWIKYTDMEL
KARNINHARN LFDRAITLLP RVDALWYKYV YLEELLLNVS GARQIFERWM QWEPNDKAWQ
SYIKLEERYN ELDRASAIYE RWIACRPIPK NWVAWAKFEE DRGQPDKARE VFQTALEFFG
DEEEQVEKAQ SVFAAFARME TRLKEFERAR VIYKFALARL PRSKSASLYA QYTKFEKQHG
DRAGVELTVL GKRRIQYEEE LAYDPTNYDA WFSLARLEED AYRADREDGE DVEPMRVREV
YERAVANVPP ALEKRYWRRY IYLWLQYAAF EEIDTKDYDR ARDVYKAAVK LVPHKTFTFA
KLWLAYAYFE IRRLDVSAAR KVLGAGIGMC PKPKLFTGYI ELEMRLREFD RVRTLYEKFL
TYDPSLSSAW IQWTQVESAV EDFERVRAIF ELAVQQSLDM PEIVWKAYID FEAGEGERER
ARNLYERLLE RTSHVKVWIS YALMEIATLG GGEDEDGNEI EGEAGDADLA RKVFERGYKD
LRAKGEKEDR AVLLESWKSF EQEHGDEEML AKVEDMLPTT RKRWRKAEDG SGELEEYWDL
VFPDDEKEAN PTSFKFFQAA QAWAQQRAGQ GEEGGLSYDL PSDSESENED EDGDNREEEG
MDQD