CFT1_COCIM
ID CFT1_COCIM Reviewed; 1387 AA.
AC Q1E5B0; J3KLB0;
DT 12-JUN-2007, integrated into UniProtKB/Swiss-Prot.
DT 11-JUL-2006, sequence version 1.
DT 25-MAY-2022, entry version 70.
DE RecName: Full=Protein CFT1;
DE AltName: Full=Cleavage factor two protein 1;
GN Name=CFT1; ORFNames=CIMG_02253;
OS Coccidioides immitis (strain RS) (Valley fever fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenaceae; Coccidioides.
OX NCBI_TaxID=246410;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RS;
RX PubMed=19717792; DOI=10.1101/gr.087551.108;
RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., Wortman J.R.,
RA Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., Zeng Q., Hung C.-Y.,
RA McMahan C., Muszewska A., Grynberg M., Mandel M.A., Kellner E.M.,
RA Barker B.M., Galgiani J.N., Orbach M.J., Kirkland T.N., Cole G.T.,
RA Henn M.R., Birren B.W., Taylor J.W.;
RT "Comparative genomic analyses of the human fungal pathogens Coccidioides
RT and their relatives.";
RL Genome Res. 19:1722-1731(2009).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=RS;
RX PubMed=20516208; DOI=10.1101/gr.103911.109;
RA Neafsey D.E., Barker B.M., Sharpton T.J., Stajich J.E., Park D.J.,
RA Whiston E., Hung C.-Y., McMahan C., White J., Sykes S., Heiman D.,
RA Young S., Zeng Q., Abouelleil A., Aftuck L., Bessette D., Brown A.,
RA FitzGerald M., Lui A., Macdonald J.P., Priest M., Orbach M.J.,
RA Galgiani J.N., Kirkland T.N., Cole G.T., Birren B.W., Henn M.R.,
RA Taylor J.W., Rounsley S.D.;
RT "Population genomic sequencing of Coccidioides fungi reveals recent
RT hybridization and transposon control.";
RL Genome Res. 20:938-946(2010).
CC -!- FUNCTION: RNA-binding component of the cleavage and polyadenylation
CC factor (CPF) complex, which plays a key role in polyadenylation-
CC dependent pre-mRNA 3'-end formation and cooperates with cleavage
CC factors including the CFIA complex and NAB4/CFIB. Involved in poly(A)
CC site recognition. May be involved in coupling transcription termination
CC and mRNA 3'-end formation (By similarity). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the CFT1 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG704911; EAS36899.3; -; Genomic_DNA.
DR RefSeq; XP_001248482.1; XM_001248481.2.
DR AlphaFoldDB; Q1E5B0; -.
DR SMR; Q1E5B0; -.
DR STRING; 246410.Q1E5B0; -.
DR EnsemblFungi; EAS36899; EAS36899; CIMG_02253.
DR GeneID; 4568108; -.
DR KEGG; cim:CIMG_02253; -.
DR VEuPathDB; FungiDB:CIMG_02253; -.
DR InParanoid; Q1E5B0; -.
DR OMA; PMTKFKL; -.
DR OrthoDB; 360328at2759; -.
DR Proteomes; UP000001261; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; -; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR Pfam; PF03178; CPSF_A; 1.
PE 3: Inferred from homology;
KW mRNA processing; Nucleus; Reference proteome; RNA-binding.
FT CHAIN 1..1387
FT /note="Protein CFT1"
FT /id="PRO_0000290627"
FT REGION 440..493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 509..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 454..470
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..493
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1387 AA; 152937 MW; 9F8DD6B411094DB1 CRC64;
MQCYTELLPP SGVTHAISLP FLSATSNNLI VAKTSILQVF SLVNVAYGTS APPNADDKGR
VERQQYTKLI LVAEYDLSGT ITGLGRVKIL DSRSGGEALL VSTRNAKLSL VEWDHERHGI
STISIHYYER EDVHSSPWTP DLRLCPSLLA VDPSSRCAIL NFGIHSVAIL PFHQTGDDLV
MDEFDEDLDE KPEGASNIPA QAAVANDTTM YKTPYASSFV LPLTALDPAL VHPIHLAFLY
EYREPTFGIL YSHLTTSSAL LHDRKDIVSY AVFTLDIQQR ASTTLITVSR LPSDLWKVVP
LPPPIGGALL IGSNELIHVD QAGKTNAVGI NEFARQASAF SMVDQSDLGL RLEGCVVEQL
GTDSGDILLV LADGKMAILR LKVDGRSVSG ISAQLVSEKA GGSILKARPS CSASLGRGKV
FFGSEETDSL LIGWSRPSQS MRKPKVESAD DVFGDHSETE DDEDDIYEDD LYSTPVNQTT
LSKTTSQTNG LNKDDFVFRS HDRLWNLGPM SDVTLGRPPG SHDKNRKQSS SRTSADLELV
VTQGKGNAGG LAVLQRELDP YVIDSMKMDN VDGVWSIQVG APDSTNTRTS SRNYDKYLVF
SKSTEPGKEQ SVVYSVGGSG IEEMKAPEFN PNEDSTVDIG TLAGGTRVVQ VLKSEVRSYD
TNLELAQIYP IWDEDTSDEL SVVSASFAEP YVLIVRDDQS LLLLQADKSG DLDEVNIDGI
LSSHRWLSGC LYLDKYHTFV PTKGQDQPLS DNILLVLLRA DHTLFIFSLP TLTEPLCSVD
GVDLLPLILS CEPPPKRVTY RETLSEVLIA DLGDSISRQP YMILRTANDD LILYQPYHPK
TSLDKPELRF VKIIDHFLPR FDPSPKAYMP HSKFLRAYSD ICGYKTVFMS GSNPCFVMKS
STSSPHVLRL RGEAVSSLSS FHIPACEKGF AYVDASNMVR MCRLPSNTRF DNSWVTRKVH
VGDQIDCVEY FAHSEIYALG SSHKVDFKLP EDDEIHPEWR SEVISFMPQL ERGCIKLLSP
RTWSVVDSYE LGDAERVMCM KTINMEISEI THEMKDMLVV GTATVRGEDI TPRGSIYVFE
IIEVAPDPDR PETNRKLKIF AKDDVKGAVT AVSGIGGQGF LIMAQGQKCM VRGLKEDGSL
LPVAFMDMQC YVKVLKELQG TGLCIMGDAL KGIWFAGYSE EPYRLTLFGK DNEYLQVIAA
DFLPDGKRLY ILVADDDCTI HVLEYDPEDP TSSKGDRLLH RSSFHTGHFT STMTLLPEHS
SSPSADDPEE DDMDVDYVPK SYQVLVTSQE GSIGVVTPLT EDSYRRLSAL QSQLVTSMEH
PCGLNPKAYR AVESDGFGGR GIVDGNLLLR WLDMGVQRKA EIAGRVGADI ESIRVDLETI
SGGLDFL