CTF50_ARATH
ID CTF50_ARATH Reviewed; 429 AA.
AC Q8L4J2; F4K0J5; Q9FME6;
DT 26-NOV-2014, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2002, sequence version 1.
DT 25-MAY-2022, entry version 138.
DE RecName: Full=Cleavage stimulation factor subunit 50 {ECO:0000303|PubMed:12379796};
DE Short=AtCstF-50 {ECO:0000303|PubMed:12379796};
DE Short=AtCstF50 {ECO:0000303|PubMed:16282318};
DE AltName: Full=CF-1 50 kDa subunit {ECO:0000305};
DE AltName: Full=Cleavage stimulation factor 50 kDa subunit {ECO:0000305};
DE Short=CSTF 50 kDa subunit {ECO:0000305};
GN Name=CSTF50 {ECO:0000303|PubMed:12379796};
GN OrderedLocusNames=At5g60940 {ECO:0000312|Araport:AT5G60940};
GN ORFNames=MSL3.60 {ECO:0000312|EMBL:BAB10643.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000312|EMBL:AAM61334.1};
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
RC STRAIN=cv. Columbia;
RX PubMed=12379796; DOI=10.1093/jxb/erf073;
RA Yao Y., Song L., Katz Y., Galili G.;
RT "Cloning and characterization of Arabidopsis homologues of the animal CstF
RT complex that regulates 3' mRNA cleavage and polyadenylation.";
RL J. Exp. Bot. 53:2277-2278(2002).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=9501997; DOI=10.1093/dnares/4.6.401;
RA Nakamura Y., Sato S., Kaneko T., Kotani H., Asamizu E., Miyajima N.,
RA Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 5. III. Sequence
RT features of the regions of 1,191,918 bp covered by seventeen physically
RT assigned P1 clones.";
RL DNA Res. 4:401-414(1997).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RA Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B.,
RA Feldmann K.A.;
RT "Full-length cDNA from Arabidopsis thaliana.";
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP GENE FAMILY.
RX PubMed=16282318; DOI=10.1074/jbc.m510964200;
RA Forbes K.P., Addepalli B., Hunt A.G.;
RT "An Arabidopsis Fip1 homolog interacts with RNA and provides conceptual
RT links with a number of other polyadenylation factor subunits.";
RL J. Biol. Chem. 281:176-186(2006).
RN [7]
RP INTERACTION WITH CSTF64; CPSF100; CPSF30; FIPS5 AND PABN3, GENE FAMILY, AND
RP NOMENCLATURE.
RX PubMed=18479511; DOI=10.1186/1471-2164-9-220;
RA Hunt A.G., Xu R., Addepalli B., Rao S., Forbes K.P., Meeks L.R., Xing D.,
RA Mo M., Zhao H., Bandyopadhyay A., Dampanaboina L., Marion A.,
RA Von Lanken C., Li Q.Q.;
RT "Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of
RT protein-protein interactions and gene expression profiling.";
RL BMC Genomics 9:220-220(2008).
CC -!- FUNCTION: One of the multiple factors required for polyadenylation and
CC 3'-end cleavage of pre-mRNAs. May be responsible for the interaction of
CC CSTF with other factors to form a stable complex on the pre-mRNA.
CC {ECO:0000250|UniProtKB:Q05048}.
CC -!- SUBUNIT: Homodimer. Belongs to the CSTF complex (By similarity). Forms
CC a complex with cleavage and polyadenylation specificity factor (CPSF)
CC subunits CSTF64, PABN3, CPSF30, FIPS5 and CPSF100 (PubMed:18479511).
CC {ECO:0000250|UniProtKB:Q05048, ECO:0000269|PubMed:18479511}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q05048}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q8L4J2-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8L4J2-2; Sequence=VSP_057234;
CC -!- DOMAIN: N-terminus mediates homodimerization.
CC {ECO:0000250|UniProtKB:Q05048}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB10643.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF515696; AAM64165.1; -; mRNA.
DR EMBL; AB008269; BAB10643.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002688; AED97399.1; -; Genomic_DNA.
DR EMBL; CP002688; AED97400.1; -; Genomic_DNA.
DR EMBL; AY136384; AAM97050.1; -; mRNA.
DR EMBL; BT000184; AAN15503.1; -; mRNA.
DR EMBL; AY084766; AAM61334.1; -; mRNA.
DR RefSeq; NP_200902.1; NM_125487.3. [Q8L4J2-1]
DR RefSeq; NP_974972.1; NM_203243.2. [Q8L4J2-2]
DR AlphaFoldDB; Q8L4J2; -.
DR SMR; Q8L4J2; -.
DR BioGRID; 21459; 3.
DR IntAct; Q8L4J2; 3.
DR STRING; 3702.AT5G60940.1; -.
DR iPTMnet; Q8L4J2; -.
DR PaxDb; Q8L4J2; -.
DR PRIDE; Q8L4J2; -.
DR ProteomicsDB; 222632; -. [Q8L4J2-1]
DR EnsemblPlants; AT5G60940.1; AT5G60940.1; AT5G60940. [Q8L4J2-1]
DR EnsemblPlants; AT5G60940.2; AT5G60940.2; AT5G60940. [Q8L4J2-2]
DR GeneID; 836215; -.
DR Gramene; AT5G60940.1; AT5G60940.1; AT5G60940. [Q8L4J2-1]
DR Gramene; AT5G60940.2; AT5G60940.2; AT5G60940. [Q8L4J2-2]
DR KEGG; ath:AT5G60940; -.
DR Araport; AT5G60940; -.
DR TAIR; locus:2173542; AT5G60940.
DR eggNOG; KOG0640; Eukaryota.
DR InParanoid; Q8L4J2; -.
DR OMA; SNRCINT; -.
DR PhylomeDB; Q8L4J2; -.
DR PRO; PR:Q8L4J2; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; Q8L4J2; baseline and differential.
DR Genevisible; Q8L4J2; AT.
DR GO; GO:0005848; C:mRNA cleavage stimulating factor complex; IBA:GO_Central.
DR GO; GO:0031124; P:mRNA 3'-end processing; IEA:InterPro.
DR Gene3D; 2.130.10.10; -; 1.
DR InterPro; IPR044633; CstF1-like.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR44133; PTHR44133; 1.
DR Pfam; PF00400; WD40; 5.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF50978; SSF50978; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 4.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; mRNA processing; Nucleus; Reference proteome; Repeat;
KW WD repeat.
FT CHAIN 1..429
FT /note="Cleavage stimulation factor subunit 50"
FT /id="PRO_0000431325"
FT REPEAT 121..160
FT /note="WD 1"
FT /evidence="ECO:0000255"
FT REPEAT 174..213
FT /note="WD 2"
FT /evidence="ECO:0000255"
FT REPEAT 218..257
FT /note="WD 3"
FT /evidence="ECO:0000255"
FT REPEAT 264..303
FT /note="WD 4"
FT /evidence="ECO:0000255"
FT REPEAT 308..347
FT /note="WD 5"
FT /evidence="ECO:0000255"
FT REPEAT 351..392
FT /note="WD 6"
FT /evidence="ECO:0000255"
FT REPEAT 396..429
FT /note="WD 7"
FT /evidence="ECO:0000255"
FT REGION 20..41
FT /note="Hydrophobic"
FT /evidence="ECO:0000250|UniProtKB:Q05048"
FT VAR_SEQ 1..99
FT /note="MGNSGDLEQALQDGNIFRQLNALIVAHLRHHNLSQVASAVASATMTPLNIEV
FT PPNRLLELVAKGLAAENNGTLRGVSSSVLLPSSYGSITTPRTASIDF -> MFGIVRT
FT (in isoform 2)"
FT /id="VSP_057234"
SQ SEQUENCE 429 AA; 47065 MW; 7512E37B8CA1BF66 CRC64;
MGNSGDLEQA LQDGNIFRQL NALIVAHLRH HNLSQVASAV ASATMTPLNI EVPPNRLLEL
VAKGLAAENN GTLRGVSSSV LLPSSYGSIT TPRTASIDFS VNHAKGSSKT IPKHESKTLS
EHKSVVRCAR FSPDGMFFAT GGADTSIKLF EVPKVKQMIS GDTQARPLIR TFYDHAEPIN
DLDFHPRSTI LISSAKDNCI KFFDFSKTTA KRAFKVFQDT HNVRSISFHP SGEFLLAGTD
HPIPHLYDVN TYQCFLPSNF PDSGVSGAIN QVRYSSTGSI YITASKDGAI RLFDGVSAKC
VRSIGNAHGK SEVTSAVFTK DQRFVLSSGK DSTVKLWEIG SGRMVKEYLG AKRVKLRSQA
IFNDTEEFVI SIDEASNEVV TWDARTADKV AKWPSNHNGA PRWIEHSPVE SVFVTCGIDR
SIRFWKESV