PCFS5_ARATH
ID PCFS5_ARATH Reviewed; 410 AA.
AC Q9FIX8;
DT 26-NOV-2014, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 117.
DE RecName: Full=Polyadenylation and cleavage factor homolog 5 {ECO:0000305};
GN Name=PCFS5 {ECO:0000303|PubMed:18479511};
GN OrderedLocusNames=At5g43620 {ECO:0000312|Araport:AT5G43620};
GN ORFNames=K9D7.12 {ECO:0000312|EMBL:BAB11625.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000312|Proteomes:UP000006548};
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10048488; DOI=10.1093/dnares/5.6.379;
RA Asamizu E., Sato S., Kaneko T., Nakamura Y., Kotani H., Miyajima N.,
RA Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 5. VIII. Sequence
RT features of the regions of 1,081,958 bp covered by seventeen physically
RT assigned P1 and TAC clones.";
RL DNA Res. 5:379-391(1998).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Cheuk R.F., Chen H., Kim C.J., Shinn P., Ecker J.R.;
RT "Arabidopsis ORF clones.";
RL Submitted (MAY-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP GENE FAMILY.
RX PubMed=15236668; DOI=10.1186/1471-2164-5-39;
RA Englbrecht C.C., Schoof H., Boehm S.;
RT "Conservation, diversification and expansion of C2H2 zinc finger proteins
RT in the Arabidopsis thaliana genome.";
RL BMC Genomics 5:39-39(2004).
RN [6]
RP INTERACTION WITH CSTF77; CLPS3; PCFS4 AND PCFS1, GENE FAMILY, AND
RP NOMENCLATURE.
RX PubMed=18479511; DOI=10.1186/1471-2164-9-220;
RA Hunt A.G., Xu R., Addepalli B., Rao S., Forbes K.P., Meeks L.R., Xing D.,
RA Mo M., Zhao H., Bandyopadhyay A., Dampanaboina L., Marion A.,
RA Von Lanken C., Li Q.Q.;
RT "Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of
RT protein-protein interactions and gene expression profiling.";
RL BMC Genomics 9:220-220(2008).
CC -!- SUBUNIT: Forms a complex with cleavage and polyadenylation specificity
CC factor (CPSF) subunits CSTF77, CLPS3, PCFS4 and PCFS1.
CC {ECO:0000269|PubMed:18479511}.
CC -!- INTERACTION:
CC Q9FIX8; Q93Z00: TCP14; NbExp=4; IntAct=EBI-1775691, EBI-4424563;
CC Q9FIX8; Q8LPR5: TCP4; NbExp=3; IntAct=EBI-1775691, EBI-15192325;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB016875; BAB11625.1; -; Genomic_DNA.
DR EMBL; CP002688; AED94988.1; -; Genomic_DNA.
DR EMBL; AK175583; BAD43346.1; -; mRNA.
DR EMBL; BT023453; AAY56444.1; -; mRNA.
DR EMBL; BT025626; ABF59044.1; -; mRNA.
DR RefSeq; NP_199175.1; NM_123728.4.
DR AlphaFoldDB; Q9FIX8; -.
DR BioGRID; 19632; 10.
DR IntAct; Q9FIX8; 10.
DR STRING; 3702.AT5G43620.1; -.
DR PaxDb; Q9FIX8; -.
DR PRIDE; Q9FIX8; -.
DR EnsemblPlants; AT5G43620.1; AT5G43620.1; AT5G43620.
DR GeneID; 834382; -.
DR Gramene; AT5G43620.1; AT5G43620.1; AT5G43620.
DR KEGG; ath:AT5G43620; -.
DR Araport; AT5G43620; -.
DR TAIR; locus:2158362; AT5G43620.
DR eggNOG; KOG2071; Eukaryota.
DR HOGENOM; CLU_046922_0_0_1; -.
DR InParanoid; Q9FIX8; -.
DR OMA; NNNEKEC; -.
DR OrthoDB; 264854at2759; -.
DR PhylomeDB; Q9FIX8; -.
DR PRO; PR:Q9FIX8; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; Q9FIX8; baseline and differential.
DR Genevisible; Q9FIX8; AT.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005849; C:mRNA cleavage factor complex; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0000993; F:RNA polymerase II complex binding; IBA:GO_Central.
DR GO; GO:0006379; P:mRNA cleavage; IEA:InterPro.
DR GO; GO:0006378; P:mRNA polyadenylation; IBA:GO_Central.
DR GO; GO:0006369; P:termination of RNA polymerase II transcription; IBA:GO_Central.
DR InterPro; IPR045154; PCF11-like.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15921; PTHR15921; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 1: Evidence at protein level;
KW Coiled coil; Metal-binding; Nucleus; Reference proteome; Zinc; Zinc-finger.
FT CHAIN 1..410
FT /note="Polyadenylation and cleavage factor homolog 5"
FT /id="PRO_0000431351"
FT ZN_FING 247..269
FT /note="C2H2-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 191..214
FT /evidence="ECO:0000255"
FT COMPBIAS 1..21
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 410 AA; 46049 MW; 059D0FA238B0E064 CRC64;
MASNGSFSAQ RNANAGTTMK RRNDNRGYGG GIGCYQEERN RYAPPQKRFR SQLQQQFRSG
HNPLYHYGSN TNNNVSRVSS QSYNNYGVDV IASNSSFALP NNDSNTNNYQ KPFVVYGNPN
PQIVPLPLPY RKLDPLDSLP QWVPNSTPNY PVRSSNFVPN TPDFTNVQNP MNHSNMVSVV
SQSMHQPIVL SKELTDLLSL LNNEKEKKTS EASNNDSLPV GLSFDNPSSL NVRHESVIKS
LYSDMPRQCT SCGVRFKCQE EHSKHMDWHV RKNRSVKTTT RLGQQPKKSR GWLASASLWL
CAPTGGGTVE VASFGGGEMQ KKNEKDQVQK QHMVPADEDQ KNCALCVEPF EEFFSHEADD
WMYKDAVYLT KNGRIVHVKC MPEPRPAKDL REPSRVMSVT VPSVAKAILC