CPSF1_ORYSJ
ID CPSF1_ORYSJ Reviewed; 1441 AA.
AC Q7XWP1;
DT 25-JUL-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2004, sequence version 2.
DT 03-AUG-2022, entry version 70.
DE RecName: Full=Probable cleavage and polyadenylation specificity factor subunit 1;
DE AltName: Full=Cleavage and polyadenylation specificity factor 160 kDa subunit;
DE Short=CPSF 160 kDa subunit;
GN OrderedLocusNames=Os04g0252200, LOC_Os04g18010; ORFNames=OSJNBa0032B23.5;
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=12447439; DOI=10.1038/nature01183;
RA Feng Q., Zhang Y., Hao P., Wang S., Fu G., Huang Y., Li Y., Zhu J., Liu Y.,
RA Hu X., Jia P., Zhang Y., Zhao Q., Ying K., Yu S., Tang Y., Weng Q.,
RA Zhang L., Lu Y., Mu J., Lu Y., Zhang L.S., Yu Z., Fan D., Liu X., Lu T.,
RA Li C., Wu Y., Sun T., Lei H., Li T., Hu H., Guan J., Wu M., Zhang R.,
RA Zhou B., Chen Z., Chen L., Jin Z., Wang R., Yin H., Cai Z., Ren S., Lv G.,
RA Gu W., Zhu G., Tu Y., Jia J., Zhang Y., Chen J., Kang H., Chen X., Shao C.,
RA Sun Y., Hu Q., Zhang X., Zhang W., Wang L., Ding C., Sheng H., Gu J.,
RA Chen S., Ni L., Zhu F., Chen W., Lan L., Lai Y., Cheng Z., Gu M., Jiang J.,
RA Li J., Hong G., Xue Y., Han B.;
RT "Sequence and analysis of rice chromosome 4.";
RL Nature 420:316-320(2002).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare;
RX PubMed=24280374; DOI=10.1186/1939-8433-6-4;
RA Kawahara Y., de la Bastide M., Hamilton J.P., Kanamori H., McCombie W.R.,
RA Ouyang S., Schwartz D.C., Tanaka T., Wu J., Zhou S., Childs K.L.,
RA Davidson R.M., Lin H., Quesada-Ocampo L., Vaillancourt B., Sakai H.,
RA Lee S.S., Kim J., Numa H., Itoh T., Buell C.R., Matsumoto T.;
RT "Improvement of the Oryza sativa Nipponbare reference genome using next
RT generation sequence and optical map data.";
RL Rice 6:4-4(2013).
CC -!- FUNCTION: CPSF plays a key role in pre-mRNA 3'-end formation,
CC recognizing the AAUAAA signal sequence and interacting with
CC poly(A)polymerase and other factors to bring about cleavage and poly(A)
CC addition. This subunit is involved in the RNA recognition step of the
CC polyadenylation reaction (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: CPSF is a heterotetramer composed of four distinct subunits
CC 160, 100, 70 and 30 kDa. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the CPSF1 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL606991; CAD39979.2; -; Genomic_DNA.
DR EMBL; AP014960; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; Q7XWP1; -.
DR SMR; Q7XWP1; -.
DR STRING; 4530.OS04T0252200-01; -.
DR PaxDb; Q7XWP1; -.
DR PRIDE; Q7XWP1; -.
DR eggNOG; KOG1896; Eukaryota.
DR InParanoid; Q7XWP1; -.
DR Proteomes; UP000000763; Chromosome 4.
DR Proteomes; UP000059680; Chromosome 4.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006378; P:mRNA polyadenylation; IBA:GO_Central.
DR Gene3D; 2.130.10.10; -; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW mRNA processing; Nucleus; Reference proteome; RNA-binding.
FT CHAIN 1..1441
FT /note="Probable cleavage and polyadenylation specificity
FT factor subunit 1"
FT /id="PRO_0000247466"
SQ SEQUENCE 1441 AA; 158262 MW; F9359CF34AB317DF CRC64;
MSYAAYKMMH WPTGVDHCAA GFVTHSPSDA AAFFTAATVG PGPEGDIDSA AAASRPRRLG
PSPNLVVAAA NVLEVYAVRA ETAAEDGGGG TQPSSSSGAV LDGISGARLE LVCYYRLHGN
IESMTVLSDG AENRRATIAL AFKDAKITCL EFDDAIHGLR TSSMHCFEGP EWQHLKRGRE
SFAWGPVIKA DPLGRCGAAL AYGLQMIILK AAQVGHSLVG EDEPTCALSS TAVCIESSYL
IDLRALDMNH VKDFAFVHGY IEPVLVILHE QEPTWAGRIL SKHHTCMISA FSISMTLKQH
PVIWSAANLP HDAYQLLAVP PPISGVLVIC ANSIHYHSQS TSCSLDLNNF SSHPDGSPEI
SKSNFQVELD AAKATWLSND IVMFSTKAGE MLLLTVVYDG RVVQRLDLMK SKASVLSSAV
TSIGNSFFFL GSRLGDSLLV QFSYCASKSV LQDLTNERSA DIEGDLPFSK RLKRIPSDVL
QDVTSVEELS FQNIIAPNSL ESAQKISYIV RDALINVGPL KDFSYGLRAN ADPNAMGNAK
QSNYELVCCS GHGKNGSLSV LQQSIRPDLI TEVELPSCRG IWTVYYKSYR GQMAEDNEYH
AYLIISLENR TMVLETGDDL GEVTETVDYF VQASTIAAGN LFGRRRVIQV YGKGARVLDG
SFMTQELNFT THASESSSSE ALGVACASIA DPYVLLKMVD GSVQLLIGDY CTCTLSVNAP
SIFISSSERI AACTLYRDRG PEPWLTKTRS DAWLSTGIAE AIDGNGTSSH DQSDIYCIIC
YESGKLEIFE VPSFRCVFSV ENFISGEALL VDKFSQLIYE DSTKERYDCT KASLKKEAGD
SIRIVELAMH RWSGQFSRPF LFGLLNDGTL LCYHAFSYEA SESNVKRVPL SPQGSADHHN
ASDSRLRNLR FHRVSIDITS REDIPTLGRP RITTFNNVGG YEGLFLSGTR PAWVMVCRQR
LRVHPQLCDG PIEAFTVLHN VNCSHGFIYV TSQGFLKICQ LPSAYNYDSY WPVQKVPLHG
TPHQVTYYAE QSLYPLIVSV PVVRPLNQVL SSMADQESVH HMDNDVTSTD ALHKTYTVDE
FEVRILELEK PGGHWETKST IPMQLFENAL TVRIVTLHNT TTKENETLLA IGTAYVLGED
VAARGRVLLF SFTKSENSQN LVTEVYSKES KGAVSAVASL QGHLLIASGP KITLNKWTGA
ELTAVAFYDA PLHVVSLNIV KNFVLFGDIH KSIYFLSWKE QGSQLSLLAK DFGSLDCFAT
EFLIDGSTLS LVASDSDKNV QIFYYAPKMV ESWKGQKLLS RAEFHVGAHI TKFLRLQMLP
TQGLSSEKTN RFALLFGNLD GGIGCIAPID ELTFRRLQSL QRKLVDAVPH VCGLNPRSFR
QFHSNGKGHR PGPDNIIDFE LLCSYEMLSL DEQLDVAQQI GTTRSQILSN FSDISLGTSF
L