PRP31_DANRE
ID PRP31_DANRE Reviewed; 508 AA.
AC Q7SXM7;
DT 21-MAR-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2003, sequence version 1.
DT 03-AUG-2022, entry version 112.
DE RecName: Full=U4/U6 small nuclear ribonucleoprotein Prp31;
DE AltName: Full=Pre-mRNA-processing factor 31;
GN Name=prpf31;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RG NIH - Zebrafish Gene Collection (ZGC) project;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Involved in pre-mRNA splicing as component of the
CC spliceosome. Required for the assembly of the U4/U5/U6 tri-snRNP
CC complex, one of the building blocks of the spliceosome.
CC {ECO:0000250|UniProtKB:Q8WWY3}.
CC -!- SUBUNIT: Identified in the spliceosome B complex. Component of the
CC U4/U6-U5 tri-snRNP complex. Component of some MLL1/MLL complex.
CC {ECO:0000250|UniProtKB:Q8WWY3}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q8WWY3}. Nucleus
CC speckle {ECO:0000250|UniProtKB:Q8WWY3}. Nucleus, Cajal body
CC {ECO:0000250|UniProtKB:Q8WWY3}. Note=Predominantly found in speckles
CC and in Cajal bodies. {ECO:0000250|UniProtKB:Q8WWY3}.
CC -!- DOMAIN: Interacts with the snRNP via the Nop domain.
CC {ECO:0000250|UniProtKB:Q8WWY3}.
CC -!- DOMAIN: The coiled coil domain is formed by two non-contiguous helices.
CC {ECO:0000250|UniProtKB:Q8WWY3}.
CC -!- SIMILARITY: Belongs to the PRP31 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC055531; AAH55531.1; -; mRNA.
DR RefSeq; NP_956798.1; NM_200504.1.
DR AlphaFoldDB; Q7SXM7; -.
DR SMR; Q7SXM7; -.
DR STRING; 7955.ENSDARP00000120708; -.
DR PaxDb; Q7SXM7; -.
DR Ensembl; ENSDART00000137029; ENSDARP00000120708; ENSDARG00000095904.
DR Ensembl; ENSDART00000190122; ENSDARP00000157055; ENSDARG00000095904.
DR GeneID; 393476; -.
DR KEGG; dre:393476; -.
DR CTD; 26121; -.
DR ZFIN; ZDB-GENE-040426-1561; prpf31.
DR eggNOG; KOG2574; Eukaryota.
DR GeneTree; ENSGT00550000075069; -.
DR HOGENOM; CLU_026337_2_0_1; -.
DR InParanoid; Q7SXM7; -.
DR OMA; IIGNGPM; -.
DR OrthoDB; 791296at2759; -.
DR PhylomeDB; Q7SXM7; -.
DR TreeFam; TF300677; -.
DR Reactome; R-DRE-72163; mRNA Splicing - Major Pathway.
DR PRO; PR:Q7SXM7; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Chromosome 16.
DR Bgee; ENSDARG00000095904; Expressed in early embryo and 29 other tissues.
DR ExpressionAtlas; Q7SXM7; baseline and differential.
DR GO; GO:0015030; C:Cajal body; IEA:UniProtKB-SubCell.
DR GO; GO:0071339; C:MLL1 complex; ISS:UniProtKB.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0071011; C:precatalytic spliceosome; IBA:GO_Central.
DR GO; GO:0097526; C:spliceosomal tri-snRNP complex; IBA:GO_Central.
DR GO; GO:0071005; C:U2-type precatalytic spliceosome; ISS:UniProtKB.
DR GO; GO:0005687; C:U4 snRNP; IBA:GO_Central.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; ISS:UniProtKB.
DR GO; GO:0005690; C:U4atac snRNP; ISS:UniProtKB.
DR GO; GO:0030622; F:U4atac snRNA binding; ISS:UniProtKB.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IMP:ZFIN.
DR GO; GO:0060041; P:retina development in camera-type eye; IMP:ZFIN.
DR GO; GO:0000244; P:spliceosomal tri-snRNP complex assembly; IEA:InterPro.
DR Gene3D; 1.10.246.90; -; 1.
DR InterPro; IPR042239; Nop_C.
DR InterPro; IPR002687; Nop_dom.
DR InterPro; IPR036070; Nop_dom_sf.
DR InterPro; IPR012976; NOSIC.
DR InterPro; IPR027105; Prp31.
DR InterPro; IPR019175; Prp31_C.
DR PANTHER; PTHR13904; PTHR13904; 1.
DR Pfam; PF01798; Nop; 1.
DR Pfam; PF09785; Prp31_C; 1.
DR SMART; SM00931; NOSIC; 1.
DR SUPFAM; SSF89124; SSF89124; 1.
DR PROSITE; PS51358; NOP; 1.
PE 2: Evidence at transcript level;
KW Coiled coil; mRNA processing; mRNA splicing; Nucleus; Reference proteome;
KW Ribonucleoprotein; RNA-binding; Spliceosome.
FT CHAIN 1..508
FT /note="U4/U6 small nuclear ribonucleoprotein Prp31"
FT /id="PRO_0000227801"
FT DOMAIN 226..344
FT /note="Nop"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00690"
FT REGION 1..45
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 345..368
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 442..461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 96..131
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT COILED 192..226
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT MOTIF 362..375
FT /note="Nuclear localization signal (NLS)"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT COMPBIAS 9..25
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..43
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT SITE 258
FT /note="Interaction with U4 snRNA"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT SITE 281
FT /note="Interaction with U4 snRNA and U4atac snRNA"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT SITE 300
FT /note="Interaction with U4atac snRNA"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT SITE 304
FT /note="Interaction with U4 snRNA and U4atac snRNA"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
FT SITE 309
FT /note="Interaction with U4 snRNA and U4atac snRNA"
FT /evidence="ECO:0000250|UniProtKB:Q8WWY3"
SQ SEQUENCE 508 AA; 56474 MW; 2CB8CFF09606DE5F CRC64;
MSLADELLAD LEEAGEEDGL YPGGEEGESD GEPGERQVDG GLEDIPEEME VDYSSTESVT
SIAKLRHSKP FAEIMDKISH YVGNQRKNSE VSGPVEADPE YRLIVAANNL TVEIDNELNI
IHKFVRDKYS KRFPELESLV PNALDYIRTV KELGNNLEKC KNNETLQQIL TNATIMVVSV
TASTTQGTML GDDELQRLEE ACDMALELNQ SKHRIYEYVE SRMSFIAPNL SIIVGASTAA
KIMGVAGGLT NLSKMPACNL MLLGAQRRTL SGFSSTSLLP HTGYIYHCDV VQTLPPDLRR
KAARLVSAKC TLASRVDSFH ESADGKVGYD LKEEIERKFD KWQEPPPVKQ VKPLPAPLDG
QRKKRGGRRY RKMKERLGLT EIRKHANRMT FAEIEDDAYQ EDLGFSLGQL GKSGSGRVRQ
AQVNDSTKAR ISKSLQRTLQ KQSMTYGGKS TVRDRSSGTS SSVAFTPLQG LEIVNPQAAE
KKVAEANQKY FSNMAEFLKV KREKEDKV