SNR40_BOVIN
ID SNR40_BOVIN Reviewed; 358 AA.
AC Q2HJH6;
DT 11-JUL-2006, integrated into UniProtKB/Swiss-Prot.
DT 21-MAR-2006, sequence version 1.
DT 25-MAY-2022, entry version 85.
DE RecName: Full=U5 small nuclear ribonucleoprotein 40 kDa protein;
DE Short=U5 snRNP 40 kDa protein;
DE AltName: Full=WD repeat-containing protein 57;
GN Name=SNRNP40; Synonyms=WDR57;
OS Bos taurus (Bovine).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=9913;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Hereford; TISSUE=Testis;
RG NIH - Mammalian Gene Collection (MGC) project;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Required for pre-mRNA splicing as component of the activated
CC spliceosome. Component of the U5 small nuclear ribonucleoprotein
CC (snRNP) complex and the U4/U6-U5 tri-snRNP complex, building blocks of
CC the spliceosome. {ECO:0000250|UniProtKB:Q96DI7}.
CC -!- SUBUNIT: Component of the pre-catalytic and catalytic spliceosome
CC complexes. Component of the postcatalytic spliceosome P complex. Part
CC of the U5 snRNP complex. Interacts with PRPF8. Component of the U4/U6-
CC U5 tri-snRNP complex composed of the U4, U6 and U5 snRNAs and at least
CC PRPF3, PRPF4, PRPF6, PRPF8, PRPF31, SNRNP200, TXNL4A, WDR57, SNRNP40,
CC DDX23, CD2BP2, PPIH, SNU13, EFTUD2, SART1 and USP39.
CC {ECO:0000250|UniProtKB:Q96DI7}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q96DI7}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC105383; AAI05384.1; -; mRNA.
DR RefSeq; NP_001039847.1; NM_001046382.2.
DR AlphaFoldDB; Q2HJH6; -.
DR SMR; Q2HJH6; -.
DR STRING; 9913.ENSBTAP00000022372; -.
DR PaxDb; Q2HJH6; -.
DR GeneID; 534645; -.
DR KEGG; bta:534645; -.
DR CTD; 9410; -.
DR eggNOG; KOG0265; Eukaryota.
DR InParanoid; Q2HJH6; -.
DR OrthoDB; 1133270at2759; -.
DR Proteomes; UP000009136; Unplaced.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; -; 1.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR Pfam; PF00400; WD40; 7.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; SSF50978; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 5.
DR PROSITE; PS50082; WD_REPEATS_2; 7.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 2: Evidence at transcript level;
KW Isopeptide bond; Methylation; mRNA processing; mRNA splicing; Nucleus;
KW Reference proteome; Repeat; Spliceosome; Ubl conjugation; WD repeat.
FT CHAIN 1..358
FT /note="U5 small nuclear ribonucleoprotein 40 kDa protein"
FT /id="PRO_0000246075"
FT REPEAT 65..104
FT /note="WD 1"
FT REPEAT 108..147
FT /note="WD 2"
FT REPEAT 150..190
FT /note="WD 3"
FT REPEAT 192..231
FT /note="WD 4"
FT REPEAT 234..273
FT /note="WD 5"
FT REPEAT 284..323
FT /note="WD 6"
FT REPEAT 326..358
FT /note="WD 7"
FT MOD_RES 21
FT /note="Asymmetric dimethylarginine"
FT /evidence="ECO:0000250|UniProtKB:Q6PE01"
FT CROSSLNK 18
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q96DI7"
FT CROSSLNK 271
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q96DI7"
SQ SEQUENCE 358 AA; 39380 MW; ADF79EFE0B0A5F26 CRC64;
MIEQQKRKGP ELPLVPVKRQ RHELLLGAAG SGPGAGQQQA APGALLQAGP PRCSSLQAPI
MLLSGHEGEV YCCKFHPNGS TLASAGFDRL ILLWNVYGDC DNYATLKGHS GAVMELHYNT
DGSMLFSAST DKTVAVWDSE TGERVKRLKG HTSFVNSCYP ARRGPQLVCT GSDDGTVKLW
DIRKKAAIQT FQNTYQVLAV TFNDTSDQII SGGIDNDIKV WDLRQNKLTY TMRGHADSVT
GLSLSSEGSY LLSNAMDNTV RVWDVRPFAP KERCVRIFQG NVHNFEKNLL RCSWSPDGSK
IAAGSADRFV YVWDTTSRRI LYKLPGHAGS INEVAFHPDE PIILSASSDK RLYMGEIQ