SNR40_PONAB
ID SNR40_PONAB Reviewed; 357 AA.
AC Q5RF51;
DT 07-JUN-2005, integrated into UniProtKB/Swiss-Prot.
DT 21-DEC-2004, sequence version 1.
DT 25-MAY-2022, entry version 77.
DE RecName: Full=U5 small nuclear ribonucleoprotein 40 kDa protein;
DE Short=U5 snRNP 40 kDa protein;
DE AltName: Full=WD repeat-containing protein 57;
GN Name=SNRNP40; Synonyms=WDR57;
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Kidney;
RG The German cDNA consortium;
RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Required for pre-mRNA splicing as component of the activated
CC spliceosome. Component of the U5 small nuclear ribonucleoprotein
CC (snRNP) complex and the U4/U6-U5 tri-snRNP complex, building blocks of
CC the spliceosome. {ECO:0000250|UniProtKB:Q96DI7}.
CC -!- SUBUNIT: Component of the pre-catalytic and catalytic spliceosome
CC complexes. Component of the postcatalytic spliceosome P complex. Part
CC of the U5 snRNP complex. Interacts with PRPF8. Component of the U4/U6-
CC U5 tri-snRNP complex composed of the U4, U6 and U5 snRNAs and at least
CC PRPF3, PRPF4, PRPF6, PRPF8, PRPF31, SNRNP200, TXNL4A, WDR57, SNRNP40,
CC DDX23, CD2BP2, PPIH, SNU13, EFTUD2, SART1 and USP39.
CC {ECO:0000250|UniProtKB:Q96DI7}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q96DI7}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CR857310; CAH89606.1; -; mRNA.
DR RefSeq; NP_001124715.1; NM_001131243.1.
DR AlphaFoldDB; Q5RF51; -.
DR SMR; Q5RF51; -.
DR STRING; 9601.ENSPPYP00000001868; -.
DR GeneID; 100171563; -.
DR KEGG; pon:100171563; -.
DR CTD; 9410; -.
DR eggNOG; KOG0265; Eukaryota.
DR InParanoid; Q5RF51; -.
DR OrthoDB; 1133270at2759; -.
DR Proteomes; UP000001595; Unplaced.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; -; 1.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR Pfam; PF00400; WD40; 7.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; SSF50978; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 5.
DR PROSITE; PS50082; WD_REPEATS_2; 7.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 2: Evidence at transcript level;
KW Isopeptide bond; Methylation; mRNA processing; mRNA splicing; Nucleus;
KW Reference proteome; Repeat; Spliceosome; Ubl conjugation; WD repeat.
FT CHAIN 1..357
FT /note="U5 small nuclear ribonucleoprotein 40 kDa protein"
FT /id="PRO_0000051419"
FT REPEAT 64..103
FT /note="WD 1"
FT REPEAT 107..146
FT /note="WD 2"
FT REPEAT 149..189
FT /note="WD 3"
FT REPEAT 191..230
FT /note="WD 4"
FT REPEAT 233..272
FT /note="WD 5"
FT REPEAT 283..322
FT /note="WD 6"
FT REPEAT 325..357
FT /note="WD 7"
FT MOD_RES 21
FT /note="Asymmetric dimethylarginine"
FT /evidence="ECO:0000250|UniProtKB:Q6PE01"
FT CROSSLNK 18
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q96DI7"
FT CROSSLNK 270
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q96DI7"
SQ SEQUENCE 357 AA; 39158 MW; F849C91B1334F911 CRC64;
MIEQQKRKGP ELPLVPVKRQ RHELLLGAGS GPGAGQQQAT PGALLQAGPP RCSSLQAPIM
LLSGHEGEVY CCKFHPNGST LASAGFDRLI LLWNVYGDCG NYATLKGYSG AVMELHYNTD
GSMLFSASTD KTVAVWDSET GERVKRLKGH TSFVNSCYPA RRGPQLVCTG SDDGTVKLWD
IRKKAAIQTF QNTYQVLAVT FNDTSDQIIS GGIDNDIKVW DLRQNKLTYT MRGHADSVTG
LSLSSEGSYL LSNAMDNTVR VWDVRPFAPK ERCVKIFQGN VHNFEKNLLR CSWSPDGSKI
AAGSADRSVC VWDTTSRRIL YKLPGHAGSI NEVAFHPDEP IIISASSDKR LYMGEIQ