KHDC4_XENTR
ID KHDC4_XENTR Reviewed; 612 AA.
AC A0JM64;
DT 24-JUL-2007, integrated into UniProtKB/Swiss-Prot.
DT 12-DEC-2006, sequence version 1.
DT 03-AUG-2022, entry version 50.
DE RecName: Full=KH homology domain-containing protein 4;
DE AltName: Full=Brings lots of money 7 {ECO:0000250|UniProtKB:Q7Z7F0};
DE AltName: Full=Pre-mRNA splicing factor protein khdc4;
GN Name=khdc4; Synonyms=blom7;
OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Silurana.
OX NCBI_TaxID=8364;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Testis;
RG NIH - Xenopus Gene Collection (XGC) project;
RL Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: RNA-binding protein involved in pre-mRNA splicing. Interacts
CC with the PRP19C/Prp19 complex/NTC/Nineteen complex which is part of the
CC spliceosome. Involved in regulating splice site selection. Binds
CC preferentially RNA with A/C rich sequences and poly-C stretches.
CC {ECO:0000250|UniProtKB:Q7Z7F0}.
CC -!- SUBUNIT: Interacts with PRPF19. {ECO:0000250|UniProtKB:Q7Z7F0}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q7Z7F0}. Cytoplasm
CC {ECO:0000250|UniProtKB:Q7Z7F0}.
CC -!- DOMAIN: The C-terminal part is necessary for the interaction with the
CC PRP19C/Prp19 complex/NTC/Nineteen complex.
CC {ECO:0000250|UniProtKB:Q7Z7F0}.
CC -!- DOMAIN: The KH domains mediate RNA-binding.
CC {ECO:0000250|UniProtKB:Q7Z7F0}.
CC -!- SIMILARITY: Belongs to the KHDC4 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC125755; AAI25756.1; -; mRNA.
DR RefSeq; NP_001072751.1; NM_001079283.1.
DR AlphaFoldDB; A0JM64; -.
DR SMR; A0JM64; -.
DR STRING; 8364.ENSXETP00000027314; -.
DR DNASU; 780208; -.
DR Ensembl; ENSXETT00000092027; ENSXETP00000098835; ENSXETG00000037253.
DR GeneID; 780208; -.
DR KEGG; xtr:780208; -.
DR CTD; 22889; -.
DR Xenbase; XB-GENE-5957982; khdc4.
DR InParanoid; A0JM64; -.
DR OrthoDB; 633868at2759; -.
DR Proteomes; UP000008143; Chromosome 8.
DR Proteomes; UP000790000; Unplaced.
DR Bgee; ENSXETG00000037253; Expressed in neurula embryo and 13 other tissues.
DR GO; GO:0005737; C:cytoplasm; ISS:UniProtKB.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0003723; F:RNA binding; ISS:UniProtKB.
DR GO; GO:0006376; P:mRNA splice site selection; ISS:UniProtKB.
DR Gene3D; 3.30.1370.10; -; 2.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR InterPro; IPR031121; RIK/BLOM7.
DR PANTHER; PTHR15744; PTHR15744; 1.
DR SUPFAM; SSF54791; SSF54791; 2.
PE 2: Evidence at transcript level;
KW Cytoplasm; mRNA processing; mRNA splicing; Nucleus; Phosphoprotein;
KW Reference proteome; RNA-binding.
FT CHAIN 1..612
FT /note="KH homology domain-containing protein 4"
FT /id="PRO_0000296674"
FT DOMAIN 105..185
FT /note="KH 1"
FT /evidence="ECO:0000250|UniProtKB:Q7Z7F0"
FT DOMAIN 237..319
FT /note="KH 2"
FT /evidence="ECO:0000250|UniProtKB:Q7Z7F0"
FT REGION 477..545
FT /note="Required for nuclear retention"
FT /evidence="ECO:0000250|UniProtKB:Q7Z7F0"
FT REGION 510..542
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 565..612
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 578..604
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 612 AA; 65049 MW; 274421FF37273945 CRC64;
MSAGSGRRSK WDQPGPPSTT LLLPGVLPAI VPFTAGAFLS PPEMPPVTAT CESVSAPSGA
LDAAAAVAAK INAMLMAKGK IKPTQNAPEK VQVPGKAPSA AKSKDDLVVA EVEINDVPLT
CRNLLTRGQT QDEISRMSGA AVSTRGRYMT AEEKAKIGPG DRPLYLHVQG QTRELVDRAV
NRIKEIITNG VVKAATGSSP TFNGATVTVY HQPAPVAPVA PPKPQFQSGM HYVQDKLFVG
LEHAVATFNV KEKVEGPGCS YLQHIQLETG AKVFLRGKGS GCIEPASGRE AFEPMYIYIS
HPKPEGLAAA KKLCENLLQT VHSEYNRFVN QIATTAPLTG YAQPPQIGAV PMQPQYYPPN
GYQTGFPVVQ QPAPQPAVQV PYVVSTPIGS PVPPLPGVVP TMAAPVPPVP AVPTRYPIPQ
VQPPGSTVPA QLTAPYMPAP HGKHAAAVVT QAPLQGQKRR FTEELPEERD SGLLGYQHGP
IHMTNLGTGF PAQSKVDGAT IKSDTMVVKE RERDRQLMPP PGMPVSAQKE PEEKSSPGTV
GVADDYPVKK LKSSGKTFGL VAYAGDSSDE EEDHGVLKSS GSLSQGWNAG YQYPASQQQQ
RAKPQMPFWM AP