CWC22_MUSDO
ID CWC22_MUSDO Reviewed; 1292 AA.
AC A0A1I8M2I8;
DT 30-AUG-2017, integrated into UniProtKB/Swiss-Prot.
DT 18-JAN-2017, sequence version 1.
DT 25-MAY-2022, entry version 22.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000305};
DE AltName: Full=Nucampholin {ECO:0000303|PubMed:28495751};
GN Name=ncm {ECO:0000303|PubMed:28495751};
OS Musca domestica (House fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Muscoidea;
OC Muscidae; Musca.
OX NCBI_TaxID=7370;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=aabys;
RX PubMed=25315136; DOI=10.1186/s13059-014-0466-3;
RA Scott J.G., Warren W.C., Beukeboom L.W., Bopp D., Clark A.G., Giers S.D.,
RA Hediger M., Jones A.K., Kasai S., Leichter C.A., Li M., Meisel R.P.,
RA Minx P., Murphy T.D., Nelson D.R., Reid W.R., Rinkevich F.D.,
RA Robertson H.M., Sackton T.B., Sattelle D.B., Thibaud-Nissen F.,
RA Tomlinson C., van de Zande L., Walden K.K., Wilson R.K., Liu N.;
RT "Genome of the house fly, Musca domestica L., a global vector of diseases
RT with adaptations to a septic environment.";
RL Genome Biol. 15:RESEARCH0466.1-RESEARCH0466.16(2014).
RN [2]
RP DISRUPTION PHENOTYPE.
RX PubMed=28495751; DOI=10.1126/science.aam5498;
RA Sharma A., Heinze S.D., Wu Y., Kohlbrenner T., Morilla I., Brunner C.,
RA Wimmer E.A., van de Zande L., Robinson M.D., Beukeboom L.W., Bopp D.;
RT "Male sex in houseflies is determined by Mdmd, a paralog of the generic
RT splice factor gene CWC22.";
RL Science 356:642-645(2017).
CC -!- FUNCTION: Required for pre-mRNA splicing and for exon-junction complex
CC (EJC) assembly. Hinders eIF4AIII from non-specifically binding RNA and
CC escorts it to the splicing machinery to promote EJC assembly on mature
CC mRNAs. {ECO:0000250|UniProtKB:Q9HCG8}.
CC -!- SUBUNIT: Component of the spliceosome C complex.
CC {ECO:0000250|UniProtKB:Q9HCG8}.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000250|UniProtKB:Q9HCG8}.
CC -!- DISRUPTION PHENOTYPE: Early lethality in both males and females.
CC {ECO:0000269|PubMed:28495751}.
CC -!- SIMILARITY: Belongs to the CWC22 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A1I8M2I8; -.
DR SMR; A0A1I8M2I8; -.
DR STRING; 7370.XP_005185085.1; -.
DR VEuPathDB; VectorBase:MDOA000598; -.
DR eggNOG; KOG2140; Eukaryota.
DR Proteomes; UP000095301; Unplaced.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR Pfam; PF02847; MA3; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; SSF48371; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW mRNA processing; mRNA splicing; Nucleus.
FT CHAIN 1..1292
FT /note="Pre-mRNA-splicing factor CWC22 homolog"
FT /id="PRO_0000441320"
FT DOMAIN 483..666
FT /note="MIF4G"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00698"
FT DOMAIN 776..892
FT /note="MI"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00698"
FT REGION 1..425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 726..767
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 989..1292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..21
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..53
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..89
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 90..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 135..149
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 150..175
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..200
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..287
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..315
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 316..355
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 356..425
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 729..744
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 989..1031
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1042..1082
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1083..1097
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1133..1292
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1292 AA; 147084 MW; 158618C7CB06D797 CRC64;
MTGSQSENNT STSSNSSEDT NNDSRNESET NADKNVVETK TTSDNNNKMM NAAETAANEE
KNGRQKKEKS KTPSKEDKKS RKKKSESSSE SSSSSDSDSS ESSSSSSDGE VSTSSGSSSD
SEKVKSKVKN KSKSPSAQRE VVQKETVTDT PDNISKETQE KLVEDIPKEN ELNVNLAKED
SVNAKQNDTE ASNENVAEEI TKPPQKQPDT EEGEITENNE KNSSARSPSK EKQLSANRSR
SRERRSHSGD SRGAHSGDKG SPSRLKRSPS RSKKSPSRSK RSVSRNRSSS RHGRDSSRNR
RSVSGDKENQ LRRKSRSRSS RSRSRSRSRS RSRRPERKHR SRTPRRSRSR ERRHERRRSV
SSDYDARRRS RRSESIERRR EERKRRHAER DEREKSKRSR RDEDDSFKTN KVSAEMEKDK
ENDNATVTDP KAKITERQRK TVDILTSRTG GAYIPPAKLR MMQAEITDKA SAAYQRIAWE
ALKKSIHGYI NKVNVDNIAI ITRELLKENI VRGRGLLCRS IIQAQAASPT FTHVYAALVA
IINSKFPNIG ELLLKRLVIQ FKRAFRRNDK TVCLSSSRFI AHLVNQRVAH EILALEILTL
LVESPTDDSV EVAIAFLKEC GMKLTEVSSK GIGAIFEMLK NILHEGKLDK RVQYMIEVVF
QVRKDGFKDH QSVIESLELV EEDDQFTHLL MLDDATSPED VLNAFKFDEQ YEANEEKYKG
LSKEILGSDA SDSDGSSGSG SDSDSESSDS DEDKNEGDEK PTAGDIIDNT ETNLIALRRT
IYLTINSSLD YEECAHKLMK MQLKPGQEIE LCHMFLDCCA EQRTYEKFYG LLAQRFCNIN
KSYIEPFEEI FKDTYQTTHR LDTNRLRNVS KFFAHLLFTD AISWDVLDCI KLNEDDTTSS
SRIFIKILFQ ELAEYMGLGQ LNKKLKDEVL AESLAGLFPK DNPRNTRFSI NFFTSIGLGG
LTDELRQFLK NAPKSVPAIN AEILANKPVD VSNSSSSSSS SSSSTSSSSS SSSSSSESSA
SSSSSDSSSD SSSDSEVDKK KRKGKVKKTK KKKTSNKKEK TKKSKSKNKK DAKKKAKKRK
SKKGSTSSED DSSKSESSSS ESKSSDSSDS EQSGNDKSKK KTKSKSKTKP NKKSKRSDSE
DVENERDIKR KKREDFGNYI HEDRRRDEEY SKNGGRDRKK DNHGNNKEKR EIPQRRDSEI
ERRREEREKR HRERERNFSR SRSRSRGGSG RKDRDRRETN GRSERKDRER DRDSDHRRYR
KDYDRSRSRD RNEKREREYS RSKLRNTSKE RG