SYMPK_DROME
ID SYMPK_DROME Reviewed; 1165 AA.
AC Q8MSU4; Q9VNH4;
DT 03-APR-2013, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2002, sequence version 1.
DT 03-AUG-2022, entry version 147.
DE RecName: Full=Symplekin {ECO:0000312|EMBL:AAF51962.2};
GN Name=Sym; ORFNames=CG2097;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1] {ECO:0000312|EMBL:AAF51962.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000312|EMBL:AAF51962.2}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [3] {ECO:0000312|EMBL:AAM49961.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley {ECO:0000269|PubMed:12537569};
RC TISSUE=Embryo {ECO:0000269|PubMed:12537569};
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [4] {ECO:0000305}
RP FUNCTION, AND SUBCELLULAR LOCATION.
RX PubMed=18042462; DOI=10.1016/j.molcel.2007.10.009;
RA Wagner E.J., Burch B.D., Godfrey A.C., Salzler H.R., Duronio R.J.,
RA Marzluff W.F.;
RT "A genome-wide RNA interference screen reveals that variant histones are
RT necessary for replication-dependent histone pre-mRNA processing.";
RL Mol. Cell 28:692-699(2007).
RN [5] {ECO:0000305}
RP FUNCTION, AND INTERACTION WITH CPSF73; CPSF100; SLBP AND LSM11.
RX PubMed=19450530; DOI=10.1016/j.molcel.2009.04.024;
RA Sullivan K.D., Steiniger M., Marzluff W.F.;
RT "A core complex of CPSF73, CPSF100, and Symplekin may form two different
RT cleavage factors for processing of poly(A) and histone mRNAs.";
RL Mol. Cell 34:322-332(2009).
RN [6] {ECO:0000305, ECO:0000312|PDB:3GS3}
RP X-RAY CRYSTALLOGRAPHY (2.40 ANGSTROMS) OF 19-270, AND HEAT REPEATS.
RX PubMed=19576221; DOI=10.1016/j.jmb.2009.06.062;
RA Kennedy S.A., Frazier M.L., Steiniger M., Mast A.M., Marzluff W.F.,
RA Redinbo M.R.;
RT "Crystal structure of the HEAT domain from the Pre-mRNA processing factor
RT Symplekin.";
RL J. Mol. Biol. 392:115-128(2009).
CC -!- FUNCTION: Component of a protein complex required for cotranscriptional
CC processing of 3'-ends of polyadenylated and histone pre-mRNA.
CC {ECO:0000269|PubMed:18042462, ECO:0000269|PubMed:19450530}.
CC -!- SUBUNIT: Interacts with Cpsf73 and Cpsf100 forming a core cleavage
CC factor required for both polyadenylated and histone mRNA processing.
CC Interacts with Slbp and Lsm11. {ECO:0000269|PubMed:19450530}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:18042462}.
CC Note=Concentrates in the histone locus body.
CC {ECO:0000269|PubMed:18042462}.
CC -!- DOMAIN: The HEAT repeats have been determined based on 3D-structure
CC analysis and are not detected by sequence-based prediction programs.
CC {ECO:0000269|PubMed:19576221}.
CC -!- SIMILARITY: Belongs to the Symplekin family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; AAF51962.2; -; Genomic_DNA.
DR EMBL; AY118592; AAM49961.1; -; mRNA.
DR RefSeq; NP_649580.1; NM_141323.2.
DR PDB; 3GS3; X-ray; 2.40 A; A=19-270.
DR PDB; 4IMI; X-ray; 2.35 A; A/C=19-351.
DR PDB; 4IMJ; X-ray; 2.58 A; A/C=19-351.
DR PDB; 4YGX; X-ray; 2.95 A; A/C=19-351.
DR PDB; 6NPW; X-ray; 2.49 A; A/C=19-351.
DR PDBsum; 3GS3; -.
DR PDBsum; 4IMI; -.
DR PDBsum; 4IMJ; -.
DR PDBsum; 4YGX; -.
DR PDBsum; 6NPW; -.
DR AlphaFoldDB; Q8MSU4; -.
DR SMR; Q8MSU4; -.
DR BioGRID; 65914; 13.
DR IntAct; Q8MSU4; 12.
DR STRING; 7227.FBpp0078372; -.
DR PaxDb; Q8MSU4; -.
DR PRIDE; Q8MSU4; -.
DR DNASU; 40709; -.
DR EnsemblMetazoa; FBtr0078723; FBpp0078372; FBgn0037371.
DR GeneID; 40709; -.
DR KEGG; dme:Dmel_CG2097; -.
DR UCSC; CG2097-RA; d. melanogaster.
DR CTD; 40709; -.
DR FlyBase; FBgn0037371; Sym.
DR VEuPathDB; VectorBase:FBgn0037371; -.
DR eggNOG; KOG1895; Eukaryota.
DR GeneTree; ENSGT00390000017045; -.
DR HOGENOM; CLU_004756_0_0_1; -.
DR InParanoid; Q8MSU4; -.
DR OMA; QVCKVKV; -.
DR OrthoDB; 386749at2759; -.
DR PhylomeDB; Q8MSU4; -.
DR Reactome; R-DME-159231; Transport of Mature mRNA Derived from an Intronless Transcript.
DR Reactome; R-DME-72163; mRNA Splicing - Major Pathway.
DR Reactome; R-DME-72187; mRNA 3'-end processing.
DR Reactome; R-DME-73856; RNA Polymerase II Transcription Termination.
DR Reactome; R-DME-77595; Processing of Intronless Pre-mRNAs.
DR SignaLink; Q8MSU4; -.
DR BioGRID-ORCS; 40709; 0 hits in 1 CRISPR screen.
DR EvolutionaryTrace; Q8MSU4; -.
DR GenomeRNAi; 40709; -.
DR PRO; PR:Q8MSU4; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0037371; Expressed in egg chamber and 24 other tissues.
DR Genevisible; Q8MSU4; DM.
DR GO; GO:0035363; C:histone locus body; IDA:UniProtKB.
DR GO; GO:0005847; C:mRNA cleavage and polyadenylation specificity factor complex; IDA:FlyBase.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0061689; C:tricellular tight junction; IDA:FlyBase.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006398; P:mRNA 3'-end processing by stem-loop binding and cleavage; IMP:FlyBase.
DR GO; GO:0006378; P:mRNA polyadenylation; IMP:FlyBase.
DR Gene3D; 1.25.10.10; -; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR021850; Symplekin/Pta1.
DR InterPro; IPR032460; Symplekin/Pta1_N.
DR InterPro; IPR022075; Symplekin_C.
DR PANTHER; PTHR15245:SF20; PTHR15245:SF20; 1.
DR Pfam; PF11935; SYMPK_PTA1_N; 1.
DR Pfam; PF12295; Symplekin_C; 1.
DR SUPFAM; SSF48371; SSF48371; 2.
PE 1: Evidence at protein level;
KW 3D-structure; Coiled coil; mRNA processing; Nucleus; Reference proteome;
KW Repeat; RNA-binding.
FT CHAIN 1..1165
FT /note="Symplekin"
FT /id="PRO_0000421978"
FT REPEAT 23..58
FT /note="HEAT 1"
FT /evidence="ECO:0000269|PubMed:19576221"
FT REPEAT 61..95
FT /note="HEAT 2"
FT /evidence="ECO:0000269|PubMed:19576221"
FT REPEAT 98..140
FT /note="HEAT 3"
FT /evidence="ECO:0000269|PubMed:19576221"
FT REPEAT 147..186
FT /note="HEAT 4"
FT /evidence="ECO:0000269|PubMed:19576221"
FT REPEAT 218..257
FT /note="HEAT 5"
FT /evidence="ECO:0000269|PubMed:19576221"
FT REGION 365..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT HELIX 22..37
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 42..56
FT /evidence="ECO:0007829|PDB:4IMI"
FT TURN 57..60
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 61..67
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 68..72
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 73..75
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 80..96
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 98..103
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 105..111
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 117..140
FT /evidence="ECO:0007829|PDB:4IMI"
FT STRAND 141..143
FT /evidence="ECO:0007829|PDB:3GS3"
FT HELIX 146..164
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 165..167
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 171..187
FT /evidence="ECO:0007829|PDB:4IMI"
FT STRAND 193..195
FT /evidence="ECO:0007829|PDB:4IMJ"
FT HELIX 204..206
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 216..234
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 241..257
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 259..261
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 262..274
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 282..300
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 303..308
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 309..318
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 323..327
FT /evidence="ECO:0007829|PDB:4IMI"
FT HELIX 335..348
FT /evidence="ECO:0007829|PDB:4IMI"
SQ SEQUENCE 1165 AA; 132077 MW; CFA818C50B2CC847 CRC64;
MDSIIGRSQF VSETANLFTD EKTATARAKV VDWCNELVIA SPSTKCELLA KVQETVLGSC
AELAEEFLES VLSLAHDSNM EVRKQVVAFV EQVCKVKVEL LPHVINVVSM LLRDNSAQVI
KRVIQACGSI YKNGLQYLCS LMEPGDSAEQ AWNILSLIKA QILDMIDNEN DGIRTNAIKF
LEGVVVLQSF ADEDSLKRDG DFSLADVPDH CTLFRREKLQ EEGNNILDIL LQFHGTTHIS
SVNLIACTSS LCTIAKMRPI FMGAVVEAFK QLNANLPPTL TDSQVSSVRK SLKMQLQTLL
KNRGAFEFAS TIRGMLVDLG SSTNEIQKLI PKMDKQEMAR RQKRILENAA QSLAKRARLA
CEQQDQQQRE MELDTEELER QKQKSTRVNE KFLAEHFRNP ETVVTLVLEF LPSLPTEVPQ
KFLQEYTPIR EMSIQQQVTN ISRFFGEQLS EKRLGPGAAT FSREPPMRVK KVQAIESTLT
AMEVDEDAVQ KLSEEEFQRK EEATKKLRET MERAKGEQTV IEKMKERAKT LKLQEITKPL
PRNLKEKFLT DAVRRILNSE RQCIKGGVSS KRRKLVTVIA ATFPDNVRYG IMEFILEDIK
QRIDLAFSWL FEEYSLLQGF TRHTYVKTEN RPDHAYNELL NKLIFGIGER CDHKDKIILI
RRVYLEAPIL PEVSIGHLVQ LSLDDEFSQH GLELIKDLAV LRPPRKNRFV RVLLNFSVHE
RLDLRDLAQA HLVSLYHVHK ILPARIDEFA LEWLKFIEQE SPPAAVFSQD FGRPTEEPDW
REDTTKVCFG LAFTLLPYKP EVYLQQICQV FVSTSAELKR TILRSLDIPI KKMGVESPTL
LQLIEDCPKG METLVIRIIY ILTERVPSPH EELVRRVRDL YQNKVKDVRV MIPVLSGLTR
SELISVLPKL IKLNPAVVKE VFNRLLGIGA EFAHQTMAMT PTDILVALHT IDTSVCDIKA
IVKATSLCLA ERDLYTQEVL MAVLQQLVEV TPLPTLMMRT TIQSLTLYPR LANFVMNLLQ
RLIIKQVWRQ KVIWEGFLKT VQRLKPQSMP ILLHLPPAQL VDALQQCPDL RPALSEYAES
MQDEPMNGSG ITQQVLDIIS GKSVDVFVTD ESGGYISAEH IKKEAPDPSE ISVISTVPVL
TSLVPLPVPP PIGSDLNQPL PPGED