PEP_DROME
ID PEP_DROME Reviewed; 716 AA.
AC P41073; Q8IGB1; Q8MSV5; Q9VVJ2; Q9VVJ3; Q9VVJ4;
DT 01-FEB-1995, integrated into UniProtKB/Swiss-Prot.
DT 01-FEB-1995, sequence version 1.
DT 03-AUG-2022, entry version 171.
DE RecName: Full=Zinc finger protein on ecdysone puffs;
GN Name=Pep; ORFNames=CG6143;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM B), FUNCTION, SUBCELLULAR LOCATION, AND
RP DEVELOPMENTAL STAGE.
RC STRAIN=Oregon-R; TISSUE=Embryo;
RX PubMed=1899840; DOI=10.1101/gad.5.2.188;
RA Amero S.A., Elgin S.C.R., Beyer A.L.;
RT "A unique zinc finger protein is associated preferentially with active
RT ecdysone-responsive loci in Drosophila.";
RL Genes Dev. 5:188-200(1991).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM C), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 202-716 (ISOFORMS B/C).
RC STRAIN=Berkeley; TISSUE=Embryo;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-201; THR-203; SER-206;
RP SER-673; SER-684; SER-686 AND THR-692, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: May play a role in the process of early and late gene
CC activation, or possibly in RNA processing, for a defined set of
CC developmentally regulated loci. {ECO:0000269|PubMed:1899840}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:1899840}. Chromosome
CC {ECO:0000269|PubMed:1899840}. Note=It is associated with the active
CC ecdysone-regulated loci on polytene chromosomes, and on some heat
CC shock-induced puffs. Its distribution pattern follows the changes of
CC puffing patterns in the developmental program, or following heat shock.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=B;
CC IsoId=P41073-1; Sequence=Displayed;
CC Name=A;
CC IsoId=P41073-2; Sequence=VSP_009605, VSP_037472, VSP_037473;
CC Name=C;
CC IsoId=P41073-3; Sequence=VSP_009605;
CC -!- DEVELOPMENTAL STAGE: Expressed both maternally and zygotically, zygotic
CC expression is at a low and constant level thereafter.
CC {ECO:0000269|PubMed:1899840}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAN71637.1; Type=Miscellaneous discrepancy; Note=Intron retention.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X56689; CAA40017.1; -; mRNA.
DR EMBL; AE014296; AAF49317.4; -; Genomic_DNA.
DR EMBL; AE014296; AAF49319.3; -; Genomic_DNA.
DR EMBL; AE014296; AAS64975.1; -; Genomic_DNA.
DR EMBL; AY118552; AAM49921.1; -; mRNA.
DR EMBL; BT001867; AAN71637.1; ALT_SEQ; mRNA.
DR PIR; S26759; S26759.
DR RefSeq; NP_001246817.1; NM_001259888.3. [P41073-3]
DR RefSeq; NP_524858.1; NM_080119.6. [P41073-1]
DR RefSeq; NP_730290.3; NM_168742.4. [P41073-2]
DR RefSeq; NP_996118.1; NM_206396.4. [P41073-3]
DR AlphaFoldDB; P41073; -.
DR BioGRID; 70020; 18.
DR DIP; DIP-20217N; -.
DR IntAct; P41073; 36.
DR MINT; P41073; -.
DR STRING; 7227.FBpp0074963; -.
DR iPTMnet; P41073; -.
DR PaxDb; P41073; -.
DR PRIDE; P41073; -.
DR DNASU; 45961; -.
DR EnsemblMetazoa; FBtr0075198; FBpp0074962; FBgn0004401. [P41073-2]
DR EnsemblMetazoa; FBtr0075199; FBpp0074963; FBgn0004401. [P41073-1]
DR EnsemblMetazoa; FBtr0075200; FBpp0089253; FBgn0004401. [P41073-3]
DR EnsemblMetazoa; FBtr0304977; FBpp0293516; FBgn0004401. [P41073-3]
DR GeneID; 45961; -.
DR KEGG; dme:Dmel_CG6143; -.
DR CTD; 45961; -.
DR FlyBase; FBgn0004401; Pep.
DR VEuPathDB; VectorBase:FBgn0004401; -.
DR eggNOG; ENOG502RYYN; Eukaryota.
DR GeneTree; ENSGT00440000039084; -.
DR InParanoid; P41073; -.
DR OMA; KATPQRN; -.
DR PhylomeDB; P41073; -.
DR SignaLink; P41073; -.
DR BioGRID-ORCS; 45961; 1 hit in 1 CRISPR screen.
DR ChiTaRS; Pep; fly.
DR GenomeRNAi; 45961; -.
DR PRO; PR:P41073; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0004401; Expressed in wing disc and 25 other tissues.
DR ExpressionAtlas; P41073; baseline and differential.
DR Genevisible; P41073; DM.
DR GO; GO:0071013; C:catalytic step 2 spliceosome; HDA:FlyBase.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0071011; C:precatalytic spliceosome; HDA:FlyBase.
DR GO; GO:0003677; F:DNA binding; IDA:FlyBase.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003697; F:single-stranded DNA binding; IDA:FlyBase.
DR GO; GO:0003727; F:single-stranded RNA binding; IDA:FlyBase.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IC:FlyBase.
DR GO; GO:0048024; P:regulation of mRNA splicing, via spliceosome; IMP:FlyBase.
DR InterPro; IPR026811; CIZ1.
DR InterPro; IPR022755; Znf_C2H2_jaz.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15491; PTHR15491; 1.
DR Pfam; PF12171; zf-C2H2_jaz; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SUPFAM; SSF57667; SSF57667; 3.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 3.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Chromosome; DNA-binding; Metal-binding; Nucleus;
KW Phosphoprotein; Reference proteome; Repeat; Zinc; Zinc-finger.
FT CHAIN 1..716
FT /note="Zinc finger protein on ecdysone puffs"
FT /id="PRO_0000047016"
FT ZN_FING 216..240
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 288..310
FT /note="C2H2-type 2; atypical"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 319..343
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 489..513
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 103..168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 182..208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 350..447
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 534..716
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 379..383
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255"
FT MOTIF 544..548
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255"
FT COMPBIAS 189..203
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..405
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..447
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 534..555
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..635
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 639..658
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 201
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 203
FT /note="Phosphothreonine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 206
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 673
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 684
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 686
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 692
FT /note="Phosphothreonine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 1..23
FT /note="Missing (in isoform A and isoform C)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_009605"
FT VAR_SEQ 125..132
FT /note="MVSRGGGA -> PYQGVSIR (in isoform A)"
FT /evidence="ECO:0000305"
FT /id="VSP_037472"
FT VAR_SEQ 133..716
FT /note="Missing (in isoform A)"
FT /evidence="ECO:0000305"
FT /id="VSP_037473"
SQ SEQUENCE 716 AA; 78048 MW; 256D7D765C5F3050 CRC64;
MVSVKVNGNP QNRLVNNAKV NGNMAFRGNQ NRNRNFGGGN NNYGGPMGAN RMGGMNMSPW
ESQNPGGGQF GNNMRQGGGQ MNAQAINLAN NLLNNLFRNQ NPPSLLDLPR GGGGMGNRNQ
RGGPMVSRGG GAGNRLNNRR GQGGGFQNRG ATGSGPKPPP KQGGGGIRKQ NAFDRAKKLL
AKNANQNKKK EPTPGEKKIE SPTKESPYAS VPNDMFYCHL CKKHMWDANS FENHIKGRTH
LMMREGIEES YRLKANMIRQ EAKIAEQLKS IEFDRLKRMG KSKQRQLDYC TMCDLNFHGH
ISTHRKSEGH LQLKKFLHPK CIECNKEFAT RIDYDTHLLS AEHLKKAAEN NTKVGERKRQ
TLPISTEEEE TRDLRLPQKR KKKPVKKEGE AADGEAKKEG AGDGEGAEGD EAEGEEAKEG
EEAADETKEG DELNESQEEE EVALPVDPED CILDFNDGDE IPSEVDTRLP KYNWQRAVGP
GLISKLECYE CSVCSKFFDT EVTAEIHSRT ATHHRNFLKF INEKSSDTKI AQKRAAAALE
ENERKKRKVE EAEAPAAEGA AEETTEGAEG ELYDPSEATG DDEDVEMVDD NAEGEGEGEG
DEEAEAEVEE DGAGQDNGEE EMEAQEEEGQ EGEQEPEPEP APVQTPAPAE PAPPAKTPAK
TPTKAAAPAA VASPAAAATS ADASPSPAKK ATPARAAAGA KATPQRQRAR GRYNRY