SOB_DROME
ID SOB_DROME Reviewed; 578 AA.
AC Q9VQS7; Q24571;
DT 16-AUG-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-2000, sequence version 1.
DT 03-AUG-2022, entry version 151.
DE RecName: Full=Protein sister of odd and bowel;
GN Name=sob {ECO:0000312|FlyBase:FBgn0004892}; ORFNames=CG3242;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1] {ECO:0000305, ECO:0000312|EMBL:AAC47282.1}
RP NUCLEOTIDE SEQUENCE [MRNA], AND TISSUE SPECIFICITY.
RC STRAIN=Canton-S {ECO:0000269|PubMed:8878683};
RC TISSUE=Embryo {ECO:0000269|PubMed:8878683};
RX PubMed=8878683; DOI=10.1093/genetics/144.1.171;
RA Hart M.C., Wang L., Coulter D.E.;
RT "Comparison of the structure and expression of odd-skipped and two related
RT genes that encode a new family of zinc finger proteins in Drosophila.";
RL Genetics 144:171-182(1996).
RN [2] {ECO:0000312|EMBL:AAF51087.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000269|PubMed:10731132};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3] {ECO:0000305, ECO:0000312|EMBL:AAF51087.1}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000312|EMBL:AAO24960.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=Berkeley {ECO:0000312|EMBL:AAO24960.1}; TISSUE=Embryo;
RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J.W., Champe M.,
RA Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., George R.A.,
RA Gonzalez M., Guarin H., Kronmiller B., Li P.W., Liao G., Miranda A.,
RA Mungall C.J., Nunoo J., Pacleb J.M., Paragas V., Park S., Patel S.,
RA Phouanenavong S., Wan K.H., Yu C., Lewis S.E., Rubin G.M., Celniker S.E.;
RL Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases.
RN [5] {ECO:0000305}
RP POSSIBLE FUNCTION, AND TISSUE SPECIFICITY.
RX PubMed=14597202; DOI=10.1016/j.ydbio.2003.07.011;
RA Hao I., Green R.B., Dunaevsky O., Lengyel J.A., Rauskolb C.;
RT "The odd-skipped family of zinc finger genes promotes Drosophila leg
RT segmentation.";
RL Dev. Biol. 263:282-295(2003).
RN [6] {ECO:0000305}
RP TISSUE SPECIFICITY.
RX PubMed=14568103; DOI=10.1016/j.mod.2003.08.001;
RA Johansen K.A., Green R.B., Iwaki D.D., Hernandez J.B., Lengyel J.A.;
RT "The Drm-Bowl-Lin relief-of-repression hierarchy controls fore- and hindgut
RT patterning and morphogenesis.";
RL Mech. Dev. 120:1139-1151(2003).
CC -!- FUNCTION: Pair-rule protein that determines both the size and polarity
CC of even-numbered as well as odd-numbered parasegments during
CC embryogenesis. DNA-binding transcription factor that acts primarily as
CC a transcriptional repressor but can also function as a transcriptional
CC activator, depending on the stage of development and spatial
CC restrictions (By similarity). May function redundantly with odd and drm
CC in leg joint formation during the larval stages, acting downstream of
CC Notch activation. {ECO:0000250|UniProtKB:P23803,
CC ECO:0000269|PubMed:14597202}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Has two temporally distinct modes of expression
CC during early embryogenesis; expressed in seven stripes at the
CC blastoderm stage. Also expressed in a non-periodic domain at the
CC anterior of the embryo. During gastrulation, the seven primary stripes
CC are supplemented by seven secondary stripes that appear in alternate
CC segments. This results in the labelling of each of the 14 segments in
CC the extended germ band. Expression is relatively weak at the blastoderm
CC stage, gaining in intensity at gastrulation. Expressed in the
CC invaginating stomodeum and proctodeum of the embryonic gut. By stage
CC 13, expressed in the region that will form the proventriculus and in a
CC wide ring at the most posterior portion of the midgut. Expression
CC continues in the gut through the remainder of embryogenesis. Expressed
CC in the proximal Malpighian tubules, brain and pharyngeal muscles during
CC late embryogenesis. Expressed weakly in a segmentally repeated pattern
CC in the leg disk at the distal edge of each presumptive leg segment
CC except in tarsal segments 1 to 4. {ECO:0000269|PubMed:14568103,
CC ECO:0000269|PubMed:14597202, ECO:0000269|PubMed:8878683}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U62004; AAC47282.1; -; mRNA.
DR EMBL; AE014134; AAF51087.1; -; Genomic_DNA.
DR EMBL; BT003205; AAO24960.1; -; mRNA.
DR PIR; S72227; S72227.
DR RefSeq; NP_476882.1; NM_057534.4.
DR AlphaFoldDB; Q9VQS7; -.
DR SMR; Q9VQS7; -.
DR BioGRID; 59790; 15.
DR IntAct; Q9VQS7; 10.
DR STRING; 7227.FBpp0077247; -.
DR PaxDb; Q9VQS7; -.
DR DNASU; 33581; -.
DR EnsemblMetazoa; FBtr0077558; FBpp0077247; FBgn0004892.
DR GeneID; 33581; -.
DR KEGG; dme:Dmel_CG3242; -.
DR CTD; 33581; -.
DR FlyBase; FBgn0004892; sob.
DR VEuPathDB; VectorBase:FBgn0004892; -.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000168461; -.
DR HOGENOM; CLU_027599_1_1_1; -.
DR InParanoid; Q9VQS7; -.
DR OMA; HKKSESC; -.
DR OrthoDB; 1318335at2759; -.
DR PhylomeDB; Q9VQS7; -.
DR BioGRID-ORCS; 33581; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 33581; -.
DR PRO; PR:Q9VQS7; -.
DR Proteomes; UP000000803; Chromosome 2L.
DR Bgee; FBgn0004892; Expressed in ectoderm and 60 other tissues.
DR Genevisible; Q9VQS7; DM.
DR GO; GO:0005634; C:nucleus; ISS:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISS:FlyBase.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0007350; P:blastoderm segmentation; ISS:UniProtKB.
DR GO; GO:0048619; P:embryonic hindgut morphogenesis; IBA:GO_Central.
DR GO; GO:0009880; P:embryonic pattern specification; IBA:GO_Central.
DR GO; GO:0016348; P:imaginal disc-derived leg joint morphogenesis; IMP:FlyBase.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; ISS:UniProtKB.
DR GO; GO:0045892; P:negative regulation of transcription, DNA-templated; ISS:UniProtKB.
DR GO; GO:0007366; P:periodic partitioning by pair rule gene; ISS:UniProtKB.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; ISS:UniProtKB.
DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; ISS:UniProtKB.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:FlyBase.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF00096; zf-C2H2; 5.
DR SMART; SM00355; ZnF_C2H2; 5.
DR SUPFAM; SSF57667; SSF57667; 3.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 5.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 5.
PE 2: Evidence at transcript level;
KW Activator; Developmental protein; DNA-binding; Metal-binding; Nucleus;
KW Pair-rule protein; Reference proteome; Repeat; Repressor; Transcription;
KW Transcription regulation; Zinc; Zinc-finger.
FT CHAIN 1..578
FT /note="Protein sister of odd and bowel"
FT /id="PRO_0000046928"
FT ZN_FING 395..417
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 423..445
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 451..473
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 479..501
FT /note="C2H2-type 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 507..529
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 55..92
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 224..262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 547..578
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 176
FT /note="S -> P (in Ref. 1; AAC47282)"
FT /evidence="ECO:0000305"
FT CONFLICT 188..189
FT /note="SS -> G (in Ref. 1; AAC47282)"
FT /evidence="ECO:0000305"
FT CONFLICT 546
FT /note="P -> L (in Ref. 1; AAC47282)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 578 AA; 58455 MW; 0F600954CFA7D8D0 CRC64;
MEAVKHLSAA AAAAAAAATC SDSPAKAAAS PAASSDIAEA LGELKASATA AASSASKAAT
SKHHSNNNHK PSAAATATAA HKKSESCNSN GNKCTAATSP IGSKTSNAAM AAATATAAAA
TNDLAAAAAV VLSLQGTMVS SLQQAALLPA NSAAAAALNL QALESYLALQ RLTGKSDVFR
FSNSNTGSSN SNNATTCNSS SSEADNNALP SLIDIANIEL KSSCSSSSSG EPPLTAATAS
AAATSSPSSN NSNSTSTPTT SKCVPLPSIG TVSAAVAAAA AAAAAAASQQ AALDCATAAE
LAAECDLPLL DGEDALSFEA GDLDSSYGSF MFNPSAFSQA ETDSALHSLQ ATMYQDKMSV
ISGAAGGVGA GAVGGLEEAG SSAAAAAAQR SKKQFICKFC NRQFTKSYNL LIHERTHTDE
RPYSCDICGK AFRRQDHLRD HRYIHSKEKP FKCAECGKGF CQSRTLAVHK ILHMEESPHK
CPVCNRSFNQ RSNLKTHLLT HTDIKPYNCA SCGKVFRRNC DLRRHSLTHN LSAGVGGVVG
GNLSDPFGSG SSSSSELGLP TGSASTAASS RDLVAVSD