HMPB_DROME
ID HMPB_DROME Reviewed; 782 AA.
AC P31264; O97058; Q4JFI5; Q9VI44;
DT 01-JUL-1993, integrated into UniProtKB/Swiss-Prot.
DT 03-JUL-2003, sequence version 2.
DT 03-AUG-2022, entry version 176.
DE RecName: Full=Homeotic protein proboscipedia;
GN Name=pb; ORFNames=CG31481;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORMS A; B; C AND D).
RC STRAIN=Canton-S;
RX PubMed=1348688; DOI=10.1002/j.1460-2075.1992.tb05188.x;
RA Cribbs D.L., Pultz M.A., Johnson D., Mazzulla M., Kaufman T.C.;
RT "Structural complexity and evolutionary conservation of the Drosophila
RT homeotic gene proboscipedia.";
RL EMBO J. 11:1437-1449(1992).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=Berkeley;
RA Celniker S.E., Pfeiffer B.D., Knafels J., Martin C.H., Mayeda C.A.,
RA Palazzolo M.J.;
RT "Complete sequence of the Antennapedia complex of Drosophila.";
RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [4]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
CC -!- FUNCTION: Sequence-specific transcription factor which is part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis. Controls
CC development of mouthparts, and labial and maxillary palps.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Comment=Additional isoforms seem to exist.;
CC Name=A;
CC IsoId=P31264-1; Sequence=Displayed;
CC Name=C;
CC IsoId=P31264-2; Sequence=VSP_002398;
CC Name=B;
CC IsoId=P31264-3; Sequence=VSP_002399;
CC Name=D;
CC IsoId=P31264-4; Sequence=VSP_002400;
CC -!- SIMILARITY: Belongs to the Antp homeobox family. Proboscipedia
CC subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; S94723; AAA08526.1; -; Genomic_DNA.
DR EMBL; S94709; AAA08526.1; JOINED; Genomic_DNA.
DR EMBL; S94714; AAA08526.1; JOINED; Genomic_DNA.
DR EMBL; S94717; AAA08526.1; JOINED; Genomic_DNA.
DR EMBL; S94719; AAA08526.1; JOINED; Genomic_DNA.
DR EMBL; X63729; CAA45272.1; -; mRNA.
DR EMBL; X63728; CAA45271.1; -; mRNA.
DR EMBL; AE001572; AAD19802.1; -; Genomic_DNA.
DR EMBL; AE014297; AAF54089.3; -; Genomic_DNA.
DR EMBL; AE014297; AAS65118.1; -; Genomic_DNA.
DR EMBL; AE014297; AAS65119.1; -; Genomic_DNA.
DR EMBL; AE014297; AAS65120.1; -; Genomic_DNA.
DR PIR; S20881; S20881.
DR RefSeq; NP_476669.3; NM_057321.5. [P31264-1]
DR RefSeq; NP_996161.1; NM_206439.2. [P31264-4]
DR RefSeq; NP_996162.1; NM_206440.2. [P31264-2]
DR RefSeq; NP_996163.1; NM_206441.2. [P31264-3]
DR AlphaFoldDB; P31264; -.
DR SMR; P31264; -.
DR BioGRID; 66024; 18.
DR IntAct; P31264; 2.
DR STRING; 7227.FBpp0088333; -.
DR PaxDb; P31264; -.
DR DNASU; 40826; -.
DR EnsemblMetazoa; FBtr0089276; FBpp0088333; FBgn0051481. [P31264-1]
DR EnsemblMetazoa; FBtr0089277; FBpp0088334; FBgn0051481. [P31264-3]
DR EnsemblMetazoa; FBtr0089278; FBpp0088335; FBgn0051481. [P31264-2]
DR EnsemblMetazoa; FBtr0089279; FBpp0088336; FBgn0051481. [P31264-4]
DR GeneID; 40826; -.
DR KEGG; dme:Dmel_CG31481; -.
DR UCSC; CG31481-RA; d. melanogaster.
DR CTD; 40826; -.
DR FlyBase; FBgn0051481; pb.
DR VEuPathDB; VectorBase:FBgn0051481; -.
DR eggNOG; KOG0489; Eukaryota.
DR GeneTree; ENSGT00940000155029; -.
DR HOGENOM; CLU_359131_0_0_1; -.
DR InParanoid; P31264; -.
DR OMA; HEQQFYY; -.
DR PhylomeDB; P31264; -.
DR BioGRID-ORCS; 40826; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 40826; -.
DR PRO; PR:P31264; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0051481; Expressed in mouthpart and 3 other tissues.
DR Genevisible; P31264; DM.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048728; P:proboscis development; IMP:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0007381; P:specification of segmental identity, labial segment; IMP:FlyBase.
DR GO; GO:0007382; P:specification of segmental identity, maxillary segment; IMP:FlyBase.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001827; Homeobox_Antennapedia_CS.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00032; ANTENNAPEDIA; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW Nucleus; Reference proteome.
FT CHAIN 1..782
FT /note="Homeotic protein proboscipedia"
FT /id="PRO_0000200263"
FT DNA_BIND 198..257
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 153..195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 251..336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 358..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 506..586
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 164..169
FT /note="Antp-type hexapeptide"
FT COMPBIAS 256..280
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..336
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..380
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..455
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 468..493
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 528..563
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 184..193
FT /note="Missing (in isoform D)"
FT /evidence="ECO:0000303|PubMed:1348688"
FT /id="VSP_002400"
FT VAR_SEQ 184..189
FT /note="GDNSIT -> A (in isoform C)"
FT /evidence="ECO:0000303|PubMed:1348688"
FT /id="VSP_002398"
FT VAR_SEQ 189..194
FT /note="TEFVPE -> K (in isoform B)"
FT /evidence="ECO:0000303|PubMed:1348688"
FT /id="VSP_002399"
FT CONFLICT 520
FT /note="H -> D (in Ref. 1; AAA08526/CAA45272/CAA45271)"
FT /evidence="ECO:0000305"
FT CONFLICT 651
FT /note="Q -> E (in Ref. 1; AAA08526/CAA45272/CAA45271)"
FT /evidence="ECO:0000305"
FT CONFLICT 685
FT /note="H -> D (in Ref. 1; AAA08526/CAA45272/CAA45271)"
FT /evidence="ECO:0000305"
FT CONFLICT 701
FT /note="P -> H (in Ref. 1; AAA08526/CAA45272/CAA45271)"
FT /evidence="ECO:0000305"
FT CONFLICT 768..782
FT /note="SNLANDFAPEYYQLS -> ATWPTTLRRNTTNSVSSCRKYARGQCKYLLA
FT (in Ref. 1; AAA08526/CAA45272/CAA45271)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 782 AA; 83747 MW; BAB876E2B1429F00 CRC64;
MQEVCSSLDT TSMGTQIKSE SPLNPLQVQT GQTSLPVGGC GGAGVVGGVG GVGVSVGQPG
IGQQGVPPVP SVLMVNKMTP NCDKRSADTA YWMTASEGGF INSQPSMAEF LNHLSPESPK
IGTPVGSGAI GGVGVNVNVN VGVGVGYPVG VVPQTPDGMD SVPEYPWMKE KKTSRKSSNN
NNQGDNSITE FVPENGLPRR LRTAYTNTQL LELEKEFHFN KYLCRPRRIE IAASLDLTER
QVKVWFQNRR MKHKRQTLSK TDDEDNKDSL KGDDDQSDSN SNSKKSCQGC ELPSDDIPDS
TSNSRGHNNN TPSATNNNPS AGNLTPNSSL ETGISSNLMG STTVSASNVI SADSSVASSV
SLDEDIEESS PIKVKKKDDG QVIKKEAVST SSKASPFGYE NSTPSLVSFR RDSDASAVGN
APTSKAVGKK RFQSAANAIA TPTPLSDSNS GNGSGGGPAG GYFPGYYPSP KQQQQVQQQQ
LHPQQQQLPQ QQPQDYYGKY DIEFAASPHH NPHNKQQALH GEYLSPKPSS ANFHQNSQQQ
QQNDHFYYNY NDTNGTPYLN HQQQHHHHAQ HHQQQQHHQN HVADFEGPVN GPSNFNNGAY
YDNMSFQQQA QAHQHQTVVF QQQQPHQPAA INHQHMHHLG NGETYSALGL QMENCEGYNN
FGAAGTGGGY YEAGQQPPIP ATHGHGHHPH HVQVPAQAHA PIHAHHNSAA IPGGVGVGPP
PSHIHGFAIN GGPAVQGQAF GNNGSTAAGT AAISGLENSN SSDFNFLSNL ANDFAPEYYQ
LS