DEFI_ANOGA
ID DEFI_ANOGA Reviewed; 102 AA.
AC Q17027; O61721; Q38L95; Q38L99; Q38LA3; Q38LA4; Q38LA6; Q38LB0; Q38LB2;
AC Q38LB3; Q38LC5; Q38LD1; Q38LD3; Q38LD5; Q7QHQ1;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1997, sequence version 1.
DT 03-AUG-2022, entry version 135.
DE RecName: Full=Defensin;
DE Flags: Precursor;
GN Name=Def1; ORFNames=AGAP011294;
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], PROTEIN SEQUENCE OF 63-76, INDUCTION, AND
RP DEVELOPMENTAL STAGE.
RC STRAIN=G3;
RX PubMed=8799739; DOI=10.1111/j.1365-2583.1996.tb00055.x;
RA Richman A.M., Bulet P., Hetru C., Barillas-Mury C., Hoffmann J.A.,
RA Kafatos F.C.;
RT "Inducible immune factors of the vector mosquito Anopheles gambiae:
RT biochemical purification of a defensin antibacterial peptide and molecular
RT cloning of preprodefensin cDNA.";
RL Insect Mol. Biol. 5:203-210(1996).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=KWA;
RX PubMed=11029666; DOI=10.1046/j.1365-2583.2000.00212.x;
RA Eggleston P., Lu W., Zhao Y.;
RT "Genomic organization and immune regulation of the defensin gene from the
RT mosquito, Anopheles gambiae.";
RL Insect Mol. Biol. 9:481-490(2000).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=Gbab1062c, Gbab1063g, Gbab1064e, Gbab1068c, Gbab5002a, Gbab5010bs,
RC Gbab5011f, Gbab5020c, Gbab5024h, Gbab5029as, Gbbkj16015a, Gbbkj16024b,
RC Gbbkj16025b, Gbbkj16030c, Gbbkj16032b, Gbbkj16034a, Gbbkj16037a, Gbbkj6a,
RC Gbbkj7a, Gbjo11cs, Gbjo12g, Gbjo14c, Gbjo15as, Gbjo22d, Gbjo3a, Gbjo4i,
RC Gbjo6e, Gbjo7h, Gbjo8h, Gbjo9a, Gbng3334as, Gbng3335ds, Gbng3336g,
RC Gbng3337bs, Gbng3339b, Gbng3340f, Gbng3341as, Gbng3343c, Gbng3347f,
RC Gbng3349a, Gbng3363f, and Gbng3365b;
RA Simard F., Licht M., Lehmann T.;
RT "Molecular evidence for selection in the Defensin gene within the Anopheles
RT gambiae complex.";
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST;
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [5]
RP STRUCTURE BY NMR OF 63-102, AND DISULFIDE BONDS.
RX PubMed=18214975; DOI=10.1002/prot.21912;
RA Landon C., Barbault F., Legrain M., Guenneugues M., Vovelle F.;
RT "Rational design of peptides active against the gram positive bacteria
RT Staphylococcus aureus.";
RL Proteins 72:229-239(2008).
CC -!- FUNCTION: Responsible for the anti Gram-positive activity of immune
CC hemolymph. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Secreted.
CC -!- DEVELOPMENTAL STAGE: In larvae, transcript is enhanced 4 hours after
CC the bacterial infection, increases up to 12 hours post inoculation and
CC declines by 24 hours and by 30 hours returns to background levels. In
CC adult female, expression is evident by 18 hours after infection.
CC Constitutively expressed in both early and late pupae.
CC {ECO:0000269|PubMed:8799739}.
CC -!- INDUCTION: By bacterial infection. {ECO:0000269|PubMed:8799739}.
CC -!- SIMILARITY: Belongs to the invertebrate defensin family. Type 1
CC subfamily. {ECO:0000255|PROSITE-ProRule:PRU00710}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA63775.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X93562; CAA63775.1; ALT_INIT; mRNA.
DR EMBL; AF063402; AAC18575.1; -; Genomic_DNA.
DR EMBL; DQ212001; ABB00946.1; -; Genomic_DNA.
DR EMBL; DQ212002; ABB00947.1; -; Genomic_DNA.
DR EMBL; DQ212003; ABB00948.1; -; Genomic_DNA.
DR EMBL; DQ212004; ABB00949.1; -; Genomic_DNA.
DR EMBL; DQ212005; ABB00950.1; -; Genomic_DNA.
DR EMBL; DQ212006; ABB00951.1; -; Genomic_DNA.
DR EMBL; DQ212007; ABB00952.1; -; Genomic_DNA.
DR EMBL; DQ212008; ABB00953.1; -; Genomic_DNA.
DR EMBL; DQ212009; ABB00954.1; -; Genomic_DNA.
DR EMBL; DQ212010; ABB00955.1; -; Genomic_DNA.
DR EMBL; DQ212011; ABB00956.1; -; Genomic_DNA.
DR EMBL; DQ212012; ABB00957.1; -; Genomic_DNA.
DR EMBL; DQ212013; ABB00958.1; -; Genomic_DNA.
DR EMBL; DQ212014; ABB00959.1; -; Genomic_DNA.
DR EMBL; DQ212015; ABB00960.1; -; Genomic_DNA.
DR EMBL; DQ212016; ABB00961.1; -; Genomic_DNA.
DR EMBL; DQ212017; ABB00962.1; -; Genomic_DNA.
DR EMBL; DQ212018; ABB00963.1; -; Genomic_DNA.
DR EMBL; DQ212019; ABB00964.1; -; Genomic_DNA.
DR EMBL; DQ212020; ABB00965.1; -; Genomic_DNA.
DR EMBL; DQ212021; ABB00966.1; -; Genomic_DNA.
DR EMBL; DQ212022; ABB00967.1; -; Genomic_DNA.
DR EMBL; DQ212023; ABB00968.1; -; Genomic_DNA.
DR EMBL; DQ212024; ABB00969.1; -; Genomic_DNA.
DR EMBL; DQ212025; ABB00970.1; -; Genomic_DNA.
DR EMBL; DQ212026; ABB00971.1; -; Genomic_DNA.
DR EMBL; DQ212027; ABB00972.1; -; Genomic_DNA.
DR EMBL; DQ212028; ABB00973.1; -; Genomic_DNA.
DR EMBL; DQ212029; ABB00974.1; -; Genomic_DNA.
DR EMBL; DQ212030; ABB00975.1; -; Genomic_DNA.
DR EMBL; DQ212031; ABB00976.1; -; Genomic_DNA.
DR EMBL; DQ212032; ABB00977.1; -; Genomic_DNA.
DR EMBL; DQ212033; ABB00978.1; -; Genomic_DNA.
DR EMBL; DQ212034; ABB00979.1; -; Genomic_DNA.
DR EMBL; DQ212035; ABB00980.1; -; Genomic_DNA.
DR EMBL; DQ212036; ABB00981.1; -; Genomic_DNA.
DR EMBL; DQ212037; ABB00982.1; -; Genomic_DNA.
DR EMBL; DQ212038; ABB00983.1; -; Genomic_DNA.
DR EMBL; DQ212039; ABB00984.1; -; Genomic_DNA.
DR EMBL; DQ212040; ABB00985.1; -; Genomic_DNA.
DR EMBL; DQ212041; ABB00986.1; -; Genomic_DNA.
DR EMBL; DQ212042; ABB00987.1; -; Genomic_DNA.
DR EMBL; AAAB01008816; EAA05234.4; -; Genomic_DNA.
DR PDB; 2E3E; NMR; -; A=63-102.
DR PDB; 2E3F; NMR; -; A=63-101.
DR PDB; 2E3G; NMR; -; A=63-102.
DR PDB; 2NY8; NMR; -; X=63-102.
DR PDB; 2NY9; NMR; -; X=63-102.
DR PDB; 2NZ3; NMR; -; A=63-102.
DR PDBsum; 2E3E; -.
DR PDBsum; 2E3F; -.
DR PDBsum; 2E3G; -.
DR PDBsum; 2NY8; -.
DR PDBsum; 2NY9; -.
DR PDBsum; 2NZ3; -.
DR AlphaFoldDB; Q17027; -.
DR SMR; Q17027; -.
DR STRING; 7165.AGAP011294-PA; -.
DR PaxDb; Q17027; -.
DR VEuPathDB; VectorBase:AGAP011294; -.
DR eggNOG; ENOG502SD3P; Eukaryota.
DR HOGENOM; CLU_174272_0_0_1; -.
DR InParanoid; Q17027; -.
DR OMA; PEETHHA; -.
DR OrthoDB; 1600872at2759; -.
DR PhylomeDB; Q17027; -.
DR EvolutionaryTrace; Q17027; -.
DR Proteomes; UP000007062; Chromosome 3L.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0042742; P:defense response to bacterium; IBA:GO_Central.
DR GO; GO:0050830; P:defense response to Gram-positive bacterium; IEA:UniProt.
DR GO; GO:0006959; P:humoral immune response; IBA:GO_Central.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR Gene3D; 3.30.30.10; -; 1.
DR InterPro; IPR001542; Defensin_invertebrate/fungal.
DR InterPro; IPR036574; Scorpion_toxin-like_sf.
DR Pfam; PF01097; Defensin_2; 1.
DR SUPFAM; SSF57095; SSF57095; 1.
DR PROSITE; PS51378; INVERT_DEFENSINS; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Antibiotic; Antimicrobial;
KW Cleavage on pair of basic residues; Defensin; Direct protein sequencing;
KW Disulfide bond; Immunity; Innate immunity; Reference proteome; Secreted;
KW Signal.
FT SIGNAL 1..25
FT /evidence="ECO:0000255"
FT PROPEP 26..62
FT /evidence="ECO:0000255"
FT /id="PRO_0000006738"
FT CHAIN 63..102
FT /note="Defensin"
FT /id="PRO_0000006739"
FT DISULFID 65..92
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00710,
FT ECO:0000269|PubMed:18214975"
FT DISULFID 78..98
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00710,
FT ECO:0000269|PubMed:18214975"
FT DISULFID 82..100
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00710,
FT ECO:0000269|PubMed:18214975"
FT VARIANT 9
FT /note="T -> A (in strain: Gbab1064e, Gbab1068c, Gbab5010bs,
FT Gbab5011f, Gbab5024h, Gbbkj16034a, Gbjo4i, Gbjo8h, Gbjo12g,
FT Gbjo15as, Gbng3334as and Gbng3365b)"
FT VARIANT 15
FT /note="A -> S (in strain: Gbab1068c, Gbab5011f and
FT Gbab5024h)"
FT VARIANT 18
FT /note="L -> F (in strain: Gbbkj16015a, Gbbkj16024b,
FT Gbbkj16025b, Gbbkj16032b and Gbng3347f)"
FT VARIANT 34
FT /note="S -> G (in strain: Gbab1068c, Gbab5010bs, Gbab5011f,
FT Gbab5024h, Gbbkj16015a, Gbbkj16024b, Gbbkj16025b,
FT Gbbkj16032b, Gbbkj16034a, Gbjo4i, Gbjo6e, Gbjo7h, Gbjo8h,
FT Gbjo9a, Gbjo12g, Gbjo14c, Gbjo15as, Gbjo22d, Gbng3336g,
FT Gbng3339b, Gbng3340f and Gbng3347f)"
FT VARIANT 37..38
FT /note="AN -> VS (in strain: Gbng3343c)"
FT VARIANT 37
FT /note="A -> G (in strain: Gbng3337bs)"
FT VARIANT 37
FT /note="A -> V (in strain: Gbab1068c, Gbab5011f, Gbab5024h
FT and Gbng3336g)"
FT VARIANT 51
FT /note="H -> L (in strain: Gbjo6e)"
FT VARIANT 69
FT /note="S -> R (in strain: Gbng3365b)"
FT VARIANT 75..76
FT /note="SS -> NN (in strain: KWA)"
FT VARIANT 95
FT /note="K -> R (in strain: Gbjo3a)"
FT STRAND 68..71
FT /evidence="ECO:0007829|PDB:2E3F"
FT HELIX 74..76
FT /evidence="ECO:0007829|PDB:2E3G"
FT STRAND 77..79
FT /evidence="ECO:0007829|PDB:2E3E"
FT HELIX 80..85
FT /evidence="ECO:0007829|PDB:2E3E"
FT STRAND 90..93
FT /evidence="ECO:0007829|PDB:2E3E"
FT STRAND 94..96
FT /evidence="ECO:0007829|PDB:2E3F"
FT STRAND 97..100
FT /evidence="ECO:0007829|PDB:2E3E"
SQ SEQUENCE 102 AA; 10627 MW; 628834560AEDCD0C CRC64;
MKCATIVCTI AVVLAATLLN GSVQAAPQEE AALSGGANLN TLLDELPEET HHAALENYRA
KRATCDLASG FGVGSSLCAA HCIARRYRGG YCNSKAVCVC RN