NEP4_NEMVE
ID NEP4_NEMVE Reviewed; 370 AA.
AC A7RNZ0;
DT 29-SEP-2021, integrated into UniProtKB/Swiss-Prot.
DT 02-OCT-2007, sequence version 1.
DT 03-AUG-2022, entry version 47.
DE RecName: Full=Nematocyst expressed protein 4 {ECO:0000303|PubMed:23151943};
DE Short=NEP-4 {ECO:0000303|PubMed:23151943};
DE Short=NEP4 {ECO:0000305};
DE Flags: Precursor;
GN ORFNames=v1g239676 {ECO:0000312|EMBL:EDO46764.1};
OS Nematostella vectensis (Starlet sea anemone).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Actiniaria;
OC Edwardsiidae; Nematostella.
OX NCBI_TaxID=45351;
RN [1] {ECO:0000312|EMBL:EDO46764.1, ECO:0000312|Proteomes:UP000001593}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CH2 X CH6 {ECO:0000312|Proteomes:UP000001593};
RX PubMed=17615350; DOI=10.1126/science.1139158;
RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., Salamov A.,
RA Terry A., Shapiro H., Lindquist E., Kapitonov V.V., Jurka J.,
RA Genikhovich G., Grigoriev I.V., Lucas S.M., Steele R.E., Finnerty J.R.,
RA Technau U., Martindale M.Q., Rokhsar D.S.;
RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire and
RT genomic organization.";
RL Science 317:86-94(2007).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], IDENTIFICATION BY MASS SPECTROMETRY,
RP SUBCELLULAR LOCATION, AND TISSUE SPECIFICITY.
RX PubMed=23151943; DOI=10.1007/s10126-012-9491-y;
RA Moran Y., Praher D., Schlesinger A., Ayalon A., Tal Y., Technau U.;
RT "Analysis of soluble protein contents from the nematocysts of a model sea
RT anemone sheds light on venom evolution.";
RL Mar. Biotechnol. 15:329-339(2013).
RN [3]
RP DEVELOPMENTAL STAGE, TISSUE SPECIFICITY, AND SUBCELLULAR LOCATION.
RX PubMed=29424690; DOI=10.7554/elife.35014;
RA Columbus-Shenkar Y.Y., Sachkova M.Y., Macrander J., Fridrich A.,
RA Modepalli V., Reitzel A.M., Sunagar K., Moran Y.;
RT "Dynamics of venom composition across a complex life cycle.";
RL Elife 7:0-0(2018).
CC -!- SUBCELLULAR LOCATION: Nematocyst {ECO:0000269|PubMed:23151943}.
CC Secreted {ECO:0000269|PubMed:23151943}.
CC -!- TISSUE SPECIFICITY: Nematocytes (PubMed:23151943, PubMed:29424690). In
CC late planulae, transcripts are found throughout the ectoderm in
CC nematocytes, with high concentration of expressing cells in the oral
CC pole (PubMed:29424690). In primary polyps, is expressed in nematocytes
CC in the body wall and physa ectoderm and in the upper and lower pharynx
CC (PubMed:29424690). {ECO:0000269|PubMed:23151943,
CC ECO:0000269|PubMed:29424690}.
CC -!- DEVELOPMENTAL STAGE: Transcripts are expressed in gastrulae, early and
CC late planulae, metamorphosing planulae, and primary polyps.
CC {ECO:0000269|PubMed:29424690}.
CC -!- SIMILARITY: Belongs to the NEP3 family. {ECO:0000305}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000255|PROSITE-ProRule:PRU01005}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS469524; EDO46764.1; -; Genomic_DNA.
DR RefSeq; XP_001638827.1; XM_001638777.1.
DR SMR; A7RNZ0; -.
DR EnsemblMetazoa; EDO46764; EDO46764; NEMVEDRAFT_v1g239676.
DR GeneID; 5518902; -.
DR KEGG; nve:5518902; -.
DR HOGENOM; CLU_748635_0_0_1; -.
DR OMA; FMSTECT; -.
DR OrthoDB; 1905541at2759; -.
DR Proteomes; UP000001593; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0042151; C:nematocyst; IEA:UniProtKB-SubCell.
DR GO; GO:0090729; F:toxin activity; IEA:UniProtKB-KW.
DR InterPro; IPR003582; ShKT_dom.
DR Pfam; PF01549; ShK; 3.
DR SMART; SM00254; ShKT; 2.
DR PROSITE; PS51670; SHKT; 2.
PE 1: Evidence at protein level;
KW Disulfide bond; Nematocyst; Reference proteome; Secreted; Signal; Toxin.
FT SIGNAL 1..19
FT /evidence="ECO:0000255"
FT CHAIN 20..370
FT /note="Nematocyst expressed protein 4"
FT /evidence="ECO:0000305"
FT /id="PRO_5002713691"
FT DOMAIN 70..102
FT /note="ShKT 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DOMAIN 113..149
FT /note="ShKT 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DOMAIN 155..190
FT /note="ShKT 3"
FT /evidence="ECO:0000305|PubMed:29424690"
FT REGION 34..55
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 306..370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 306..340
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..370
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 70..102
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 77..95
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 84..99
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 113..149
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 121..142
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 131..146
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01005"
FT DISULFID 164..183
FT /evidence="ECO:0000305"
FT DISULFID 173..187
FT /evidence="ECO:0000305"
SQ SEQUENCE 370 AA; 41553 MW; 316E790BAF80DBEB CRC64;
MAWTLVLLVL LGTSSCLDAK ELHKKIDISA LLESGSGSGE EGSSGSGSAP EPVRDQDVDW
KQFKIKPEQC LDKGENCTGD PEQCQENWQE MVVQCPFSCR FCSQRSIDDV TECTDARGAA
CKDWADNRND CLRFPQFMST ECTKSCKLCG KETNGKKFDK DVRCIEWAKN GYCNEGELYK
EKCPHNCEVH KSIKKEPKGP YPYPTESLYP YQYWQLYPAP GPAPYPPYPY PTAAPYPYQY
SPYPYTPYPP PPYPNPYPQP PYPPPPPPYP NPYPQPPYPP PPAPCSGPGP CPYPGPPPPP
YPAPTPYPPP PPPYPEQVPP PPPPPPPPPP PPPYPYPYPY PDESENTKHK SKKHAKHHEK
HHKENHSKKS