SPHR_AMEPV
ID SPHR_AMEPV Reviewed; 1003 AA.
AC P29815;
DT 01-APR-1993, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 2.
DT 23-FEB-2022, entry version 62.
DE RecName: Full=Spheroidin;
GN OrderedLocusNames=AMV187; ORFNames=G5;
OS Amsacta moorei entomopoxvirus (AmEPV).
OC Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota; Pokkesviricetes;
OC Chitovirales; Poxviridae; Entomopoxvirinae; Betaentomopoxvirus.
OX NCBI_TaxID=28321;
OH NCBI_TaxID=340055; Amsacta.
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], PARTIAL PROTEIN SEQUENCE, AND
RP ACETYLATION AT SER-2.
RX PubMed=1545219; DOI=10.1099/0022-1317-73-3-559;
RA Banville M., Dumas F., Trifiro S., Arif B., Richardson C.;
RT "The predicted amino acid sequence of the spheroidin protein from Amsacta
RT moorei entomopoxvirus: lack of homology between major occlusion body
RT proteins of different poxviruses.";
RL J. Gen. Virol. 73:559-566(1992).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND PARTIAL PROTEIN SEQUENCE.
RX PubMed=1942245; DOI=10.1128/jvi.65.12.6516-6527.1991;
RA Hall R.L., Moyer R.W.;
RT "Identification, cloning, and sequencing of a fragment of Amsacta moorei
RT entomopoxvirus DNA containing the spheroidin gene and three vaccinia virus-
RT related open reading frames.";
RL J. Virol. 65:6516-6527(1991).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=10936094; DOI=10.1006/viro.2000.0449;
RA Bawden A.L., Glassberg K.J., Diggans J., Shaw R., Farmerie W., Moyer R.W.;
RT "Complete genomic sequence of the Amsacta moorei entomopoxvirus: analysis
RT and comparison with other poxviruses.";
RL Virology 274:120-139(2000).
CC -!- FUNCTION: Major component of viral occlusion bodies, the protective
CC complexes in which the virions are embedded in the cytoplasm of their
CC insect hosts.
CC -!- SUBUNIT: May form disulfide-bond-linked aggregates.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M75889; AAA42378.1; -; Genomic_DNA.
DR EMBL; M77182; AAA42383.1; -; Genomic_DNA.
DR EMBL; AF250284; AAG02893.1; -; Genomic_DNA.
DR PIR; JQ1436; PYVZAM.
DR RefSeq; NP_064969.1; NC_002520.1.
DR iPTMnet; P29815; -.
DR GeneID; 1494777; -.
DR KEGG; vg:1494777; -.
DR Proteomes; UP000000872; Genome.
DR GO; GO:0039679; C:viral occlusion body; IEA:UniProtKB-KW.
DR InterPro; IPR008843; Spheroidin.
DR Pfam; PF05541; Spheroidin; 1.
PE 1: Evidence at protein level;
KW Acetylation; Direct protein sequencing; Disulfide bond; Late protein;
KW Reference proteome; Viral occlusion body.
FT INIT_MET 1
FT /note="Removed; by host"
FT CHAIN 2..1003
FT /note="Spheroidin"
FT /id="PRO_0000099759"
FT REGION 953..979
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 953..974
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 2
FT /note="N-acetylserine; by host"
FT /evidence="ECO:0000269|PubMed:1545219"
SQ SEQUENCE 1003 AA; 114870 MW; 6529F7767B058308 CRC64;
MSNVPLATKT IRKLSNRKYE IKIYLKDENT CFERVVDMVV PLYDVCNETS GVTLESCSPN
IEVIELDNTH VRIKVHGDTL KEMCFELLFP CNVNEAQVWK YVSRLLLDNV SHNDVKYKLA
NFRLTLNGKH LKLKEIDQPL FIYFVDDLGN YGLITKENIQ NNNLQVNKDA SFITIFPQYA
YICLGRKVYL NEKVTFDVTT DATNITLDFN KSVNIAVSFL DIYYEVNNNE QKDLLKDLLK
RYGEFEVYNA DTGLIYAKNL SIKNYDTVIQ VERLPVNLKV RAYTKDENGR NLCLMKITSS
TEVDPEYVTS NNALLGTLRV YKKFDKSHLK IVMHNRGSGN VFPLRSLYLE LSNVKGYPVK
ASDTSRLDVG IYKLNKIYVD NDENKIILEE IEAEYRCGRQ VFHERVKLNK HQCKYTPKCP
FQFVVNSPDT TIHLYGISNV CLKPKVPKNL RLWGWILDCD TSRFIKHMAD GSDDLDLDVR
LNRNDICLKQ AIKQHYTNVI ILEYANTYPN CTLSLGNNRF NNVFDMNDNK TISEYTNFTK
SRQDLNNMSC ILGINIGNSV NISSLPGWVT PHEAKILRSG CARVREFCKS FCDLSNKRFY
AMARDLVSLL FMCNYVNIEI NEAVCEYPGY VILFARAIKV INDLLLINGV DNLAGYSISL
PIHYGSTEKT LPNEKYGGVD KKFKYLFLKN KLKDLMRDAD FVQPPLYIST YFRTLLDAPP
TDNYEKYLVD SSVQSQDVLQ GLLNTCNTID TNARVASSVI GYVYEPCGTS EHKIGSEALC
KMAKEASRLG NLGLVNRINE SNYNKCNKYG YRGVYENNKL KTKYYREIFD CNPNNNNELI
SRYGYRIMDL HKIGEIFANY DESESPCERR CHYLEDRGLL YGPEYVHHRY QESCTPNTFG
NNTNCVTRNG EQHVYENSCG DNATCGRRTG YGRRSRDEWN DYRKPHVYDN CADANSSSSD
SCSDSSSSSE SESDSDGCCD TDASLDSDIE NCYQNPSKCD AGC