CAPSD_CDDV1
ID CAPSD_CDDV1 Reviewed; 395 AA.
AC P0DOK2;
DT 07-NOV-2018, integrated into UniProtKB/Swiss-Prot.
DT 07-NOV-2018, sequence version 1.
DT 29-SEP-2021, entry version 8.
DE RecName: Full=Capsid protein;
OS Chaetoceros diatodnavirus 1 (Chaetoceros setoense DNA virus).
OC Viruses; Monodnaviria; Shotokuvirae; Cressdnaviricota; Arfiviricetes;
OC Baphyvirales; Bacilladnaviridae; Diatodnavirus.
OX NCBI_TaxID=2169869;
OH NCBI_TaxID=1290580; Chaetoceros setoense.
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24275766; DOI=10.1038/srep03337;
RA Tomaru Y., Toyoda K., Suzuki H., Nagumo T., Kimura K., Takao Y.;
RT "New single-stranded DNA virus with a unique genomic structure that infects
RT marine diatom Chaetoceros setoensis.";
RL Sci. Rep. 3:3337-3337(2013).
CC -!- FUNCTION: Self-assembles to form the virion icosahedral capsid.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Host nucleus {ECO:0000305}. Virion {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB781089; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell.
PE 4: Predicted;
KW Host nucleus; Virion.
FT CHAIN 1..395
FT /note="Capsid protein"
FT /id="PRO_0000445650"
FT REGION 1..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 2..9
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00768"
FT COMPBIAS 1..39
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 395 AA; 44042 MW; CF0E3CFB620DD1FF CRC64;
MARKYAKRSK SRPRTARRSP KSRSRPRSRA PRRKAPSRPR IQRVNPVRRP MNSTAAQSLA
IYRNPFSHSP GQPKIPDGKA IMSIGSKVQV SAQLLNKASG DDILHVFLYP GLTQGMVVFG
DSKEQGTRGF TAYGYNDHMT YDASSVYNAG TGADGNIESN DNINEWRLVS QGLKLSLLNT
DEENDGWFEC VRYKDALRAN EFAFYSGDNL EQTTATVFGP DITFGSTLLT KNLVNSPTYV
SGALEDIDKY EFKLQAQSEQ HDFKRIPDRW YTEHGVDTVT VSGVNDYVTL QADTAQAHSI
HNSLVDDSFD AVYIRIHCRT NSGAGATTGS KLLAHLVSNQ ELVYDEDQNE HKFMTQAAMA
KAEFMKANEM ARKSQVGADM IGNVRSGVRS QRRPR