CAPSD_SOCMV
ID CAPSD_SOCMV Reviewed; 441 AA.
AC P15627;
DT 01-APR-1990, integrated into UniProtKB/Swiss-Prot.
DT 29-AUG-2001, sequence version 2.
DT 02-JUN-2021, entry version 93.
DE RecName: Full=Capsid protein;
DE Short=CP;
DE AltName: Full=Coat protein;
GN ORFNames=ORF IV;
OS Soybean chlorotic mottle virus.
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Caulimoviridae; Soymovirus.
OX NCBI_TaxID=10651;
OH NCBI_TaxID=3847; Glycine max (Soybean) (Glycine hispida).
OH NCBI_TaxID=35936; Lablab purpureus (Hyacinth bean) (Dolichos lablab).
OH NCBI_TaxID=3885; Phaseolus vulgaris (Kidney bean) (French bean).
OH NCBI_TaxID=3917; Vigna unguiculata (Cowpea).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=2602148; DOI=10.1093/nar/17.23.9993;
RA Hasegawa A., Verver J., Shimada A., Saito M., Goldbach R., van Kammen A.,
RA Miki K., Kameya-Iwaki M., Hibi T.;
RT "The complete sequence of soybean chlorotic mottle virus DNA and the
RT identification of a novel promoter.";
RL Nucleic Acids Res. 17:9993-10013(1989).
RN [2]
RP SEQUENCE REVISION.
RA Hibi T.;
RL Submitted (NOV-2000) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Self assembles to form an icosahedral capsid, about 50 nm in
CC diameter, nm, composed of 420 subunits of the viral capsid protein. The
CC capsid encapsulates the genomic dsDNA. Following virus entry into host
CC cell, provides nuclear import of the viral genome. Virus particles do
CC not enter the nucleus, but dock at the nuclear membrane through the
CC interaction with host importins (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: Interacts (via nuclear localization signal) with host importin
CC alpha. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000305}. Host nucleus {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the caulimoviridae capsid protein family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X15828; CAC16944.1; -; Genomic_DNA.
DR PIR; JS0374; JS0374.
DR RefSeq; NP_068728.1; NC_001739.2.
DR SMR; P15627; -.
DR GeneID; 912259; -.
DR KEGG; vg:912259; -.
DR Proteomes; UP000001065; Genome.
DR GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0039620; C:T=7 icosahedral viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0075732; P:viral penetration into host nucleus; IEA:UniProtKB-KW.
DR InterPro; IPR001988; Caulimo_coat.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PRINTS; PR00221; CAULIMOCOAT.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; SSF57756; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 3: Inferred from homology;
KW Capsid protein; Host nucleus; Metal-binding; Reference proteome;
KW T=7 icosahedral capsid protein; Viral penetration into host nucleus;
KW Virion; Virus entry into host cell; Zinc; Zinc-finger.
FT CHAIN 1..441
FT /note="Capsid protein"
FT /id="PRO_0000222036"
FT ZN_FING 381..398
FT /note="CCHC-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00047"
FT REGION 26..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 77..79
FT /note="Nuclear localization signal"
FT /evidence="ECO:0000250"
FT COMPBIAS 41..63
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 441 AA; 52102 MW; 07244CD2181CFAFF CRC64;
MEETQQELTQ QLKELETLMA AINLDDSKKK QPIYQNSSES EESETENKNF IYDFSSEEDF
EEPVKVKIEE EAETSNKRKF DKNPEFTRFK YQKIPKEYVP AHQTTSTIGV LDIDCVANTE
KIIKEWFNHH SILITINEEL KNLSSLDTFY YLVYKTRGIA HAYLSNLPSE VLSRIPADRK
QVDDWVYNLL LREFVGRLER PESEEAFSQN NYYKLINLEI CNMCYLENFL CEFQSRYYGI
NPIDRENLKV DLLLYAKLPE YVRTQVEAYF NASITSNKLD NTLGGRITAL KLWQTEQCNQ
KLAKRQASVG LCCSKIEDKI GKYGCRKSNP RAKKPKKKFR KIKKYPKKNF WKWNNQRKKK
TFRKKRPFRK QQTCPTGKKK CQCWLCHEEG HYANECPKKD NKKAQTLKLI FDLGFEPVES
DIETDEELFE LTSEDSSEDE Y