CAPS3_MIMIV
ID CAPS3_MIMIV Reviewed; 2156 AA.
AC Q5UQN7;
DT 13-SEP-2005, integrated into UniProtKB/Swiss-Prot.
DT 07-DEC-2004, sequence version 1.
DT 23-FEB-2022, entry version 62.
DE RecName: Full=Probable capsid protein 3;
GN OrderedLocusNames=MIMI_R440;
OS Acanthamoeba polyphaga mimivirus (APMV).
OC Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota; Megaviricetes;
OC Imitervirales; Mimiviridae; Mimivirus.
OX NCBI_TaxID=212035;
OH NCBI_TaxID=5757; Acanthamoeba polyphaga (Amoeba).
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Rowbotham-Bradford;
RX PubMed=15486256; DOI=10.1126/science.1101485;
RA Raoult D., Audic S., Robert C., Abergel C., Renesto P., Ogata H.,
RA La Scola B., Susan M., Claverie J.-M.;
RT "The 1.2-megabase genome sequence of Mimivirus.";
RL Science 306:1344-1350(2004).
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the NCLDV major capsid protein family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY653733; AAV50707.1; -; Genomic_DNA.
DR RefSeq; YP_003986947.1; NC_014649.1.
DR GeneID; 9925064; -.
DR KEGG; vg:9925064; -.
DR Proteomes; UP000001134; Genome.
DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR InterPro; IPR031654; Capsid_N.
DR InterPro; IPR007542; MCP_C.
DR InterPro; IPR016112; VP_dsDNA_II.
DR Pfam; PF16903; Capsid_N; 2.
DR Pfam; PF04451; Capsid_NCLDV; 1.
DR SUPFAM; SSF49749; SSF49749; 3.
PE 3: Inferred from homology;
KW Capsid protein; Reference proteome; Virion.
FT CHAIN 1..2156
FT /note="Probable capsid protein 3"
FT /id="PRO_0000071170"
FT REGION 1319..1345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1325..1345
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2156 AA; 253104 MW; CF2029A36EE4FCC6 CRC64;
MAGGILQLVC NATAAENMWI NNDPEITFFK KLFRRHTPFA SEYIPLYFKS NLDFGQSSSA
TILSNGDLVH KIFFVCDIPS ISAEFINSKN EDVIKTIKNL NSQYIDNDFM RKLNNCINGT
IIEYDNICNI IDENIQYYDY YKPIIISIIN TLSKYQNKDV LINYQQKIFG KLLNKSNDNL
SQNESFQNDS FKLEILDSFF STNVKYNLVF ELIKLIDLEQ QFYSQKIPII PIQNTGKYLA
NNFVNSVLPA VNNLGNNFGK DFYHRLNTHN AIIGAIESLS ISVPTVIIKP FVLNDCYDIY
RPNNQNVYFD STYYSSIIDP NFKLNFMLKS NQVEKYIQSN EFIPIDIIGT NDFIYPEIDN
NYNLLFNTQA NIMFDNIQSF TDLLFEHYRN LLFDSTENLF FNKSPTPSNI YSYILPTKAY
DTKSNNLSDE KINDGPNVFN LNIWFFYFFK YLDQFDVDKF IVYVKDNVHK NLSNACYSFL
RNMMVLLKIN VDYYMHEISY YMNDMCSKSP SHDLSDTLKN YVPNISESTN QFNKQSGSNL
LMVTLIFHRN IIPSIEDMFN FIYEFIENIN CNDIENYLEV EIGFVDRETL TELKSTAKQL
YQGFYNYFMN KYNKMHFEAE YHCEEINPLV QEYVNHFLTG TSIHDKFYQK RNLNDIIHQL
EFYFISETIH IRELQKFYYN IFSNNNLIKN YVDDYGCELI DYFNSVLKTD LEQKYYKVNN
IHRYNGESYK MTPYNTRFYG LNDNLPLVYP FDHPPVNPYG VNPDYYSHYR SVTDYVTIPT
SNNQILCTEI PINLNNTEPI NNRYNDSEYQ RFEIDYFRLK HEIFHGTIDN PISDLYKIFI
SEYDFNVLKL YHLLQQLLDQ SNTKPMLSDD LLYKLYITTI YLMNYINEDA GNDTVMSKSI
MHQFLEMIED YLNNQTENIS ENTINEMVEL SDILLKNNQH NYTTDDLIKC NQYINLIKNT
DKYDNGIIDR LQIIKYNFLS QYFIYSLESQ NISMLKNLDN KFFFKNTGQI INEILQLVDN
NPVLDDLSNM EYLYSGEFET EHNHLLCTDS SNLSRYFMNT FNIFTAHELN PIKKLSVRDI
YDIIDTTFMS VREIYNICMN NGSINNILVK LDKYQDKLLN KLVLTNKIGQ YFYDFLQNKP
IEQQNKKDLI KIIDSFGIDL ENYMCNKIIP LYNQLVLNNN KNSTIIILAI ISKDLDSFFL
PNSSASSFKQ VIIEDLTNGN KEDPLYLYLK LIGNEYYSYL KFFLDYCCEN DLDYDSITNP
LESLDITVLK NDNNTNRILS VKDCLNYFMD YLWDHSYPSM KQFGFNEKIS DVINKYSSNK
SNKSNKSNES DKSSESDKSS ESSNHDDKIS KIINLLSGSF DTNGFVLNRY LEEFTKNDEI
SEIQNLSEKS NIIYGLNLYL GKLLILLEQQ YNSGLEIKNK ISNILYRNKK ACTAWIKKLA
HYLVDEISIS TNSETIDSHK SDWFEIYSQT NLSESRIDGY YKMIGNIDEL TIFSNNDKSS
RTIILPLCFY FNRNIALSLP LNASINLTYT INIKLKELSD VLFKEEFSEY IDNTGSIVKP
KLSNVHLMCE YIYLSNEERQ AFVTKHLQYL VDEFQYSTTN ITDNNLTPVY KIGTNKISCV
VKKNGKKTTE TYFNKGIYVD QNNVNVDDNE LILRKDLEVK PSKNKSGISS TMIQHTIIDT
DPKIHFKRVS IENHFSNPTK MMAILIRPDL HIITDKRDYS SDYFFGEKQW SNYGVHSWYD
FSQVRKIRED YYTKFRQKIN NLEDPVYGFL NIINHTIENM KIPEIPDKLI KYCEQIKSMY
IKHSDEIFDQ SNIQKIRDNL HILKINFDIT DKQLLLQMIY DICYNMDVSL PTNSIIIDEF
RRLDSNFTLD DDNLYVNYEF FKDCICSILK LSILQGRSDE LYQLIQQIYD KHNENQINLL
INKLSEIIPI SNHTYSFTNI MYKAKNLCLL HKCDNHIVNL INQIQDKIDF MNSSELSCVN
NMKTVSSTYK DIINQIIVNT NYLDHIPSYI IDTISNKMLE TLNIIIDKQY IKIIPYNKLI
KPNPKINPLI SGHLTFNNIS TMPENSDHTF WTACQTYQHF KHDTETGINL YSWAIKPFNE
QNSGSANLSR IDRFMSILNI HPIISSKNSA SIITMTLSIN IINYLSGLCG KAWEKY