SCA4_RICPR
ID SCA4_RICPR Reviewed; 1022 AA.
AC Q9ZD49; Q9AJ36; Q9ZD48;
DT 11-JUL-2001, integrated into UniProtKB/Swiss-Prot.
DT 11-JUL-2001, sequence version 2.
DT 25-MAY-2022, entry version 89.
DE RecName: Full=Antigenic heat-stable 120 kDa protein;
DE AltName: Full=120 kDa antigen;
DE AltName: Full=Protein PS 120;
DE Short=PS120;
GN Name=sca4; OrderedLocusNames=RP498/RP499;
OS Rickettsia prowazekii (strain Madrid E).
OC Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsiales;
OC Rickettsiaceae; Rickettsieae; Rickettsia; typhus group.
OX NCBI_TaxID=272947;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Madrid E;
RX PubMed=9823893; DOI=10.1038/24094;
RA Andersson S.G.E., Zomorodipour A., Andersson J.O., Sicheritz-Ponten T.,
RA Alsmark U.C.M., Podowski R.M., Naeslund A.K., Eriksson A.-S., Winkler H.H.,
RA Kurland C.G.;
RT "The genome sequence of Rickettsia prowazekii and the origin of
RT mitochondria.";
RL Nature 396:133-140(1998).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 11-1016.
RX PubMed=11491333; DOI=10.1099/00207713-51-4-1353;
RA Sekeyova Z., Roux V., Raoult D.;
RT "Phylogeny of Rickettsia spp. inferred by comparing sequences of 'gene D',
RT which encodes an intracytoplasmic protein.";
RL Int. J. Syst. Evol. Microbiol. 51:1353-1360(2001).
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA14950.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ235272; CAA14951.1; ALT_FRAME; Genomic_DNA.
DR EMBL; AJ235272; CAA14950.1; ALT_FRAME; Genomic_DNA.
DR EMBL; AF200340; AAK31305.1; -; Genomic_DNA.
DR PIR; D71653; D71653.
DR PIR; E71653; E71653.
DR RefSeq; NP_220874.1; NC_000963.1.
DR RefSeq; NP_220875.1; NC_000963.1.
DR AlphaFoldDB; Q9ZD49; -.
DR SMR; Q9ZD49; -.
DR STRING; 272947.RP498; -.
DR EnsemblBacteria; CAA14950; CAA14950; CAA14950.
DR EnsemblBacteria; CAA14951; CAA14951; CAA14951.
DR KEGG; rpr:RP498; -.
DR KEGG; rpr:RP499; -.
DR PATRIC; fig|272947.5.peg.507; -.
DR eggNOG; COG5183; Bacteria.
DR HOGENOM; CLU_009206_0_0_5; -.
DR Proteomes; UP000002480; Chromosome.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR InterPro; IPR020954; Rickettsia_antigen_120kDa.
DR Pfam; PF12574; 120_Rick_ant; 1.
PE 4: Predicted;
KW Cytoplasm; Reference proteome.
FT CHAIN 1..1022
FT /note="Antigenic heat-stable 120 kDa protein"
FT /id="PRO_0000097615"
FT REGION 1..33
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 11..15
FT /note="EFDPL -> RPGLV (in Ref. 2; AAK31305)"
FT /evidence="ECO:0000305"
FT CONFLICT 365
FT /note="H -> Y (in Ref. 2; AAK31305)"
FT /evidence="ECO:0000305"
FT CONFLICT 413
FT /note="Missing (in Ref. 2; AAK31305)"
FT /evidence="ECO:0000305"
FT CONFLICT 957
FT /note="G -> R (in Ref. 2; AAK31305)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1022 AA; 114410 MW; 03230E3A663A9622 CRC64;
MSKNGNQDIS EFDPLNREFT EAEKQQQMQQ EQEFFSQTIL DIADDGFMVA SSSQATPSIS
FLSNNRPHGD HKSDPITEAI RKEILEKQRD ILREYFVNTN PELAEQIAKE EDDRKFRAFL
SNQDNYALIN KAFEDTKTKK NLEKAEIVGY KNVLSTYSVA NGYQGGFQPV QWENQVSASD
LRSTVVKNDE GEELCTLNET TVKTKDLIVA KQDGTQVQIN SYREINFPIK LDKANGSMHL
SMVALKADGT KPAKDKAVYF TAHYEEGPNG KPQLKEISSP QPLKFVGTGD DAVAYIEHGG
EIYTLAVTRG KYKEMMKEVA LNHGQSVALS QTIAEDLTHV QGPSHETHKP IIIPNQELES
SIEQHTSQQV PPITTFNKSL QPKISQIHQL QPQQAQSSGI PNPVLNAANA LSTSMQDLLN
NINSYLTKNQ DINKQSDLIK EAAIAILNNK KSDFAEKQYN IIDLAKNIFS NKDIIADAKV
NVVNTLLETI QNDQNTLDIK KSKILEDTVA ITLNSENIEL KQKQQILEKV VDIGLSIKDD
ISRVVAVDSI MDTVIKSNIA NEDKEKIFIT VFDQINSYEF SNVAKQKLLD SILKKTAETQ
VLSPEQQQLM NQNLDNITTE HTKRDTIEKV NNILLEPLSN TALKTTNIQV MTSNVLDSPV
QIEMKSKLIQ VVTKTVAESA LVEPKDKTEI VKGIGKTIVT HSDTSLPLHD KVVIMGSVAK
GIVESKNDLL DRELIIAGLV DGIYEAKGDN AVVHAISSMI ANSNINQSEK EALKRSQDVV
SEKVLDKEIQ NLDRELKAQN INESKLHDDI YNKTQDVANA LKNVITTVLD DNSGQRGVSE
EAPKKVSSLL NDISKRTIEK INNLRAMLSQ DGNLKTFEEK KDEATKKVDE LVKAFDNKSS
TEEQQNFIKS NLIDNKTLSR EIRLQIIDNL LKAQAQKRAE TIENLSAKTE DVRVISGKSE
LKPISQDEPY IQKAKMVVER DRVDIKDNIK IMSALINARD SIQSENFNKS IHIKKESSFP
QR