CGA1_HELPX
ID CGA1_HELPX Reviewed; 1147 AA.
AC P80200;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-1996, sequence version 1.
DT 25-MAY-2022, entry version 67.
DE RecName: Full=Cytotoxicity-associated immunodominant antigen;
DE AltName: Full=120 kDa protein;
GN Name=cagA; Synonyms=cai;
OS Helicobacter pylori (Campylobacter pylori).
OC Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales;
OC Helicobacteraceae; Helicobacter.
OX NCBI_TaxID=210;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=DSM 4867 / CCUG 17874 / NCTC 11638;
RX PubMed=8516329; DOI=10.1073/pnas.90.12.5791;
RA Covacci A., Censini S., Bugnoli M., Petracca R., Burroni D., Macchia G.,
RA Massone A., Papini E., Xiang Z., Figura N., Rappuoli R.;
RT "Molecular characterization of the 128-kDa immunodominant antigen of
RT Helicobacter pylori associated with cytotoxicity and duodenal ulcer.";
RL Proc. Natl. Acad. Sci. U.S.A. 90:5791-5795(1993).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=DSM 4867 / CCUG 17874 / NCTC 11638;
RX PubMed=8962108; DOI=10.1073/pnas.93.25.14648;
RA Censini S., Lange C., Xiang Z., Crabtree J., Ghiara P., Borodovsky M.,
RA Rappuoli R., Covacci A.;
RT "cag, a pathogenicity island of Helicobacter pylori, encodes type I-
RT specific and disease-associated virulence factors.";
RL Proc. Natl. Acad. Sci. U.S.A. 93:14648-14653(1996).
RN [3]
RP PROTEIN SEQUENCE OF 181-190; 314-328; 366-377; 420-430; 494-506; 661-677;
RP 900-914 AND 1062-1077.
RA Herrmann V., Herrmann J., Kist M.;
RL Submitted (APR-1993) to UniProtKB.
CC -!- FUNCTION: May be necessary for the transcription, folding, export, or
CC function of the cytotoxin.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X70039; CAA49633.1; -; Genomic_DNA.
DR EMBL; AF282853; AAC44706.1; -; Genomic_DNA.
DR PIR; B48281; B48281.
DR RefSeq; WP_000180734.1; NZ_JAFCIE010000014.1.
DR AlphaFoldDB; P80200; -.
DR SMR; P80200; -.
DR MINT; P80200; -.
DR GO; GO:0019534; F:toxin transmembrane transporter activity; IEA:InterPro.
DR InterPro; IPR005169; CagA_C.
DR InterPro; IPR045157; CagA_N.
DR InterPro; IPR004355; IVSec_CagA.
DR Pfam; PF03507; CagA; 1.
DR Pfam; PF18971; CagA_N; 1.
DR PRINTS; PR01553; TYPE4SSCAGA.
PE 1: Evidence at protein level;
KW Direct protein sequencing.
FT CHAIN 1..1147
FT /note="Cytotoxicity-associated immunodominant antigen"
FT /id="PRO_0000089593"
FT REGION 625..647
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..640
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 320
FT /note="G -> A (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 325
FT /note="P -> F (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 328
FT /note="R -> K (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 426
FT /note="K -> E (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 429
FT /note="Q -> E (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 673..675
FT /note="AQN -> TED (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 901
FT /note="A -> T (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 903
FT /note="Q -> E (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 907
FT /note="L -> P (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 910
FT /note="P -> R (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 914
FT /note="Q -> E (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 1072
FT /note="P -> S (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
FT CONFLICT 1074
FT /note="S -> D (in Ref. 3; AA sequence)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1147 AA; 128014 MW; AB92770835F68490 CRC64;
MTNETIDQQP QTEAAFNPQQ FINNLQVAFL KVDNAVASYD PDQKPIVDKN DRDNRQAFEG
ISQLREEYSN KAIKNPTKKN QYFSDFINKS NDLINKDNLI DVESSTKSFQ KFGDQRYRIF
TSWVSHQNDP SKINTRSIRN FMENIIQPPI LDDKEKAEFL KSAKQSFAGI IIGNQIRTDQ
KFMGVFDESL KERQEAEKNG EPTGGDWLDI FLSFIFDKKQ SSDVKEAINQ EPVPHVQPDI
ATTTTDIQGL PPEARDLLDE RGNFSKFTLG DMEMLDVEGV ADIDPNYKFN QLLIHNNALS
SVLMGSHNGI EPEKVSLLYG GNGGPGARHD WNATVGYKDQ QGNNVATIIN VHMKNGSGLV
IAGGEKGINN PSFYLYKEDQ LTGSQRALSQ EEIQNKIDFM EFLAQNNAKL DNLSEKEKEK
FRTEIKDFQK DSKAYLDALG NDRIAFVSKK DTKHSALITE FGNGDLSYTL KDYGKKADKA
LDREKNVTLQ GSLKHDGVMF VDYSNFKYTN ASKNPNKGVG VTNGVSHLEV GFNKVAIFNL
PDLNNLAITS FVRRNLEDKL TTKGLSPQEA NKLIKDFLSS NKELVGKTLN FNKAVADAKN
TGNYDEVKKA QKDLEKSLRK REHLEKEVEK KLESKSGNKN KMEAKAQANS QKDEIFALIN
KEANRDARAI AYAQNLKGIK RELSDKLENV NKNLKDFDKS FDEFKNGKNK DFSKAEETLK
ALKGSVKDLG INPEWISKVE NLNAALNEFK NGKNKDFSKV TQAKSDLENS VKDVIINQKV
TDKVDNLNQA VSVAKATGDF SRVEQALADL KNFSKEQLAQ QAQKNESLNA RKKSEIYQSV
KNGVNGTLVG NGLSQAEATT LSKNFSDIKK ELNAKLGNFN NNNNNGLKNE PIYAKVNKKK
AGQAASLEEP IYAQVAKKVN AKIDRLNQIA SGLGVVGQAA GFPLKRHDKV DDLSKVGLSR
NQELAQKIDN LNQAVSEAKA GFFGNLEQTI DKLKDSTKHN PMNLWVESAK KVPASLSAKL
DNYATNSHIR INSNIKNGAI NEKATGMLTQ KNPEWLKLVN DKIVAHNVGS VPLSEYDKIG
FNQKNMKDYS DSFKFSTKLN NAVKDTNSGF TQFLTNAFST ASYYCLAREN AEHGIKNVNT
KGGFQKS