HGPC_HAEIF
ID HGPC_HAEIF Reviewed; 1066 AA.
AC Q9X442;
DT 29-AUG-2001, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1999, sequence version 1.
DT 03-AUG-2022, entry version 85.
DE RecName: Full=Hemoglobin and hemoglobin-haptoglobin-binding protein C;
DE Flags: Precursor;
GN Name=hgpC;
OS Haemophilus influenzae.
OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales;
OC Pasteurellaceae; Haemophilus.
OX NCBI_TaxID=727;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=HI689 / Serotype B;
RX PubMed=10338475; DOI=10.1128/iai.67.6.2729-2739.1999;
RA Morton D.J., Whitby P.W., Stull T.L.;
RT "Effect of multiple mutations in the hemoglobin- and hemoglobin-
RT haptoglobin-binding proteins, HgpA, HgpB, and HgpC, of Haemophilus
RT influenzae type b.";
RL Infect. Immun. 67:2729-2739(1999).
CC -!- FUNCTION: Acts as a receptor for hemoglobin or the
CC hemoglobin/haptoglobin complex of the human host and is required for
CC heme uptake.
CC -!- SUBCELLULAR LOCATION: Cell outer membrane.
CC -!- MISCELLANEOUS: This protein is subject to phase-variable expression
CC associated with alteration in the length of the CCAA repeat region.
CC This mechanism is called slipped-strand mispairing. Addition or loss of
CC CCAA repeat units would change the reading frame and result in
CC introduction of stop codons downstream of the repeat region. This may
CC be a mechanism of regulation and a way to avoid the immunological
CC response of the host.
CC -!- SIMILARITY: Belongs to the TonB-dependent receptor family.
CC Hemoglobin/haptoglobin binding protein subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF094574; AAD33112.1; -; Genomic_DNA.
DR AlphaFoldDB; Q9X442; -.
DR SMR; Q9X442; -.
DR GO; GO:0009279; C:cell outer membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0006811; P:ion transport; IEA:UniProt.
DR GO; GO:0071702; P:organic substance transport; IEA:UniProt.
DR Gene3D; 2.170.130.10; -; 1.
DR Gene3D; 2.40.170.20; -; 2.
DR InterPro; IPR039426; BtuB-like.
DR InterPro; IPR012910; Plug_dom.
DR InterPro; IPR037066; Plug_dom_sf.
DR InterPro; IPR006970; PT.
DR InterPro; IPR000531; TonB-dep_rcpt_b-brl.
DR InterPro; IPR036942; TonB_rcpt_b-brl_sf.
DR InterPro; IPR010917; TonB_rcpt_CS.
DR PANTHER; PTHR30069; PTHR30069; 1.
DR Pfam; PF07715; Plug; 1.
DR Pfam; PF04886; PT; 1.
DR Pfam; PF00593; TonB_dep_Rec; 1.
DR PROSITE; PS01156; TONB_DEPENDENT_REC_2; 1.
PE 3: Inferred from homology;
KW Cell outer membrane; Membrane; Receptor; Repeat; Signal; TonB box;
KW Transmembrane; Transmembrane beta strand; Transport.
FT SIGNAL 1..24
FT /evidence="ECO:0000255"
FT CHAIN 25..1066
FT /note="Hemoglobin and hemoglobin-haptoglobin-binding
FT protein C"
FT /id="PRO_0000034780"
FT REPEAT 26..29
FT /note="1"
FT REPEAT 30..33
FT /note="2"
FT REPEAT 34..37
FT /note="3"
FT REPEAT 38..41
FT /note="4"
FT REPEAT 42..45
FT /note="5"
FT REPEAT 46..49
FT /note="6"
FT REPEAT 50..53
FT /note="7"
FT REGION 26..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 26..53
FT /note="7 X 4 AA tandem repeats of Q-P-T-N"
FT MOTIF 63..70
FT /note="TonB box"
FT MOTIF 1049..1066
FT /note="TonB C-terminal box"
SQ SEQUENCE 1066 AA; 122594 MW; EFB88D5CE4247583 CRC64;
MTNFKFTLLA RSIAFALNAS TAYAAQPTNQ PTNQPTNQPT NQPTNQPTNQ PTNQDSNLSE
QLEQINVSGS TETINVKEKK VGETQISAKK LAKQQASDSR DLVRYETGIT VVETGRTGAS
GYAVRGVDEN RVGIMVDGLR QAETLSSQGF KELFEGYGNF NNTRNNIEIE NVKTATITKG
ADSLKSGSGA LGGSVIFETK DARDYLIDKD YYVSYKRGYQ TMNNQNLKTL TLAGRSKKFD
ILVVDTTRDG HEIENYDYKI YPNKQADLSA VGPTREKADP YQITRQSTLI KLGFQPNENH
RLSVALDDST LETKGMDLSY AFRPYSQADK EIYGERIIND QSKRKNIQFS YENFSQTPFW
DHIKLSYSSQ KITNKARSDE YCHQSTCNGV SNPQGLHLVE EKGVYKIVDK DNKDFNYQED
KNNPWSYGKE LYNSKNEKIS NDVDTEGGAL DSVLINCEKL NCEKKKFPIY KEKDEEWKDK
YEHEDRDITI KELNGKKYGE ISLKKSDSSG FTKYESARFL FPKSFGYSTD FVNDRDLNTN
TQQIKLDLDK EFHLWHAQHQ LKYGGLYEKT LKSMVNHQYN TAANVQWWAD YFFCKKPVNG
NRIPAPDHSA YRCKLMNSDI GKDTYLIPVT TKNNVLYFGD NVQLTSWLGL DLNYRYDHVK
YLPSYDKNIP VPNGLITGLF KKFKSTDYVY GNKYLVPKGY TNCTYTTDCY KQNFEENLAL
LLRKTDYKHH SYNLGLNLDP TNWLRVQLKY ANGFRAPTSD EIYMTFKHPQ FSIQPNTDLK
AETSKTKEVA FTFYKNSSYI TLNAFQNDYR NFIDLVEVGE RPIEEGSVVR YPFHQNQNRD
RARVRGIEIA SRLEMGDLLE KLQRFPLGYK FTYQKGRIKD NGLHPKYKEF LELNKDEHPE
YEAIARKPQP MNALQPTTSV YNIGYDAPSQ KWGVDMYITN VAAKKAKDSF NSQWTSMVAR
KEKIYDTEST VPAKKANGKE VKDSRGLWRN NRYTVIDTIA YWKPIKNLTF TAGVYNLTNK
KYLTWDSARS VRHLGTINRV ETATGKGLNR LYAPGRNYRM SVQFEF