HGPA_HAEIF
ID HGPA_HAEIF Reviewed; 1077 AA.
AC Q9ZA21; Q9R649;
DT 29-AUG-2001, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-1999, sequence version 1.
DT 03-AUG-2022, entry version 85.
DE RecName: Full=Hemoglobin and hemoglobin-haptoglobin-binding protein A;
DE AltName: Full=Heme-repressible hemoglobin-binding protein;
DE Short=Hgb;
DE Flags: Precursor;
GN Name=hgpA;
OS Haemophilus influenzae.
OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales;
OC Pasteurellaceae; Haemophilus.
OX NCBI_TaxID=727;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=HI689 / Serotype B;
RX PubMed=10220170; DOI=10.1099/13500872-145-4-905;
RA Jin H., Ren Z., Whitby P.W., Morton D.J., Stull T.L.;
RT "Characterization of hgpA, a gene encoding a haemoglobin/haemoglobin-
RT haptoglobin-binding protein of Haemophilus influenzae.";
RL Microbiology 145:905-914(1999).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-145 AND 988-1077, AND PARTIAL
RP PROTEIN SEQUENCE.
RC STRAIN=HI689 / Serotype B;
RX PubMed=8757844; DOI=10.1128/iai.64.8.3134-3141.1996;
RA Jin H., Ren Z., Pozsgay J.M., Elkins C., Whitby P.W., Morton D.J.,
RA Stull T.L.;
RT "Cloning of a DNA fragment encoding a heme-repressible hemoglobin-binding
RT outer membrane protein from Haemophilus influenzae.";
RL Infect. Immun. 64:3134-3141(1996).
RN [3]
RP ROLE OF CCAA NUCLEOTIDE REPEATS.
RX PubMed=10482534; DOI=10.1128/jb.181.18.5865-5870.1999;
RA Ren Z., Jin H., Whitby P.W., Morton D.J., Stull T.L.;
RT "Role of CCAA nucleotide repeats in regulation of hemoglobin and
RT hemoglobin-haptoglobin binding protein genes of Haemophilus influenzae.";
RL J. Bacteriol. 181:5865-5870(1999).
CC -!- FUNCTION: Acts as a receptor for hemoglobin or the
CC hemoglobin/haptoglobin complex of the human host and is required for
CC heme uptake.
CC -!- SUBCELLULAR LOCATION: Cell outer membrane.
CC -!- MISCELLANEOUS: This protein is subject to phase-variable expression
CC associated with alteration in the length of the CCAA repeat region.
CC This mechanism is called slipped-strand mispairing. Addition or loss of
CC CCAA repeat units would change the reading frame and result in
CC introduction of stop codons downstream of the repeat region. This may
CC be a mechanism of regulation and a way to avoid the immunological
CC response of the host.
CC -!- SIMILARITY: Belongs to the TonB-dependent receptor family.
CC Hemoglobin/haptoglobin binding protein subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U51922; AAD10835.1; -; Genomic_DNA.
DR AlphaFoldDB; Q9ZA21; -.
DR GO; GO:0009279; C:cell outer membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0022857; F:transmembrane transporter activity; IEA:InterPro.
DR GO; GO:0006811; P:ion transport; IEA:UniProt.
DR GO; GO:0071702; P:organic substance transport; IEA:UniProt.
DR Gene3D; 2.170.130.10; -; 1.
DR Gene3D; 2.40.170.20; -; 2.
DR InterPro; IPR039426; BtuB-like.
DR InterPro; IPR012910; Plug_dom.
DR InterPro; IPR037066; Plug_dom_sf.
DR InterPro; IPR006970; PT.
DR InterPro; IPR000531; TonB-dep_rcpt_b-brl.
DR InterPro; IPR010949; TonB_Hb/transfer/lactofer_rcpt.
DR InterPro; IPR036942; TonB_rcpt_b-brl_sf.
DR InterPro; IPR010917; TonB_rcpt_CS.
DR PANTHER; PTHR30069; PTHR30069; 2.
DR Pfam; PF07715; Plug; 1.
DR Pfam; PF04886; PT; 2.
DR Pfam; PF00593; TonB_dep_Rec; 1.
DR TIGRFAMs; TIGR01786; TonB-hemlactrns; 1.
DR PROSITE; PS01156; TONB_DEPENDENT_REC_2; 1.
PE 1: Evidence at protein level;
KW Cell outer membrane; Direct protein sequencing; Membrane; Receptor; Repeat;
KW Signal; TonB box; Transmembrane; Transmembrane beta strand; Transport.
FT SIGNAL 1..24
FT CHAIN 25..1077
FT /note="Hemoglobin and hemoglobin-haptoglobin-binding
FT protein A"
FT /id="PRO_0000034778"
FT REPEAT 26..29
FT /note="1"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 30..33
FT /note="2"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 34..37
FT /note="3"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 38..41
FT /note="4"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 42..45
FT /note="5"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 46..49
FT /note="6"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 50..53
FT /note="7"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 54..57
FT /note="8"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 58..61
FT /note="9"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 62..65
FT /note="10"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REPEAT 66..69
FT /note="11"
FT /evidence="ECO:0000269|PubMed:10482534"
FT REGION 25..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 26..69
FT /note="11 X 4 AA tandem repeats of P-T-N-Q"
FT MOTIF 78..85
FT /note="TonB box"
FT MOTIF 1060..1077
FT /note="TonB C-terminal box"
SQ SEQUENCE 1077 AA; 122814 MW; 693F673BB5AC59F1 CRC64;
MTNFRLNVLA YSVMLGLTAS VAYAEPTNQP TNQPTNQPTN QPTNQPTNQP TNQPTNQPTN
QPTNQPTNQN SNASEQLEQI NVSGSTENTD TKAPPKIAET VKTAKKLEKE QAQDVKDLVR
YETGITVVEA GRFGNSGFAV RGVEENRVAV QIDGLHQAET ISSQGFKELF EGYGNFNNTR
NSAEIETLKQ VTIRKGADSL KSGSGALGGS VSLDTKDARD YLLNKNYYAS YKRGYNTADN
QNLNTLTLGG RYKYFDAIAV LTSRKGHELE NFGYKNYNDK IQGKTREKAD PYRRTQDSAL
LKIGFQPTEN HRFSVVADLY KQTSKGHDFS YTLKPNTQYM TYDEKELRHT NDKVERKNIA
FVYENFTETP FWDTLKITYS HQKITTSART DDYCDGNDKC ALAGNPLGMK YNQDNQLVGK
DGKSAKYQDI NKTQVIKERL PFTKPNGRWR FHKVDWDALK KKYPGVPIYA SCLEEDNDPS
EFCTYEVKTT KKENTFEING KRYDLLSEAD KNVISDEQRL PTNVSYLFSC DGLNCDKKTI
LGFKKRRNLL KIFLFEVIEK RCQKYGKTKV KANDQLSGPY LFMPNKKGYQ ANLWSQRDLT
SETKQINLDL TKHLELGKTQ HDLSYGGLWS EMEKSMTNLA GDTPLNVKWW AQYPHNCATF
LPPSTMTPNA KPTLNPERTS TLCNNVNVFS FLIPVKTKTG ALYFINDFRV NNYVAFNLGY
RYDRVKYEPE YIPGKTPKIP DDMVTNLYIK TPEFDASKAD SDPDELSKKE ANAAANIKEI
AQPKKFSASS YSFGTTLDPL NWLRLQAKYS KGFRAPTSDE IYFTFKHPDF SIQPNRDLQP
ETAKTKELSL TVHNDMGYIT TSVFDTRYQN FIDLSYQGRR DVHGHSKLIP FHFYQNVNRP
NAKVTGFEIA SQISLGNITK LFNGFSLSYK YTYQKGRING NIPMNAIQPR TAVYGVSYVH
PDDKYGLDLY ISHASAKNAE DTYNMFYKEE GKTDSTIKWR SKSYTTIDLL GYIKPIKNLT
LRAGVYNLTN RKYITWDSAR SIRPFGTSNM INQDTGLGIN RFYAPERNYR MSVQFEF