HGP4_HAEIN
ID HGP4_HAEIN Reviewed; 999 AA.
AC Q57408; O86244; P96344;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 29-AUG-2001, sequence version 3.
DT 03-AUG-2022, entry version 119.
DE RecName: Full=Probable hemoglobin and hemoglobin-haptoglobin-binding protein 4;
DE Flags: Precursor;
GN OrderedLocusNames=HI_1565/HI_1567;
OS Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales;
OC Pasteurellaceae; Haemophilus.
OX NCBI_TaxID=71421;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 51907 / DSM 11121 / KW20 / Rd;
RX PubMed=7542800; DOI=10.1126/science.7542800;
RA Fleischmann R.D., Adams M.D., White O., Clayton R.A., Kirkness E.F.,
RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M.,
RA McKenney K., Sutton G.G., FitzHugh W., Fields C.A., Gocayne J.D.,
RA Scott J.D., Shirley R., Liu L.-I., Glodek A., Kelley J.M., Weidman J.F.,
RA Phillips C.A., Spriggs T., Hedblom E., Cotton M.D., Utterback T.R.,
RA Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., Fine L.D.,
RA Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., Gnehm C.L., McDonald L.A.,
RA Small K.V., Fraser C.M., Smith H.O., Venter J.C.;
RT "Whole-genome random sequencing and assembly of Haemophilus influenzae
RT Rd.";
RL Science 269:496-512(1995).
RN [2]
RP SEQUENCE REVISION.
RA White O., Clayton R.A., Kerlavage A.R., Fleischmann R.D., Peterson J.,
RA Hickey E., Dodson R., Gwinn M.;
RL Submitted (MAY-1998) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Acts as a receptor for hemoglobin or the
CC hemoglobin/haptoglobin complex of the human host and is required for
CC heme uptake. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Cell outer membrane {ECO:0000250}; Peripheral
CC membrane protein {ECO:0000250}.
CC -!- MISCELLANEOUS: This protein is subject to phase-variable expression
CC associated with alteration in the length of the CCAA repeat region.
CC This mechanism is called slipped-strand mispairing. Addition or loss of
CC CCAA repeat units would change the reading frame and result in
CC introduction of stop codons downstream of the repeat region. This may
CC be a mechanism of regulation and a way to avoid the immunological
CC response of the host (By similarity). {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the TonB-dependent receptor family.
CC Hemoglobin/haptoglobin binding protein subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC23213.1; Type=Frameshift; Note=The first frameshift is found in the repeats region.; Evidence={ECO:0000305};
CC Sequence=AAC23214.1; Type=Frameshift; Note=The first frameshift is found in the repeats region.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L42023; AAC23213.1; ALT_SEQ; Genomic_DNA.
DR EMBL; L42023; AAC23214.1; ALT_SEQ; Genomic_DNA.
DR PIR; A64130; A64130.
DR AlphaFoldDB; Q57408; -.
DR SMR; Q57408; -.
DR STRING; 71421.HI_1567; -.
DR PRIDE; Q57408; -.
DR EnsemblBacteria; AAC23213; AAC23213; HI_1565.
DR EnsemblBacteria; AAC23214; AAC23214; HI_1567.
DR KEGG; hin:HI_1565; -.
DR KEGG; hin:HI_1567; -.
DR eggNOG; COG1629; Bacteria.
DR eggNOG; COG4771; Bacteria.
DR HOGENOM; CLU_078218_0_0_6; -.
DR PhylomeDB; Q57408; -.
DR Proteomes; UP000000579; Chromosome.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0031230; C:intrinsic component of cell outer membrane; IBA:GO_Central.
DR GO; GO:0015344; F:siderophore uptake transmembrane transporter activity; IBA:GO_Central.
DR GO; GO:0071702; P:organic substance transport; IEA:UniProt.
DR GO; GO:0044718; P:siderophore transmembrane transport; IBA:GO_Central.
DR Gene3D; 2.170.130.10; -; 1.
DR Gene3D; 2.40.170.20; -; 2.
DR InterPro; IPR039426; BtuB-like.
DR InterPro; IPR012910; Plug_dom.
DR InterPro; IPR037066; Plug_dom_sf.
DR InterPro; IPR006970; PT.
DR InterPro; IPR000531; TonB-dep_rcpt_b-brl.
DR InterPro; IPR010949; TonB_Hb/transfer/lactofer_rcpt.
DR InterPro; IPR036942; TonB_rcpt_b-brl_sf.
DR InterPro; IPR010917; TonB_rcpt_CS.
DR PANTHER; PTHR30069; PTHR30069; 1.
DR Pfam; PF07715; Plug; 1.
DR Pfam; PF04886; PT; 1.
DR Pfam; PF00593; TonB_dep_Rec; 1.
DR TIGRFAMs; TIGR01786; TonB-hemlactrns; 1.
DR PROSITE; PS01156; TONB_DEPENDENT_REC_2; 1.
PE 3: Inferred from homology;
KW Cell outer membrane; Membrane; Receptor; Reference proteome; Repeat;
KW Signal; TonB box; Transmembrane; Transmembrane beta strand; Transport.
FT SIGNAL 1..24
FT /evidence="ECO:0000255"
FT CHAIN 25..999
FT /note="Probable hemoglobin and hemoglobin-haptoglobin-
FT binding protein 4"
FT /id="PRO_0000034788"
FT REPEAT 26..29
FT /note="1"
FT REPEAT 30..33
FT /note="2"
FT REPEAT 34..37
FT /note="3"
FT REPEAT 38..41
FT /note="4"
FT REPEAT 42..45
FT /note="5"
FT REPEAT 46..49
FT /note="6"
FT REGION 25..52
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 26..49
FT /note="6 X 4 AA tandem repeats of P-T-N-Q"
FT MOTIF 58..65
FT /note="TonB box"
FT MOTIF 982..999
FT /note="TonB C-terminal box"
SQ SEQUENCE 999 AA; 114315 MW; DAFCD4EB7000A876 CRC64;
MTNFRLNVLA YSVMLGLTAS VAYAEPTNQP TNQPTNQPTN QPTNQPTNQN SNASEQLEQI
NVLGSDNNND NTPPKIAETV KTASQLKRQQ VQDSRDLVRY ETGVTVVEAG RFGSSGYAIR
GVDENRVAIT VDGLHQAETL SSQGFKELFE GYGNFNNTRN SVEIETLKVA KIAKGADSVK
VGSGSLGGAV LFETKDARDF LTEKDWHIGY KAGYSTADNQ GLNAVTLAGR YQMFDALIMH
SKRHGHELEN YDYKNGRDIQ GKEREKADPY TITKESTLVK FSFSPTENHR FTVASDTYLQ
HSRGHDFSYN LVKTTYINKD EEELRHTNDL TKRKNVSFTY ENYTVTPFWD TLKLSYSQQR
ITTRARTEDY CDGNEKCDSY KNPLGLQLKE GKVVDRNGDP VELKLVEDEQ GQKRHQVVDK
YNNPFSVASG TNNDAFVGKQ LSPSEFWLDC SIFNCDKPVR VYKYQYSNQE PESKEVELNR
TMEINGKKFA TYESNNYRDR YHMILPNSKG YLPLDYKERD LNTKTKQINL DLTKAFTLFE
IENELSYGGV YAKTTKEMVN KAGYYGRNPT WWAERTLGKS LLNGLRTCKE DSSYNGLLCP
RHEPKTSFLI PVETTTKSLY FADNIKLHNM LSVDLGYRYD DIKYQPEYIP GVTPKIADDM
VRELFVPLPP ANGKDWQGNP VYTPEQIRKN AEENIAYIAQ EKRFKKHSYS LGATFDPLNF
LRVQVKYSKG FRTPTSDELY FTFKHPDFTI LPNPNMKPEE AKNQEIALTF HHDWGFFSTN
VFQTKYRQFI DLAYLGSRNL SNSVGGQAQA RDFQVYQNVN VDRAKVKGVE INSRLNIGYF
FEKLDGFNVS YKFTYQRGRL DGNRPMNAIQ PKTSVIGLGY DHKEQRFGAD LYVTHVSAKK
AKDTYNMFYK EQGYKDSAVR WRSDDYTLVD FVTYIKPVKN VTLQFGVYNL TDRKYLTWES
ARSIKPFGTS NLINQGTGAG INRFYSPGRN YKLSAEITF