HGP1_HAEIN
ID HGP1_HAEIN Reviewed; 1063 AA.
AC P44795;
DT 01-NOV-1995, integrated into UniProtKB/Swiss-Prot.
DT 29-AUG-2001, sequence version 2.
DT 03-AUG-2022, entry version 137.
DE RecName: Full=Probable hemoglobin and hemoglobin-haptoglobin-binding protein 1;
DE Flags: Precursor;
GN OrderedLocusNames=HI_0635;
OS Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales;
OC Pasteurellaceae; Haemophilus.
OX NCBI_TaxID=71421;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 51907 / DSM 11121 / KW20 / Rd;
RX PubMed=7542800; DOI=10.1126/science.7542800;
RA Fleischmann R.D., Adams M.D., White O., Clayton R.A., Kirkness E.F.,
RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M.,
RA McKenney K., Sutton G.G., FitzHugh W., Fields C.A., Gocayne J.D.,
RA Scott J.D., Shirley R., Liu L.-I., Glodek A., Kelley J.M., Weidman J.F.,
RA Phillips C.A., Spriggs T., Hedblom E., Cotton M.D., Utterback T.R.,
RA Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., Fine L.D.,
RA Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., Gnehm C.L., McDonald L.A.,
RA Small K.V., Fraser C.M., Smith H.O., Venter J.C.;
RT "Whole-genome random sequencing and assembly of Haemophilus influenzae
RT Rd.";
RL Science 269:496-512(1995).
RN [2]
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=ATCC 51907 / DSM 11121 / KW20 / Rd;
RX PubMed=10675023;
RX DOI=10.1002/(sici)1522-2683(20000101)21:2<411::aid-elps411>3.0.co;2-4;
RA Langen H., Takacs B., Evers S., Berndt P., Lahm H.W., Wipf B., Gray C.,
RA Fountoulakis M.;
RT "Two-dimensional map of the proteome of Haemophilus influenzae.";
RL Electrophoresis 21:411-429(2000).
CC -!- FUNCTION: Acts as a receptor for hemoglobin or the
CC hemoglobin/haptoglobin complex of the human host and is required for
CC heme uptake. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Cell outer membrane {ECO:0000250}; Peripheral
CC membrane protein {ECO:0000250}.
CC -!- MISCELLANEOUS: This protein is subject to phase-variable expression
CC associated with alteration in the length of the CCAA repeat region.
CC This mechanism is called slipped-strand mispairing. Addition or loss of
CC CCAA repeat units would change the reading frame and result in
CC introduction of stop codons downstream of the repeat region. This may
CC be a mechanism of regulation and a way to avoid the immunological
CC response of the host (By similarity). {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the TonB-dependent receptor family.
CC Hemoglobin/haptoglobin binding protein subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC22294.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L42023; AAC22294.1; ALT_SEQ; Genomic_DNA.
DR PIR; B64083; B64083.
DR RefSeq; NP_438795.2; NC_000907.1.
DR AlphaFoldDB; P44795; -.
DR SMR; P44795; -.
DR STRING; 71421.HI_0635; -.
DR EnsemblBacteria; AAC22294; AAC22294; HI_0635.
DR KEGG; hin:HI_0635; -.
DR PATRIC; fig|71421.8.peg.663; -.
DR eggNOG; COG1629; Bacteria.
DR eggNOG; COG4771; Bacteria.
DR HOGENOM; CLU_008287_19_0_6; -.
DR PhylomeDB; P44795; -.
DR Proteomes; UP000000579; Chromosome.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0031230; C:intrinsic component of cell outer membrane; IBA:GO_Central.
DR GO; GO:0015344; F:siderophore uptake transmembrane transporter activity; IBA:GO_Central.
DR GO; GO:0071702; P:organic substance transport; IEA:UniProt.
DR GO; GO:0044718; P:siderophore transmembrane transport; IBA:GO_Central.
DR Gene3D; 2.170.130.10; -; 1.
DR Gene3D; 2.40.170.20; -; 1.
DR InterPro; IPR039426; BtuB-like.
DR InterPro; IPR012910; Plug_dom.
DR InterPro; IPR037066; Plug_dom_sf.
DR InterPro; IPR006970; PT.
DR InterPro; IPR000531; TonB-dep_rcpt_b-brl.
DR InterPro; IPR036942; TonB_rcpt_b-brl_sf.
DR InterPro; IPR010917; TonB_rcpt_CS.
DR PANTHER; PTHR30069; PTHR30069; 1.
DR Pfam; PF07715; Plug; 1.
DR Pfam; PF04886; PT; 1.
DR Pfam; PF00593; TonB_dep_Rec; 1.
DR PROSITE; PS01156; TONB_DEPENDENT_REC_2; 1.
PE 1: Evidence at protein level;
KW Cell outer membrane; Membrane; Receptor; Reference proteome; Repeat;
KW Signal; TonB box; Transmembrane; Transmembrane beta strand; Transport.
FT SIGNAL 1..24
FT /evidence="ECO:0000255"
FT CHAIN 25..1063
FT /note="Probable hemoglobin and hemoglobin-haptoglobin-
FT binding protein 1"
FT /id="PRO_0000034785"
FT REPEAT 26..29
FT /note="1"
FT REPEAT 30..33
FT /note="2"
FT REPEAT 34..37
FT /note="3"
FT REPEAT 38..41
FT /note="4"
FT REPEAT 42..45
FT /note="5"
FT REPEAT 46..49
FT /note="6"
FT REPEAT 50..53
FT /note="7"
FT REGION 26..53
FT /note="7 X 4 AA tandem repeats of Q-P-T-N"
FT REGION 28..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 63..70
FT /note="TonB box"
FT MOTIF 1046..1063
FT /note="TonB C-terminal box"
SQ SEQUENCE 1063 AA; 121161 MW; 370CB515523F2788 CRC64;
MTNFKFSLLA CSIAFALNAS IAYAAQPTNQ PTNQPTNQPT NQPTNQPTNQ PTNQNSNVSE
QLEQINVSGS SENINVKEKK VGETQISAKK LAKQQASDSR DLVRYETGIT VVETGRTGAS
GYAVRGVDEN RVGIMVDGLR QAETLSSQGF KELFEGYGNF NNTRNSIEIE NVKTATITKG
ADSLKSGSGA LGGSVIFETK DARDYLIDKD YYLSYKRGYQ TMNNQNLKTL TLAGRSKKFD
ILIIDTTRDG HEIENYDYKI YPNKQADLRA VGPTREKADP YQITRQSTLI KLGFQPNENH
RLSVALDDST LETKGIDLSY ALRPYSTANN EKYGERIIND QSKRKNIQFS YENFSQTPFW
DHIKLSYSSQ KITNKARSDE YCHQSTCNGV SNPQGLHLVE EKGVYKIKDK YGGELESKEI
GWSHEFKNSK GEDADKDISQ RSSLDSVLIN CEKLDCSKKF RIYQEYDENS SEKYTYDDRE
IEVGTLPNGK KYGKIPLKKG KTPSWNGFPQ ETARFLFPKS YGYSTDFVND RDLNTHTQQI
KLDLDKEFHL WHTQHQLKYG GLYEKTLKSM VNHQYNTAAN VQWWADYFFC ARAKGGNLGE
KKTPHPNVSV AGCVNGTPLH SDIGKDTYLI PVTTKNNVLY FGDNVQLTSW LGLDLNYRYD
HVKYLPGYDE KTPVPGGLIA GIFVPFNEKD VVYGAYVPSG YKDCRYNTEC YKKNFEENLA
LLLRKTDYKH HSYNLGLNLD PTDWLRVQLK YANAFRAPTS DEIYMTFKHP DFSIGPNTNL
KAETAKTKEV AFTFYKENSY LTLSAFQSDY RNFIDLVFEK NKQIDKGSAI EYPFYQNQNR
DQARVRGIEI ASRLEMGDLF EKLQGFHLGY KLTYQKGRIK DNKLRSGYAE FLKLNPQYTA
IASQDQPMNA LQPTTSVYNI GYDAPSKKWG MDVYITDVAA KKAKDSFNSQ WTSMVKRKEN
IYGTERTVPA TQANGKDVKD SRGLWRNNRY TVIDTIAYWK PIKNLTFTAG VYNLTNKKYL
TWDSARSVRH LGTINRVETA TGKGLNRFYA PGRNYRMSVQ FEF