HGP2_HAEIN
ID HGP2_HAEIN Reviewed; 999 AA.
AC P44809;
DT 01-NOV-1995, integrated into UniProtKB/Swiss-Prot.
DT 29-AUG-2001, sequence version 3.
DT 03-AUG-2022, entry version 143.
DE RecName: Full=Probable hemoglobin and hemoglobin-haptoglobin-binding protein 2;
DE Flags: Precursor;
GN OrderedLocusNames=HI_0661;
OS Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales;
OC Pasteurellaceae; Haemophilus.
OX NCBI_TaxID=71421;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 51907 / DSM 11121 / KW20 / Rd;
RX PubMed=7542800; DOI=10.1126/science.7542800;
RA Fleischmann R.D., Adams M.D., White O., Clayton R.A., Kirkness E.F.,
RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M.,
RA McKenney K., Sutton G.G., FitzHugh W., Fields C.A., Gocayne J.D.,
RA Scott J.D., Shirley R., Liu L.-I., Glodek A., Kelley J.M., Weidman J.F.,
RA Phillips C.A., Spriggs T., Hedblom E., Cotton M.D., Utterback T.R.,
RA Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., Fine L.D.,
RA Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., Gnehm C.L., McDonald L.A.,
RA Small K.V., Fraser C.M., Smith H.O., Venter J.C.;
RT "Whole-genome random sequencing and assembly of Haemophilus influenzae
RT Rd.";
RL Science 269:496-512(1995).
RN [2]
RP SEQUENCE REVISION.
RA White O., Clayton R.A., Kerlavage A.R., Fleischmann R.D., Peterson J.,
RA Hickey E., Dodson R., Gwinn M.;
RL Submitted (MAY-1998) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP IDENTIFICATION BY MASS SPECTROMETRY.
RC STRAIN=ATCC 51907 / DSM 11121 / KW20 / Rd;
RX PubMed=10675023;
RX DOI=10.1002/(sici)1522-2683(20000101)21:2<411::aid-elps411>3.0.co;2-4;
RA Langen H., Takacs B., Evers S., Berndt P., Lahm H.W., Wipf B., Gray C.,
RA Fountoulakis M.;
RT "Two-dimensional map of the proteome of Haemophilus influenzae.";
RL Electrophoresis 21:411-429(2000).
CC -!- FUNCTION: Acts as a receptor for hemoglobin or the
CC hemoglobin/haptoglobin complex of the human host and is required for
CC heme uptake. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Cell outer membrane {ECO:0000250}; Peripheral
CC membrane protein {ECO:0000250}.
CC -!- MISCELLANEOUS: This protein is subject to phase-variable expression
CC associated with alteration in the length of the CCAA repeat region.
CC This mechanism is called slipped-strand mispairing. Addition or loss of
CC CCAA repeat units would change the reading frame and result in
CC introduction of stop codons downstream of the repeat region. This may
CC be a mechanism of regulation and a way to avoid the immunological
CC response of the host (By similarity). {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the TonB-dependent receptor family.
CC Hemoglobin/haptoglobin binding protein subfamily. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAC22319.1; Type=Frameshift; Note=In the leader peptide and the repeats region.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; L42023; AAC22319.1; ALT_SEQ; Genomic_DNA.
DR PIR; I64084; I64084.
DR RefSeq; NP_438821.1; NC_000907.1.
DR AlphaFoldDB; P44809; -.
DR SMR; P44809; -.
DR STRING; 71421.HI_0661; -.
DR EnsemblBacteria; AAC22319; AAC22319; HI_0661.
DR KEGG; hin:HI_0661; -.
DR PATRIC; fig|71421.8.peg.690; -.
DR eggNOG; COG1629; Bacteria.
DR eggNOG; COG4771; Bacteria.
DR HOGENOM; CLU_008287_19_0_6; -.
DR PhylomeDB; P44809; -.
DR Proteomes; UP000000579; Chromosome.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0031230; C:intrinsic component of cell outer membrane; IBA:GO_Central.
DR GO; GO:0015344; F:siderophore uptake transmembrane transporter activity; IBA:GO_Central.
DR GO; GO:0071702; P:organic substance transport; IEA:UniProt.
DR GO; GO:0044718; P:siderophore transmembrane transport; IBA:GO_Central.
DR Gene3D; 2.170.130.10; -; 1.
DR Gene3D; 2.40.170.20; -; 1.
DR InterPro; IPR039426; BtuB-like.
DR InterPro; IPR012910; Plug_dom.
DR InterPro; IPR037066; Plug_dom_sf.
DR InterPro; IPR006970; PT.
DR InterPro; IPR000531; TonB-dep_rcpt_b-brl.
DR InterPro; IPR010949; TonB_Hb/transfer/lactofer_rcpt.
DR InterPro; IPR036942; TonB_rcpt_b-brl_sf.
DR InterPro; IPR010917; TonB_rcpt_CS.
DR PANTHER; PTHR30069; PTHR30069; 1.
DR Pfam; PF07715; Plug; 1.
DR Pfam; PF04886; PT; 1.
DR Pfam; PF00593; TonB_dep_Rec; 1.
DR TIGRFAMs; TIGR01786; TonB-hemlactrns; 1.
DR PROSITE; PS01156; TONB_DEPENDENT_REC_2; 1.
PE 1: Evidence at protein level;
KW Cell outer membrane; Membrane; Receptor; Reference proteome; Repeat;
KW Signal; TonB box; Transmembrane; Transmembrane beta strand; Transport.
FT SIGNAL 1..24
FT /evidence="ECO:0000255"
FT CHAIN 25..999
FT /note="Probable hemoglobin and hemoglobin-haptoglobin-
FT binding protein 2"
FT /id="PRO_0000034786"
FT REPEAT 26..29
FT /note="1"
FT REPEAT 30..33
FT /note="2"
FT REPEAT 34..37
FT /note="3"
FT REPEAT 38..41
FT /note="4"
FT REPEAT 42..45
FT /note="5"
FT REPEAT 46..49
FT /note="6"
FT REPEAT 50..53
FT /note="7"
FT REGION 26..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 26..53
FT /note="7 X 4 AA tandem repeats of Q-P-T-N"
FT MOTIF 63..70
FT /note="TonB box"
FT MOTIF 982..999
FT /note="TonB C-terminal box"
SQ SEQUENCE 999 AA; 114690 MW; 1A17AAB220092B7D CRC64;
MTNFRLNVLA YSVMLGLTAG VAYAAQPTNQ PTNQPTNQPT NQPTNQPTNQ PTNQNSNVSE
QLEQINVSGS TENSDTKTPP KIAETVKTAK TLEREQANNI KDIVKYETGV TVVEAGRFGQ
SGFAIRGVDE NRVAINIDGL RQAETLSSQG FKELFEGYGN FNNTRNGAEI ETLKEVNITK
GADSIKNGSG SLGGSVIYKT KDARDYLINK DYYVSYKKGY ATENNQSFDT LTLAGRYKKF
DVLVVTTSRN GHELENYGYK NYNDKIQGKK REKADPYKIE QDSTLLKLSF NPTENHRFTF
AADLYEHRSR GQDLSYTLKY QRSGNETPEV DSRHTNDKTK RRNISFSYEN FSQTPFWDTL
KLTYSDQRIK TRARTDEYCD AGVRHCEGTD NPTGLKVTNG KITRRDGSDL QFEEKNNTAK
SSDKTYDFKK FIDTDKRVID DKLVLNNPSD TWYDCSIFNC ENNAKIKVFK GNNYYGYDGK
WKEVDLEIKE LNGKKFAKIK DNDRKIKSIL PSSPGYLERL WQERDLDTNT QQLNLDLTKD
FKIWHIEHNL QYGGSYNTAM KRMVNRAGND ASDVQWWATP TLGEDSWTGK PHTCATTYEW
NANLCPRVDP EFSYLLPIKT TGKSVYLFDN FVITDYLSFD LGYRYDNIHY QPKYKHGITP
KLPDDIVKGL FIPLPNNSNS DPNKVKENVQ QNIDYIAKQN KKYKAHSYSF VSTIDPTSFL
RLQLKYSKGF RTPTSDEMYF TFKHPDFTIL PNTDLKPEIA KTKEIAFTLH NDDWGFISTS
LFKTNYKNFI DLIFKKQETF KVGGSGRGET LPFSLYQNIN RDNASLKGIE INSKVFLGKM
AKFMDGFNLS YKYTYQKGRM NGNIPMNAIQ PRTMVYGLGY DHPNHKFGFD FYTTHVASKN
PEDTYNMFYK EENKKDSTIK WRSKSYTILD LIGYVQPIKN LTIRAGVYNL TNRKYITWDS
ARSIRSFGTS NVIDQSTGLG INRFYAPGRN YKMSVQFEF