ADGB_HUMAN
ID ADGB_HUMAN Reviewed; 1667 AA.
AC Q8N7X0; Q5T402; Q5T904; Q5T905;
DT 18-APR-2006, integrated into UniProtKB/Swiss-Prot.
DT 15-JUN-2010, sequence version 3.
DT 03-AUG-2022, entry version 140.
DE RecName: Full=Androglobin;
DE AltName: Full=Calpain-7-like protein;
GN Name=ADGB; Synonyms=C6orf103, CAPN7L;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RX PubMed=17974005; DOI=10.1186/1471-2164-8-399;
RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U.,
RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D.,
RA Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A.,
RA Wiemann S., Schupp I.;
RT "The full-ORF clone resource of the German cDNA consortium.";
RL BMC Genomics 8:399-399(2007).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT ALA-1637.
RX PubMed=14574404; DOI=10.1038/nature02055;
RA Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L.,
RA Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R.,
RA Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D.,
RA Andrews T.D., Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J.,
RA Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H.,
RA Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J.,
RA Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P.,
RA Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V.,
RA Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J.,
RA Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E.,
RA Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J.,
RA French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J.,
RA Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C.,
RA Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A.,
RA Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R.,
RA Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M.,
RA Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K.,
RA Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R.,
RA Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M.,
RA Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A.,
RA Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L.,
RA Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I.,
RA Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y.,
RA Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E.,
RA Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A.,
RA Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W.,
RA Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M.,
RA West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J.,
RA Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M.,
RA Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I.,
RA Rogers J., Beck S.;
RT "The DNA sequence and analysis of human chromosome 6.";
RL Nature 425:805-811(2003).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-754 (ISOFORM 1), AND VARIANT
RP THR-310.
RC TISSUE=Testis;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 697-964.
RA Stevens M., Wei C., Gross S.S., McPherson J., Brent M.R.;
RT "Exhaustive RT-PCR and sequencing of all novel TWINSCAN predictions in
RT human.";
RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q8N7X0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8N7X0-2; Sequence=VSP_039243, VSP_039244;
CC -!- SIMILARITY: Belongs to the peptidase C2 family. {ECO:0000305}.
CC -!- CAUTION: Lacks the conserved active site residues. Probably
CC catalytically inactive. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AL832192; Type=Frameshift; Evidence={ECO:0000305};
CC Sequence=AL832192; Type=Miscellaneous discrepancy; Note=Intron retention.; Evidence={ECO:0000305};
CC Sequence=CAI16490.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAI20488.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=DN831198; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL832192; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; AL138916; CAI14697.1; -; Genomic_DNA.
DR EMBL; AL158199; CAI14697.1; JOINED; Genomic_DNA.
DR EMBL; AL138916; CAI14698.1; -; Genomic_DNA.
DR EMBL; AL158199; CAI14698.1; JOINED; Genomic_DNA.
DR EMBL; AL158199; CAI20488.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL359547; CAI20488.1; JOINED; Genomic_DNA.
DR EMBL; AL158199; CAI20489.1; -; Genomic_DNA.
DR EMBL; AL138916; CAI20489.1; JOINED; Genomic_DNA.
DR EMBL; AL158199; CAI20490.1; -; Genomic_DNA.
DR EMBL; AL138916; CAI20490.1; JOINED; Genomic_DNA.
DR EMBL; AL359547; CAI16490.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL158199; CAI16490.1; JOINED; Genomic_DNA.
DR EMBL; AK097570; BAC05106.1; -; mRNA.
DR EMBL; DN831198; -; NOT_ANNOTATED_CDS; mRNA.
DR RefSeq; NP_078970.3; NM_024694.3. [Q8N7X0-1]
DR AlphaFoldDB; Q8N7X0; -.
DR SMR; Q8N7X0; -.
DR BioGRID; 122859; 5.
DR IntAct; Q8N7X0; 6.
DR STRING; 9606.ENSP00000381036; -.
DR MEROPS; C02.972; -.
DR iPTMnet; Q8N7X0; -.
DR PhosphoSitePlus; Q8N7X0; -.
DR BioMuta; ADGB; -.
DR DMDM; 298286920; -.
DR EPD; Q8N7X0; -.
DR jPOST; Q8N7X0; -.
DR MassIVE; Q8N7X0; -.
DR MaxQB; Q8N7X0; -.
DR PaxDb; Q8N7X0; -.
DR PeptideAtlas; Q8N7X0; -.
DR PRIDE; Q8N7X0; -.
DR ProteomicsDB; 72338; -. [Q8N7X0-1]
DR ProteomicsDB; 72339; -. [Q8N7X0-2]
DR Antibodypedia; 50988; 17 antibodies from 9 providers.
DR DNASU; 79747; -.
DR Ensembl; ENST00000397944.8; ENSP00000381036.3; ENSG00000118492.18. [Q8N7X0-1]
DR GeneID; 79747; -.
DR KEGG; hsa:79747; -.
DR MANE-Select; ENST00000397944.8; ENSP00000381036.3; NM_024694.4; NP_078970.3.
DR UCSC; uc010khx.4; human. [Q8N7X0-1]
DR CTD; 79747; -.
DR DisGeNET; 79747; -.
DR GeneCards; ADGB; -.
DR HGNC; HGNC:21212; ADGB.
DR HPA; ENSG00000118492; Tissue enhanced (fallopian tube, testis).
DR MIM; 614630; gene.
DR neXtProt; NX_Q8N7X0; -.
DR OpenTargets; ENSG00000118492; -.
DR PharmGKB; PA134944476; -.
DR VEuPathDB; HostDB:ENSG00000118492; -.
DR eggNOG; KOG0045; Eukaryota.
DR GeneTree; ENSGT00390000014904; -.
DR HOGENOM; CLU_003228_0_0_1; -.
DR InParanoid; Q8N7X0; -.
DR OMA; GHAIHIC; -.
DR PhylomeDB; Q8N7X0; -.
DR TreeFam; TF329120; -.
DR PathwayCommons; Q8N7X0; -.
DR SignaLink; Q8N7X0; -.
DR BioGRID-ORCS; 79747; 2 hits in 247 CRISPR screens.
DR ChiTaRS; ADGB; human.
DR GenomeRNAi; 79747; -.
DR Pharos; Q8N7X0; Tdark.
DR PRO; PR:Q8N7X0; -.
DR Proteomes; UP000005640; Chromosome 6.
DR RNAct; Q8N7X0; protein.
DR Bgee; ENSG00000118492; Expressed in right uterine tube and 92 other tissues.
DR ExpressionAtlas; Q8N7X0; baseline and differential.
DR Genevisible; Q8N7X0; HS.
DR GO; GO:0004198; F:calcium-dependent cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0020037; F:heme binding; IEA:InterPro.
DR GO; GO:0019825; F:oxygen binding; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 1.10.490.10; -; 1.
DR InterPro; IPR012292; Globin/Proto.
DR InterPro; IPR000048; IQ_motif_EF-hand-BS.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001300; Peptidase_C2_calpain_cat.
DR Pfam; PF00648; Peptidase_C2; 1.
DR SMART; SM00230; CysPc; 1.
DR SUPFAM; SSF54001; SSF54001; 1.
DR PROSITE; PS50203; CALPAIN_CAT; 1.
DR PROSITE; PS50096; IQ; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Coiled coil; Reference proteome.
FT CHAIN 1..1667
FT /note="Androglobin"
FT /id="PRO_0000232525"
FT DOMAIN 70..411
FT /note="Calpain catalytic"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00239"
FT DOMAIN 906..935
FT /note="IQ"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00116"
FT REGION 1..45
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 347..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 540..566
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1297..1355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1420..1522
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1646..1667
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1588..1629
FT /evidence="ECO:0000255"
FT COMPBIAS 1..20
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..41
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..387
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..566
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1301..1317
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1318..1339
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1431..1445
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1447..1472
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1482..1496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..951
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000305"
FT /id="VSP_039243"
FT VAR_SEQ 1185..1288
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000305"
FT /id="VSP_039244"
FT VARIANT 310
FT /note="I -> T (in dbSNP:rs9497606)"
FT /evidence="ECO:0000269|PubMed:14702039"
FT /id="VAR_025948"
FT VARIANT 1637
FT /note="T -> A (in dbSNP:rs1052445)"
FT /evidence="ECO:0000269|PubMed:14574404"
FT /id="VAR_063158"
FT CONFLICT 205..280
FT /note="Missing (in Ref. 1; AL832192)"
FT /evidence="ECO:0000305"
FT CONFLICT 570..580
FT /note="Missing (in Ref. 1; AL832192)"
FT /evidence="ECO:0000305"
FT CONFLICT 716
FT /note="T -> R (in Ref. 4; DN831198)"
FT /evidence="ECO:0000305"
FT CONFLICT 747..754
FT /note="RHMLLFNA -> YEVASFFP (in Ref. 3; BAC05106)"
FT /evidence="ECO:0000305"
FT CONFLICT 810
FT /note="K -> M (in Ref. 1; AL832192)"
FT /evidence="ECO:0000305"
FT CONFLICT 931
FT /note="P -> R (in Ref. 4; DN831198)"
FT /evidence="ECO:0000305"
FT CONFLICT 944
FT /note="Q -> L (in Ref. 4; DN831198)"
FT /evidence="ECO:0000305"
FT CONFLICT 1639
FT /note="A -> S (in Ref. 2; CAI14697/CAI20489)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1667 AA; 189713 MW; 46732C56A6C63B3F CRC64;
MASKQTKKKE VHRINSAHGS DKSKDFYPFG SNVQSGSTEQ KKGKFPLWPE WSEADINSEK
WDAGKGAKEK DKTGKSPVFH FFEDPEGKIE LPPSLKIYSW KRPQDILFSQ TPVVVKNEIT
FDLFSANEHL LCSELMRWII SEIYAVWKIF NGGILSNYFK GTSGEPPLLP WKPWEHIYSL
CKAVKGHMPL FNSYGKYVVK LYWMGCWRKI TIDDFLPFDE DNNLLLPATT YEFELWPMLL
SKAIIKLANI DIHVADRREL GEFTVIHALT GWLPEVISLH PGYMDKVWEL LKEILPEFKL
SDEASSESKI AVLDSKLKEP GKEGKEGKEI KDGKEVKDVK EFKPESSLTT LKAPEKSDKV
PKEKADARDI GKKRSKDGEK EKFKFSLHGS RPSSEVQYSV QSLSDCSSAI QTSHMVVYAT
FTPLYLFENK IFSLEKMADS AEKLREYGLS HICSHPVLVT RSRSCPLVAP PKPPPLPPWK
LIRQKKETVI TDEAQELIVK KPERFLEISS PFLNYRMTPF TIPTEMHFVR SLIKKGIPPG
SDLPSVSETD ETATHSQTDL SQITKATSQG NTASQVILGK GTDEQTDFGL GDAHQSDGLN
LEREIVSQTT ATQEKSQEEL PTTNNSVSKE IWLDFEDFCV CFQNIYIFHK PSSYCLNFQK
SEFKFSEERV SYYLFVDSLK PIELLVCFSA LVRWGEYGAL TKDSPPIEPG LLTAETFSWK
SLKPGSLVLK IHTYATKATV VRLPVGRHML LFNAYSPVGH SIHICSMVSF VIGDEHVVLP
NFEPESCRFT EQSLLIMKAI GNVIANFKDK GKLSAALKDL QTAHYPVPFH DKELTAQHFR
VFHLSLWRLM KKVQITKPPP NFKFAFRAMV LDLELLNSSL EEVSLVEWLD VKYCMPTSDK
EYSAEEVAAA IKIQAMWRGT YVRLLMKARI PDTKENISVA DTLQKVWAVL EMNLEQYAVS
LLRLMFKSKC KSLESYPCYQ DEETKIAFAD YTVTYQEQPP NSWFIVFRET FLVHQDMILV
PKVYTTLPIC ILHIVNNDTM EQVPKVFQKV VPYLYTKNKK GYTFVAEAFT GDTYVAASRW
KLRLIGSSAP LPCLSRDSPC NSFAIKEIRD YYIPNDKKIL FRYSVKVLTP QPATIQVRTS
KPDAFIKLQV LENEETMVSS TGKGQAIIPA FHFLKSEKGL SSQSSKHILS FHSASKKEQE
VYVKKKAAQG IQKSPKGRAV SAIQDIGLPL VEEETTSTPT REDSSSTPLQ NYKYIIQCSV
LYNSWPLTES QLTFVQALKD LKKSNTKAYG ERHEELINLG SPDSHTISEG QKSSVTSKTT
RKGKEKSSEK EKTAKEKQAP RFEPQISTVH PQQEDPNKPY WILRLVTEHN ESELFEVKKD
TERADEIRAM KQAWETTEPG RAIKASQARL HYLSGFIKKT SDAESPPISE SQTKPKEEVE
TAARGVKEPN SKNSAGSESK EMTQTGSGSA VWKKWQLTKG LRDVAKSTSS ESGGVSSPGK
EEREQSTRKE NIQTGPRTRS PTILETSPRL IRKALEFMDL SQYVRKTDTD PLLQTDELNQ
QQAMQKAEEI HQFRQHRTRV LSIRNIDQEE RLKLKDEVLD MYKEMQDSLD EARQKIFDIR
EEYRNKLLEA EHLKLETLAA QEAAMKLETE KMTPAPDTQK KKKGKKK