EGFEM_MOUSE
ID EGFEM_MOUSE Reviewed; 590 AA.
AC Q8C088; B7ZCE8; Q149G1;
DT 10-JUN-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 134.
DE RecName: Full=EGF-like and EMI domain-containing protein 1;
DE Flags: Precursor;
GN Name=Egfem1;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=C57BL/6J; TISSUE=Medulla oblongata;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q8C088-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8C088-2; Sequence=VSP_034213;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK032017; BAC27650.1; -; mRNA.
DR EMBL; AC110222; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC119240; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL691427; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC117810; AAI17811.1; -; mRNA.
DR CCDS; CCDS50881.1; -. [Q8C088-1]
DR CCDS; CCDS50882.1; -. [Q8C088-2]
DR RefSeq; NP_001161220.1; NM_001167748.1. [Q8C088-2]
DR AlphaFoldDB; Q8C088; -.
DR BioGRID; 217705; 2.
DR STRING; 10090.ENSMUSP00000112943; -.
DR MaxQB; Q8C088; -.
DR PaxDb; Q8C088; -.
DR PRIDE; Q8C088; -.
DR Ensembl; ENSMUST00000118531; ENSMUSP00000112907; ENSMUSG00000063600. [Q8C088-2]
DR GeneID; 75740; -.
DR KEGG; mmu:75740; -.
DR UCSC; uc008ouk.2; mouse. [Q8C088-2]
DR CTD; 75740; -.
DR MGI; MGI:1922990; Egfem1.
DR VEuPathDB; HostDB:ENSMUSG00000063600; -.
DR eggNOG; KOG1218; Eukaryota.
DR GeneTree; ENSGT00940000164694; -.
DR InParanoid; Q8C088; -.
DR OrthoDB; 25795at2759; -.
DR BioGRID-ORCS; 75740; 0 hits in 72 CRISPR screens.
DR ChiTaRS; Egfem1; mouse.
DR PRO; PR:Q8C088; -.
DR Proteomes; UP000000589; Chromosome 3.
DR RNAct; Q8C088; protein.
DR Bgee; ENSMUSG00000063600; Expressed in dentate gyrus of hippocampal formation granule cell and 59 other tissues.
DR ExpressionAtlas; Q8C088; baseline and differential.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR011489; EMI_domain.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF07546; EMI; 1.
DR SMART; SM00181; EGF; 8.
DR SMART; SM00179; EGF_CA; 5.
DR SUPFAM; SSF57184; SSF57184; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 3.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 6.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS51041; EMI; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Calcium; Disulfide bond; EGF-like domain;
KW Reference proteome; Repeat; Signal.
FT SIGNAL 1..23
FT /evidence="ECO:0000255"
FT CHAIN 24..590
FT /note="EGF-like and EMI domain-containing protein 1"
FT /id="PRO_0000340647"
FT DOMAIN 44..104
FT /note="EMI"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00384"
FT DOMAIN 105..145
FT /note="EGF-like 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00076"
FT DOMAIN 164..204
FT /note="EGF-like 2; calcium-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00076"
FT DOMAIN 205..244
FT /note="EGF-like 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00076"
FT DOMAIN 245..285
FT /note="EGF-like 4; calcium-binding"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00076"
FT DOMAIN 445..481
FT /note="EGF-like 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00076"
FT REGION 393..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 109..125
FT /evidence="ECO:0000250"
FT DISULFID 135..144
FT /evidence="ECO:0000250"
FT DISULFID 168..179
FT /evidence="ECO:0000250"
FT DISULFID 175..188
FT /evidence="ECO:0000250"
FT DISULFID 190..203
FT /evidence="ECO:0000250"
FT DISULFID 209..219
FT /evidence="ECO:0000250"
FT DISULFID 215..228
FT /evidence="ECO:0000250"
FT DISULFID 230..243
FT /evidence="ECO:0000250"
FT DISULFID 249..260
FT /evidence="ECO:0000250"
FT DISULFID 256..269
FT /evidence="ECO:0000250"
FT DISULFID 271..284
FT /evidence="ECO:0000250"
FT DISULFID 449..462
FT /evidence="ECO:0000250"
FT DISULFID 456..469
FT /evidence="ECO:0000250"
FT DISULFID 471..480
FT /evidence="ECO:0000250"
FT VAR_SEQ 147..285
FT /note="EKNKHLESELTPGFLQKNVDECAVVNGGCQQRCINTLGTFHCECDTGYRRHA
FT DERTCIKTDPCAGANGCAHLCQTENGMARCACHAGYQLSEDKKACEDINECAGELAPCA
FT HHCVNSKGSFTCTCHPGFELGADRKHCY -> DINECAVDNGGCRDRCCNTIGSYYCRC
FT QAGQKLEEDGRGCEDVDECAVVNGGCQQRCINTLGTFHCECDTGYRRHADERTCI (in
FT isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_034213"
FT CONFLICT 517
FT /note="V -> A (in Ref. 3; AAI17811)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 590 AA; 65021 MW; 71EAB323CF34C4CF CRC64;
MTSPLCFWCF CVWAAANWPP GSALQLQPGM PNVCREEQLT LVRLSRPCAQ AFIDTIQFWK
QGCSGPRWCV GYERRIRYYI IYRHVYATEH QTVFRCCPGW IQWDDEPGCF SSLSSLGTHF
SGRECSYQDT RQCLCSQGFH GPHCQYEKNK HLESELTPGF LQKNVDECAV VNGGCQQRCI
NTLGTFHCEC DTGYRRHADE RTCIKTDPCA GANGCAHLCQ TENGMARCAC HAGYQLSEDK
KACEDINECA GELAPCAHHC VNSKGSFTCT CHPGFELGAD RKHCYRIELE IVNICEKNNG
GCSHHCEPAI GGAHCSCNHG HQLDTDGKTC IDFDECESGE ACCAQLCINY LGGYECSCEE
GFQISSDGCG CDALDEQLEE EEEEIDILRF PGRLAQNPPQ PFPYLDPSLT ASYEDEDNDD
ADSEAEGEVQ GLTALYRVVC LDGTFGLDCS LSCEDCMNGG RCQEGKSGCL CPAEWTGLIC
NESSVLRTGE DQQAPAGCLK GFFGKNCKRK CHCANNVHCH RVYGACMCDL GRYGRFCHLS
CPRGAYGASC SLECQCVEEN TLECSAKNGS CTCKSGYQGN RCQEELPLPA