THEGL_MOUSE
ID THEGL_MOUSE Reviewed; 459 AA.
AC Q9DA15;
DT 18-APR-2012, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2001, sequence version 1.
DT 03-AUG-2022, entry version 111.
DE RecName: Full=Testicular haploid expressed gene protein-like;
GN Name=Thegl;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=C57BL/6J; TISSUE=Testis;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Testis;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK006270; BAB24494.1; -; mRNA.
DR EMBL; AC114666; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC165975; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH466524; EDL37925.1; -; Genomic_DNA.
DR EMBL; BC053422; AAH53422.1; -; mRNA.
DR CCDS; CCDS19369.1; -.
DR RefSeq; NP_082246.1; NM_027970.1.
DR AlphaFoldDB; Q9DA15; -.
DR BioGRID; 214992; 1.
DR STRING; 10090.ENSMUSP00000031161; -.
DR iPTMnet; Q9DA15; -.
DR PhosphoSitePlus; Q9DA15; -.
DR PaxDb; Q9DA15; -.
DR PRIDE; Q9DA15; -.
DR ProteomicsDB; 262915; -.
DR Antibodypedia; 76754; 4 antibodies from 4 providers.
DR DNASU; 71868; -.
DR Ensembl; ENSMUST00000031161; ENSMUSP00000031161; ENSMUSG00000029248.
DR Ensembl; ENSMUST00000117880; ENSMUSP00000112814; ENSMUSG00000029248.
DR GeneID; 71868; -.
DR KEGG; mmu:71868; -.
DR UCSC; uc008xvt.1; mouse.
DR CTD; 100506564; -.
DR MGI; MGI:1919118; Thegl.
DR VEuPathDB; HostDB:ENSMUSG00000029248; -.
DR eggNOG; ENOG502S0I9; Eukaryota.
DR GeneTree; ENSGT00940000154630; -.
DR HOGENOM; CLU_587870_0_0_1; -.
DR InParanoid; Q9DA15; -.
DR OMA; HIVYYDP; -.
DR OrthoDB; 1209550at2759; -.
DR PhylomeDB; Q9DA15; -.
DR TreeFam; TF329290; -.
DR BioGRID-ORCS; 71868; 1 hit in 41 CRISPR screens.
DR PRO; PR:Q9DA15; -.
DR Proteomes; UP000000589; Chromosome 5.
DR RNAct; Q9DA15; protein.
DR Bgee; ENSMUSG00000029248; Expressed in seminiferous tubule of testis and 51 other tissues.
DR ExpressionAtlas; Q9DA15; baseline and differential.
DR Genevisible; Q9DA15; MM.
DR InterPro; IPR006623; THEG.
DR InterPro; IPR042401; THEG-like.
DR PANTHER; PTHR15901; PTHR15901; 1.
DR Pfam; PF14912; THEG; 6.
DR SMART; SM00705; THEG; 8.
PE 2: Evidence at transcript level;
KW Reference proteome; Repeat.
FT CHAIN 1..459
FT /note="Testicular haploid expressed gene protein-like"
FT /id="PRO_0000416827"
FT REPEAT 172..190
FT /note="THEG 1"
FT REPEAT 212..231
FT /note="THEG 2"
FT REPEAT 258..277
FT /note="THEG 3"
FT REPEAT 291..310
FT /note="THEG 4"
FT REPEAT 327..346
FT /note="THEG 5"
FT REPEAT 367..386
FT /note="THEG 6"
FT REPEAT 403..422
FT /note="THEG 7"
FT REPEAT 440..459
FT /note="THEG 8"
FT REGION 1..138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..34
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 60..76
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 77..91
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 92..110
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 459 AA; 51801 MW; 92BA3C17B988A6C1 CRC64;
MEEGDFSGSS VRSEVTDGRN TTTTTETRTT SELQPKPLVL RLLEVQNGDE AEAVGEEGQE
EDYEGSKTHK SHEVSASFRS HNSSDPPQSR KASDSLRSRK GIEPLEPRKT SDSFRSLMGS
DPLQSSERQE DGKDDLFPNA VIMTSPSLIA RYLPRLQLAS LRAHPVTRDL VKKCFYSRKR
VQDLSKPKKQ WGTPDRRLFW GNQDPIRPVS EAALKAKLSK RIEDLAQPRL VSRHYVPNRI
QYYYSCGRES VIWEISPPAL VTRPSKRIQK LAKPNKFKAQ SLIKRETVPG TTRYSDPSPR
ILRLSIAKGT NPSYLPPKTL ETKISFSTLS AVATPRIVDL AHPRIKIEGL CYERERSELP
IRPVAPAALL ANPSKRTIFL AKSKRVHEDY LPIRDARWPV SYAATHSQVS ERVQELANPH
TRGPANLVYY DPNVFKVKPS ALKAHCSDRV KELAEPIVR