GSG1_HUMAN
ID GSG1_HUMAN Reviewed; 349 AA.
AC Q2KHT4; Q8N4M3; Q8NBR4; Q8NBS0; Q8NBT1; Q96LP9; Q96SI6; Q9BUY4;
DT 29-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 29-APR-2008, sequence version 2.
DT 03-AUG-2022, entry version 110.
DE RecName: Full=Germ cell-specific gene 1 protein;
GN Name=GSG1; ORFNames=UNQ709/PRO1360;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4).
RX PubMed=12975309; DOI=10.1101/gr.1293003;
RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J.,
RA Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P.,
RA Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E., Heldens S., Huang A.,
RA Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D.,
RA Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L.,
RA Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C.,
RA Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J.,
RA Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.;
RT "The secreted protein discovery initiative (SPDI), a large-scale effort to
RT identify novel human secreted and transmembrane proteins: a bioinformatics
RT assessment.";
RL Genome Res. 13:2265-2270(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 3; 4; 5; 7 AND 8).
RC TISSUE=Retinoblastoma, and Testis;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16541075; DOI=10.1038/nature04569;
RA Scherer S.E., Muzny D.M., Buhay C.J., Chen R., Cree A., Ding Y.,
RA Dugan-Rocha S., Gill R., Gunaratne P., Harris R.A., Hawes A.C.,
RA Hernandez J., Hodgson A.V., Hume J., Jackson A., Khan Z.M., Kovar-Smith C.,
RA Lewis L.R., Lozado R.J., Metzker M.L., Milosavljevic A., Miner G.R.,
RA Montgomery K.T., Morgan M.B., Nazareth L.V., Scott G., Sodergren E.,
RA Song X.-Z., Steffen D., Lovering R.C., Wheeler D.A., Worley K.C., Yuan Y.,
RA Zhang Z., Adams C.Q., Ansari-Lari M.A., Ayele M., Brown M.J., Chen G.,
RA Chen Z., Clerc-Blankenburg K.P., Davis C., Delgado O., Dinh H.H.,
RA Draper H., Gonzalez-Garay M.L., Havlak P., Jackson L.R., Jacob L.S.,
RA Kelly S.H., Li L., Li Z., Liu J., Liu W., Lu J., Maheshwari M.,
RA Nguyen B.-V., Okwuonu G.O., Pasternak S., Perez L.M., Plopper F.J.H.,
RA Santibanez J., Shen H., Tabor P.E., Verduzco D., Waldron L., Wang Q.,
RA Williams G.A., Zhang J., Zhou J., Allen C.C., Amin A.G., Anyalebechi V.,
RA Bailey M., Barbaria J.A., Bimage K.E., Bryant N.P., Burch P.E.,
RA Burkett C.E., Burrell K.L., Calderon E., Cardenas V., Carter K., Casias K.,
RA Cavazos I., Cavazos S.R., Ceasar H., Chacko J., Chan S.N., Chavez D.,
RA Christopoulos C., Chu J., Cockrell R., Cox C.D., Dang M., Dathorne S.R.,
RA David R., Davis C.M., Davy-Carroll L., Deshazo D.R., Donlin J.E.,
RA D'Souza L., Eaves K.A., Egan A., Emery-Cohen A.J., Escotto M., Flagg N.,
RA Forbes L.D., Gabisi A.M., Garza M., Hamilton C., Henderson N.,
RA Hernandez O., Hines S., Hogues M.E., Huang M., Idlebird D.G., Johnson R.,
RA Jolivet A., Jones S., Kagan R., King L.M., Leal B., Lebow H., Lee S.,
RA LeVan J.M., Lewis L.C., London P., Lorensuhewa L.M., Loulseged H.,
RA Lovett D.A., Lucier A., Lucier R.L., Ma J., Madu R.C., Mapua P.,
RA Martindale A.D., Martinez E., Massey E., Mawhiney S., Meador M.G.,
RA Mendez S., Mercado C., Mercado I.C., Merritt C.E., Miner Z.L., Minja E.,
RA Mitchell T., Mohabbat F., Mohabbat K., Montgomery B., Moore N., Morris S.,
RA Munidasa M., Ngo R.N., Nguyen N.B., Nickerson E., Nwaokelemeh O.O.,
RA Nwokenkwo S., Obregon M., Oguh M., Oragunye N., Oviedo R.J., Parish B.J.,
RA Parker D.N., Parrish J., Parks K.L., Paul H.A., Payton B.A., Perez A.,
RA Perrin W., Pickens A., Primus E.L., Pu L.-L., Puazo M., Quiles M.M.,
RA Quiroz J.B., Rabata D., Reeves K., Ruiz S.J., Shao H., Sisson I.,
RA Sonaike T., Sorelle R.P., Sutton A.E., Svatek A.F., Svetz L.A.,
RA Tamerisa K.S., Taylor T.R., Teague B., Thomas N., Thorn R.D., Trejos Z.Y.,
RA Trevino B.K., Ukegbu O.N., Urban J.B., Vasquez L.I., Vera V.A.,
RA Villasana D.M., Wang L., Ward-Moore S., Warren J.T., Wei X., White F.,
RA Williamson A.L., Wleczyk R., Wooden H.S., Wooden S.H., Yen J., Yoon L.,
RA Yoon V., Zorrilla S.E., Nelson D., Kucherlapati R., Weinstock G.,
RA Gibbs R.A.;
RT "The finished DNA sequence of human chromosome 12.";
RL Nature 440:346-351(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2; 4 AND 6), AND VARIANT
RP LEU-39.
RC TISSUE=Brain, and Eye;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- FUNCTION: May cause the redistribution of PAPOLB from the cytosol to
CC the endoplasmic reticulum. {ECO:0000250}.
CC -!- SUBUNIT: Interacts with PAPOLB. {ECO:0000250}.
CC -!- INTERACTION:
CC Q2KHT4; Q12933: TRAF2; NbExp=3; IntAct=EBI-10239244, EBI-355744;
CC Q2KHT4-3; P53365: ARFIP2; NbExp=3; IntAct=EBI-12951679, EBI-638194;
CC Q2KHT4-3; Q9BXU8: FTHL17; NbExp=3; IntAct=EBI-12951679, EBI-12156897;
CC Q2KHT4-3; Q96C03-3: MIEF2; NbExp=3; IntAct=EBI-12951679, EBI-11988931;
CC Q2KHT4-3; Q9UKF7-2: PITPNC1; NbExp=3; IntAct=EBI-12951679, EBI-14223623;
CC Q2KHT4-3; O00560: SDCBP; NbExp=3; IntAct=EBI-12951679, EBI-727004;
CC Q2KHT4-3; Q9H0W8: SMG9; NbExp=3; IntAct=EBI-12951679, EBI-2872322;
CC Q2KHT4-3; Q17RD7: SYT16; NbExp=3; IntAct=EBI-12951679, EBI-10238936;
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane {ECO:0000250};
CC Multi-pass membrane protein {ECO:0000250}. Note=Colocalizes with PAPOLB
CC in the endoplasmic reticulum. {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=8;
CC Name=1;
CC IsoId=Q2KHT4-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q2KHT4-2; Sequence=VSP_032997;
CC Name=3;
CC IsoId=Q2KHT4-3; Sequence=VSP_032993;
CC Name=4;
CC IsoId=Q2KHT4-4; Sequence=VSP_032993, VSP_032994;
CC Name=5;
CC IsoId=Q2KHT4-5; Sequence=VSP_032993, VSP_032997;
CC Name=6;
CC IsoId=Q2KHT4-6; Sequence=VSP_032992, VSP_032993;
CC Name=7;
CC IsoId=Q2KHT4-7; Sequence=VSP_032992, VSP_032995;
CC Name=8;
CC IsoId=Q2KHT4-8; Sequence=VSP_032993, VSP_032996;
CC -!- SIMILARITY: Belongs to the GSG1 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH01796.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC Sequence=AAH33854.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY358513; AAQ88877.1; -; mRNA.
DR EMBL; AK027894; BAB55437.1; -; mRNA.
DR EMBL; AK058045; BAB71638.1; -; mRNA.
DR EMBL; AK075288; BAC11524.1; -; mRNA.
DR EMBL; AK075311; BAC11540.1; -; mRNA.
DR EMBL; AK075322; BAC11548.1; -; mRNA.
DR EMBL; AC023790; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC079628; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471094; EAW96299.1; -; Genomic_DNA.
DR EMBL; BC001796; AAH01796.1; ALT_INIT; mRNA.
DR EMBL; BC033854; AAH33854.1; ALT_INIT; mRNA.
DR EMBL; BC112896; AAI12897.1; -; mRNA.
DR CCDS; CCDS44835.1; -. [Q2KHT4-3]
DR CCDS; CCDS55806.1; -. [Q2KHT4-5]
DR CCDS; CCDS55807.1; -. [Q2KHT4-2]
DR CCDS; CCDS55808.1; -. [Q2KHT4-6]
DR CCDS; CCDS8659.2; -. [Q2KHT4-4]
DR RefSeq; NP_001074023.1; NM_001080554.2.
DR RefSeq; NP_001193771.1; NM_001206842.1. [Q2KHT4-6]
DR RefSeq; NP_001193772.1; NM_001206843.1. [Q2KHT4-2]
DR RefSeq; NP_001193774.1; NM_001206845.1. [Q2KHT4-5]
DR RefSeq; NP_112579.2; NM_031289.3. [Q2KHT4-4]
DR AlphaFoldDB; Q2KHT4; -.
DR SMR; Q2KHT4; -.
DR BioGRID; 123647; 14.
DR IntAct; Q2KHT4; 10.
DR STRING; 9606.ENSP00000405032; -.
DR TCDB; 8.A.16.5.1; the ca(+) channel auxiliary subunit Gama1-Gama8 (ccaGama) family.
DR PhosphoSitePlus; Q2KHT4; -.
DR BioMuta; GSG1; -.
DR DMDM; 187471163; -.
DR MassIVE; Q2KHT4; -.
DR PeptideAtlas; Q2KHT4; -.
DR PRIDE; Q2KHT4; -.
DR Antibodypedia; 23587; 131 antibodies from 18 providers.
DR DNASU; 83445; -.
DR Ensembl; ENST00000337630.10; ENSP00000336816.6; ENSG00000111305.19. [Q2KHT4-3]
DR Ensembl; ENST00000396302.7; ENSP00000379596.3; ENSG00000111305.19. [Q2KHT4-4]
DR Ensembl; ENST00000432710.6; ENSP00000405032.2; ENSG00000111305.19. [Q2KHT4-6]
DR Ensembl; ENST00000457134.6; ENSP00000398384.2; ENSG00000111305.19. [Q2KHT4-5]
DR Ensembl; ENST00000537302.5; ENSP00000441718.1; ENSG00000111305.19. [Q2KHT4-2]
DR GeneID; 83445; -.
DR KEGG; hsa:83445; -.
DR UCSC; uc001rbj.4; human. [Q2KHT4-1]
DR CTD; 83445; -.
DR GeneCards; GSG1; -.
DR HGNC; HGNC:19716; GSG1.
DR HPA; ENSG00000111305; Tissue enriched (testis).
DR neXtProt; NX_Q2KHT4; -.
DR OpenTargets; ENSG00000111305; -.
DR PharmGKB; PA134868096; -.
DR VEuPathDB; HostDB:ENSG00000111305; -.
DR eggNOG; ENOG502QRSH; Eukaryota.
DR GeneTree; ENSGT01050000244814; -.
DR HOGENOM; CLU_063057_0_0_1; -.
DR InParanoid; Q2KHT4; -.
DR OrthoDB; 957556at2759; -.
DR PhylomeDB; Q2KHT4; -.
DR TreeFam; TF331388; -.
DR PathwayCommons; Q2KHT4; -.
DR SignaLink; Q2KHT4; -.
DR BioGRID-ORCS; 83445; 8 hits in 1069 CRISPR screens.
DR ChiTaRS; GSG1; human.
DR GenomeRNAi; 83445; -.
DR Pharos; Q2KHT4; Tdark.
DR PRO; PR:Q2KHT4; -.
DR Proteomes; UP000005640; Chromosome 12.
DR RNAct; Q2KHT4; protein.
DR Bgee; ENSG00000111305; Expressed in left testis and 110 other tissues.
DR ExpressionAtlas; Q2KHT4; baseline and differential.
DR Genevisible; Q2KHT4; HS.
DR GO; GO:0005789; C:endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IBA:GO_Central.
DR InterPro; IPR012478; GSG-1.
DR Pfam; PF07803; GSG-1; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Endoplasmic reticulum; Membrane; Reference proteome;
KW Transmembrane; Transmembrane helix.
FT CHAIN 1..349
FT /note="Germ cell-specific gene 1 protein"
FT /id="PRO_0000329461"
FT TRANSMEM 20..40
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 158..178
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 189..209
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TRANSMEM 233..253
FT /note="Helical"
FT /evidence="ECO:0000255"
FT VAR_SEQ 1..3
FT /note="MAK -> MSDPSQLTQNVCLTQE (in isoform 6 and isoform
FT 7)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_032992"
FT VAR_SEQ 109..131
FT /note="Missing (in isoform 3, isoform 4, isoform 5, isoform
FT 6 and isoform 8)"
FT /evidence="ECO:0000303|PubMed:12975309,
FT ECO:0000303|PubMed:14702039, ECO:0000303|PubMed:15489334"
FT /id="VSP_032993"
FT VAR_SEQ 148..349
FT /note="EILWLSLGTQITYIGLQFISFLLLLTDLLLTGNPACGLKLSAFAAVSSVLSG
FT LLGMVAHMMYSQVFQATVNLGPEDWRPHVWNYGWAFYMAWLSFTCCMASAVTTFNTYTR
FT MVLEFKCKHSKSFKENPNCLPHHHQCFPRRLSSAAPTVGPLTSYHQYHNQPIHSVSEGV
FT DFYSELRNKGFQRGASQELKEAVRSSVEEEQC -> GEKGLLEFATLQGPCHPTLRFGG
FT KRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQLPPATNRLATHWEPCLWAQTE
FT RLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLELWLGLLHGLALLHLLHG
FT VGCHHLQHVHQDGAGVQVQA (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:12975309,
FT ECO:0000303|PubMed:14702039, ECO:0000303|PubMed:15489334"
FT /id="VSP_032994"
FT VAR_SEQ 148..349
FT /note="EILWLSLGTQITYIGLQFISFLLLLTDLLLTGNPACGLKLSAFAAVSSVLSG
FT LLGMVAHMMYSQVFQATVNLGPEDWRPHVWNYGWAFYMAWLSFTCCMASAVTTFNTYTR
FT MVLEFKCKHSKSFKENPNCLPHHHQCFPRRLSSAAPTVGPLTSYHQYHNQPIHSVSEGV
FT DFYSELRNKGFQRGASQELKEAVRSSVEEEQC -> GEKGLLEFATLQGPCHPTLRFGG
FT KRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQLPPATNRLATHWEPCLWAQTE
FT RLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLELWLGLLHGLALLHLLHG
FT VGCHHLQHVHQDGAGVQVQPPPWVL (in isoform 7)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_032995"
FT VAR_SEQ 148..280
FT /note="EILWLSLGTQITYIGLQFISFLLLLTDLLLTGNPACGLKLSAFAAVSSVLSG
FT LLGMVAHMMYSQVFQATVNLGPEDWRPHVWNYGWAFYMAWLSFTCCMASAVTTFNTYTR
FT MVLEFKCKHSKSFKENPNCLPH -> GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLP
FT SPPLGLCGKNPMVIPGNADHLHRTSTHQLPPATNRLATHWEPCLWAQTERLCCCFLCPV
FT RSPGDGGPHDVFTSLPSDCQLGSRRLETTCLELWLGLLHGLALLHLLHGVGCHHLQHV
FT (in isoform 8)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_032996"
FT VAR_SEQ 148..198
FT /note="Missing (in isoform 2 and isoform 5)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_032997"
FT VARIANT 39
FT /note="F -> L (in dbSNP:rs2306765)"
FT /evidence="ECO:0000269|PubMed:15489334"
FT /id="VAR_042684"
FT VARIANT 67
FT /note="G -> V (in dbSNP:rs11546332)"
FT /id="VAR_042685"
FT CONFLICT 32
FT /note="S -> P (in Ref. 2; BAC11548)"
FT /evidence="ECO:0000305"
FT CONFLICT 259
FT /note="M -> V (in Ref. 2; BAC11540)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 349 AA; 39248 MW; 72AB9D98344023B5 CRC64;
MAKMELSKAF SGQRTLLSAI LSMLSLSFST TSLLSNYWFV GTQKVPKPLC EKGLAAKCFD
MPVSLDGDTN TSTQEVVQYN WETGDDRFSF RSFRSGMWLS CEETVEEPAL LHPQSWKQFR
ALRSSGTAAA KGERCRSFIE LTPPAKREIL WLSLGTQITY IGLQFISFLL LLTDLLLTGN
PACGLKLSAF AAVSSVLSGL LGMVAHMMYS QVFQATVNLG PEDWRPHVWN YGWAFYMAWL
SFTCCMASAV TTFNTYTRMV LEFKCKHSKS FKENPNCLPH HHQCFPRRLS SAAPTVGPLT
SYHQYHNQPI HSVSEGVDFY SELRNKGFQR GASQELKEAV RSSVEEEQC