ELF1_DROME
ID ELF1_DROME Reviewed; 1333 AA.
AC P13002; A0JQ61; A1ZB05; Q6NN37; Q86PE1; Q8MMA8; Q8MMB0; Q8MMB1; Q960H2;
AC Q9TYG4; Q9V889; Q9V890; Q9V891;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 24-OCT-2003, sequence version 3.
DT 03-AUG-2022, entry version 189.
DE RecName: Full=Protein grainyhead;
DE AltName: Full=DNA-binding protein ELF-1;
DE AltName: Full=Element I-binding activity;
DE AltName: Full=Protein grainy-head;
DE AltName: Full=Transcription factor NTF-1;
GN Name=grh; Synonyms=elf1; ORFNames=CG5058;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM I), FUNCTION, SUBCELLULAR LOCATION, AND
RP TISSUE SPECIFICITY.
RC STRAIN=Oregon-R; TISSUE=Embryo;
RX PubMed=2792757; DOI=10.1101/gad.3.8.1130;
RA Bray S.J., Burke B., Brown N.H., Hirsh J.;
RT "Embryonic expression pattern of a family of Drosophila proteins that
RT interact with a central nervous system regulatory element.";
RL Genes Dev. 3:1130-1145(1989).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM N).
RC STRAIN=Berkeley; TISSUE=Head, Larva, and Pupae;
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS H; K AND O).
RC STRAIN=Berkeley; TISSUE=Embryo;
RA Stapleton M., Carlson J.W., Frise E., Kapadia B., Park S., Wan K.H., Yu C.,
RA Celniker S.E.;
RL Submitted (NOV-2006) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] OF 1-415.
RC STRAIN=Oregon-R;
RX PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA Glover D.M.;
RT "From sequence to chromosome: the tip of the X chromosome of D.
RT melanogaster.";
RL Science 287:2220-2222(2000).
RN [7]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 198-1333 (ISOFORM I), AND FUNCTION.
RC TISSUE=Embryo;
RX PubMed=2606344; DOI=10.1101/gad.3.11.1677;
RA Dynlacht B.D., Attardi L.D., Admon A., Freeman M., Tjian R.;
RT "Functional analysis of NTF-1, a developmentally regulated Drosophila
RT transcription factor that binds neuronal cis elements.";
RL Genes Dev. 3:1677-1688(1989).
CC -!- FUNCTION: Binds a CNS-specific regulatory element of the Dopa
CC decarboxylase (Ddc) gene. Also interacts with sequences adjacent to
CC other transcription units, including Ultrabithorax (Ubx) and engrailed
CC (en). Activity in vivo may be required only at high levels transiently
CC to activate the expression of Ddc in the CNS.
CC {ECO:0000269|PubMed:2606344, ECO:0000269|PubMed:2792757}.
CC -!- INTERACTION:
CC P13002; Q94527: Rel; NbExp=3; IntAct=EBI-497032, EBI-869024;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:2792757}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=7;
CC Comment=Additional isoforms seem to exist.;
CC Name=J {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-6; Sequence=Displayed;
CC Name=I; Synonyms=A {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-1; Sequence=VSP_008611;
CC Name=H {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-2; Sequence=VSP_008611, VSP_008612;
CC Name=L {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-3; Sequence=VSP_008612;
CC Name=O {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-5; Sequence=VSP_008609, VSP_008612;
CC Name=N {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-4; Sequence=VSP_008609;
CC Name=K {ECO:0000312|FlyBase:FBgn0259211};
CC IsoId=P13002-8; Sequence=VSP_058160, VSP_008611;
CC -!- TISSUE SPECIFICITY: Restricted, during embryogenesis, to tissues
CC derived from ectoderm, predominantly the central nervous system (CNS)
CC and the epidermis. {ECO:0000269|PubMed:2792757}.
CC -!- SIMILARITY: Belongs to the grh/CP2 family. Grainyhead subfamily.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X15657; CAA33692.1; -; mRNA.
DR EMBL; AE013599; AAF57782.3; -; Genomic_DNA.
DR EMBL; AE013599; AAF57784.3; -; Genomic_DNA.
DR EMBL; AE013599; AAM68467.2; -; Genomic_DNA.
DR EMBL; AE013599; AAM68468.2; -; Genomic_DNA.
DR EMBL; AE013599; AAM68470.2; -; Genomic_DNA.
DR EMBL; AE013599; AAM68472.2; -; Genomic_DNA.
DR EMBL; AY052066; AAK93490.1; -; mRNA.
DR EMBL; BT001414; AAN71169.1; -; mRNA.
DR EMBL; BT003182; AAO24937.1; -; mRNA.
DR EMBL; BT029431; ABK57088.1; -; mRNA.
DR EMBL; AL035312; CAA22954.1; -; Genomic_DNA.
DR EMBL; BT011460; AAR99118.1; -; mRNA.
DR PIR; S06206; S06206.
DR RefSeq; NP_001286551.1; NM_001299622.1. [P13002-1]
DR RefSeq; NP_476842.2; NM_057494.4. [P13002-1]
DR RefSeq; NP_476843.2; NM_057495.4. [P13002-2]
DR RefSeq; NP_476844.2; NM_057496.5. [P13002-3]
DR RefSeq; NP_476845.2; NM_057497.4. [P13002-6]
DR RefSeq; NP_725723.2; NM_166250.4. [P13002-8]
DR RefSeq; NP_725725.1; NM_166252.3. [P13002-4]
DR AlphaFoldDB; P13002; -.
DR SMR; P13002; -.
DR BioGRID; 62723; 38.
DR DIP; DIP-59591N; -.
DR IntAct; P13002; 12.
DR STRING; 7227.FBpp0288984; -.
DR PaxDb; P13002; -.
DR DNASU; 37038; -.
DR EnsemblMetazoa; FBtr0299705; FBpp0288983; FBgn0259211. [P13002-1]
DR EnsemblMetazoa; FBtr0299706; FBpp0288984; FBgn0259211. [P13002-6]
DR EnsemblMetazoa; FBtr0299707; FBpp0288985; FBgn0259211. [P13002-8]
DR EnsemblMetazoa; FBtr0300538; FBpp0289765; FBgn0259211. [P13002-2]
DR EnsemblMetazoa; FBtr0300539; FBpp0289766; FBgn0259211. [P13002-3]
DR EnsemblMetazoa; FBtr0300541; FBpp0289768; FBgn0259211. [P13002-4]
DR EnsemblMetazoa; FBtr0306625; FBpp0297580; FBgn0259211. [P13002-5]
DR EnsemblMetazoa; FBtr0345791; FBpp0311781; FBgn0259211. [P13002-1]
DR GeneID; 37038; -.
DR KEGG; dme:Dmel_CG42311; -.
DR CTD; 37038; -.
DR FlyBase; FBgn0259211; grh.
DR VEuPathDB; VectorBase:FBgn0259211; -.
DR eggNOG; KOG4091; Eukaryota.
DR InParanoid; P13002; -.
DR OMA; QGVYQTS; -.
DR PhylomeDB; P13002; -.
DR SignaLink; P13002; -.
DR BioGRID-ORCS; 37038; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 37038; -.
DR PRO; PR:P13002; -.
DR Proteomes; UP000000803; Chromosome 2R.
DR Bgee; FBgn0259211; Expressed in wing disc and 39 other tissues.
DR ExpressionAtlas; P13002; baseline and differential.
DR Genevisible; P13002; DM.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0003677; F:DNA binding; IDA:FlyBase.
DR GO; GO:0001228; F:DNA-binding transcription activator activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IDA:FlyBase.
DR GO; GO:0042803; F:protein homodimerization activity; IPI:FlyBase.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0043565; F:sequence-specific DNA binding; IDA:FlyBase.
DR GO; GO:0040003; P:chitin-based cuticle development; IMP:FlyBase.
DR GO; GO:0008362; P:chitin-based embryonic cuticle biosynthetic process; IMP:FlyBase.
DR GO; GO:0003382; P:epithelial cell morphogenesis; IMP:FlyBase.
DR GO; GO:0008543; P:fibroblast growth factor receptor signaling pathway; IGI:FlyBase.
DR GO; GO:0007402; P:ganglion mother cell fate determination; TAS:FlyBase.
DR GO; GO:0061024; P:membrane organization; TAS:FlyBase.
DR GO; GO:0007399; P:nervous system development; IMP:FlyBase.
DR GO; GO:0007424; P:open tracheal system development; IEP:FlyBase.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEP:FlyBase.
DR GO; GO:0007464; P:R3/R4 cell fate commitment; IMP:FlyBase.
DR GO; GO:0042127; P:regulation of cell population proliferation; IMP:FlyBase.
DR GO; GO:0050767; P:regulation of neurogenesis; IMP:FlyBase.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0035159; P:regulation of tube length, open tracheal system; IMP:FlyBase.
DR GO; GO:0061041; P:regulation of wound healing; IMP:FlyBase.
DR GO; GO:0042052; P:rhabdomere development; IMP:FlyBase.
DR GO; GO:0007426; P:tracheal outgrowth, open tracheal system; IMP:FlyBase.
DR GO; GO:0007419; P:ventral cord development; HMP:FlyBase.
DR InterPro; IPR007604; CP2.
DR Pfam; PF04516; CP2; 1.
DR PROSITE; PS51968; GRH_CP2_DB; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..1333
FT /note="Protein grainyhead"
FT /id="PRO_0000086952"
FT DOMAIN 899..1125
FT /note="Grh/CP2 DB"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01313"
FT REGION 52..93
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..598
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 617..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 727..784
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 853..885
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..491
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 499..520
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..598
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 735..781
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 853..880
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..604
FT /note="Missing (in isoform O and isoform N)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_008609"
FT VAR_SEQ 2..414
FT /note="STSTATTSVITSNELSLSGHAHGHGHAHQLHQHTHSRLGVGVGVGILSDASL
FT SPIQQGSGGHSGGGNTNSSPLAPNGVPLLTTMHRSPDSPQPELATMTNVNVLDLHTDNS
FT KLYDKEAVFIYETPKVVMPADGGGGNNSDEGHAIDARIAAQMGNQAQQQQQQQQQTEHQ
FT PLAKIEFDENQIIRVVGPNGEQQQIISREIINGEHHILSRNEAGEHILTRIVSDPSKLM
FT PNDNAVATAMYNQAQKMNNDHGQAVYQTSPLPLDASVLHYSGGNDSNVIKTEADIYEDH
FT KKHAAAAAAAAGGGSIIYTTSDPNGVNVKQLPHLTVPQKLDPDLYQADKHIDLIYNDGS
FT KTVIYSTTDQKSLEIYSGGDIGSLVSDGQVVVQAGLPYATTTGAGGQPVYIVADGALPA
FT GVEEHLQ -> QEATSTTNYALSYTDWMSEPRRGFSLDSLHAGHVESDPQQQQHHQQLL
FT HHYYSPGDQVKDQGLVLPLNHQQDHHGHHLTAAGLMQAVAAEIQLNEDQDQLHPNANQN
FT SPYPIYSYYRSEGERANNGSPGADLPW (in isoform K)"
FT /evidence="ECO:0000305"
FT /id="VSP_058160"
FT VAR_SEQ 553..822
FT /note="Missing (in isoform H, isoform I and isoform K)"
FT /evidence="ECO:0000303|PubMed:2606344,
FT ECO:0000303|PubMed:2792757, ECO:0000303|Ref.5"
FT /id="VSP_008611"
FT VAR_SEQ 1133..1163
FT /note="Missing (in isoform H, isoform L and isoform O)"
FT /evidence="ECO:0000303|PubMed:12537569, ECO:0000303|Ref.5"
FT /id="VSP_008612"
FT CONFLICT 262
FT /note="P -> R (in Ref. 1; CAA33692)"
FT /evidence="ECO:0000305"
FT CONFLICT 415
FT /note="S -> R (in Ref. 6; CAA22954)"
FT /evidence="ECO:0000305"
FT CONFLICT 504
FT /note="I -> V (in Ref. 5; AAR99118)"
FT /evidence="ECO:0000305"
FT CONFLICT 610
FT /note="G -> C (in Ref. 4; AAK93490)"
FT /evidence="ECO:0000305"
FT CONFLICT 969
FT /note="K -> R (in Ref. 5; AAR99118)"
FT /evidence="ECO:0000305"
FT CONFLICT 999
FT /note="C -> V (in Ref. 1; CAA33692)"
FT /evidence="ECO:0000305"
FT CONFLICT 1006..1007
FT /note="NA -> KS (in Ref. 1; CAA33692)"
FT /evidence="ECO:0000305"
FT CONFLICT 1089
FT /note="K -> E (in Ref. 5; AAR99118)"
FT /evidence="ECO:0000305"
FT CONFLICT 1233
FT /note="R -> S (in Ref. 4; AAK93490)"
FT /evidence="ECO:0000305"
FT CONFLICT 1322
FT /note="L -> R (in Ref. 4; AAO24937)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1333 AA; 143915 MW; 71EBE69DAE6D071C CRC64;
MSTSTATTSV ITSNELSLSG HAHGHGHAHQ LHQHTHSRLG VGVGVGILSD ASLSPIQQGS
GGHSGGGNTN SSPLAPNGVP LLTTMHRSPD SPQPELATMT NVNVLDLHTD NSKLYDKEAV
FIYETPKVVM PADGGGGNNS DEGHAIDARI AAQMGNQAQQ QQQQQQQTEH QPLAKIEFDE
NQIIRVVGPN GEQQQIISRE IINGEHHILS RNEAGEHILT RIVSDPSKLM PNDNAVATAM
YNQAQKMNND HGQAVYQTSP LPLDASVLHY SGGNDSNVIK TEADIYEDHK KHAAAAAAAA
GGGSIIYTTS DPNGVNVKQL PHLTVPQKLD PDLYQADKHI DLIYNDGSKT VIYSTTDQKS
LEIYSGGDIG SLVSDGQVVV QAGLPYATTT GAGGQPVYIV ADGALPAGVE EHLQSGKLNG
QTTPIDVSGL SQNEIQGFLL GSHPSSSATV STTGVVSTTT ISHHQQQQQQ QQQQQQQQQQ
QHQQQQQHPG DIVSAAGVGS TGSIVSSAAQ QQQQQQLISI KREPEDLRKD PKNGNIAGAA
TANGPGSVIT QKILHVDAPT ASEADRPSTP SSSINSTENT ESDSQSVSGS ESGSPGARTT
ATLEMYATTG GTQIYLQTSH PSTASGAGGG AGPAGAAGGG GVSMQAQSPS PGPYITANDY
GMYTASRLPP GPPPTSTTTF IAEPSYYREY FAPDGQGGYV PASTRSLYGD VDVSVSQPGG
VVTYEGRFAG SVPPPATTTV LTSVHHHQQQ QQQQQQHQQQ QQQQQHHQQQ QHHSQDGKSN
GGATPLYAKA ITAAGLTVDL PSPDSGIGTD AITPRDQTNI QQSFDYTELC QPGTLIDANG
SIPVSVNSIQ QRTAVHGSQN SPTTSLVDTS TNGSTRSRPW HDFGRQNDAD KIQIPKIFTN
VGFRYHLESP ISSSQRREDD RITYINKGQF YGITLEYVHD AEKPIKNTTV KSVIMLMFRE
EKSPEDEIKA WQFWHSRQHS VKQRILDADT KNSVGLVGCI EEVSHNAIAV YWNPLESSAK
INIAVQCLST DFSSQKGVKG LPLHVQIDTF EDPRDTAVFH RGYCQIKVFC DKGAERKTRD
EERRAAKRKM TATGRKKLDE LYHPVTDRSE FYGMQDFAKP PVLFSPAEDM EKVGQLGIGA
ATGMTFNPLS NGNSNSNSHS SLQSFYGHET DSPDLKGASP FLLHGQKVAT PTLKFHNHFP
PDMQTDKKDH ILDQNMLTST PLTDFGPPMK RGRMTPPTSE RVMLYVRQEN EEVYTPLHVV
PPTTIGLLNA IENKYKISTT SINNIYRTNK KGITAKIDDD MISFYCNEDI FLLEVQQIED
DLYDVTLTEL PNQ