7LESS_DROME
ID 7LESS_DROME Reviewed; 2554 AA.
AC P13368; Q9TYI0; Q9U5V7; Q9VZ36;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 20-JUN-2001, sequence version 2.
DT 03-AUG-2022, entry version 196.
DE RecName: Full=Protein sevenless;
DE EC=2.7.10.1;
GN Name=sev; Synonyms=HD-265; ORFNames=CG18085;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], FUNCTION, AND MUTAGENESIS OF LYS-2242.
RC STRAIN=Canton-S;
RX PubMed=2840202; DOI=10.1016/0092-8674(88)90193-6;
RA Basler K., Hafen E.;
RT "Control of photoreceptor cell fate by the sevenless protein requires a
RT functional tyrosine kinase domain.";
RL Cell 54:299-311(1988).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=Oregon-R;
RX PubMed=3138161; DOI=10.1101/gad.2.6.620;
RA Bowtell D.L.L., Simon M.A., Rubin G.M.;
RT "Nucleotide sequence and structure of the sevenless gene of Drosophila
RT melanogaster.";
RL Genes Dev. 2:620-634(1988).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [5]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 2349-2408.
RX PubMed=9731193; DOI=10.1006/bbrc.1998.9003;
RA Oates A.C., Wollberg P., Achen M.G., Wilks A.F.;
RT "Sampling the genomic pool of protein tyrosine kinase genes using the
RT polymerase chain reaction with genomic DNA.";
RL Biochem. Biophys. Res. Commun. 249:660-667(1998).
RN [6]
RP IDENTIFICATION OF FN-III REPEATS.
RX PubMed=2317871; DOI=10.1016/0092-8674(90)90209-w;
RA Norton P.A., Hynes R.O., Ress D.J.G.;
RT "Sevenless: seven found?";
RL Cell 61:15-16(1990).
RN [7]
RP INTERACTION WITH DAB.
RX PubMed=9671493; DOI=10.1128/mcb.18.8.4844;
RA Le N., Simon M.A.;
RT "Disabled is a putative adaptor protein that functions during signaling by
RT the sevenless receptor tyrosine kinase.";
RL Mol. Cell. Biol. 18:4844-4854(1998).
CC -!- FUNCTION: Receptor for an extracellular signal required to instruct a
CC cell to differentiate into an R7 photoreceptor. The ligand for sev is
CC the boss (bride of sevenless) protein on the surface of the neighboring
CC R8 cell. {ECO:0000269|PubMed:2840202}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + L-tyrosyl-[protein] = ADP + H(+) + O-phospho-L-tyrosyl-
CC [protein]; Xref=Rhea:RHEA:10596, Rhea:RHEA-COMP:10136, Rhea:RHEA-
CC COMP:10137, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616, ChEBI:CHEBI:46858,
CC ChEBI:CHEBI:82620, ChEBI:CHEBI:456216; EC=2.7.10.1;
CC Evidence={ECO:0000255|PROSITE-ProRule:PRU10028};
CC -!- SUBUNIT: May form a complex with drk and Sos. Binds the phosphotyrosine
CC interaction domain (PID) of Dab.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000305}; Single-pass membrane
CC protein {ECO:0000305}.
CC -!- DOMAIN: It is unclear whether the potential membrane spanning region
CC near the N-terminus is present as a transmembrane domain in the native
CC protein or serves as a cleaved signal sequence.
CC -!- SIMILARITY: Belongs to the protein kinase superfamily. Tyr protein
CC kinase family. Insulin receptor subfamily. {ECO:0000255|PROSITE-
CC ProRule:PRU00159}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA31960.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; J03158; AAA28882.1; -; Genomic_DNA.
DR EMBL; X13666; CAA31960.1; ALT_INIT; mRNA.
DR EMBL; X13666; CAB55310.1; -; mRNA.
DR EMBL; AE014298; AAF47992.2; -; Genomic_DNA.
DR EMBL; AJ002917; CAA05752.1; -; Genomic_DNA.
DR PIR; A28912; TVFF7L.
DR RefSeq; NP_511114.2; NM_078559.3.
DR AlphaFoldDB; P13368; -.
DR SMR; P13368; -.
DR BioGRID; 58454; 45.
DR IntAct; P13368; 6.
DR STRING; 7227.FBpp0073249; -.
DR GlyGen; P13368; 19 sites.
DR PaxDb; P13368; -.
DR PRIDE; P13368; -.
DR EnsemblMetazoa; FBtr0073393; FBpp0073249; FBgn0003366.
DR GeneID; 32039; -.
DR KEGG; dme:Dmel_CG18085; -.
DR UCSC; CG18085-RA; d. melanogaster.
DR CTD; 32039; -.
DR FlyBase; FBgn0003366; sev.
DR VEuPathDB; VectorBase:FBgn0003366; -.
DR eggNOG; KOG1095; Eukaryota.
DR HOGENOM; CLU_000762_0_0_1; -.
DR InParanoid; P13368; -.
DR PhylomeDB; P13368; -.
DR BRENDA; 2.7.10.1; 1994.
DR Reactome; R-DME-9009391; Extra-nuclear estrogen signaling.
DR SignaLink; P13368; -.
DR BioGRID-ORCS; 32039; 0 hits in 3 CRISPR screens.
DR ChiTaRS; sev; fly.
DR GenomeRNAi; 32039; -.
DR PRO; PR:P13368; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0003366; Expressed in seminal fluid secreting gland and 10 other tissues.
DR ExpressionAtlas; P13368; baseline and differential.
DR Genevisible; P13368; DM.
DR GO; GO:0005887; C:integral component of plasma membrane; IDA:FlyBase.
DR GO; GO:0043235; C:receptor complex; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0008288; F:boss receptor activity; IDA:FlyBase.
DR GO; GO:0019199; F:transmembrane receptor protein kinase activity; IDA:FlyBase.
DR GO; GO:0004714; F:transmembrane receptor protein tyrosine kinase activity; IBA:GO_Central.
DR GO; GO:0060250; P:germ-line stem-cell niche homeostasis; IMP:FlyBase.
DR GO; GO:0038083; P:peptidyl-tyrosine autophosphorylation; IDA:FlyBase.
DR GO; GO:0033674; P:positive regulation of kinase activity; IBA:GO_Central.
DR GO; GO:0007465; P:R7 cell fate commitment; IMP:FlyBase.
DR GO; GO:0032006; P:regulation of TOR signaling; IBA:GO_Central.
DR GO; GO:0045500; P:sevenless signaling pathway; IDA:FlyBase.
DR GO; GO:0007169; P:transmembrane receptor protein tyrosine kinase signaling pathway; IBA:GO_Central.
DR GO; GO:0007601; P:visual perception; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 5.
DR Gene3D; 2.120.10.30; -; 1.
DR Gene3D; 2.60.40.10; -; 5.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR011009; Kinase-like_dom_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR000719; Prot_kinase_dom.
DR InterPro; IPR017441; Protein_kinase_ATP_BS.
DR InterPro; IPR001245; Ser-Thr/Tyr_kinase_cat_dom.
DR InterPro; IPR008266; Tyr_kinase_AS.
DR InterPro; IPR020635; Tyr_kinase_cat_dom.
DR InterPro; IPR002011; Tyr_kinase_rcpt_2_CS.
DR Pfam; PF00041; fn3; 2.
DR Pfam; PF07714; PK_Tyr_Ser-Thr; 1.
DR PRINTS; PR00109; TYRKINASE.
DR SMART; SM00060; FN3; 7.
DR SMART; SM00135; LY; 2.
DR SMART; SM00219; TyrKc; 1.
DR SUPFAM; SSF49265; SSF49265; 5.
DR SUPFAM; SSF56112; SSF56112; 1.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS00107; PROTEIN_KINASE_ATP; 1.
DR PROSITE; PS50011; PROTEIN_KINASE_DOM; 1.
DR PROSITE; PS00109; PROTEIN_KINASE_TYR; 1.
DR PROSITE; PS00239; RECEPTOR_TYR_KIN_II; 1.
PE 1: Evidence at protein level;
KW ATP-binding; Cell membrane; Glycoprotein; Kinase; Membrane;
KW Nucleotide-binding; Phosphoprotein; Receptor; Reference proteome; Repeat;
KW Sensory transduction; Transferase; Transmembrane; Transmembrane helix;
KW Tyrosine-protein kinase; Vision.
FT CHAIN 1..2554
FT /note="Protein sevenless"
FT /id="PRO_0000058928"
FT TOPO_DOM 1..2123
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 2124..2147
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 2148..2554
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT DOMAIN 440..533
FT /note="Fibronectin type-III 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 824..924
FT /note="Fibronectin type-III 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT REPEAT 1010..1053
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000269|PubMed:2317871"
FT DOMAIN 1202..1290
FT /note="Fibronectin type-III 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 1294..1397
FT /note="Fibronectin type-III 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 1801..1901
FT /note="Fibronectin type-III 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 1902..1988
FT /note="Fibronectin type-III 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 1995..2117
FT /note="Fibronectin type-III 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00316"
FT DOMAIN 2209..2485
FT /note="Protein kinase"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00159"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 51..75
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 181..208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2515..2534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..208
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 2343
FT /note="Proton acceptor"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00159,
FT ECO:0000255|PROSITE-ProRule:PRU10028"
FT BINDING 2215..2223
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00159"
FT BINDING 2242
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00159"
FT MOD_RES 2380
FT /note="Phosphotyrosine; by autocatalysis"
FT /evidence="ECO:0000250"
FT CARBOHYD 30
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 129
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 481
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 505
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 617
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 647
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 966
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1228
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1313
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1353
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1550
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1557
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1639
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1725
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1756
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1804
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1889
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1947
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 2073
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT MUTAGEN 2242
FT /note="K->M: Inactivates the protein."
FT /evidence="ECO:0000269|PubMed:2840202"
FT CONFLICT 392
FT /note="V -> M (in Ref. 1; AAA28882)"
FT /evidence="ECO:0000305"
FT CONFLICT 663
FT /note="A -> T (in Ref. 3; AAF47992)"
FT /evidence="ECO:0000305"
FT CONFLICT 1703
FT /note="N -> H (in Ref. 3; AAF47992)"
FT /evidence="ECO:0000305"
FT CONFLICT 1730..1731
FT /note="RG -> KE (in Ref. 3; AAF47992)"
FT /evidence="ECO:0000305"
FT CONFLICT 1741
FT /note="V -> M (in Ref. 3; AAF47992)"
FT /evidence="ECO:0000305"
FT CONFLICT 1823
FT /note="E -> Q (in Ref. 2; CAA31960/CAB55310)"
FT /evidence="ECO:0000305"
FT CONFLICT 2271
FT /note="C -> R (in Ref. 1; AAA28882)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 2554 AA; 287025 MW; 09E238A0F27684F8 CRC64;
MTMFWQQNVD HQSDEQDKQA KGAAPTKRLN ISFNVKIAVN VNTKMTTTHI NQQAPGTSSS
SSNSQNASPS KIVVRQQSSS FDLRQQLARL GRQLASGQDG HGGISTILII NLLLLILLSI
CCDVCRSHNY TVHQSPEPVS KDQMRLLRPK LDSDVVEKVA IWHKHAAAAP PSIVEGIAIS
SRPQSTMAHH PDDRDRDRDP SEEQHGVDER MVLERVTRDC VQRCIVEEDL FLDEFGIQCE
KADNGEKCYK TRCTKGCAQW YRALKELESC QEACLSLQFY PYDMPCIGAC EMAQRDYWHL
QRLAISHLVE RTQPQLERAP RADGQSTPLT IRWAMHFPEH YLASRPFNIQ YQFVDHHGEE
LDLEQEDQDA SGETGSSAWF NLADYDCDEY YVCEILEALI PYTQYRFRFE LPFGENRDEV
LYSPATPAYQ TPPEGAPISA PVIEHLMGLD DSHLAVHWHP GRFTNGPIEG YRLRLSSSEG
NATSEQLVPA GRGSYIFSQL QAGTNYTLAL SMINKQGEGP VAKGFVQTHS ARNEKPAKDL
TESVLLVGRR AVMWQSLEPA GENSMIYQSQ EELADIAWSK REQQLWLLNV HGELRSLKFE
SGQMVSPAQQ LKLDLGNISS GRWVPRRLSF DWLHHRLYFA MESPERNQSS FQIISTDLLG
ESAQKVGESF DLPVEQLEVD ALNGWIFWRN EESLWRQDLH GRMIHRLLRI RQPGWFLVQP
QHFIIHLMLP QEGKFLEISY DGGFKHPLPL PPPSNGAGNG PASSHWQSFA LLGRSLLLPD
SGQLILVEQQ GQAASPSASW PLKNLPDCWA VILLVPESQP LTSAGGKPHS LKALLGAQAA
KISWKEPERN PYQSADAARS WSYELEVLDV ASQSAFSIRN IRGPIFGLQR LQPDNLYQLR
VRAINVDGEP GEWTEPLAAR TWPLGPHRLR WASRQGSVIH TNELGEGLEV QQEQLERLPG
PMTMVNESVG YYVTGDGLLH CINLVHSQWG CPISEPLQHV GSVTYDWRGG RVYWTDLARN
CVVRMDPWSG SRELLPVFEA NFLALDPRQG HLYYATSSQL SRHGSTPDEA VTYYRVNGLE
GSIASFVLDT QQDQLFWLVK GSGALRLYRA PLTAGGDSLQ MIQQIKGVFQ AVPDSLQLLR
PLGALLWLER SGRRARLVRL AAPLDVMELP TPDQASPASA LQLLDPQPLP PRDEGVIPMT
VLPDSVRLDD GHWDDFHVRW QPSTSGGNHS VSYRLLLEFG QRLQTLDLST PFARLTQLPQ
AQLQLKISIT PRTAWRSGDT TRVQLTTPPV APSQPRRLRV FVERLATALQ EANVSAVLRW
DAPEQGQEAP MQALEYHISC WVGSELHEEL RLNQSALEAR VEHLQPDQTY HFQVEARVAA
TGAAAGAASH ALHVAPEVQA VPRVLYANAE FIGELDLDTR NRRRLVHTAS PVEHLVGIEG
EQRLLWVNEH VELLTHVPGS APAKLARMRA EVLALAVDWI QRIVYWAELD ATAPQAAIIY
RLDLCNFEGK ILQGERVWST PRGRLLKDLV ALPQAQSLIW LEYEQGSPRN GSLRGRNLTD
GSELEWATVQ PLIRLHAGSL EPGSETLNLV DNQGKLCVYD VARQLCTASA LRAQLNLLGE
DSIAGQLAQD SGYLYAVKNW SIRAYGRRRQ QLEYTVELEP EEVRLLQAHN YQAYPPKNCL
LLPSSGGSLL KATDCEEQRC LLNLPMITAS EDCPLPIPGV RYQLNLTLAR GPGSEEHDHG
VEPLGQWLLG AGESLNLTDL LPFTRYRVSG ILSSFYQKKL ALPTLVLAPL ELLTASATPS
PPRNFSVRVL SPRELEVSWL PPEQLRSESV YYTLHWQQEL DGENVQDRRE WEAHERRLET
AGTHRLTGIK PGSGYSLWVQ AHATPTKSNS SERLHVRSFA ELPELQLLEL GPYSLSLTWA
GTPDPLGSLQ LECRSSAEQL RRNVAGNHTK MVVEPLQPRT RYQCRLLLGY AATPGAPLYH
GTAEVYETLG DAPSQPGKPQ LEHIAEEVFR VTWTAARGNG APIALYNLEA LQARSDIRRR
RRRRRRNSGG SLEQLPWAEE PVVVEDQWLD FCNTTELSCI VKSLHSSRLL LFRVRARSLE
HGWGPYSEES ERVAEPFVSP EKRGSLVLAI IAPAAIVSSC VLALVLVRKV QKRRLRAKKL
LQQSRPSIWS NLSTLQTQQQ LMAVRNRAFS TTLSDADIAL LPQINWSQLK LLRFLGSGAF
GEVYEGQLKT EDSEEPQRVA IKSLRKGASE FAELLQEAQL MSNFKHENIV CLVGICFDTE
SISLIMEHME AGDLLSYLRA ARATSTQEPQ PTAGLSLSEL LAMCIDVANG CSYLEDMHFV
HRDLACRNCL VTESTGSTDR RRTVKIGDFG LARDIYKSDY YRKEGEGLLP VRWMSPESLV
DGLFTTQSDV WAFGVLCWEI LTLGQQPYAA RNNFEVLAHV KEGGRLQQPP MCTEKLYSLL
LLCWRTDPWE RPSFRRCYNT LHAISTDLRR TQMASATADT VVSCSRPEFK VRFDGQPLEE
HREHNERPED ENLTLREVPL KDKQLYANEG VSRL