Y34F_DROME
ID Y34F_DROME Reviewed; 1820 AA.
AC Q9W5D0; A8JUT7; A8JUT8; D8FT41; M9PDF0; M9PG59; M9PGC2; M9PGC6; M9PGJ1;
AC M9PIN4; O77432; O77433; Q1RKT6;
DT 15-NOV-2002, integrated into UniProtKB/Swiss-Prot.
DT 07-JUL-2009, sequence version 4.
DT 03-AUG-2022, entry version 162.
DE RecName: Full=Uncharacterized protein CG43867;
GN ORFNames=CG43867;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2]
RP GENOME REANNOTATION, AND ALTERNATIVE SPLICING.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [3] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Oregon-R;
RX PubMed=10731137; DOI=10.1126/science.287.5461.2220;
RA Benos P.V., Gatt M.K., Ashburner M., Murphy L., Harris D., Barrell B.G.,
RA Ferraz C., Vidal S., Brun C., Demailles J., Cadieu E., Dreano S., Gloux S.,
RA Lelaure V., Mottier S., Galibert F., Borkova D., Minana B., Kafatos F.C.,
RA Louis C., Siden-Kiamos I., Bolshakov S., Papagiannakis G., Spanos L.,
RA Cox S., Madueno E., de Pablos B., Modolell J., Peter A., Schoettler P.,
RA Werner M., Mourkioti F., Beinert N., Dowe G., Schaefer U., Jaeckle H.,
RA Bucheton A., Callister D.M., Campbell L.A., Darlamitsou A., Henderson N.S.,
RA McMillan P.J., Salles C., Tait E.A., Valenti P., Saunders R.D.C.,
RA Glover D.M.;
RT "From sequence to chromosome: the tip of the X chromosome of D.
RT melanogaster.";
RL Science 287:2220-2222(2000).
RN [4] {ECO:0000305}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-492 (ISOFORMS D/H), AND
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 373-1539 (ISOFORM C).
RA Stapleton M., Booth B., Carlson J., Chavez C., Frise E., George R.,
RA Pacleb J., Park S., Wan K., Yu C., Celniker S.;
RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-542; SER-543; SER-1073;
RP SER-1075 AND SER-1077, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=7;
CC Name=D {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-1; Sequence=Displayed;
CC Name=B {ECO:0000312|FlyBase:FBgn0264449}; Synonyms=J
CC {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-3; Sequence=VSP_053572, VSP_053573, VSP_053574,
CC VSP_053575, VSP_053578;
CC Name=I {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-4; Sequence=VSP_035867, VSP_053575, VSP_053577;
CC Name=A {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-5; Sequence=VSP_053572, VSP_053573, VSP_053574,
CC VSP_053575;
CC Name=H {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-6; Sequence=VSP_053575, VSP_053576;
CC Name=C {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-7; Sequence=VSP_053572, VSP_053573, VSP_053574,
CC VSP_053578;
CC Name=F {ECO:0000312|FlyBase:FBgn0264449};
CC IsoId=Q9W5D0-2; Sequence=VSP_035867;
CC -!- SEQUENCE CAUTION:
CC Sequence=ABE73295.1; Type=Miscellaneous discrepancy; Note=Contaminating sequence. Potential poly-A sequence.; Evidence={ECO:0000305};
CC Sequence=CAA20900.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAA20901.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014298; ABW09321.2; -; Genomic_DNA.
DR EMBL; AE014298; ABW09322.1; -; Genomic_DNA.
DR EMBL; AE014298; AGB94950.1; -; Genomic_DNA.
DR EMBL; AE014298; AGB94951.1; -; Genomic_DNA.
DR EMBL; AE014298; AGB94952.1; -; Genomic_DNA.
DR EMBL; AE014298; AGB94953.2; -; Genomic_DNA.
DR EMBL; AE014298; AGB94954.1; -; Genomic_DNA.
DR EMBL; AE014298; AGB94955.1; -; Genomic_DNA.
DR EMBL; AL031583; CAA20900.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL031583; CAA20901.1; ALT_SEQ; Genomic_DNA.
DR EMBL; BT025124; ABE73295.1; ALT_SEQ; mRNA.
DR EMBL; BT125063; ADK27790.1; -; mRNA.
DR PIR; T13475; T13475.
DR PIR; T13476; T13476.
DR RefSeq; NP_001096860.2; NM_001103390.2. [Q9W5D0-1]
DR RefSeq; NP_001096861.1; NM_001103391.1. [Q9W5D0-2]
DR RefSeq; NP_001259104.1; NM_001272175.1. [Q9W5D0-7]
DR RefSeq; NP_001259105.1; NM_001272176.1. [Q9W5D0-4]
DR RefSeq; NP_001259106.1; NM_001272177.1. [Q9W5D0-6]
DR RefSeq; NP_001259107.2; NM_001272178.2. [Q9W5D0-3]
DR RefSeq; NP_001259108.1; NM_001272179.1. [Q9W5D0-3]
DR RefSeq; NP_001259109.1; NM_001272180.1. [Q9W5D0-5]
DR AlphaFoldDB; Q9W5D0; -.
DR SMR; Q9W5D0; -.
DR BioGRID; 57599; 6.
DR IntAct; Q9W5D0; 5.
DR STRING; 7227.FBpp0304853; -.
DR iPTMnet; Q9W5D0; -.
DR PaxDb; Q9W5D0; -.
DR EnsemblMetazoa; FBtr0332604; FBpp0304853; FBgn0264449. [Q9W5D0-7]
DR EnsemblMetazoa; FBtr0332605; FBpp0304854; FBgn0264449. [Q9W5D0-4]
DR EnsemblMetazoa; FBtr0332606; FBpp0304855; FBgn0264449. [Q9W5D0-6]
DR EnsemblMetazoa; FBtr0332609; FBpp0304858; FBgn0264449. [Q9W5D0-1]
DR EnsemblMetazoa; FBtr0332610; FBpp0304859; FBgn0264449. [Q9W5D0-2]
DR EnsemblMetazoa; FBtr0332611; FBpp0304860; FBgn0264449. [Q9W5D0-3]
DR EnsemblMetazoa; FBtr0332612; FBpp0304861; FBgn0264449. [Q9W5D0-5]
DR EnsemblMetazoa; FBtr0343358; FBpp0310015; FBgn0264449. [Q9W5D0-3]
DR GeneID; 31031; -.
DR KEGG; dme:Dmel_CG43867; -.
DR UCSC; CG42248-RD; d. melanogaster. [Q9W5D0-1]
DR FlyBase; FBgn0264449; CG43867.
DR VEuPathDB; VectorBase:FBgn0264449; -.
DR eggNOG; KOG0248; Eukaryota.
DR GeneTree; ENSGT00940000167791; -.
DR InParanoid; Q9W5D0; -.
DR OMA; NRKASMP; -.
DR PhylomeDB; Q9W5D0; -.
DR BioGRID-ORCS; 31031; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 31031; -.
DR PRO; PR:Q9W5D0; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0264449; Expressed in brain and 15 other tissues.
DR ExpressionAtlas; Q9W5D0; baseline and differential.
DR Genevisible; Q9W5D0; DM.
DR GO; GO:0071944; C:cell periphery; IEA:UniProt.
DR GO; GO:0005856; C:cytoskeleton; IEA:InterPro.
DR GO; GO:0009887; P:animal organ morphogenesis; IEA:UniProt.
DR GO; GO:0030182; P:neuron differentiation; IEA:UniProt.
DR CDD; cd14473; FERM_B-lobe; 1.
DR Gene3D; 1.20.80.10; -; 1.
DR Gene3D; 1.25.40.530; -; 1.
DR Gene3D; 2.30.29.30; -; 3.
DR InterPro; IPR019749; Band_41_domain.
DR InterPro; IPR014352; FERM/acyl-CoA-bd_prot_sf.
DR InterPro; IPR035963; FERM_2.
DR InterPro; IPR019748; FERM_central.
DR InterPro; IPR000299; FERM_domain.
DR InterPro; IPR000857; MyTH4_dom.
DR InterPro; IPR038185; MyTH4_dom_sf.
DR InterPro; IPR011993; PH-like_dom_sf.
DR InterPro; IPR001849; PH_domain.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR Pfam; PF00373; FERM_M; 1.
DR Pfam; PF00784; MyTH4; 2.
DR Pfam; PF00169; PH; 2.
DR SMART; SM00295; B41; 1.
DR SMART; SM00139; MyTH4; 1.
DR SMART; SM00233; PH; 2.
DR SUPFAM; SSF47031; SSF47031; 1.
DR SUPFAM; SSF54236; SSF54236; 1.
DR PROSITE; PS50057; FERM_3; 1.
DR PROSITE; PS51016; MYTH4; 1.
DR PROSITE; PS50003; PH_DOMAIN; 2.
PE 1: Evidence at protein level;
KW Alternative splicing; Coiled coil; Phosphoprotein; Reference proteome;
KW Repeat.
FT CHAIN 1..1820
FT /note="Uncharacterized protein CG43867"
FT /id="PRO_0000219451"
FT DOMAIN 909..1003
FT /note="PH 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00145"
FT DOMAIN 1017..1124
FT /note="PH 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00145"
FT DOMAIN 1159..1378
FT /note="MyTH4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00359"
FT DOMAIN 1389..1712
FT /note="FERM"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00084"
FT REGION 1..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 86..158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 226..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..366
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 453..497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 519..548
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 577..597
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 619..689
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 735..830
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1713..1748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1764..1820
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 265..308
FT /evidence="ECO:0000255"
FT COILED 362..438
FT /evidence="ECO:0000255"
FT COMPBIAS 86..125
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..302
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 317..350
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 619..645
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 674..689
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 735..774
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 781..830
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1764..1796
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1803..1820
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 542
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 543
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 1073
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 1075
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT MOD_RES 1077
FT /note="Phosphoserine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 1..298
FT /note="Missing (in isoform F and isoform I)"
FT /evidence="ECO:0000305"
FT /id="VSP_035867"
FT VAR_SEQ 2..45
FT /note="QWTAASCDQHDQHDPPAVNRNSIEQRTPANNCEVDPMDATGSTK -> SDEV
FT PLGRLSHIFDTLTNLQQQQHLRSQEQLHSQQ (in isoform A, isoform B
FT and isoform C)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_053572"
FT VAR_SEQ 49..269
FT /note="LPKQREADRPVSGSGGGSSSLTLSGAVTTSMEVTVSTTTTTTIIESSSSTNT
FT TLEKNSPSPAGGSCSSGSGSLSPAYLQHHLQHHGSPLHHLQVHHHTAPPSPLAAVRSAQ
FT MGSSVANGAGPAAACLAVCCSPGSSHHHLGHVGHLATGHPLPHQLPHQLPHPSLYPLMA
FT AAQLGYAGSSGPSSLVNSPALGRRKRYTSNSSNCSSQFNNNYAGLDVDSLD -> SQLQ
FT PEPQQSSAEIRRRSASSSPSPSASASASTSGRATPSLGVASPLNSLQYYSHAHNYFLRP
FT QEVAGSGYLHTFPSHFYHHQVHHLQQHSQPPSLPTQLGAARGSQSLQGSPLLAKRATSF
FT SGQIPLAQGRFTASGTTAASGAIGLPASTPNSPRLLPRRAPRPPPIPAKPNQVKADQQS
FT KDAQARNSTTTTVQATVNPVLAALDAPDAPWPHFSTLTEHLDVHQVNNYGQALPQINWQ
FT ERCLELQLELHRSKNQAGRIR (in isoform A, isoform B and isoform
FT C)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_053573"
FT VAR_SEQ 273
FT /note="R -> RE (in isoform A, isoform B and isoform C)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_053574"
FT VAR_SEQ 784..867
FT /note="Missing (in isoform A, isoform B, isoform H and
FT isoform I)"
FT /evidence="ECO:0000305"
FT /id="VSP_053575"
FT VAR_SEQ 1246..1287
FT /note="Missing (in isoform H)"
FT /evidence="ECO:0000305"
FT /id="VSP_053576"
FT VAR_SEQ 1246..1255
FT /note="Missing (in isoform I)"
FT /evidence="ECO:0000305"
FT /id="VSP_053577"
FT VAR_SEQ 1256..1287
FT /note="Missing (in isoform B and isoform C)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_053578"
SQ SEQUENCE 1820 AA; 198641 MW; 3C24A19192464053 CRC64;
MQWTAASCDQ HDQHDPPAVN RNSIEQRTPA NNCEVDPMDA TGSTKHPHLP KQREADRPVS
GSGGGSSSLT LSGAVTTSME VTVSTTTTTT IIESSSSTNT TLEKNSPSPA GGSCSSGSGS
LSPAYLQHHL QHHGSPLHHL QVHHHTAPPS PLAAVRSAQM GSSVANGAGP AAACLAVCCS
PGSSHHHLGH VGHLATGHPL PHQLPHQLPH PSLYPLMAAA QLGYAGSSGP SSLVNSPALG
RRKRYTSNSS NCSSQFNNNY AGLDVDSLDD MLRKLTELEQ RVIEAEERAE EAEDKVRAME
QRLSEWPKPP PQQAQHPHSH SHPHQPIPSH PQEQQAKNHC SPSHQASGGA TAGAAGSGLP
PTQETEKTIT SLEIQVEEQR QLRLHDARQI EAKAAKIKEW VNNKLRDLEE QNQLLREQNV
KCNQQLELLK NHIANQSQRH SIVGPVRNSL SLDVQDFTGS GSNPEHRRRS ESLDPQEIIG
RPLTSSYPHH QHRRNLSMEP QELERNLVAA VDGLTLAPLS SISNKAPGGV PTESGVVTRP
DSSDTDTAHD YAEIYTPSCE KLPAWMKNNP ALMASGGNSS TTTTTTSELG VPRPPTPPLH
RFPSWEAKIY QVANDGLAGA GTGTSTAEST ASQEPDIQDG MGTNLSNGRR HGHGHGSGTG
IGTGDGHGTL GSTPGTPLPP SRQQQTASGG FCDISVPVYA TVKGRASQIR SMPFTGDSSD
DSSDGEDHAV MLTHHSHNSS STDNTETSTS GSASSPSKSL KTSSSLSPAK RSGSESPKNA
KARVHIQSRT STTPSSRINQ HLQPSQHQHH TLSNQNHGHQ LGAYTVTPSS GQLSLPRYHA
NALQPGSLPS PLQHMRGTVI SDLSFESGLS DDYALPPDAV SESTCMDASM PSLLMRQSYV
DSPSKKIESL EKMGHLAKLG GKLKTWRKRW FVLKNGSLNY WKSQHDVQRK PQGQIQLDEV
CRINRAEGAS TFEIDTGKKV YYLTADSHAT MDDWIRVLQN VQRRNATKLL LSRDDQKPTV
QGWVTKVKNG HPKKCWCVLL GKMFLYFKAP AETNPLGQIN MRDARVEEVE HVSDSDSEER
EDAAQDQARL TVAIYPAHQG PTYLILSGKP ERDNWLYHLT VVSGGGPSAG TQYEQLVQKL
METDGDPNCV LWRHPILLHT KDTITAPLSS MHTETMQPEA IKLFKSIQLF MSVAVNQPGI
DYHVVLAQNA LQHALDMPEL QTEMICILIK QTSRHLGQKL SVGVQVNKKL GKQTRQLLLC
ATQSLFTCDT QQAGHAQANG SSPTSIQAPS ATPIIDCKSN PPVYSFVQGW QLLALAVSLF
VPRSSRLLWY LKLHLSRNAD TKTETGKYAA YCERALERTL KNGGRETKPS RMEVLSILLK
NPYHHSLPHA IPVHMMNSTY QVVSFDGSTT IEEFQATLAH ELGTRDATNG FCLFSDDPIE
KDLEHYLEPL AKLCDVISKW ETALREKGSG KFENSRVIQL SYKNRLYWKH TIKCETDKER
LLLCYQTNSQ IVQGRFPLSR ELALELASLM SQIDMGDYSL EKSRDVGVGL KGLDKFYPYR
YRDALGAEQL KDVQELLVSK WMLLKGRSTL DCVRIYLTCC RKWPYFGACL FQAKPRQSPE
SNTASGATPV AWLAVAEDAL NVLELSTMAP VARYPYSSVM TFGGCQDDFM LVVSHDDGGG
GEQKLLFAMS KPKILEITLL IADYMNALGH TVPGTPQMNS LTRNGSHRSL RTSQRPNLGG
GSAVATGFST NATTTAHNTL NSHATHTLNS NHSHTLSSSH HAGGGSQPGT LSSGHHQHHH
IQQHHQPDIL KSTPDHQRIK