WDR47_HUMAN
ID WDR47_HUMAN Reviewed; 919 AA.
AC O94967; A8MX09; Q5TYV7; Q5TYV8; Q5TYV9; Q8IXT7; Q8IYU9;
DT 20-JUN-2001, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-1999, sequence version 1.
DT 03-AUG-2022, entry version 173.
DE RecName: Full=WD repeat-containing protein 47;
DE AltName: Full=Neuronal enriched MAP-interacting protein;
DE Short=Nemitin;
GN Name=WDR47; Synonyms=KIAA0893;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Brain;
RX PubMed=10048485; DOI=10.1093/dnares/5.6.355;
RA Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N.,
RA Tanaka A., Kotani H., Nomura N., Ohara O.;
RT "Prediction of the coding sequences of unidentified human genes. XII. The
RT complete sequences of 100 new cDNA clones from brain which code for large
RT proteins in vitro.";
RL DNA Res. 5:355-364(1998).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
RC TISSUE=Brain;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4).
RC TISSUE=Brain;
RA Totoki Y., Toyoda A., Takeda T., Sakaki Y., Tanaka A., Yokoyama S.;
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3).
RC TISSUE=Brain, and Testis;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18220336; DOI=10.1021/pr0705441;
RA Cantin G.T., Yi W., Lu B., Park S.K., Xu T., Lee J.-D., Yates J.R. III;
RT "Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient
RT phosphoproteomic analysis.";
RL J. Proteome Res. 7:1346-1351(2008).
RN [8]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-285; SER-292 AND SER-297, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=18669648; DOI=10.1073/pnas.0805139105;
RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E.,
RA Elledge S.J., Gygi S.P.;
RT "A quantitative atlas of mitotic phosphorylation.";
RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008).
RN [9]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-292, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma;
RX PubMed=20068231; DOI=10.1126/scisignal.2000475;
RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L.,
RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.;
RT "Quantitative phosphoproteomics reveals widespread full phosphorylation
RT site occupancy during mitosis.";
RL Sci. Signal. 3:RA3-RA3(2010).
RN [10]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-312; SER-422 AND THR-542, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Cervix carcinoma, and Erythroleukemia;
RX PubMed=23186163; DOI=10.1021/pr300630k;
RA Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J.,
RA Mohammed S.;
RT "Toward a comprehensive characterization of a human cancer cell
RT phosphoproteome.";
RL J. Proteome Res. 12:260-271(2013).
CC -!- SUBUNIT: Interacts with MAP1S (via WD repeats). {ECO:0000250}.
CC -!- INTERACTION:
CC O94967; P56279: TCL1A; NbExp=3; IntAct=EBI-723239, EBI-749995;
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytoskeleton {ECO:0000250}.
CC Note=Localization along microtubules is mediated by MAP1S.
CC {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=1;
CC IsoId=O94967-1; Sequence=Displayed;
CC Name=2;
CC IsoId=O94967-2; Sequence=VSP_012093;
CC Name=3;
CC IsoId=O94967-3; Sequence=VSP_035045;
CC Name=4;
CC IsoId=O94967-4; Sequence=VSP_046727, VSP_035045;
CC -!- SEQUENCE CAUTION:
CC Sequence=AK225781; Type=Frameshift; Evidence={ECO:0000305};
CC Sequence=BAA74916.2; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB020700; BAA74916.2; ALT_INIT; mRNA.
DR EMBL; AK289789; BAF82478.1; -; mRNA.
DR EMBL; AK225781; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BX679664; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL449266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471122; EAW56350.1; -; Genomic_DNA.
DR EMBL; CH471122; EAW56351.1; -; Genomic_DNA.
DR EMBL; BC034964; AAH34964.1; -; mRNA.
DR EMBL; BC039254; AAH39254.1; -; mRNA.
DR CCDS; CCDS30787.1; -. [O94967-3]
DR CCDS; CCDS44186.1; -. [O94967-4]
DR CCDS; CCDS44187.1; -. [O94967-1]
DR RefSeq; NP_001136022.1; NM_001142550.1. [O94967-4]
DR RefSeq; NP_001136023.1; NM_001142551.1. [O94967-1]
DR RefSeq; NP_055784.3; NM_014969.5. [O94967-3]
DR RefSeq; XP_016856186.1; XM_017000697.1. [O94967-1]
DR AlphaFoldDB; O94967; -.
DR SMR; O94967; -.
DR BioGRID; 116574; 33.
DR IntAct; O94967; 15.
DR MINT; O94967; -.
DR STRING; 9606.ENSP00000383599; -.
DR GlyGen; O94967; 1 site, 1 O-linked glycan (1 site).
DR iPTMnet; O94967; -.
DR PhosphoSitePlus; O94967; -.
DR BioMuta; WDR47; -.
DR EPD; O94967; -.
DR jPOST; O94967; -.
DR MassIVE; O94967; -.
DR MaxQB; O94967; -.
DR PaxDb; O94967; -.
DR PeptideAtlas; O94967; -.
DR PRIDE; O94967; -.
DR ProteomicsDB; 2284; -.
DR ProteomicsDB; 50587; -. [O94967-1]
DR ProteomicsDB; 50588; -. [O94967-2]
DR ProteomicsDB; 50589; -. [O94967-3]
DR Antibodypedia; 33741; 50 antibodies from 14 providers.
DR DNASU; 22911; -.
DR Ensembl; ENST00000361054.7; ENSP00000354339.3; ENSG00000085433.17. [O94967-2]
DR Ensembl; ENST00000369962.8; ENSP00000358979.3; ENSG00000085433.17. [O94967-1]
DR Ensembl; ENST00000369965.8; ENSP00000358982.4; ENSG00000085433.17. [O94967-3]
DR Ensembl; ENST00000400794.7; ENSP00000383599.3; ENSG00000085433.17. [O94967-4]
DR GeneID; 22911; -.
DR KEGG; hsa:22911; -.
DR MANE-Select; ENST00000369962.8; ENSP00000358979.3; NM_001142551.2; NP_001136023.1.
DR UCSC; uc001dwi.4; human. [O94967-1]
DR CTD; 22911; -.
DR DisGeNET; 22911; -.
DR GeneCards; WDR47; -.
DR HGNC; HGNC:29141; WDR47.
DR HPA; ENSG00000085433; Low tissue specificity.
DR MIM; 615734; gene.
DR neXtProt; NX_O94967; -.
DR OpenTargets; ENSG00000085433; -.
DR PharmGKB; PA134937302; -.
DR VEuPathDB; HostDB:ENSG00000085433; -.
DR eggNOG; KOG0641; Eukaryota.
DR GeneTree; ENSGT00940000155561; -.
DR HOGENOM; CLU_014985_0_0_1; -.
DR InParanoid; O94967; -.
DR OMA; NVGMENI; -.
DR OrthoDB; 245263at2759; -.
DR PhylomeDB; O94967; -.
DR TreeFam; TF312810; -.
DR PathwayCommons; O94967; -.
DR SignaLink; O94967; -.
DR BioGRID-ORCS; 22911; 10 hits in 1078 CRISPR screens.
DR ChiTaRS; WDR47; human.
DR GeneWiki; WDR47; -.
DR GenomeRNAi; 22911; -.
DR Pharos; O94967; Tdark.
DR PRO; PR:O94967; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; O94967; protein.
DR Bgee; ENSG00000085433; Expressed in cortical plate and 200 other tissues.
DR ExpressionAtlas; O94967; baseline and differential.
DR Genevisible; O94967; HS.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
DR GO; GO:0005874; C:microtubule; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; -; 2.
DR InterPro; IPR024977; Apc4_WD40_dom.
DR InterPro; IPR006595; CTLH_C.
DR InterPro; IPR006594; LisH.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR040067; WDR47.
DR PANTHER; PTHR19863; PTHR19863; 1.
DR Pfam; PF12894; ANAPC4_WD40; 1.
DR Pfam; PF00400; WD40; 2.
DR SMART; SM00668; CTLH; 1.
DR SMART; SM00667; LisH; 1.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; SSF50978; 1.
DR PROSITE; PS50897; CTLH; 1.
DR PROSITE; PS50896; LISH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 5.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Cytoplasm; Cytoskeleton; Developmental protein;
KW Microtubule; Phosphoprotein; Reference proteome; Repeat; WD repeat.
FT CHAIN 1..919
FT /note="WD repeat-containing protein 47"
FT /id="PRO_0000051397"
FT DOMAIN 10..42
FT /note="LisH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00126"
FT DOMAIN 45..102
FT /note="CTLH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00058"
FT REPEAT 604..643
FT /note="WD 1"
FT REPEAT 659..698
FT /note="WD 2"
FT REPEAT 706..748
FT /note="WD 3"
FT REPEAT 753..791
FT /note="WD 4"
FT REPEAT 798..837
FT /note="WD 5"
FT REPEAT 840..879
FT /note="WD 6"
FT REPEAT 886..918
FT /note="WD 7"
FT REGION 393..421
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 500..590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..569
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 285
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:18669648"
FT MOD_RES 289
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8CGF6"
FT MOD_RES 292
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648,
FT ECO:0007744|PubMed:20068231"
FT MOD_RES 297
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:18669648"
FT MOD_RES 312
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 422
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT MOD_RES 542
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:23186163"
FT VAR_SEQ 54..81
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_012093"
FT VAR_SEQ 109
FT /note="H -> HVRFLFLK (in isoform 4)"
FT /evidence="ECO:0000303|Ref.3"
FT /id="VSP_046727"
FT VAR_SEQ 377
FT /note="R -> RS (in isoform 3 and isoform 4)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334, ECO:0000303|Ref.3"
FT /id="VSP_035045"
FT CONFLICT 298
FT /note="P -> Q (in Ref. 6; AAH39254)"
FT /evidence="ECO:0000305"
FT CONFLICT 369
FT /note="S -> C (in Ref. 6; AAH39254)"
FT /evidence="ECO:0000305"
FT CONFLICT 698
FT /note="T -> TVS (in Ref. 6; AAH39254)"
FT /evidence="ECO:0000305"
FT CONFLICT 779
FT /note="R -> G (in Ref. 6; AAH34964)"
FT /evidence="ECO:0000305"
FT CONFLICT 789
FT /note="V -> A (in Ref. 6; AAH34964)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 919 AA; 101949 MW; 2BC571E17DB67166 CRC64;
MTAEETVNVK EVEIIKLILD FLNSKKLHIS MLALEKESGV INGLFSDDML FLRQLILDGQ
WDEVLQFIQP LECMEKFDKK RFRYIILKQK FLEALCVNNA MSAEDEPQHL EFTMQEAVQC
LHALEEYCPS KDDYSKLCLL LTLPRLTNHA EFKDWNPSTA RVHCFEEACV MVAEFIPADR
KLSEAGFKAS NNRLFQLVMK GLLYECCVEF CQSKATGEEI TESEVLLGID LLCGNGCDDL
DLSLLSWLQN LPSSVFSCAF EQKMLNIHVD KLLKPTKAAY ADLLTPLISK LSPYPSSPMR
RPQSADAYMT RSLNPALDGL TCGLTSHDKR ISDLGNKTSP MSHSFANFHY PGVQNLSRSL
MLENTECHSI YEESPERDTP VDAQRPIGSE ILGQSSVSEK EPANGAQNPG PAKQEKNELR
DSTEQFQEYY RQRLRYQQHL EQKEQQRQIY QQMLLEGGVN QEDGPDQQQN LTEQFLNRSI
QKLGELNIGM DGLGNEVSAL NQQCNGSKGN GSNGSSVTSF TTPPQDSSQR LTHDASNIHT
STPRNPGSTN HIPFLEESPC GSQISSEHSV IKPPLGDSPG SLSRSKGEED DKSKKQFVCI
NILEDTQAVR AVAFHPAGGL YAVGSNSKTL RVCAYPDVID PSAHETPKQP VVRFKRNKHH
KGSIYCVAWS PCGQLLATGS NDKYVKVLPF NAETCNATGP DLEFSMHDGT IRDLAFMEGP
ESGGAILISA GAGDCNIYTT DCQRGQGLHA LSGHTGHILA LYTWSGWMIA SGSQDKTVRF
WDLRVPSCVR VVGTTFHGTG SAVASVAVDP SGRLLATGQE DSSCMLYDIR GGRMVQSYHP
HSSDVRSVRF SPGAHYLLTG SYDMKIKVTD LQGDLTKQLP IMVVGEHKDK VIQCRWHTQD
LSFLSSSADR TVTLWTYNG