WDFY4_HUMAN
ID WDFY4_HUMAN Reviewed; 3184 AA.
AC Q6ZS81; B9ZVP2; Q86WZ4; Q8N4A3; Q8TEN7; Q96BE1; Q9H7H8; Q9HCG5;
DT 03-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 25-NOV-2008, sequence version 3.
DT 03-AUG-2022, entry version 145.
DE RecName: Full=WD repeat- and FYVE domain-containing protein 4;
GN Name=WDFY4; Synonyms=C10orf64, KIAA1607;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 2587-3184 (ISOFORM 5), AND VARIANT PRO-214.
RC TISSUE=Spleen;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15164054; DOI=10.1038/nature02462;
RA Deloukas P., Earthrowl M.E., Grafham D.V., Rubenfield M., French L.,
RA Steward C.A., Sims S.K., Jones M.C., Searle S., Scott C., Howe K.,
RA Hunt S.E., Andrews T.D., Gilbert J.G.R., Swarbreck D., Ashurst J.L.,
RA Taylor A., Battles J., Bird C.P., Ainscough R., Almeida J.P.,
RA Ashwell R.I.S., Ambrose K.D., Babbage A.K., Bagguley C.L., Bailey J.,
RA Banerjee R., Bates K., Beasley H., Bray-Allen S., Brown A.J., Brown J.Y.,
RA Burford D.C., Burrill W., Burton J., Cahill P., Camire D., Carter N.P.,
RA Chapman J.C., Clark S.Y., Clarke G., Clee C.M., Clegg S., Corby N.,
RA Coulson A., Dhami P., Dutta I., Dunn M., Faulkner L., Frankish A.,
RA Frankland J.A., Garner P., Garnett J., Gribble S., Griffiths C.,
RA Grocock R., Gustafson E., Hammond S., Harley J.L., Hart E., Heath P.D.,
RA Ho T.P., Hopkins B., Horne J., Howden P.J., Huckle E., Hynds C.,
RA Johnson C., Johnson D., Kana A., Kay M., Kimberley A.M., Kershaw J.K.,
RA Kokkinaki M., Laird G.K., Lawlor S., Lee H.M., Leongamornlert D.A.,
RA Laird G., Lloyd C., Lloyd D.M., Loveland J., Lovell J., McLaren S.,
RA McLay K.E., McMurray A., Mashreghi-Mohammadi M., Matthews L., Milne S.,
RA Nickerson T., Nguyen M., Overton-Larty E., Palmer S.A., Pearce A.V.,
RA Peck A.I., Pelan S., Phillimore B., Porter K., Rice C.M., Rogosin A.,
RA Ross M.T., Sarafidou T., Sehra H.K., Shownkeen R., Skuce C.D., Smith M.,
RA Standring L., Sycamore N., Tester J., Thorpe A., Torcasso W., Tracey A.,
RA Tromans A., Tsolas J., Wall M., Walsh J., Wang H., Weinstock K., West A.P.,
RA Willey D.L., Whitehead S.L., Wilming L., Wray P.W., Young L., Chen Y.,
RA Lovering R.C., Moschonas N.K., Siebert R., Fechtel K., Bentley D.,
RA Durbin R.M., Hubbard T., Doucette-Stamm L., Beck S., Smith D.R., Rogers J.;
RT "The DNA sequence and comparative analysis of human chromosome 10.";
RL Nature 429:375-381(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 2286-3184 (ISOFORM 1), AND VARIANT ASN-2527.
RC TISSUE=B-cell, Brain, and Lymph;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1290-3184 (ISOFORM 4), AND
RP VARIANT ASN-2527.
RC TISSUE=Spleen;
RA Jikuya H., Takano J., Nomura N., Kikuno R., Nagase T., Ohara O.;
RT "The nucleotide sequence of a long cDNA clone isolated from human spleen.";
RL Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1915-3184 (ISOFORM 1), AND
RP VARIANT ASN-2527.
RC TISSUE=Brain;
RX PubMed=10997877; DOI=10.1093/dnares/7.4.271;
RA Nagase T., Kikuno R., Nakayama M., Hirosawa M., Ohara O.;
RT "Prediction of the coding sequences of unidentified human genes. XVIII. The
RT complete sequences of 100 new cDNA clones from brain which code for large
RT proteins in vitro.";
RL DNA Res. 7:273-281(2000).
RN [6]
RP IDENTIFICATION.
RX PubMed=15254788;
RA Katoh M., Katoh M.;
RT "Identification and characterization of ARHGAP24 and ARHGAP25 genes in
RT silico.";
RL Int. J. Mol. Med. 14:333-338(2004).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=21269460; DOI=10.1186/1752-0509-5-17;
RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T.,
RA Bennett K.L., Superti-Furga G., Colinge J.;
RT "Initial characterization of the human central proteome.";
RL BMC Syst. Biol. 5:17-17(2011).
CC -!- FUNCTION: Plays a critical role in the regulation of cDC1-mediated
CC cross-presentation of viral and tumor antigens in dendritic cells.
CC Mechanistically, acts near the plasma membrane and interacts with
CC endosomal membranes to promote endosomal-to-cytosol antigen
CC trafficking. Also plays a role in B-cell survival through regulation of
CC autophagy. {ECO:0000250|UniProtKB:E9Q2M9}.
CC -!- SUBUNIT: Interacts with HSP90AB1. {ECO:0000250|UniProtKB:E9Q2M9}.
CC -!- INTERACTION:
CC Q6ZS81-2; Q96CV9: OPTN; NbExp=3; IntAct=EBI-25911158, EBI-748974;
CC -!- SUBCELLULAR LOCATION: Early endosome {ECO:0000250|UniProtKB:E9Q2M9}.
CC Endoplasmic reticulum {ECO:0000250|UniProtKB:E9Q2M9}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=5;
CC Name=1;
CC IsoId=Q6ZS81-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q6ZS81-2; Sequence=VSP_020750, VSP_020751;
CC Name=3;
CC IsoId=Q6ZS81-3; Sequence=VSP_035683, VSP_035684;
CC Name=4;
CC IsoId=Q6ZS81-4; Sequence=VSP_035687;
CC Name=5;
CC IsoId=Q6ZS81-5; Sequence=VSP_035685, VSP_035686;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK024502; BAB15792.1; -; mRNA.
DR EMBL; AK127650; BAC87073.1; -; mRNA.
DR EMBL; AC035139; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC060234; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC068898; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC015694; AAH15694.2; -; mRNA.
DR EMBL; BC034937; AAH34937.1; -; mRNA.
DR EMBL; BC047574; AAH47574.1; -; mRNA.
DR EMBL; AK074085; BAB84911.1; -; mRNA.
DR EMBL; AB046827; BAB13433.1; -; mRNA.
DR CCDS; CCDS44385.1; -. [Q6ZS81-1]
DR RefSeq; NP_065996.1; NM_020945.1. [Q6ZS81-1]
DR RefSeq; XP_005270061.1; XM_005270004.3. [Q6ZS81-1]
DR RefSeq; XP_011538290.1; XM_011539988.2. [Q6ZS81-1]
DR RefSeq; XP_016871952.1; XM_017016463.1. [Q6ZS81-1]
DR SMR; Q6ZS81; -.
DR BioGRID; 121729; 2.
DR IntAct; Q6ZS81; 5.
DR STRING; 9606.ENSP00000320563; -.
DR iPTMnet; Q6ZS81; -.
DR PhosphoSitePlus; Q6ZS81; -.
DR BioMuta; WDFY4; -.
DR DMDM; 215274123; -.
DR EPD; Q6ZS81; -.
DR jPOST; Q6ZS81; -.
DR MassIVE; Q6ZS81; -.
DR MaxQB; Q6ZS81; -.
DR PaxDb; Q6ZS81; -.
DR PeptideAtlas; Q6ZS81; -.
DR PRIDE; Q6ZS81; -.
DR ProteomicsDB; 68196; -. [Q6ZS81-1]
DR ProteomicsDB; 68197; -. [Q6ZS81-2]
DR ProteomicsDB; 68198; -. [Q6ZS81-3]
DR ProteomicsDB; 68199; -. [Q6ZS81-4]
DR ProteomicsDB; 68200; -. [Q6ZS81-5]
DR Antibodypedia; 44930; 70 antibodies from 17 providers.
DR DNASU; 57705; -.
DR Ensembl; ENST00000325239.12; ENSP00000320563.5; ENSG00000128815.20. [Q6ZS81-1]
DR Ensembl; ENST00000360890.6; ENSP00000354141.2; ENSG00000128815.20. [Q6ZS81-2]
DR GeneID; 57705; -.
DR KEGG; hsa:57705; -.
DR MANE-Select; ENST00000325239.12; ENSP00000320563.5; NM_001394531.1; NP_001381460.1.
DR UCSC; uc001jgy.3; human. [Q6ZS81-1]
DR CTD; 57705; -.
DR DisGeNET; 57705; -.
DR GeneCards; WDFY4; -.
DR HGNC; HGNC:29323; WDFY4.
DR HPA; ENSG00000128815; Tissue enhanced (bone marrow, lymphoid tissue).
DR MIM; 613316; gene.
DR neXtProt; NX_Q6ZS81; -.
DR OpenTargets; ENSG00000128815; -.
DR PharmGKB; PA134967634; -.
DR VEuPathDB; HostDB:ENSG00000128815; -.
DR eggNOG; KOG1786; Eukaryota.
DR eggNOG; KOG1788; Eukaryota.
DR GeneTree; ENSGT00940000155684; -.
DR HOGENOM; CLU_006536_1_0_1; -.
DR InParanoid; Q6ZS81; -.
DR OMA; CKSEGFV; -.
DR OrthoDB; 101142at2759; -.
DR PhylomeDB; Q6ZS81; -.
DR TreeFam; TF313658; -.
DR PathwayCommons; Q6ZS81; -.
DR SignaLink; Q6ZS81; -.
DR BioGRID-ORCS; 57705; 9 hits in 1030 CRISPR screens.
DR ChiTaRS; WDFY4; human.
DR GenomeRNAi; 57705; -.
DR Pharos; Q6ZS81; Tbio.
DR PRO; PR:Q6ZS81; -.
DR Proteomes; UP000005640; Chromosome 10.
DR RNAct; Q6ZS81; protein.
DR Bgee; ENSG00000128815; Expressed in superficial temporal artery and 132 other tissues.
DR ExpressionAtlas; Q6ZS81; baseline and differential.
DR Genevisible; Q6ZS81; HS.
DR GO; GO:0005769; C:early endosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005783; C:endoplasmic reticulum; IEA:UniProtKB-SubCell.
DR GO; GO:0019882; P:antigen processing and presentation; IBA:GO_Central.
DR GO; GO:0006914; P:autophagy; IEA:UniProtKB-KW.
DR GO; GO:0036037; P:CD8-positive, alpha-beta T cell activation; IEA:Ensembl.
DR GO; GO:0098586; P:cellular response to virus; IEA:Ensembl.
DR CDD; cd06071; Beach; 1.
DR CDD; cd01201; PH_BEACH; 1.
DR Gene3D; 1.10.1540.10; -; 1.
DR Gene3D; 1.25.10.10; -; 1.
DR Gene3D; 2.130.10.10; -; 1.
DR Gene3D; 2.30.29.30; -; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000409; BEACH_dom.
DR InterPro; IPR036372; BEACH_dom_sf.
DR InterPro; IPR023362; PH-BEACH_dom.
DR InterPro; IPR011993; PH-like_dom_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR Pfam; PF02138; Beach; 1.
DR Pfam; PF14844; PH_BEACH; 1.
DR Pfam; PF00400; WD40; 2.
DR SMART; SM01026; Beach; 1.
DR SMART; SM00320; WD40; 5.
DR SUPFAM; SSF48371; SSF48371; 2.
DR SUPFAM; SSF50978; SSF50978; 1.
DR SUPFAM; SSF81837; SSF81837; 1.
DR PROSITE; PS50197; BEACH; 1.
DR PROSITE; PS51783; PH_BEACH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Autophagy; Endoplasmic reticulum; Endosome;
KW Reference proteome; Repeat; WD repeat.
FT CHAIN 1..3184
FT /note="WD repeat- and FYVE domain-containing protein 4"
FT /id="PRO_0000251254"
FT DOMAIN 2385..2510
FT /note="BEACH-type PH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01119"
FT DOMAIN 2527..2821
FT /note="BEACH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00026"
FT REPEAT 2863..2922
FT /note="WD 1"
FT REPEAT 2923..2972
FT /note="WD 2"
FT REPEAT 2973..3014
FT /note="WD 3"
FT REPEAT 3015..3057
FT /note="WD 4"
FT REPEAT 3058..3141
FT /note="WD 5"
FT REPEAT 3142..3184
FT /note="WD 6"
FT REGION 1..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 944..993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1837..1869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2309..2335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3107..3128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 944..961
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1838..1852
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2309..2328
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 627..654
FT /note="SLLRILVTPKGRAAFRVSSGFNGLLSLL -> VISSPPLRLASLWICTKRST
FT FAQAFVFM (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_020750"
FT VAR_SEQ 655..3184
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_020751"
FT VAR_SEQ 1007..1042
FT /note="SPRNLQPQRAALAPSFVEFDMSVEGYGCLFIPTLST -> FPACWKPNIWKD
FT NLAQKPVAGDAAQCNIFPPASSVL (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_035683"
FT VAR_SEQ 1043..3184
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_035684"
FT VAR_SEQ 2862..2920
FT /note="DMYLFSLGSESPKGAIGHIVSTEKTILAVERNKVLLPPLWNRTFSWGFDDFS
FT CCLGSYG -> GDSLAMHCLVSCPRVVPSVAGFLWRSSTTYLGKVLGTDFEWLHLQSQP
FT DSRGVSSLGSV (in isoform 5)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_035685"
FT VAR_SEQ 2921..3184
FT /note="Missing (in isoform 5)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_035686"
FT VAR_SEQ 3111..3184
FT /note="RPAGEEPPAQPPSPRGHKWEKNLALSRELDVSIALTGKPSKTSPAVTALAVS
FT RNHTKLLVGDERGRIFCWSADG -> LQMGRKREAAEALAQQCQAEGGRGDWGLSSAYR
FT RNPQGLLPHSSQGRASGNHSSAAQPSPWPMGLL (in isoform 4)"
FT /evidence="ECO:0000303|Ref.4"
FT /id="VSP_035687"
FT VARIANT 214
FT /note="S -> P (in dbSNP:rs7072606)"
FT /evidence="ECO:0000269|PubMed:14702039"
FT /id="VAR_027684"
FT VARIANT 944
FT /note="S -> F (in dbSNP:rs12242384)"
FT /id="VAR_027685"
FT VARIANT 2527
FT /note="S -> N (in dbSNP:rs2663046)"
FT /evidence="ECO:0000269|PubMed:10997877,
FT ECO:0000269|PubMed:15489334, ECO:0000269|Ref.4"
FT /id="VAR_047261"
FT CONFLICT 1337
FT /note="A -> G (in Ref. 4; BAB84911)"
FT /evidence="ECO:0000305"
FT CONFLICT 2595
FT /note="D -> G (in Ref. 3; AAH47574)"
FT /evidence="ECO:0000305"
FT CONFLICT 2683
FT /note="E -> K (in Ref. 3; AAH47574)"
FT /evidence="ECO:0000305"
FT CONFLICT 3118
FT /note="P -> L (in Ref. 3; AAH47574 and 5; BAB13433)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 3184 AA; 353610 MW; 6743F924EE534E44 CRC64;
MEAEDLSKAE DRNEDPGSKN EGQLAAVQPD VPHGGQSSSP TALWDMLERK FLEYQQLTHK
SPIERQKSLL SLLPLFLKAW EHSVGIICFP SLQRLAEDVS DQLAQQLQKA LVGKPAEQAR
LAAGQLLWWK GDVDQDGYLL LKSVYVLTGT DSETLGRVAE SGLPALLLQC LYLFFVFPLD
KDELLESDLQ VQKMFVQMLL NICSDSQGLE GLLSGSELQS LLIATTCLRE HSCCFWKEPT
FCVLRAISKA QNLSIIQYLQ ATDCVRLSLQ NLSRLTDTLP APEVSEAVSL ILGFVKDSYP
VSSALFLEFE NSEGYPLLLK VLLRYDGLTQ SEVDPHLEEL LGLVVWLTTC GRSELKVFDS
ITYPQLEGFK FHHEASGVTV KNLQAFQVLQ NVFHKASDSV LCIQVLSVIR TMWAWNARNF
FLLEWTLQPI SQFVEIMPLK PAPVQEHFFQ LLEALVFELH YVPHEILRKV QHLIKESPGP
SCTLMALQSI LSIAGGDPLF TDIFRDSGLL GLLLAQLRKQ AKIMRKSGNK VSTPGVQDPE
RELTCVMLRI VVTLLKGSVR NAVVLKDHGM VPFIKIFLDD ECYREASLSI LEQLSAINAE
EYMSIIVGAL CSSTQGELQL KLDLLKSLLR ILVTPKGRAA FRVSSGFNGL LSLLSDLEGS
LQEPPLQAWG AVSPRQTLEL VLYTLCAVSA ALHWDPVNGY FFRRNGLFEK LAEDLCLLGC
FGALEEEGNL LRSWVDTKAR PFADLLGTAF SSSGSLPPRI QSCLQILGFL DSMASGTLHL
RGDLKESLRT KQGPVVDVQK GETGSDPQRN FKQWPDLEER MDEGDAAIMH PGVVCIMVRL
LPRLYHEDHP QLSEEIQCSL ASHIQSLVKS EKNRQVMCEA GLLGTLMASC HRALVTSGSP
LHSRLIRIFE KLASQAIEPD VLRQFLGLGI PSSLSATTKI LDSSHTHRGN PGCSGSQTAQ
GLAEGPWPAA PDAGLHPGVT QAPQPLGESQ DSTTALQTAL SLISMTSPRN LQPQRAALAP
SFVEFDMSVE GYGCLFIPTL STVMGTSTEY SVSGGIGTGA TRPFPPPGGL TFSCWFLISR
HGAATEGHPL RFLTLVRHLA RTEQPFVCFS VSLCPDDLSL VVSTEEKEFQ PLDVMEPEDD
SEPSAGCQLQ VRCGQLLACG QWHHLAVVVT KEMKRHCTVS TCLDGQVIGS AKMLYIQALP
GPFLSMDPSA FVDVYGYIAT PRVWKQKSSL IWRLGPTYLF EEAISMETLE VINKLGPRYC
GNFQAVHVQG EDLDSEATPF VAEERVSFGL HIASSSITSV ADIRNAYNEV DSRLIAKEMN
ISSRDNAMPV FLLRNCAGHL SGSLRTIGAV AVGQLGVRVF HSSPAASSLD FIGGPAILLG
LISLATDDHT MYAAVKVLHS VLTSNAMCDF LMQHICGYQI MAFLLRKKAS LLNHRIFQLI
LSVAGTVELG FRSSAITNTG VFQHILCNFE LWMNTADNLE LSLFSHLLEI LQSPREGPRN
AEAAHQAQLI PKLIFLFNEP SLIPSKISTI IGILACQLRG HFSTQDLLRI GLFVVYTLKP
SSVNERQICM DGALDPSLPA GSQTSGKTIW LRNQLLEMLL SVISSPQLHL SSESKEEMFL
KLGPDWFLLL LQGHLHASTT VLALKLLLYF LASPSLRTRF RDGLCAGSWV ERSTEGVDIV
MDNLKSQSPL PEQSPCLLPG FRVLNDFLAH HVHIPEVYLI VSTFFLQTPL TELMDGPKDS
LDAMLQWLLQ RHHQEEVLQA GLCTEGALLL LEMLKATMSQ PLAGSEDGAW AQTFPASVLQ
FLSLVHRTYP QDPAWRAPEF LQTLAIAAFP LGAQKGVGAE STRNTSSPEA AAEGDSTVEG
LQAPTKAHPA RRKLREFTQL LLRELLLGAS SPKQWLPLEV LLEASPDHAT SQQKRDFQSE
VLLSAMELFH MTSGGDAAMF RDGKEPQPSA EAAAAPSLAN ISCFTQKLVE KLYSGMFSAD
PRHILLFILE HIMVVIETAS SQRDTVLSTL YSSLNKVILY CLSKPQQSLS ECLGLLSILG
FLQEHWDVVF ATYNSNISFL LCLMHCLLLL NERSYPEGFG LEPKPRMSTY HQVFLSPNED
VKEKREDLPS LSDVQHNIQK TVQTLWQQLV AQRQQTLEDA FKIDLSVKPG EREVKIEEVT
PLWEETMLKA WQHYLASEKK SLASRSNVAH HSKVTLWSGS LSSAMKLMPG RQAKDPECKT
EDFVSCIENY RRRGQELYAS LYKDHVQRRK CGNIKAANAW ARIQEQLFGE LGLWSQGEET
KPCSPWELDW REGPARMRKR IKRLSPLEAL SSGRHKESQD KNDHISQTNA ENQDELTLRE
AEGEPDEVGV DCTQLTFFPA LHESLHSEDF LELCRERQVI LQELLDKEKV TQKFSLVIVQ
GHLVSEGVLL FGHQHFYICE NFTLSPTGDV YCTRHCLSNI SDPFIFNLCS KDRSTDHYSC
QCHSYADMRE LRQARFLLQD IALEIFFHNG YSKFLVFYNN DRSKAFKSFC SFQPSLKGKA
TSEDTLSLRR YPGSDRIMLQ KWQKRDISNF EYLMYLNTAA GRTCNDYMQY PVFPWVLADY
TSETLNLANP KIFRDLSKPM GAQTKERKLK FIQRFKEVEK TEGDMTVQCH YYTHYSSAII
VASYLVRMPP FTQAFCALQG GSFDVADRMF HSVKSTWESA SRENMSDVRE LTPEFFYLPE
FLTNCNGVEF GCMQDGTVLG DVQLPPWADG DPRKFISLHR KALESDFVSA NLHHWIDLIF
GYKQQGPAAV DAVNIFHPYF YGDRMDLSSI TDPLIKSTIL GFVSNFGQVP KQLFTKPHPA
RTAAGKPLPG KDVSTPVSLP GHPQPFFYSL QSLRPSQVTV KDMYLFSLGS ESPKGAIGHI
VSTEKTILAV ERNKVLLPPL WNRTFSWGFD DFSCCLGSYG SDKVLMTFEN LAAWGRCLCA
VCPSPTTIVT SGTSTVVCVW ELSMTKGRPR GLRLRQALYG HTQAVTCLAA SVTFSLLVSG
SQDCTCILWD LDHLTHVTRL PAHREGISAI TISDVSGTIV SCAGAHLSLW NVNGQPLASI
TTAWGPEGAI TCCCLMEGPA WDTSQIIITG SQDGMVRVWK TEDVKMSVPG RPAGEEPPAQ
PPSPRGHKWE KNLALSRELD VSIALTGKPS KTSPAVTALA VSRNHTKLLV GDERGRIFCW
SADG