CFA47_HUMAN
ID CFA47_HUMAN Reviewed; 3187 AA.
AC Q6ZTR5; A0A140T8X2; A6PW82; B1ARL5; Q5JRM8; Q8N6X8; Q8N9S7;
DT 05-JUL-2005, integrated into UniProtKB/Swiss-Prot.
DT 28-MAR-2018, sequence version 5.
DT 03-AUG-2022, entry version 138.
DE RecName: Full=Cilia- and flagella-associated protein 47 {ECO:0000305};
GN Name=CFAP47 {ECO:0000312|HGNC:HGNC:26708};
GN Synonyms=CHDC2 {ECO:0000312|HGNC:HGNC:26708},
GN CXorf22 {ECO:0000312|HGNC:HGNC:26708},
GN CXorf30 {ECO:0000312|HGNC:HGNC:26708},
GN CXorf59 {ECO:0000312|HGNC:HGNC:26708};
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 1364-3187 (ISOFORM 6), AND VARIANT MET-236.
RC TISSUE=Trachea;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15772651; DOI=10.1038/nature03440;
RA Ross M.T., Grafham D.V., Coffey A.J., Scherer S., McLay K., Muzny D.,
RA Platzer M., Howell G.R., Burrows C., Bird C.P., Frankish A., Lovell F.L.,
RA Howe K.L., Ashurst J.L., Fulton R.S., Sudbrak R., Wen G., Jones M.C.,
RA Hurles M.E., Andrews T.D., Scott C.E., Searle S., Ramser J., Whittaker A.,
RA Deadman R., Carter N.P., Hunt S.E., Chen R., Cree A., Gunaratne P.,
RA Havlak P., Hodgson A., Metzker M.L., Richards S., Scott G., Steffen D.,
RA Sodergren E., Wheeler D.A., Worley K.C., Ainscough R., Ambrose K.D.,
RA Ansari-Lari M.A., Aradhya S., Ashwell R.I., Babbage A.K., Bagguley C.L.,
RA Ballabio A., Banerjee R., Barker G.E., Barlow K.F., Barrett I.P.,
RA Bates K.N., Beare D.M., Beasley H., Beasley O., Beck A., Bethel G.,
RA Blechschmidt K., Brady N., Bray-Allen S., Bridgeman A.M., Brown A.J.,
RA Brown M.J., Bonnin D., Bruford E.A., Buhay C., Burch P., Burford D.,
RA Burgess J., Burrill W., Burton J., Bye J.M., Carder C., Carrel L.,
RA Chako J., Chapman J.C., Chavez D., Chen E., Chen G., Chen Y., Chen Z.,
RA Chinault C., Ciccodicola A., Clark S.Y., Clarke G., Clee C.M., Clegg S.,
RA Clerc-Blankenburg K., Clifford K., Cobley V., Cole C.G., Conquer J.S.,
RA Corby N., Connor R.E., David R., Davies J., Davis C., Davis J., Delgado O.,
RA Deshazo D., Dhami P., Ding Y., Dinh H., Dodsworth S., Draper H.,
RA Dugan-Rocha S., Dunham A., Dunn M., Durbin K.J., Dutta I., Eades T.,
RA Ellwood M., Emery-Cohen A., Errington H., Evans K.L., Faulkner L.,
RA Francis F., Frankland J., Fraser A.E., Galgoczy P., Gilbert J., Gill R.,
RA Gloeckner G., Gregory S.G., Gribble S., Griffiths C., Grocock R., Gu Y.,
RA Gwilliam R., Hamilton C., Hart E.A., Hawes A., Heath P.D., Heitmann K.,
RA Hennig S., Hernandez J., Hinzmann B., Ho S., Hoffs M., Howden P.J.,
RA Huckle E.J., Hume J., Hunt P.J., Hunt A.R., Isherwood J., Jacob L.,
RA Johnson D., Jones S., de Jong P.J., Joseph S.S., Keenan S., Kelly S.,
RA Kershaw J.K., Khan Z., Kioschis P., Klages S., Knights A.J., Kosiura A.,
RA Kovar-Smith C., Laird G.K., Langford C., Lawlor S., Leversha M., Lewis L.,
RA Liu W., Lloyd C., Lloyd D.M., Loulseged H., Loveland J.E., Lovell J.D.,
RA Lozado R., Lu J., Lyne R., Ma J., Maheshwari M., Matthews L.H.,
RA McDowall J., McLaren S., McMurray A., Meidl P., Meitinger T., Milne S.,
RA Miner G., Mistry S.L., Morgan M., Morris S., Mueller I., Mullikin J.C.,
RA Nguyen N., Nordsiek G., Nyakatura G., O'dell C.N., Okwuonu G., Palmer S.,
RA Pandian R., Parker D., Parrish J., Pasternak S., Patel D., Pearce A.V.,
RA Pearson D.M., Pelan S.E., Perez L., Porter K.M., Ramsey Y., Reichwald K.,
RA Rhodes S., Ridler K.A., Schlessinger D., Schueler M.G., Sehra H.K.,
RA Shaw-Smith C., Shen H., Sheridan E.M., Shownkeen R., Skuce C.D.,
RA Smith M.L., Sotheran E.C., Steingruber H.E., Steward C.A., Storey R.,
RA Swann R.M., Swarbreck D., Tabor P.E., Taudien S., Taylor T., Teague B.,
RA Thomas K., Thorpe A., Timms K., Tracey A., Trevanion S., Tromans A.C.,
RA d'Urso M., Verduzco D., Villasana D., Waldron L., Wall M., Wang Q.,
RA Warren J., Warry G.L., Wei X., West A., Whitehead S.L., Whiteley M.N.,
RA Wilkinson J.E., Willey D.L., Williams G., Williams L., Williamson A.,
RA Williamson H., Wilming L., Woodmansey R.L., Wray P.W., Yen J., Zhang J.,
RA Zhou J., Zoghbi H., Zorilla S., Buck D., Reinhardt R., Poustka A.,
RA Rosenthal A., Lehrach H., Meindl A., Minx P.J., Hillier L.W., Willard H.F.,
RA Wilson R.K., Waterston R.H., Rice C.M., Vaudin M., Coulson A., Nelson D.L.,
RA Weinstock G., Sulston J.E., Durbin R.M., Hubbard T., Gibbs R.A., Beck S.,
RA Rogers J., Bentley D.R.;
RT "The DNA sequence of the human X chromosome.";
RL Nature 434:325-337(2005).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 1413-3187 (ISOFORM 6), AND VARIANT TYR-634.
RC TISSUE=Cerebellum, and Lung;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [4]
RP VARIANTS SPGFX3 GLY-1742; ASN-2385 AND THR-2890, FUNCTION, TISSUE
RP SPECIFICITY, SUBCELLULAR LOCATION, CHARACTERIZATION OF VARIANTS SPGFX3
RP GLY-1742; ASN-2385 AND THR-2890, AND INTERACTION WITH CFAP65.
RX PubMed=33472045; DOI=10.1016/j.ajhg.2021.01.002;
RA Liu C., Tu C., Wang L., Wu H., Houston B.J., Mastrorosa F.K., Zhang W.,
RA Shen Y., Wang J., Tian S., Meng L., Cong J., Yang S., Jiang Y., Tang S.,
RA Zeng Y., Lv M., Lin G., Li J., Saiyin H., He X., Jin L., Toure A.,
RA Ray P.F., Veltman J.A., Shi Q., O'Bryan M.K., Cao Y., Tan Y.Q., Zhang F.;
RT "Deleterious variants in X-linked CFAP47 induce asthenoteratozoospermia and
RT primary male infertility.";
RL Am. J. Hum. Genet. 108:309-323(2021).
CC -!- FUNCTION: Plays a role in flagellar formation and sperm motility.
CC {ECO:0000269|PubMed:33472045}.
CC -!- SUBUNIT: Interacts with CFAP65. {ECO:0000269|PubMed:33472045}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytoskeleton, flagellum basal body
CC {ECO:0000269|PubMed:33472045}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=5;
CC IsoId=Q6ZTR5-5; Sequence=Displayed;
CC Name=1;
CC IsoId=Q6ZTR5-1; Sequence=VSP_057885, VSP_057886;
CC Name=2;
CC IsoId=Q6ZTR5-2; Sequence=VSP_014372, VSP_014373;
CC Name=6;
CC IsoId=Q6ZTR5-6; Sequence=VSP_059490, VSP_059491;
CC -!- TISSUE SPECIFICITY: Highly expressed in spermatzoa (at protein level).
CC {ECO:0000269|PubMed:33472045}.
CC -!- DISEASE: Spermatogenic failure, X-linked, 3 (SPGFX3) [MIM:301059]: An
CC infertility disorder characterized by asthenoteratozoospermia.
CC Spermatozoa exhibit multiple morphologic abnormalities, including
CC absent, short, coiled, and irregular-caliber flagella.
CC {ECO:0000269|PubMed:33472045}. Note=The disease is caused by variants
CC affecting the gene represented in this entry.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAI01699.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=AAI01701.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAC04251.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK093920; BAC04251.1; ALT_INIT; mRNA.
DR EMBL; AK126295; BAC86517.1; -; mRNA.
DR EMBL; AC233304; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL590065; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL603753; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL606467; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL606516; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KF459051; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC027936; AAH27936.1; -; mRNA.
DR EMBL; BC101698; AAI01699.1; ALT_INIT; mRNA.
DR EMBL; BC101700; AAI01701.1; ALT_INIT; mRNA.
DR CCDS; CCDS14237.2; -. [Q6ZTR5-1]
DR CCDS; CCDS83464.1; -. [Q6ZTR5-5]
DR RefSeq; NP_001291477.1; NM_001304548.1. [Q6ZTR5-5]
DR RefSeq; NP_689845.2; NM_152632.3. [Q6ZTR5-1]
DR RefSeq; XP_016884942.1; XM_017029453.1. [Q6ZTR5-6]
DR BioGRID; 127994; 3.
DR BioGRID; 130387; 4.
DR IntAct; Q6ZTR5; 5.
DR STRING; 9606.ENSP00000297866; -.
DR iPTMnet; Q6ZTR5; -.
DR PhosphoSitePlus; Q6ZTR5; -.
DR BioMuta; CFAP47; -.
DR DMDM; 74751127; -.
DR jPOST; Q6ZTR5; -.
DR MassIVE; Q6ZTR5; -.
DR PaxDb; Q6ZTR5; -.
DR PeptideAtlas; Q6ZTR5; -.
DR PRIDE; Q6ZTR5; -.
DR ProteomicsDB; 1739; -.
DR ProteomicsDB; 68288; -. [Q6ZTR5-1]
DR ProteomicsDB; 68289; -. [Q6ZTR5-2]
DR ProteomicsDB; 72578; -.
DR Antibodypedia; 42875; 78 antibodies from 18 providers.
DR DNASU; 286464; -.
DR Ensembl; ENST00000297866.9; ENSP00000297866.5; ENSG00000165164.14. [Q6ZTR5-1]
DR Ensembl; ENST00000378653.8; ENSP00000367922.5; ENSG00000165164.14. [Q6ZTR5-5]
DR Ensembl; ENST00000493930.1; ENSP00000433564.1; ENSG00000165164.14. [Q6ZTR5-2]
DR GeneID; 286464; -.
DR KEGG; hsa:286464; -.
DR MANE-Select; ENST00000378653.8; ENSP00000367922.5; NM_001304548.2; NP_001291477.1.
DR UCSC; uc004ddj.4; human. [Q6ZTR5-5]
DR UCSC; uc011mkc.4; human.
DR UCSC; uc064ymy.1; human.
DR CTD; 286464; -.
DR GeneCards; CFAP47; -.
DR HGNC; HGNC:26708; CFAP47.
DR HPA; ENSG00000165164; Tissue enhanced (brain, choroid plexus, fallopian tube).
DR MalaCards; CFAP47; -.
DR MIM; 301057; gene.
DR MIM; 301059; phenotype.
DR neXtProt; NX_Q6ZTR5; -.
DR OpenTargets; ENSG00000165164; -.
DR Orphanet; 137893; Male infertility due to large-headed multiflagellar polyploid spermatozoa.
DR PharmGKB; PA145149062; -.
DR VEuPathDB; HostDB:ENSG00000165164; -.
DR eggNOG; ENOG502QQ4Q; Eukaryota.
DR GeneTree; ENSGT00940000163202; -.
DR HOGENOM; CLU_008435_0_0_1; -.
DR InParanoid; Q6ZTR5; -.
DR OMA; RDCKREL; -.
DR OrthoDB; 37218at2759; -.
DR TreeFam; TF328359; -.
DR TreeFam; TF337362; -.
DR TreeFam; TF343741; -.
DR PathwayCommons; Q6ZTR5; -.
DR SignaLink; Q6ZTR5; -.
DR BioGRID-ORCS; 286464; 49 hits in 709 CRISPR screens.
DR ChiTaRS; CFAP47; human.
DR GenomeRNAi; 286464; -.
DR Pharos; Q6ZTR5; Tdark.
DR PRO; PR:Q6ZTR5; -.
DR Proteomes; UP000005640; Chromosome X.
DR RNAct; Q6ZTR5; protein.
DR Bgee; ENSG00000165164; Expressed in bronchial epithelial cell and 64 other tissues.
DR ExpressionAtlas; Q6ZTR5; baseline and differential.
DR Genevisible; Q6ZTR5; HS.
DR GO; GO:0005929; C:cilium; IBA:GO_Central.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW.
DR GO; GO:0097224; C:sperm connecting piece; IDA:UniProtKB.
DR GO; GO:0060271; P:cilium assembly; IBA:GO_Central.
DR GO; GO:0007288; P:sperm axoneme assembly; IMP:UniProtKB.
DR Gene3D; 1.10.418.10; -; 1.
DR Gene3D; 2.60.40.10; -; 6.
DR InterPro; IPR001715; CH-domain.
DR InterPro; IPR036872; CH_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR SUPFAM; SSF47576; SSF47576; 1.
DR PROSITE; PS50021; CH; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Cell projection; Cilium; Cytoplasm; Cytoskeleton;
KW Flagellum; Reference proteome.
FT CHAIN 1..3187
FT /note="Cilia- and flagella-associated protein 47"
FT /id="PRO_0000079731"
FT DOMAIN 1746..1869
FT /note="Calponin-homology (CH)"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00044"
FT REGION 2024..2052
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 752..754
FT /note="GPS -> VRY (in isoform 2)"
FT /evidence="ECO:0000305|PubMed:15489334"
FT /id="VSP_014372"
FT VAR_SEQ 755..3187
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000305|PubMed:15489334"
FT /id="VSP_014373"
FT VAR_SEQ 949..976
FT /note="LGRTKVLLLQPRILFSNCPQGLTTWRKA -> VIIFLEHGFCFEGYEFVGYT
FT LVYIVTYI (in isoform 1)"
FT /id="VSP_057885"
FT VAR_SEQ 977..3187
FT /note="Missing (in isoform 1)"
FT /evidence="ECO:0000305|PubMed:14702039"
FT /id="VSP_057886"
FT VAR_SEQ 1846..1927
FT /note="ATDICDPNPILMLMLCVYMYERLPTYLPKKVVSFECTLHDTVLNKILLKNSS
FT SRNLVYNARIVGRDAADFSLSQKGNVVTIS -> VCLRVFPKEMDAGVSGLRKEDMPSV
FT WVVTIRLAVAVARTKRQKKGDIQFACFFVCLFFFIFVVISFSLWSRMHFFLLLPLDI
FT (in isoform 6)"
FT /id="VSP_059490"
FT VAR_SEQ 1928..3187
FT /note="Missing (in isoform 6)"
FT /id="VSP_059491"
FT VARIANT 236
FT /note="V -> M (in dbSNP:rs2336029)"
FT /evidence="ECO:0000269|PubMed:14702039"
FT /id="VAR_056856"
FT VARIANT 345
FT /note="C -> R (in dbSNP:rs6632427)"
FT /id="VAR_060280"
FT VARIANT 561
FT /note="A -> T (in dbSNP:rs11795910)"
FT /id="VAR_060281"
FT VARIANT 634
FT /note="H -> Y (in dbSNP:rs17852470)"
FT /evidence="ECO:0000269|PubMed:15489334"
FT /id="VAR_060282"
FT VARIANT 1742
FT /note="S -> G (in SPGFX3; unknown pathological
FT significance; loss of protein expression; loss of CFAP65
FT flagellar location; unknown pathological significance;
FT dbSNP:rs1214810484)"
FT /evidence="ECO:0000269|PubMed:33472045"
FT /id="VAR_085272"
FT VARIANT 2385
FT /note="I -> N (in SPGFX3; unknown pathological
FT significance; loss of protein expression; loss of CFAP65
FT flagellar location)"
FT /evidence="ECO:0000269|PubMed:33472045"
FT /id="VAR_085273"
FT VARIANT 2890
FT /note="P -> T (in SPGFX3; unknown pathological
FT significance; strongly decreased protein expression;
FT dbSNP:rs1317932026)"
FT /evidence="ECO:0000269|PubMed:33472045"
FT /id="VAR_085274"
FT CONFLICT 522
FT /note="A -> T (in Ref. 1; BAC86517)"
FT /evidence="ECO:0000305"
FT CONFLICT 903
FT /note="A -> S (in Ref. 1; BAC86517)"
FT /evidence="ECO:0000305"
FT VARIANT Q6ZTR5-1:964
FT /note="F -> L (in dbSNP:rs6629027)"
FT /evidence="ECO:0000305"
FT /id="VAR_082879"
SQ SEQUENCE 3187 AA; 361625 MW; 63384F4597AAB067 CRC64;
MNTQKGSLTI NVHRGSLAMS IQRGSLVPRD MDSSGRDMQL RVIPAEVKFL DTMAGRVYRL
PITVHNICRW NQKIRFKEPV KPQFKLMLTS LDKELASGLQ MTAMVEYHPD KDEDTFDRLL
ISIENKTTEI PLIGLIPSCQ LEIESVVNFG TLVANSKVYS KEITITNHGK APGIFKAEYH
GQLPILIFPT SGIVDAKSSM VIKVDFCADQ PRIVDEEAIV ILQGQPEMLL SIKAHVVEQI
IELLSMSSDR RLECIHFGPV FFGSSKIKHA RVYNNSPEPI NWVAIIQDDA VGEELGTDIQ
QRTDIALNNL TYIRKIKNID TTIIISCLPN EGTLQPYQKT VITFCFTPKL MAVGKKDIGP
SYRQDYALFL RFESVGSKDG FLRDDDYKTI KSERFQKVEL ALTGTGLPVL LQFDPGPVLN
FKPCFMGERS EIQCIIKNQC ELLPVTYHFK KTANFEIDPE KGKITGGGMV DVMCSFVPHQ
LGVFKVKQMI EIIGLVAEED LQSLSVKSFH HVYLAFNSIC KASTKKVVMK FDPGILPSIR
NPTGKFVVKD LAKRKNYAPV AMLQSAMTRT HNHRSCEEPV KDMLLAFPND RAATIRSKDH
HKHFRPIFTK VPRFNYVNHD FAYTTFEKQQ KKLHENYYAM YLKYLRSVRL QKKQAERERM
YSYDDTDIGL EPGSGLKSPS LSEAEIEEEL SSAANSIRAN RLLTTRGIAS QEEESVRRKV
LKGLKSEPST PQEKHDCSLM LTPKQIHQVI VGPSVLNFGN ICVNSPNTHL LHVINMLPMH
VLLQLDTDLE ELQKTNQFSY VILPTSSTYI SMVFDSPTIG KFWKSFTFTV NNVPSGHILV
VAVVQPVTLE LSSNELVLRP RGFFMKTCFR GTVRLYNRQN CCAQFQWQPV NTGRGIAFSI
CPAKGTVEAY SSLECEVTWQ QGFSSPEEGE FILHVFQGNA LKLKCVAHLG RTKVLLLQPR
ILFSNCPQGL TTWRKAILQN VGQNHAYFKV CSQSLLPIIN IIPSQGIVPF GGITVLNISC
KPTVAEKFDT RAKVSIRHAN VIDLRIGGSA EIADVEINPD VFNFSGAYIG GTQIIPFVIK
NKGITRARVE FNLKDFPDFS MDLKDKSEEF KDPAVPYIYS LELEENTSLE CSITFSPKEV
TVVEFIIQVQ INFFESSKLY TKYLSSSPSN PKTVPLIRPC YVQATALQSP LNLSSTKFVF
EIPLHEMNPN NKVTKTQNLV LYNITKHHVT WTLDLSNTGK LFKDGTFKFS VLNGILRPNE
KYNVSISFCP NRPGTYTADI PMLLNYIPVC YKILHLTGEV KSPELLFDPP FIFFTPVPLD
ITTVMDINIL PQNYFRNSTL CVQIPTVRLL DGEEIHPLSV KFPKGRVIPG SHSGINNKLT
CHLSFKSSKP VSFFTNLLFC DDRKNWFSLP VTATAENCIL TIYPYMAIHL DKQNIILKND
KDEYLKKTRD GVLPPYQDAK PPSPASIKKT YTTSKFNDAE PAKGNLFIGV EVLPENLHLD
ESETSEEDHG SLEKEKYEQF LSLEEGTKAH YFFEKVVNAA QTWFSLFGWP EGPHSFSIPE
TIRRDVYKMQ FYSSTSPPQK FSRQNDFSKY NKTIYDVLLH LSGKMPPGIN SSQSLPVDNH
EKRVIQLHLQ HSSLLDFLNA QGGCISHVLP EFLLEPEDYK RWIEIMSSTN TMPVSSCTPK
KKCSIVIEMS KFEAWSKRAW TDVFLQIYKV LVLSRVVPYC SNNMPPICVQ NTPKVNPCFA
SSNIYSDSER ILLSWMNINY ENTRHVIWKN CHKDVIPSER WIVNFDKDLS DGLVFATQLG
AYCPFLIESH FINMYTRPKS PEEYLHNCLI IVNTLYEIDF DVEIQATDIC DPNPILMLML
CVYMYERLPT YLPKKVVSFE CTLHDTVLNK ILLKNSSSRN LVYNARIVGR DAADFSLSQK
GNVVTISPRN EINVTLKFTS RFIRPAEASL LLISKPKNAV RGITMTFALK GKVLDFKAID
IIKCESPCYQ FQEVTVNVKN PFHTAGDFSV ILVESSTFVS SPTKLTESRQ YPKHDDDMSS
SGSDTDQGCS DSPNVLHTSI KSTFIREFFC SMHTVHLGVK GTSSLELRFL PFNMHVRYCV
IILSNKKIGQ LIYVAEGKGM TPLPSSCLPM NTSSSPVYYS TTREEGPNKK YPVLYLKCKP
YQILYVDLKL PMTNEAKEKA LAFAAQQQMS SIEYERRLIT GTLESSSIRV AIALLGLTKI
ETLMLFRISK LRKPKTVSYT TEVSLPKYFY IPEKISIPWI PEPQVIKLSK AKASDGSVPL
PLQFLPLQSG RYPCKILLKS RYDVRAYYVE GIVNEEQPEA KFEFETPAFE ALTQNIPIKN
QTNDKWTFQV TIEGEWFYGP VDLHVGPDEI VEYPLTFKPI FECVITGKLI LQNEVDGREH
IFDIKGVGKK PSALEHITVE CQVGNVTQKH ITLPHFTNTA LTFKVTADLP IVWGNPQITV
YPYKEILYLI HVRPWKRGIL KGTITFSTTR RCTTRRKHDD YEEDTDQDQA LSCLDSITEQ
SSILDDADTY GNFNNLRFWY NLEIHSTPGP PIEIMEMTCI ALDSTCIEIP LSNPKDRGLH
LEVQLTSAAL NGDNEIILSP LQCTKYIVWY SPATTGYSDE SIIFQPEMAE EFWYLLKLTI
ELPKPTTMPE IQCDLGKHVT QIIPLVNCTH ETLKLQVTNS NPENFVLDIN RKSQLIISPH
STTELPVLFY PSALGRADHQ ACINFYCTQF TEWKFYLSGV GLFPQPLDTE RITTRIGLQS
TIVIPFKNPT MEDVLIDIIL TSVEHPRNLV MDHCWDSFIY ESSAFRFSSP SEIQGIALPP
KGNIDISLLF IPQIMKLHKT MVIIEMTKAN GKYWPIDNFD ELDIKFKSIV GIDSEEIQAI
HWIYPIVGLP QAPPPKSPPV VIQCQSRKRA EEKVEIILNA GFFGFSLTPD LTEVLVIPKR
NSHNFCEDPN EIPKIHEFEY EIQFESEAMK SKLESCVALY MIEKSYDIMA KRITFIFNLV
FTPKKPLRSH ITLKIECVTE GIWKFPIMLI ATEPDTDAVI DIEGVGLFKE SVFELRLKSQ
TRNPEPFTAH FLPGSDLEFF VKPQAGELLP FNTNGTLITV GFKPKMYCRK YKATLVIQTE
EMYWKYEING LTPTTVPPKN AKAKIDATHK THDNMPVRPH NFVRENTKLI RTGVSSTIKG
APLVKNQ