DNHD1_HUMAN
ID DNHD1_HUMAN Reviewed; 4753 AA.
AC Q96M86; Q2NKK8; Q6UWI9; Q8NAA2; Q8TEE6; Q9NSZ9;
DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot.
DT 13-JUL-2010, sequence version 2.
DT 03-AUG-2022, entry version 148.
DE RecName: Full=Dynein heavy chain domain-containing protein 1;
DE AltName: Full=Dynein heavy chain domain 1-like protein;
DE AltName: Full=Protein CCDC35;
GN Name=DNHD1; Synonyms=C11orf47, CCDC35, DHCD1, DNHD1L;
GN ORFNames=UNQ5781/PRO12970;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 3733-4753 (ISOFORM 1).
RC TISSUE=Spleen, and Testis;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), AND NUCLEOTIDE SEQUENCE
RP [LARGE SCALE MRNA] OF 3699-4753 (ISOFORM 1).
RC TISSUE=Colon;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16554811; DOI=10.1038/nature04632;
RA Taylor T.D., Noguchi H., Totoki Y., Toyoda A., Kuroki Y., Dewar K.,
RA Lloyd C., Itoh T., Takeda T., Kim D.-W., She X., Barlow K.F., Bloom T.,
RA Bruford E., Chang J.L., Cuomo C.A., Eichler E., FitzGerald M.G.,
RA Jaffe D.B., LaButti K., Nicol R., Park H.-S., Seaman C., Sougnez C.,
RA Yang X., Zimmer A.R., Zody M.C., Birren B.W., Nusbaum C., Fujiyama A.,
RA Hattori M., Rogers J., Lander E.S., Sakaki Y.;
RT "Human chromosome 11 DNA sequence and analysis including novel gene
RT identification.";
RL Nature 440:497-500(2006).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1312-2302 (ISOFORM 1).
RC TISSUE=Spleen;
RX PubMed=12693554; DOI=10.1093/dnares/10.1.49;
RA Jikuya H., Takano J., Kikuno R., Hirosawa M., Nagase T., Nomura N.,
RA Ohara O.;
RT "Characterization of long cDNA clones from human adult spleen. II. The
RT complete sequences of 81 cDNA clones.";
RL DNA Res. 10:49-57(2003).
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1702-2302 (ISOFORM 1).
RC TISSUE=Testis;
RX PubMed=17974005; DOI=10.1186/1471-2164-8-399;
RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U.,
RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D.,
RA Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A.,
RA Wiemann S., Schupp I.;
RT "The full-ORF clone resource of the German cDNA consortium.";
RL BMC Genomics 8:399-399(2007).
RN [7]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 4175-4753 (ISOFORM 1).
RX PubMed=12975309; DOI=10.1101/gr.1293003;
RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J.,
RA Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P.,
RA Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E., Heldens S., Huang A.,
RA Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D.,
RA Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L.,
RA Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C.,
RA Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J.,
RA Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.;
RT "The secreted protein discovery initiative (SPDI), a large-scale effort to
RT identify novel human secreted and transmembrane proteins: a bioinformatics
RT assessment.";
RL Genome Res. 13:2265-2270(2003).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q96M86-3; Sequence=Displayed;
CC Name=2;
CC IsoId=Q96M86-4; Sequence=VSP_040683, VSP_040684;
CC Name=3;
CC IsoId=Q96M86-5; Sequence=VSP_040682, VSP_040683, VSP_040684;
CC -!- SIMILARITY: Belongs to the dynein heavy chain family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAQ89130.1; Type=Frameshift; Evidence={ECO:0000305};
CC Sequence=BAB85004.1; Type=Miscellaneous discrepancy; Note=Intron retention.; Evidence={ECO:0000305};
CC Sequence=CAB70845.1; Type=Miscellaneous discrepancy; Note=Intron retention.; Evidence={ECO:0000305};
CC Sequence=EAW68692.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK057314; BAB71423.1; -; mRNA.
DR EMBL; AK093028; BAC04023.1; -; mRNA.
DR EMBL; BC111765; AAI11766.1; -; mRNA.
DR EMBL; BC117301; AAI17302.1; -; mRNA.
DR EMBL; BC117303; AAI17304.1; -; mRNA.
DR EMBL; AC009796; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC084337; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471064; EAW68692.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CH471064; EAW68702.1; -; Genomic_DNA.
DR EMBL; AK074178; BAB85004.1; ALT_SEQ; mRNA.
DR EMBL; AL137619; CAB70845.1; ALT_SEQ; mRNA.
DR EMBL; AY358770; AAQ89130.1; ALT_FRAME; mRNA.
DR CCDS; CCDS44532.1; -. [Q96M86-3]
DR CCDS; CCDS7767.1; -. [Q96M86-4]
DR PIR; T46462; T46462.
DR RefSeq; NP_653267.2; NM_144666.2. [Q96M86-3]
DR RefSeq; NP_775860.3; NM_173589.3. [Q96M86-4]
DR SMR; Q96M86; -.
DR BioGRID; 126834; 16.
DR IntAct; Q96M86; 4.
DR STRING; 9606.ENSP00000254579; -.
DR iPTMnet; Q96M86; -.
DR PhosphoSitePlus; Q96M86; -.
DR BioMuta; DNHD1; -.
DR DMDM; 300669633; -.
DR EPD; Q96M86; -.
DR MassIVE; Q96M86; -.
DR PaxDb; Q96M86; -.
DR PeptideAtlas; Q96M86; -.
DR PRIDE; Q96M86; -.
DR ProteomicsDB; 77309; -. [Q96M86-3]
DR ProteomicsDB; 77310; -. [Q96M86-4]
DR Antibodypedia; 48761; 23 antibodies from 8 providers.
DR DNASU; 144132; -.
DR Ensembl; ENST00000254579.11; ENSP00000254579.6; ENSG00000179532.13. [Q96M86-3]
DR Ensembl; ENST00000354685.7; ENSP00000346716.3; ENSG00000179532.13. [Q96M86-4]
DR GeneID; 144132; -.
DR KEGG; hsa:144132; -.
DR MANE-Select; ENST00000254579.11; ENSP00000254579.6; NM_144666.3; NP_653267.2.
DR UCSC; uc001mdp.4; human. [Q96M86-3]
DR CTD; 144132; -.
DR DisGeNET; 144132; -.
DR GeneCards; DNHD1; -.
DR HGNC; HGNC:26532; DNHD1.
DR HPA; ENSG00000179532; Low tissue specificity.
DR MIM; 617277; gene.
DR neXtProt; NX_Q96M86; -.
DR OpenTargets; ENSG00000179532; -.
DR PharmGKB; PA142671968; -.
DR VEuPathDB; HostDB:ENSG00000179532; -.
DR eggNOG; KOG3595; Eukaryota.
DR GeneTree; ENSGT00940000155523; -.
DR HOGENOM; CLU_000038_0_3_1; -.
DR InParanoid; Q96M86; -.
DR OMA; PNLYLER; -.
DR PhylomeDB; Q96M86; -.
DR TreeFam; TF337443; -.
DR PathwayCommons; Q96M86; -.
DR SignaLink; Q96M86; -.
DR BioGRID-ORCS; 144132; 8 hits in 1078 CRISPR screens.
DR ChiTaRS; DNHD1; human.
DR GenomeRNAi; 144132; -.
DR Pharos; Q96M86; Tdark.
DR PRO; PR:Q96M86; -.
DR Proteomes; UP000005640; Chromosome 11.
DR RNAct; Q96M86; protein.
DR Bgee; ENSG00000179532; Expressed in right testis and 111 other tissues.
DR ExpressionAtlas; Q96M86; baseline and differential.
DR Genevisible; Q96M86; HS.
DR GO; GO:0097729; C:9+2 motile cilium; IEA:UniProt.
DR GO; GO:0030286; C:dynein complex; IBA:GO_Central.
DR GO; GO:0070062; C:extracellular exosome; HDA:UniProtKB.
DR GO; GO:0036156; C:inner dynein arm; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0045505; F:dynein intermediate chain binding; IBA:GO_Central.
DR GO; GO:0051959; F:dynein light intermediate chain binding; IBA:GO_Central.
DR GO; GO:0008569; F:minus-end-directed microtubule motor activity; IBA:GO_Central.
DR GO; GO:0003341; P:cilium movement; IBA:GO_Central.
DR GO; GO:0007018; P:microtubule-based movement; IBA:GO_Central.
DR Gene3D; 1.20.140.100; -; 1.
DR Gene3D; 3.20.180.20; -; 1.
DR Gene3D; 3.40.50.300; -; 6.
DR InterPro; IPR035699; AAA_6.
DR InterPro; IPR026983; DHC_fam.
DR InterPro; IPR042222; Dynein_2_N.
DR InterPro; IPR041466; Dynein_AAA5_ext.
DR InterPro; IPR041228; Dynein_C.
DR InterPro; IPR024743; Dynein_HC_stalk.
DR InterPro; IPR024317; Dynein_heavy_chain_D4_dom.
DR InterPro; IPR004273; Dynein_heavy_D6_P-loop.
DR InterPro; IPR013602; Dynein_heavy_linker.
DR InterPro; IPR042228; Dynein_linker_3.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR10676; PTHR10676; 1.
DR Pfam; PF12774; AAA_6; 1.
DR Pfam; PF12780; AAA_8; 1.
DR Pfam; PF08393; DHC_N2; 1.
DR Pfam; PF17852; Dynein_AAA_lid; 1.
DR Pfam; PF18199; Dynein_C; 1.
DR Pfam; PF03028; Dynein_heavy; 1.
DR Pfam; PF12777; MT; 1.
DR SUPFAM; SSF52540; SSF52540; 3.
PE 2: Evidence at transcript level;
KW Alternative splicing; Coiled coil; Reference proteome.
FT CHAIN 1..4753
FT /note="Dynein heavy chain domain-containing protein 1"
FT /id="PRO_0000311985"
FT REGION 2688..2766
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3580..3657
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4669..4697
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 826..858
FT /evidence="ECO:0000255"
FT COILED 936..991
FT /evidence="ECO:0000255"
FT COILED 3125..3227
FT /evidence="ECO:0000255"
FT COILED 3590..3651
FT /evidence="ECO:0000255"
FT COILED 4431..4460
FT /evidence="ECO:0000255"
FT COMPBIAS 2696..2714
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3588..3602
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3615..3634
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..311
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039"
FT /id="VSP_040682"
FT VAR_SEQ 597
FT /note="V -> F (in isoform 2 and isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040683"
FT VAR_SEQ 598..4753
FT /note="Missing (in isoform 2 and isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040684"
FT VARIANT 240
FT /note="V -> E (in dbSNP:rs2555158)"
FT /id="VAR_039308"
FT VARIANT 279
FT /note="Q -> P (in dbSNP:rs11605196)"
FT /id="VAR_056829"
FT VARIANT 317
FT /note="D -> N (in dbSNP:rs2555152)"
FT /id="VAR_039309"
FT VARIANT 403
FT /note="F -> L (in dbSNP:rs11040904)"
FT /id="VAR_056830"
FT VARIANT 418
FT /note="H -> Y (in dbSNP:rs4758423)"
FT /id="VAR_039310"
FT VARIANT 560
FT /note="Q -> E (in dbSNP:rs11603869)"
FT /id="VAR_039311"
FT VARIANT 1358
FT /note="R -> C (in dbSNP:rs12574381)"
FT /id="VAR_033353"
FT VARIANT 1896
FT /note="K -> N (in dbSNP:rs16915277)"
FT /id="VAR_033354"
FT VARIANT 2041
FT /note="F -> L (in dbSNP:rs11825154)"
FT /id="VAR_033355"
FT VARIANT 3830
FT /note="R -> H (in dbSNP:rs10769699)"
FT /id="VAR_037388"
FT VARIANT 4666
FT /note="I -> T (in dbSNP:rs11604362)"
FT /id="VAR_037389"
FT CONFLICT 1312
FT /note="V -> P (in Ref. 5; BAB85004)"
FT /evidence="ECO:0000305"
FT CONFLICT 1909
FT /note="A -> AALLH (in Ref. 5; BAB85004 and 6; CAB70845)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 4753 AA; 533644 MW; ADAB572B9861759B CRC64;
MVPEERRVGL SSDETSSDSL KSWHSICVLD SKEQPLACQQ KQRQFVKPVT ESEQPTVLEL
LLAELRTLFS AVLQDSSPAA WRYLHAVLGL LPPYRELLVG HLDLLPFLEQ LYCWAPWVQT
HLHLDLLGAI VQAFPPDSSL LDSASHADCC PQKRRLHHRP PCPACPFVQA QWSRQQVKEE
LATWLRPLTL PELQRCLGIV GAQVALEEAV WLDGLSLLPL ALAADIPVRY ESSDTDNAEV
EPVGRKETRS QLDYEVPREK AFQKSSTGFS PETSFLDSQV MTALKMERYL KKIHFLYLNV
APSRYFRPYS LMVVPPDKVN PEHYIFSPFG ILHVHPVEGS ETMTLGTWHH HCVLWQQLQF
IPFFKYCLLR KSFTCWKKNV RLQGLHRLQK FLENHLLLAV PHFGAGLLHI SRLLQELHSV
SWLPQELDRC YELLDLQTAL AEEKHKALRL LHRCLNLCTS ILRLVHEDTY HMQQCLQERV
QNCDRIRTGQ GSIYLQRVQH KQLEQKLKQA EAWWLQLGKF ARLVDYMICQ SLISVLEEQI
TSFVANILQA PRQKPFLSSQ LVFDDHGQLS HVPCVENMIQ TLTGGLQSVK TSALQVVQSA
DLKTSSDSLY SEEEDEEEDS KDEFLMPKFQ GQPSDAVSIF CGPNVGLVWP WKSHPIAGIL
EVRGCRLRGQ YFPHNYKQLE EDLDNNPKIQ QALNIQQVLL EGVLCKVQEF CREHHWITGI
YEFLQSWGPQ KLEDMRGGPI KNYVTLVSRL NVWQARVSSM PIELLTKGGL LLLSCHDVQA
EMESKLNSIR KDILAHVQNE CWNLSQQLMT ELTDFMHIFR TINSDIHAIA QCTQKLNEAN
EQYVELEERM EYVRALHELI RNHFSLFSAE NEALDISVRR QFGESPIPPC PPPPQPHLLH
CPLLAPQLLD MWEAFQFEKS QASEFLLSKR HAIMPKLQQL MAAALAELEG LLAKALSGPF
MDPTQDQRST EHQLVSLERQ FQNTVSDLSE LHHAYAIFTE DETPVPLPIC GTRPIVQQQR
IWHLYRVISE NISEWKCMAF AKFSPAMAQE KTEGWLTEAA RMSTTLELHS PVLQHCMRIL
GEFRSYLPLL TKLGSLHPQS LNCQCLLRAL GLGSLQTIEL LTLGQLLTYP LLEFADRINQ
VWQNENERIH AQETIRRLQR YWEARQLRLL NFILHVPYEP PASERSKRQV LRSPQWEVVD
KDSGTFILSD YSNLQDSIQE SLQVLSKILA IEKSGDLNKI ALEWVAIMHG LGALLEVWLT
FQQKWIFLNK VLHEMKIQFP NADLNSRFKV MDDQYRTLMR ISVADPMVLS LVVPSAERSP
YFQGQQLQQL LQAGSVELEG IIMSLESVLY GVCAHFPRLF FLSDSELVAL LAARLESCEA
QLWVRRCFPH VHAVSFRSCP TGEKNTDDWE SSPNTQTQVE ALAVLGAGGE EVKLQGPLPL
HPDLPKWLAS LEKCLRLALV HMLQGCVAAR LARGPSLGEA LKQLPKQNKL YLQLYVQHWI
DLVQAFPWQC VLVAEEVVWR AEMEEALLEW GTLAMVSMHM RKLEVLVNFM RAQRASQGGQ
SLPSVRQTSL LSALLVMAVT HRDIAQLLEQ HQVSDLTDFH WVRQLKYHLG SPHIIPKSPL
QSLKTIASSE PSLSPAACWI DVLGRSFLYN YEYLGPRLGP LPSLLPERPA LVLLLALEEV
ACGTVLGPNG VGKRAIVNSL AQALGRQLVM LPCSPQIEAQ CLSNYLNGAL QGGAWLLLEK
VHQLPPGLLS ALGQRLGELH HLYAPLYQEA SRNTSTIDPT QPQLLGSSFF EKHHVSVRLG
YGCLLVLRAL SSAVPANLHL LLRPVALALP DLRQVAELTL LGAGMRDAFQ MATRLSKFFS
LERELVSGPL PCRLPLLKQI LEDTIRTLNV TKEEPKCQKP RSLAAIEEAA LLRSPLFSIL
NGLHLHNLRG LLCALFPSAS QVLAEPMTYK LMKPLVVEEL QQVGLDPSPD ILGSLEQLSQ
ALSRASGILL LGPAGSGKTT CWHSLFKIQN RLAAMEDTST QGCQPVEITH LYPSGLSPQE
FLGWLEGSCW HHGIFPKVLR AAGQCNNMGQ KRQTEESIGI QHWIICDGAS NGAWLDSITC
LLSELPQLSL PSGQQIARPP GTFLLMEVAD TTGISPTVVG CCALVWCGGE QTWQCILSAL
MASLPYEYRL QHRTVAELNH MAEVLVPATL RFLTCQGVSS LLQVHGQQAV CAGVAEVTSM
ARILHSLLDL HLRLKEEKAP GPEDLSYSDP VAQSFRSSKS SFLNRSQVDS DDVPDKCREH
LLAVSSFLFA LIWGFGAHLP SRFWPIFDTF IRDSISRLSN YPEPPPSALV FDLHVSPEDG
TLVPFTGQYL SSHIKGTLGT FHPSIQTERL LYVVDLLLSG GQPVLLAGEA ATGKSAFVEV
LVEPHHPYIY SPIHPAFSSS HLRLLLSRGI QGQTQASPQP GHHQDSKPSL LFLLEDLHLA
TSDPEKSCQP VLETLRQAMD GTVYAHSTLE LQTLQPTVNF LATVTVPGYC ERPLCPRLFR
LFTVLALESM TQATLLERHV PIIQAWLERF PSVERERALA RGLVRASVEA WEAVCNCFMP
SPLHPHYHFS LHSVSHLLSS LQLLPNRTGS RGFVDYPNHQ EHLRRVSGLR GTCLTVMMAT
RNVVRLWLHE AQRTFCDRLD SPRERSYCAK LLLVVAQSVF CCGPGPQHLG KDHQESEEEE
EEERVPEVES EGELAQWEDF SNSNSETEEE EEPYGLQVAR VSNSRDPSLT PSIGPVSRGM
KESISHKIRQ EKGTRASNYR LQVRRSFKTW WQKKPQMDLI SPLLLPVLLL HPQEKPSDLV
FSQELILGPN SETPNLYLER QWEKLEEQLA TSAAQLKLSP HLARCHSMAQ HVARLVRVLA
RPRQHGLLLS GALGTGRHTA ITLASSICQA HFFHLPSGSE EAILQCLRDA SWHAGMLSQP
VALLVPSGVD LTTLHRLLAL ATSGSFPGQY TEADLDRIGE HLPRENLGVK QNIKKEMVLQ
RFHQQVCSHL HLFFLIGDKQ AHKQLPSTLF LRLLQLATAS IDRYEPWDQA ALAKVAQHHL
EGAQSVPLDD GSWKYPDLQA SIPSVAKAMA LIHLSATHYH EHLCPALPLV TPKTFLDFLD
TFLMLQQQTI LKIKNKAQRV QNALENLRML IKEHGTHANL IFDLEQQLKD SGKSLSMFQQ
QLEQSKLLYK QQLEECRHQE NLIENLARQR DALQAQREAF LEQMSKAFLE PLSQLQVADF
EEIRSYRAPP ESVVRVTDAM CDLFHHETGW ASAKQLLCTE DFYQELVFFP KEKITDSELI
KLHLILKAPG MDDAALRAVS RPAASLAAWL WAVLHYGLAH CRGLPTDLLL QQVEATLTRE
QARLGYYQFQ AQETLEHNLA LAKMVEDAQA SHNCVAKTLS QAQCGQYHKW PMKAALLTPM
RAWTTQLQKL KGRCMTVFGD TLLCSAAIIY LGPFPPLRRQ ELLDEWLALC RGFQEALGPD
DVAQALKRKQ KSVSIPPKNP LLATHSPFSI LSLLSSESEQ YQWDGNLKPQ AKSAHLAGLL
LRSPTHYSSC RWPLLLDPSN EALIWLDPLP LEENRSFAPA LTEGRGKGLM RNQKRESKTD
MKEEDDESEE SNEAEDQTKE QKAEERKNEQ EKEQEENEEK EEEKTESQGS KPAYETQLPS
LPYLSVLSGA DPELGSQLQE AAACGLPVLL TNVELGLGCE ELQWLLQREQ LSPPQVQPGF
CLYLSTTLSL CAMEKVLGCE LLKGLNVLDL GLNMEILEEQ MLHEILCREY PELETRWQDL
KIRALDTCKA VEAAEERLLT MLLFQNPKRQ KPAKFLRNIV RAQGKLCQLR AHCEELEGQK
LQEMVLWAPY RPVVWHGMAM VKALSQLQNL LPLFCMSPEN WLAVTKQALD SMKPREINHG
EDLASHLLQL RAHLTRQLLG STVTALGLTQ VPLVGALGAL ALLQATGKAS ELERLALWPG
LAASPSTVHS KPVSDVARPA WLGPKAWHEC EMLELLPPFV GLCASLAGHS SAWQAYLSLS
STVLGPAPGP GPEPLSLLQK LILWRVLRPE CLAGALADFT TSLLGRPLDE NTYAPTMPFK
HSQATQPMLI LLPPPGHPSA TLHPLTVIQK LAAKYQQGQK QLQVIALGSE AWDPVSVVVS
TLSQAMYEGH WLVLDNCHLM PHWPKELLQL LLELLGRAKV VADLESEQLL DQPESRNVST
VHRDFRLWLI VPAESSASLP AVLTQHSMPV FWNQSLELGH VLIDSVELAQ QVLYMQPPTQ
ALPLLLLHGL LLHRQLYGTR LQAHRGRWSQ VTLTQVLQTQ DQLWASLSNP RAAMQELAAS
VFYGGPLGDT EDREALISLT QACLSPSSGS WVQPHTPQSL LATLMPLPEL RELDAMAECK
AQMHLLPSPP EPRLCGLSEG PQAWLLRRQS RALLSALQRS SPVWVPESRR GAQLAERRLR
QRLVQVNRRL ESLQDLLTHV IRQDESDAPW SVLGPNARRP LEGVLETEAL ELSQLVGTLQ
RDLDCLLQQL KGAPPCPSRR CAAVAHALWT GRLPLPWRPH APAGPQPPWH WLRQLSRRGQ
LLVRYLGVGA DASSDVPERV FHLSAFRHPR RLLLALRGEA ALDQNVPSSN FPGSRGSVSS
QLQYKRLEMN SNPLHFRVEN GPNPTVPERG LLLIGLQVLH AEWDPIAGAL QDSPSSQPSP
LPPVSISTQA PGTSDLPAPA DLTVYSCPVY MGGPLGTAKL QSRNIVMHLP LPTKLTPNTC
VQRRVHVCSP PLS