HERC2_DROME
ID HERC2_DROME Reviewed; 4912 AA.
AC Q9VR91; Q8MZ96; Q9N2R1;
DT 04-APR-2006, integrated into UniProtKB/Swiss-Prot.
DT 04-APR-2006, sequence version 3.
DT 03-AUG-2022, entry version 158.
DE RecName: Full=Probable E3 ubiquitin-protein ligase HERC2;
DE EC=2.3.2.26;
DE AltName: Full=HECT domain and RCC1-like domain-containing protein 2;
DE AltName: Full=HECT-type E3 ubiquitin transferase HERC2;
GN Name=HERC2; ORFNames=CG11734;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000269|PubMed:10731132};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000305}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [3] {ECO:0000305, ECO:0000312|EMBL:AAM29298.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM A).
RC STRAIN=Berkeley {ECO:0000269|PubMed:12537569};
RC TISSUE=Testis {ECO:0000269|PubMed:12537569};
RX PubMed=12537569; DOI=10.1186/gb-2002-3-12-research0080;
RA Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A.,
RA Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M.,
RA Celniker S.E.;
RT "A Drosophila full-length cDNA resource.";
RL Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002).
RN [4] {ECO:0000305, ECO:0000312|EMBL:AAF61856.1}
RP NUCLEOTIDE SEQUENCE [MRNA] OF 4170-4912 (ISOFORM B).
RX PubMed=10720573; DOI=10.1101/gr.10.3.319;
RA Ji Y., Rebert N.A., Joslin J.M., Higgins M.J., Schultz R.A., Nicholls R.D.;
RT "Structure of the highly conserved HERC2 gene and of multiple partially
RT duplicated paralogs in human.";
RL Genome Res. 10:319-329(2000).
RN [5]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-1776, AND IDENTIFICATION BY
RP MASS SPECTROMETRY.
RC TISSUE=Embryo;
RX PubMed=18327897; DOI=10.1021/pr700696a;
RA Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.;
RT "Phosphoproteome analysis of Drosophila melanogaster embryos.";
RL J. Proteome Res. 7:1675-1682(2008).
CC -!- FUNCTION: Probable E3 ubiquitin-protein ligase which accepts ubiquitin
CC from an E2 ubiquitin-conjugating enzyme in the form of a thioester and
CC then directly transfers the ubiquitin to targeted substrates.
CC {ECO:0000250}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=S-ubiquitinyl-[E2 ubiquitin-conjugating enzyme]-L-cysteine +
CC [acceptor protein]-L-lysine = [E2 ubiquitin-conjugating enzyme]-L-
CC cysteine + N(6)-ubiquitinyl-[acceptor protein]-L-lysine.;
CC EC=2.3.2.26;
CC -!- PATHWAY: Protein modification; protein ubiquitination.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytoskeleton, microtubule organizing
CC center, centrosome, centriole {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=B {ECO:0000269|PubMed:10731132};
CC IsoId=Q9VR91-1; Sequence=Displayed;
CC Name=A {ECO:0000269|PubMed:12537569};
CC IsoId=Q9VR91-2; Sequence=VSP_051976, VSP_051977;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014298; AAF50913.3; -; Genomic_DNA.
DR EMBL; AY113293; AAM29298.1; -; mRNA.
DR EMBL; AF189221; AAF61856.1; -; mRNA.
DR RefSeq; NP_608388.2; NM_134544.3. [Q9VR91-1]
DR SMR; Q9VR91; -.
DR BioGRID; 59326; 5.
DR IntAct; Q9VR91; 20.
DR STRING; 7227.FBpp0290558; -.
DR iPTMnet; Q9VR91; -.
DR PaxDb; Q9VR91; -.
DR PRIDE; Q9VR91; -.
DR EnsemblMetazoa; FBtr0301344; FBpp0290558; FBgn0031107. [Q9VR91-1]
DR GeneID; 33035; -.
DR KEGG; dme:Dmel_CG11734; -.
DR UCSC; CG11734-RB; d. melanogaster. [Q9VR91-1]
DR CTD; 8924; -.
DR FlyBase; FBgn0031107; HERC2.
DR VEuPathDB; VectorBase:FBgn0031107; -.
DR eggNOG; KOG0939; Eukaryota.
DR eggNOG; KOG1426; Eukaryota.
DR GeneTree; ENSGT00940000154975; -.
DR HOGENOM; CLU_000101_0_0_1; -.
DR InParanoid; Q9VR91; -.
DR OMA; GPRFKCK; -.
DR PhylomeDB; Q9VR91; -.
DR Reactome; R-DME-983168; Antigen processing: Ubiquitination & Proteasome degradation.
DR SignaLink; Q9VR91; -.
DR UniPathway; UPA00143; -.
DR BioGRID-ORCS; 33035; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 33035; -.
DR PRO; PR:Q9VR91; -.
DR Proteomes; UP000000803; Chromosome X.
DR Bgee; FBgn0031107; Expressed in mouthpart and 20 other tissues.
DR ExpressionAtlas; Q9VR91; baseline and differential.
DR Genevisible; Q9VR91; DM.
DR GO; GO:0005814; C:centriole; IEA:UniProtKB-SubCell.
DR GO; GO:0005737; C:cytoplasm; ISS:FlyBase.
DR GO; GO:0016020; C:membrane; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; ISS:FlyBase.
DR GO; GO:0046872; F:metal ion binding; IEA:InterPro.
DR GO; GO:0061630; F:ubiquitin protein ligase activity; ISS:FlyBase.
DR GO; GO:0004842; F:ubiquitin-protein transferase activity; ISS:FlyBase.
DR GO; GO:0016567; P:protein ubiquitination; ISS:FlyBase.
DR GO; GO:0009966; P:regulation of signal transduction; IEA:UniProt.
DR CDD; cd08664; APC10-HERC2; 1.
DR CDD; cd00078; HECTc; 1.
DR Gene3D; 2.130.10.30; -; 3.
DR Gene3D; 2.30.30.30; -; 1.
DR InterPro; IPR004939; APC_su10/DOC_dom.
DR InterPro; IPR021097; CPH_domain.
DR InterPro; IPR001199; Cyt_B5-like_heme/steroid-bd.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR000569; HECT_dom.
DR InterPro; IPR035983; Hect_E3_ubiquitin_ligase.
DR InterPro; IPR037976; HERC2_APC10.
DR InterPro; IPR010606; Mib_Herc2.
DR InterPro; IPR037252; Mib_Herc2_sf.
DR InterPro; IPR009091; RCC1/BLIP-II.
DR InterPro; IPR000408; Reg_chr_condens.
DR InterPro; IPR014722; Rib_L2_dom2.
DR InterPro; IPR015940; UBA.
DR InterPro; IPR009060; UBA-like_sf.
DR Pfam; PF11515; Cul7; 1.
DR Pfam; PF00173; Cyt-b5; 1.
DR Pfam; PF00632; HECT; 1.
DR Pfam; PF06701; MIB_HERC2; 1.
DR Pfam; PF00415; RCC1; 15.
DR PRINTS; PR00633; RCCNDNSATION.
DR SMART; SM01337; APC10; 1.
DR SMART; SM00119; HECTc; 1.
DR SUPFAM; SSF159034; SSF159034; 1.
DR SUPFAM; SSF46934; SSF46934; 1.
DR SUPFAM; SSF49785; SSF49785; 1.
DR SUPFAM; SSF50985; SSF50985; 3.
DR SUPFAM; SSF56204; SSF56204; 1.
DR PROSITE; PS51284; DOC; 1.
DR PROSITE; PS50237; HECT; 1.
DR PROSITE; PS51416; MIB_HERC2; 1.
DR PROSITE; PS00626; RCC1_2; 1.
DR PROSITE; PS50012; RCC1_3; 18.
DR PROSITE; PS50030; UBA; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Cytoplasm; Cytoskeleton; Phosphoprotein;
KW Reference proteome; Repeat; Transferase; Ubl conjugation pathway.
FT CHAIN 1..4912
FT /note="Probable E3 ubiquitin-protein ligase HERC2"
FT /id="PRO_0000229741"
FT REPEAT 634..685
FT /note="RCC1 1"
FT /evidence="ECO:0000255"
FT REPEAT 686..739
FT /note="RCC1 2"
FT /evidence="ECO:0000255"
FT REPEAT 741..789
FT /note="RCC1 3"
FT /evidence="ECO:0000255"
FT REPEAT 791..843
FT /note="RCC1 4"
FT /evidence="ECO:0000255"
FT REPEAT 844..897
FT /note="RCC1 5"
FT /evidence="ECO:0000255"
FT DOMAIN 1917..1990
FT /note="MIB/HERC2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00749"
FT DOMAIN 2511..2557
FT /note="UBA"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00212"
FT DOMAIN 2780..2958
FT /note="DOC"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00614"
FT REPEAT 2985..3036
FT /note="RCC1 6"
FT /evidence="ECO:0000255"
FT REPEAT 3037..3090
FT /note="RCC1 7"
FT /evidence="ECO:0000255"
FT REPEAT 3091..3142
FT /note="RCC1 8"
FT /evidence="ECO:0000255"
FT REPEAT 3144..3194
FT /note="RCC1 9"
FT /evidence="ECO:0000255"
FT REPEAT 3197..3248
FT /note="RCC1 10"
FT /evidence="ECO:0000255"
FT REPEAT 3250..3300
FT /note="RCC1 11"
FT /evidence="ECO:0000255"
FT REPEAT 3302..3352
FT /note="RCC1 12"
FT /evidence="ECO:0000255"
FT REPEAT 4049..4099
FT /note="RCC1 13"
FT /evidence="ECO:0000255"
FT REPEAT 4101..4153
FT /note="RCC1 14"
FT /evidence="ECO:0000255"
FT REPEAT 4155..4205
FT /note="RCC1 15"
FT /evidence="ECO:0000255"
FT REPEAT 4207..4259
FT /note="RCC1 16"
FT /evidence="ECO:0000255"
FT REPEAT 4261..4311
FT /note="RCC1 17"
FT /evidence="ECO:0000255"
FT REPEAT 4313..4363
FT /note="RCC1 18"
FT /evidence="ECO:0000255"
FT REPEAT 4365..4415
FT /note="RCC1 19"
FT /evidence="ECO:0000255"
FT DOMAIN 4547..4882
FT /note="HECT"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00104"
FT REGION 1..67
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1102..1129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1428..1475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1659..1681
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1994..2018
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2381..2412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2572..2620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3352..3374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3953..4000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4891..4912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..27
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1108..1123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1665..1681
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2587..2608
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3953..3993
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 4850
FT /note="Glycyl thioester intermediate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00104"
FT MOD_RES 1776
FT /note="Phosphothreonine"
FT /evidence="ECO:0000269|PubMed:18327897"
FT VAR_SEQ 635..636
FT /note="HN -> RK (in isoform A)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_051976"
FT VAR_SEQ 637..4912
FT /note="Missing (in isoform A)"
FT /evidence="ECO:0000303|PubMed:12537569"
FT /id="VSP_051977"
FT CONFLICT 4388
FT /note="R -> W (in Ref. 4; AAF61856)"
FT /evidence="ECO:0000305"
FT CONFLICT 4393
FT /note="A -> T (in Ref. 4; AAF61856)"
FT /evidence="ECO:0000305"
FT CONFLICT 4481
FT /note="Y -> C (in Ref. 4; AAF61856)"
FT /evidence="ECO:0000305"
FT CONFLICT 4575
FT /note="I -> T (in Ref. 4; AAF61856)"
FT /evidence="ECO:0000305"
FT CONFLICT 4652
FT /note="T -> M (in Ref. 4; AAF61856)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 4912 AA; 529997 MW; 39F45CEBEB6350DF CRC64;
MFNRQASGGA GSSGQGAGSS QTASAAPVSA GVGVGGGGGA SGAAAGAGSA AGSGSGSGSG
SAAPPSHSPD LYLRPLPFLD AKWLRADLIA SLRNAADGAV LWNHLIQDCE LVSSPTAPLL
NARGQLSYLG DDGKHYCGAQ QLKCSCCQPE YCGPLSACNC DGCRPLDSDT AIKKLTTQAA
AQQAAHLSQA TDMVLSGWLW SQPPSQQARL DCQRSMISEL QELALRAAGN CLSAQHLRQQ
LFIYERYFVA LKRERERERK STTSAQAVAS VTTANTTIAA TTANAAVESQ PAEQQAEHGA
LGLARVATHA ALNFSFAFLR RAWRSGEDTE MCSELLSEAL ASLQELPEAT LFEAAVVSSL
WLEVMERSIK FLRLVALGDP MGNRCTAPLN DRHTALCLLL ELGVQKGTLA ATLECVVLLL
LLWEKDRAGN DNRDMPRKTG APLQRILLRY QKIGATAKGG GIAGGEPAPV ASATETFLRF
LSLPKSSAAI VDLRRAAVVI ISHLDRMVQP LMPRCLLRGG QATAASSSSA PQQRIYALGW
PSLSKDQQGF SAEPTLSWSG ESAGVPTLSC RFQIKQVACC ETQMLILSQE GKLYTWRLAK
PEAEPLPMEE VAHDVFISIA GHCEGRHFLA IDSNHNAYSW GTGEDHRLGH GDTHARAVPT
KIAALEQHCV QSVYCGCSYS AAITCGGNLL TWGRGTYARL GHGNSDDRSL PTLVVALSDH
MVVDVALGSG DAHSLALTSE GLVFAWGDGD YGKLGNGNCN GSLQPILVES LPRVQRVFAG
SQFSVALSSE GQLYTWGKAT CLGHQLVERS VQGCSVPRLV SSLQHKRIVD VAVSVAHCLA
LSSSGEVFGW GRNDSQQICP ASVSSEPLLR TPILVSLPTF PASGIACTSA QSLVWTQSSH
QGVPRRAPFV VDLGEPTFRL LDQLLGMCSS QDNRQTPNQE SECIAVACLN LFRLQLLALI
ANGVEPRQVG LASGSRLLCS LKTRILGLAG GSQVLRTIQV AAQQALQDGW SVLLPTAAER
AQTLTSLLPS EPGQASSAGH RFMTDLLVSS LMAEGGLESA LQHAIRLESS SCSADCGDGV
HLPLLQLITR LLSNNAALSQ TRLSPTLGKQ QEKEEEEREQ QHQEPSTSPS LSLLHRFQRL
LLSHIHQAQP EEETTGAEAL LLQYMHALIP ACVATLQKAH ELALQCREPG ENSFGQMVGH
LGRVLQADIS DALLNELLVG LLLLKRDRPQ CLGGLRWARL FLPLLRVLDR LNRALGEGEL
RDGDDMGWPG IICRGGPKGG PPPADPETHY VRRADMENLL LDGSRCIILA GYVCDLSGYN
CESETLRSVL DSGLGKDLTA EMSSQVHRTA MEHILEHHKL GKYMVQASTE EKSSAPGPSR
LTHFSSECAL AQLLGLVANL MCSGPALQPA ELQCRQLDKS SLLSGGLQLL QPSNPFDEEK
GEARSSHSCH STAGNTPTEL PPPLPMQQQL APGRGKSQLQ LRADAFISGL AEARVSEPPV
AAWLALTERY CKAHNLMWHQ EFATEHPVQE LERLLSAVLI RHQYLGGLVL NALETEVPPP
RQLGEIIRLV HQAKWSVVRT RQQLNRSYKE VCAPILERLR FLLYEVRPAI SPQQRGLRRL
PILQRPPRFK MLVRRLLQEL RSSRQPAKPE DLLNASIQQE QEKKPQSMPK QELEEQEEEE
TLLRRLNERQ IHGEGELDPA LMQDIVDFAL QDSCDVETAR RAMYCQMQRY QLRLAGLQIV
QQLLELHGLL DAAQYSLLNG FLGLHLKSTS SGGGSTGSGS SLHVLGQLNM ISAYQKARLL
LAQSRVLDWA VRELRRLVNQ EQQGHARGKD STNLGTYVLL KRLPRARFLL SVFGLLAKEL
GPNELGFLIN SGLLGTVLGL LAQTGGEGAT GGQVHGELSI LYEDSVLKQK SSKAQLSGPD
LAKLMKIGTR IVRGADWKWG DQDGNPPGEG RIISEVGEDG WVRVEWYTGA TNSYRMGKEG
QYDLQLADSA LNVASPTEPE REDVSGSEAS PTSDSHPSKL LRHCAAKLLQ ILAVGTGLHG
AQLDKDALRG MTSMFRTIIY PKPSMSNISY ALGWLNLGFI RAISGDCPRL CLELSTPGWL
SHYLSLLEQP AGNEAGVYRQ LHCLRLLQLI LAQWGAEEEP RMPALVHQLF ATLGRIALHC
PGDASLLPTA EGKARVLLTA SHSGSVAEEL VALLRRLHTL PSWNPVINSF LAQKLCVAAE
LLAEQSHSTA LDSEQVFVLG VLGAMGGHDL RPRVGLHCFH EGSHMVIASF TPKGRCLLAP
GGVGSGVGFV KVQLPAVMPH LDHTVFSLSR LPMNEMLLNA WTVLLYGPAP ELRELPSSAD
GRLDLALLRA QQLQMAVLHT NGVLYRHQVA LRRILKQRAP GSIYASPDEP DRSDAESQQP
GEQDQQLSSG SGQEPQLLIQ CILLRATQAS PVKACYSYMD LATAALNCIQ SLATQAHQEL
SEGGGVPPNG RALSSPPQPT MVHGVPVYNV ARKEQKPSEQ VEQKSKWPAA ATDAQLIGQI
MEMGFTRRTV ELALKQLSLQ AEIMPTPEQI VQWILEHPDV CANTIEEDTL PLASSASSHD
PEADSDNECP SSNSTTSSST SSDTVEGQPM AVSGPAPPVK FESRKDFQTA DLYALYVRGL
VRPGMTVRCC RDFEEIKQGD MGTVLIVDTE GLHDLNVQVD WRNHGSTYWV CFVHIELVEA
AQTHHQPRPP PIAVGARVRL RTSSLRYGML CPLRLGRSQG SSAIGVVSSV RSKQLTVDFP
DQPAWQGHIN EVELVASQPT SATLPSLGDS CSQMPPSDLI EDWSRCIRSL TVSSNEAAAK
HLLNGSNQPW QSCSSGPCRH WIRLELHDRI LVHSLTLKVS PEDHSHMPSL LEIRVGDCVD
SLKEYTWIPV PAGASRVLLM QQVPTYYPWV EVVVKQCQNN GIQCKIHGIK FVGRRQQPDL
QHILANAQFL ASEYSAGVGP GSTAGAGAVS TSHEEAAAAP EQDLPCTVMV WGLNDKEQLG
GLKGSKVKVP TFSQTISRLR PIHIAGGSKS LFIVSQDGKV YACGEGTNGR LGLGVTHNVP
LPHQLPVLRQ YVVKKVAVHS GGKHALALTL DGKVFSWGEG EDGKLGHGNR TTLDKPRLVE
ALRAKKIRDV ACGSSHSAAI SSQGELYTWG LGEYGRLGHG DNTTQLKPKL VTALAGRRVV
QVACGSRDAQ TLALTEDGAV FSWGDGDFGK LGRGGSEGSD TPHEIERLSG IGVVQIECGA
QFSLALTRAG EVWTWGKGDY YRLGHGGDQH VRKPQPIGGL RGRRVIHVAV GALHCLAVTD
AGQVYAWGDN DHGQQGSGNT FVNKKPALVI GLDAVFVNRV ACGSSHSIAW GLPNASSDEE
KRGPVPFSST RDPLGGSSLG IYEAETMQTL KQEAKPLNQS SLSESLALET PAARQAALGH
VLRAMSILQA RQLIVAALTS HSKVNFKERG AVGGEEDHLI GGPIMGAPLQ LAETIAQGGG
EAPADATDAG LQEHSPEAAV DALTGGMSGG ANTLPPLSAG PLSAFQSLTG SLSMSGSLSS
SALPQHKHSR MSASAMSVMA ATMTQQEEML SHISHCHGLD DFGGLLGEPE AKSLVELLKL
AVCGRCGPPS TSQTIADTLI SLGAGTPAVA AMLLETCITE LEDLCTSRHC LGKLPKPVMQ
ESSHPYVDNV NVTGVVRIPG AEMLRLEFDS QCSTEKRNDP LVIMDGTGRV LAMRSGREFA
HWAPEIRVLG DELRWKFSSD SSVNGWGWRF WVHAIMPAAT LGESGSDRAV LSQPSMALVM
SLLDSRLAPR QPSVLLRLAS ALAACSQLGA LTTAQRIWSL RKLHAVLLLE QAPRPQDPSL
STLLQPLIPE LLRQYEYEEP QVRGGIHLMH SDYFKTLAAL ACDMQLDAAL PATSSSSSSS
AAPGAISSTG DVHKWAWFKR YCIAVRVAQS LIRRTELPRA FCLEVRKKFA EMLPSSSSNS
NANPGCQSPG ASMLNSTTSL SSSTVSNVSP PPGITGEQPD LHCHAHQLES TSTLLHEDHT
LFQAPHDAQL LQWLNRRPDD WALSWGGAST IYGWGHNHRG QLGGLEGSRI KTPTPCEALS
LLRPVQLAGG EQSLFAVTPD GKLFATGYGS GGRLGVGGSD SWAIPTLLGS LQHVFVKKVA
VNSGGKHCLA LTTEGEVYAW GEGEDGKLGH GNRMSYDRPK LVEHLNGMSV ADIACGSAHS
AAITASGHVL TWGKGRYGRL GHGDSEDQLR PKLVEALLGY RAIDIACGSG DAQTLCITDD
DNVWSWGDGD YGKLGRGGSD GCKLPYKIES LAGLGVVKVE CGSQFSVALT KSGAVYTWGK
GDFHRLGHGS VDHVRRPKKV AALQGKKIIS IATGSLHCVA CSDSGEVYTW GDNDEGQLGD
GTVTAIQRPR LVAALQGKHI VKVTCGSAHT LALSTSQLSE RLRPLPNPPL EYDLVRDLAP
EALHARLILL HHFSELVCPC LAMLPISGDL SLGALKDVLV YNIKEAAFRK VIQTTMVRDK
QHGPVIELNR IQVKRSRNRC NGLAGIDGMK SVFGQMVQKL PLLTQEALAL PHRVWKVKFV
GESVDDCGGG YSESIAEMCD ELQNGSVPLL INTPNGRGEA GANRDCFLLD PTLSSVLQMN
MFRFLGVLMG IAVRTGSPLS INLAEPVWRQ LTGEVLRPTD LTEVDRDYVA GLLCIRNMDD
DPKLFTALEL PFSTSSARGH EVPLSTRYTH ISPRNRAEYV RLALGFRLHE FDEQVKAVRD
GMSKVIPVPL LSLFSAAELQ AMVCGSPDIP LGLLKSVATY KGFDPSSALV TWFWEVMEEF
TNQERSLFLR FVWGRTRLPR TIADFRGRDF VLQVLEKNPP DHFLPESYTC FFLLKMPRYS
CKAVLLEKLK YAIHFCKSID TDEYARVAMG EPTEATGSED NSDLESVASH EG