HMU_HALWD
ID HMU_HALWD Reviewed; 9159 AA.
AC Q18DN4;
DT 31-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 25-JUL-2006, sequence version 1.
DT 03-AUG-2022, entry version 89.
DE RecName: Full=Halomucin;
DE Flags: Precursor;
GN Name=hmu; OrderedLocusNames=HQ_1081A;
OS Haloquadratum walsbyi (strain DSM 16790 / HBSQ001).
OC Archaea; Euryarchaeota; Stenosarchaea group; Halobacteria; Haloferacales;
OC Haloferacaceae; Haloquadratum.
OX NCBI_TaxID=362976;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 16790 / HBSQ001;
RX PubMed=16820047; DOI=10.1186/1471-2164-7-169;
RA Bolhuis H., Palm P., Wende A., Falb M., Rampp M., Rodriguez-Valera F.,
RA Pfeiffer F., Oesterhelt D.;
RT "The genome of the square archaeon Haloquadratum walsbyi: life at the
RT limits of water activity.";
RL BMC Genomics 7:169-169(2006).
CC -!- FUNCTION: May protect the organism from desiccation stress. May also
CC contribute to the rigidity and maintenance of the unique square cell
CC morphology of H.walsbyi.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC -!- PTM: Probably glycosylated with sugar containing sialic acid. This may
CC further contribute to its overall negative charge, thereby creating an
CC aqueous shield covering the cells.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AM180088; CAJ51211.1; -; Genomic_DNA.
DR RefSeq; WP_011570378.1; NC_008212.1.
DR SMR; Q18DN4; -.
DR STRING; 362976.HQ_1081A; -.
DR PRIDE; Q18DN4; -.
DR EnsemblBacteria; CAJ51211; CAJ51211; HQ_1081A.
DR GeneID; 4194131; -.
DR KEGG; hwa:HQ_1081A; -.
DR eggNOG; arCOG03439; Archaea.
DR eggNOG; arCOG06233; Archaea.
DR eggNOG; arCOG07534; Archaea.
DR eggNOG; arCOG07873; Archaea.
DR eggNOG; arCOG10954; Archaea.
DR HOGENOM; CLU_222693_0_0_2; -.
DR OMA; APDNVYV; -.
DR Proteomes; UP000001975; Chromosome.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR Gene3D; 3.10.100.10; -; 2.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR013784; Carb-bd-like_fold.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR011493; GLUG.
DR InterPro; IPR006626; PbH1.
DR Pfam; PF07581; Glug; 1.
DR SMART; SM00112; CA; 1.
DR SMART; SM00034; CLECT; 2.
DR SMART; SM00710; PbH1; 13.
DR SUPFAM; SSF49313; SSF49313; 1.
DR SUPFAM; SSF49452; SSF49452; 2.
DR SUPFAM; SSF56436; SSF56436; 2.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 2.
DR PROSITE; PS50268; CADHERIN_2; 1.
PE 3: Inferred from homology;
KW Glycoprotein; Lectin; Reference proteome; Repeat; Secreted; Signal.
FT SIGNAL 1..30
FT /evidence="ECO:0000255"
FT CHAIN 31..9159
FT /note="Halomucin"
FT /id="PRO_0000259666"
FT DOMAIN 644..776
FT /note="C-type lectin 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00040"
FT DOMAIN 929..1060
FT /note="C-type lectin 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00040"
FT DOMAIN 7686..7793
FT /note="Cadherin"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00043"
FT REGION 1310..1351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1756..3380
FT /note="V-G-G-L motif-rich region"
FT REGION 3484..3514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4878..4912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 6570..6589
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 7047..7097
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 7660..7702
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 7888..7923
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 8212..8237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 8369..8614
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1310..1337
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3484..3499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4890..4912
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7047..7081
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7901..7922
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8377..8458
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8472..8535
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8560..8614
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 9159 AA; 927738 MW; 2170B9429C2A41E7 CRC64;
MSQTAKPIFA VVVALIVLIS GVAFIGSVSA QQPNLVQNGS FENSSNDFSS DNNYNTLVAD
STAITGWRVS SGEVDQVGSY WSPQDGSVSI DLSGSEPGVI EQNVTGLEAG KRYELTYYYS
GHRVQEGKYE AGVEIADLNI TETASSPGDW TLATHTFTAD STAETLTFTQ ITPSSGAKGM
AIDNVSIVES SDPIDTAAPT VSVNQPAGGA TLTTSDVALN ASANETGNWT YSVDGGPNQT
ATDANGTKTL NVTLSGLADG SHTATVYIKD DGGNIGIDTV SFTISSAPTV TTSSGGTNYT
ASTGGFVVDE NLTVTNPDGG TIDGATIAIG SGFNASVDTL AIDEAVAPNN SITSTNYNGT
TGVLTLDGTA SAAEMQTVLR TVTYAYSGET TASARDIDVS FALGTGGNNS STPTTAQITV
TVSSDTTAPS VTIDQPVEGS TLTTGDVAFD ASANKTGNWT YSVDGDPNQT ATGANGTQTL
NVTLSGLADG SHTATVYIKD DGGNVDTDTV SFTVSTASSL TTSSSKTDYT ASDGGFVVDD
SLTVTSPDDT LINGATVAIG SGFDASEDTL AVNETVASNN NITNTNYNSA TGVLTLDGTA
SADEMQTVLR TVTYTYSGDA TASARDLDVS FSLGTGGNIV FDPTTGNYYE LVSETVTWKT
AKDEAENRSH LGLQGYLATL TSERENDETA SRFTFDKAWI GASDASTEGD WKWVTGPERG
TLFWEGDESG SEQNGEYAGW ASTDPNAIQG ENYAFINQNL EWIDQKNGVN YNYLVEYGGL
NGSSSPTAQR TVTVDTEAPT LTTSSGSVTN TTASGGYLVD DKLTVTDPDG GGIDTATVSI
SQGFDPSADR LSVNTTLATN RGITSTYDDT TGVLNVSANS SATVTADDFQ AVLRTVTYNF
TGTSVTNESD RTVPIRFALD ANQEQVTAYD GHYYEYVSDS VSFDTARSEA ENRSHLGLEG
YLATLTSEQE DDTIHQQFDQ ESWIGASDAE TEGDWKWVTG PENGTLFWKD GSTQNGEYAG
WEGGEPNSQN LDENYAEINF DDYTGWNDQV DNQGYLVEYG GLTTDSAITA QRNVTIDTGA
PNVSNVTIRR IDGGGVVTTN DTIEVSATVT DVSSIQSVTA SAIAFDAGTV NLSDDGPNSS
AADDVYSATF TVGPDPIEQT QSVTVTATDD AGNGGTPPGD LVYDSITVNP EGTTDFRTVR
LPRSFENPVV IAKPLESTSS GNVAPDNRRG HTRIRNVQST SFEIRVEEWS VQDDEDHPAA
NVSYIVAEAG TTTLDDSTKV SAGTITLNQN DGFQSITFEE SLNSPVAFTQ PQTVNDPDAV
STRNNNVGSN GLDSKIEDDQ NNGADGNPHG TETLGYIAIE QGDSVLGETG FTAGTQGNVD
ETLTSINFGD SYPAGFVAAM QTTGGTEQSY LRYDDRTDTG VRVRIEEDPV DNSDRHNSET
VGYLAWNAST QVTGTRSNEL SVDTSAPQID DLSASFTTGS PPVSAGDEFR VTATVTDAGA
VSTVEADVAA LDADPGTITL DDLGNDTYEG TFEVGQNPDP TRAVAVTVTD SLGNNASATT
IAGSQASWDH EVNGTVQTGT FAGNGSAVDP YIIDSLVDLQ AINKNATTRA YNYRLGTDID
ASTTRSWNDG KGFAPVGAVN GRDIGEPFSG SLDGDGYTIS NLSVDQRSTD TNALGLVGKL
DSDGAVRNLT LANASVAGDD DIGAAVGKSA GTVRNVTVSG MVDGDQRIGG VVGTVTSTGT
VTNTTAVANT SGNQGVGGLI GESSGTVNNA SAGGAVTATG QYAGGLIGDH QSSTAVTDVN
ASGAVTGASY TGGLVGRGQA GVINASANGD VTSTTGEYVG GLVGQLQAQH GDEEVRNVTA
SGNVSSGGQY VGGLIGDVRD DGTNYVAMSE AHATGNVNTT YADESGFNNA YVGGLVGHFK
GSEFTDISAT GDVTSTTGNE VGGLVGRVAM EQERDIVANA SATGNVTTTG QRVGGLIGYH
RTGTILRNVT ATGDVSTDGQ RVGGLVGDTD ASITDASATG EVTTSTSGEY VGGLVGQLQA
QHGDEEVRNV TASGNVSSGG RYVGGLIGDV SDDGTHYVAM SEAHATGHVN TTYADESGFN
NAYVGGLVGH FEGSEFTDIS ATGDVTSTAG NEVGGLVGRV AMEQERDIVA NASATGNVTT
TGQRVGGLIG YHRTGTILRN VTATGDVSTD GQRVGGLVGD TDASITDASA TGEVTTSTTG
EYVGGLVGQL RAQHGDEEVR NVTASGNVSS GGQYVGGLMG YVDDNNNRGR YVAMSEAHAT
GNVNTTYADE SGFNNAYVGG LVGHFKGSEF TDISATGDVT STTGNEVGGL VGRVVMRNNG
IVANASATGN VTTTGQRVGG LIGYHRTGTI LRNVTATGDV STDGQRVGGL VGDTDASITD
ASATGEVTTS TTGEYVGGLV GQLRAQHGDE EVRNVTASGN VSSGGQYVGG LMGYVDDNNN
EGRYVAMSEA HATGNVNTTY DGGSNAYVGG LVGRFKGSEF TDISATGDVT STTGNEVGGL
VGRVAMRNNG IVANASATGN VTTTGQQIGG LIGYHQNGDG VETSYARGDV STDGNYVGGL
IGDTNDGSIT DSYARGDVNS SGDYVGGLVG EANGDVTRAY ASGRVEGDGT DIGGLVGTNN
GGTLSDAYWD RGATNQTAAT GSGTPPGATG YGTVGDTRAL EMQGRAPTQF MAALNYTSPW
KLTSTYPIFQ RESATSGSLP ATVDTIEAST ATVVQTQQMT VKLNATINGS RVGPGYLITV
SDSNGLAELE GQRALTDQNG TATFTFAEQS AGTFTPTFEA VSDLNVTATA TVEVNEGAAR
TYIREDGTEL TAVYSGNGTV DDPYEIDSLA DLQAINNNTA ARDNHYELVA NIDTSATNNT
SWNNGNGFEP ITEYTGSLDG NGRTISNLSI NRSGANGVGL VDTLGSEGTI QNLTIQNASI
TGDNDLGVAV GINNGRVENV TASGTVSGND RIGGLVGTVN SGGVITASTA AATTSGSKSI
GGLVGQNNGA VNNASASGTV TATGQYAGGL IGDHRSTISI VDTNASGAVT GTSSTGGLIG
RAQANVTDSS ASGRVNGTTG SNVGGLIGEH DPSQVATIAN VSASGDVSTS GGVRVGGLVG
SSISGSPVTM LNARAEGNVT TSDLSGTDAF TGGLIGEIQN AKNVTNVSAT GQVNASIAGN
QVVTGRVGGL IGSFEHESAS RIVANASATG DVATDTNGQV GGLIGTYSGG GTVEDSNARG
NVSATGQSVG GLLGSADASI IRRSSATGDV NSTGSNVGGL IGKHDPSQVA TIANVSASGN
VSTRNQGAGG LVGAIQTSPV SISDAGASGD VTAAVGSGRG YVGGLIGEIR NAKNVTNVSA
SGQVNVSANT GFNGEHVGGL IGVIDHESGA NIVANASATG DVNADAAGPT GGLIGGTEGT
LGTVQDSYAQ GNVSGTGPVG GLIGQTNGDT ARVYASGRVE GNSGLGGLIG DNSGQISESY
WDKGATAQSD ATGSGTPGGA TGYGSVGDTT PAPQMQGRAA TELMDGLHYT TTWNVTRGYP
VLQAQSNGTE QPPRTVDTVT ATNASAIQSE QVSVSVTVTT AGTDTGTGTA AGLIVSAQDT
GGLTSLNNAT AVTNETGTAT LTITESDTGT FSPTFTVVGY STATANATVT VSDGAVRTYV
REDGEELTAV YSGNGTADNP YLIDSLADLQ AIDNSSAARA DNYRLAEDID ASATSDASWN
DGSGFEPITP FTGSLDGNDT TISNLSINRS GASTAGLVGA LDSNGAIQNL TIQNASVTGS
DELGVVAGTN DGTIQNLTVE DASIDGVNDL GVAVGTNNGT VQNVTASGAV RGSDRIGGLV
GTVNSGGVVT DATTTATANG SQSIGGLVGE SNGAVNNASA SGTVTATDQY AGGLIGDHQS
TTPVTDVNAS GAVTGTSYIG GLLGRGQADV INASAGGSVN GTSGGYVGGL IGQAKSSSGN
PITVSNAHAS GVVNATNTGT DAYTGGLVGE FQNAGNVTDV SATGDVNASE PEGGNLVGGL
FGELHHDSAD HILTNASATG DVDATGQRVG GLIGYYKDGD GVETSYARGN VSTDGDYVGG
LIGDTNNDGS ITDSYARGDV TTSGDKVGGL VGETEDVDVT RSYASGRVKG GSTVGGLVGN
NTGQLSGAYW DKGATNQTAA TGSGSSGGAT GFGTVGDDTQ ALEMQGRAPT QFMSALDYTS
PWKLTSTYPV FQRESNTSGS LLAPVDTIEA TGTTAVQTQQ ITVEVNATIN GSPVGPGYLI
NVSDSNGLAE LDSKTALTDA NGTATFTFAG PTAGTFEPEF EAVSNSAASA TATVTVESGA
VRTFIREDGT ELTAVYRGNG TADSPYEVDS LADLQAIDNS TAAHGDQYTL VANIDASATD
ESSWNGGGGF EPIANATDEA FTGTLNGNGH TISNLSVDRS GANRAGLVGT LGSNGTIQNL
TIQNASITGN NDVGTAVGTS AGTIQNVTAS GTVNGNDRIG GVVGEVTADG LVTDSTASVN
TSGNQAIGGL VGQSSGAVNN ASASGEVTAT SKYAGGLIGD HQSTTPVTDV NASGTVSGTS
SAGGLIGRAQ ANVTDATATG DVVSTSGTKV GGLIGNHQAG QTIVDVSATG TVSSDGDDVG
GLVGFTRASV RNATASGEVE STTGANVGGL VGRLSHDDST HVTNVTATGT VSAGADHVGG
LVGFIEDSDD NGNITMSGAR ATGDIEMTSD DGGPASVGGL VGRFEHGSAI TDVSATGDVS
SVGGNEVGGL IGRLDQLAST DVLKNASATG DVTTTGQKVG GLIGFHRTGS VDNSHAQGNV
SAGSGDNVGG LIGVHGRNGG RVGDSYARGD VNTSGNNVGG LIGRSEGKVL RVYATGRVEG
GSAVGGLVGK NDGGQLSESY WDKGATDKSD ATGSDTPATV SGYGSVGDTT PAPQMQGRAA
TELMEGLNYT TTWNVTRGYP VLQAKSTGTE QLPRTVDTVT ATNASAIQSE QVSVSVTVTT
TDSNIREDFV ISVQDTGGLT SLENATAVTN DTGVATFNIT ESSPDTYTPT FGVAGYSTAT
VNATVTVDNG AVRTYTREDD TELTAVYRGN GTPDSPYLID TLADLQSINI DSGTRNEQYR
LAADIDASAT NESSWNGGRG FEPITDFTGS LNGDGYAISN LSIDRGNEAS VGLFGDTNAG
SSITNVTLVQ PAVTGGQGTG PLVGSHGGSI SRTVVTGGTA TTEADGEAHL GGLVGILVDD
AKITQSHTSA IVDANGHNEA GGFAGDIGSN ARVEQVSATG GVKNGGSEIG GLFGNASSGS
EIVEAFAAGN VSGTTNVGGL LGRQDGSAVT VDRAYWDEQS TGQTSSAGDT ETALPTVKMQ
GTAATEFMPG LNFTTTWNTT RDYPVLQVQT TGTEQPPRTV DTIDATNASL VQSEQVSVSI
TVTASEADSR NGFLISVQDA AGLTSLENAT AVTNETGTAA FNLTESDAGT FTPTFGVAGD
ASATTNATLT VKQGAVRTYT REDGTALTAV YTGNGTPESP YEIDSLADLQ AIDNSSTARN
NSYRLTADID ASATIESSWN DGSGFEPIAN ATDDDEAFTG SLDGDGHTIS NLSIDRSGAD
TVGLFGELDT AGVLENVTLQ NASVTGGNDV GLVAGTSNGT LRNVTTTGTV TGNNRVGGLI
GTTELNSVLT ESSAAVNTSG SQRVGGLVGT AGGIVNTSTA NGTVTATGQY AGGLIGESQG
TIPVTNTAAS GNVTGTADVG GLIGRTNTTV RNSSASGVVN STSGDGVGGL IGRSLAAVHD
SSASGAVSST GGNSVGGLIG NAGADVTNSA GRGNVISSTG NEVGGLVGRL EDSSNIRDSH
ARGTVTAAGS DVGGLAGTID TGNATRVFAT GQVEGNSAVG GLVGKNNGGT LSDVYWDKGA
TNQTNATGSA TPTGADGYGL VNDDTPAERM QGEAATAFMS ALNYTTTWNT TRGYPVLQAQ
ATGTEQPPRR VETIDATDAS SAQTEQVAIS ITVTAADSES TDGFVITVRE SDGLSGLEDA
TAVTDQDGTA TFSFSESNAN SYKPTFGVAG DTAVSTTANI TVKEGAVRTY TREDGTNLTG
TYIGSGTPDD PYLANSLTDL QVINKNATTR DEHYQLTDDI NASATIESWN GGNGFEPIAN
ATDEAFTGTL DGNGTTISNL SIDRSGADSV GLISEVGTSG VMENLTLQNA SVIGSNDVGV
VAGTSNGTIR NVTATGTVAG DNKIGGLVGA SNSSAVVADS STAVTTSGTR QIGGLVGQSS
GSVTNSSASG SVTASGGFAG GLVGDLDVTT PEELANVSAS GNVSSGGQYV GGLVGLGTST
SSSGNQLTIT EAQASGDVNT TYNGGNDAYV GGLAGKLSNV GNVTDVTATG DVSSTSGNDV
GGLAGELVHD STSFTLRNAT ATGDVTTTGQ RVGGLIGAHR NGQALTNATA TGNVSTDGNN
VGGLIGYQPD DKTVSDVTAT GDVSTEGNNA GGLIGQTRAS LSDATASGEV QSSTGDKIGG
LVGYLKLSAE KTTNVTATGN VSTDGNNVGG LIGHAEGVES TSQTTISTAR ASGDVTSTDG
NDIGGLIGNA SFQNDADTVT NASATGDVGT DSATGSSVGG LIGSQSSGQV RNSYARGNVT
TSGDNVGGLV GSAAGKQIVD SYARGNITTE GTNVGGIAGK LSGDVKRVYA SGQVTGDNKV
GGLVGSGGTL SDAYWDKGAS TQTNATGNKS GLGTGTPNGV AGYGSIGSDV PATEMQGQSP
SKFMSALNYT STWSLVNGYP QLRAETAPSF ASDTTPPTLT SVTSTDGTTI QLTIDGGVSG
IDTSSIDNTD FAVSNNSILS TDTNTSGTDT NTTQTVTLSL SSSVETRPVT VDITSQAGGI
TDRSGNKLSN ITTTLPGIDT VSVSDDTNDD GIVTTGDRVQ VTVESNTDTN TDTDTDTTLS
SVTVNATDYD DETATLSPNG TNVETGASIH SGIIIVGSTP TDGSDQSLTV NVTDDTGNAA
VRTIQVSGVT VDTKAPEITS VSRAGGDSVD VSIQSGLSGI EKSSISAADF TVEPGKVASV
NTSAVTDGSN QTQTVRIGLT DAIRGKTPTV TINSSSDGIL DKAGNGQGED STSSNESSDG
TESDQGDPED DIGGINTGIA PAVTLTTSDL DSSTLSTVDV TYNATGEVGS SDDIEIQLYN
DNNSTTTPIA TRTVGSVEGL TSLTVPSSAV GGGTFTGRAT LVNTSAGNTE LANKSKQIVA
YENVSTSVTA GSLGGKITVK HDFGSLDPAN AQIRVSPITI GSKFTTQTVT PAPAQKQGTV
TIPVPKQANS RFNVQTEVVD TSRNRQIQSS LGYGCVGSQR DPCSTVSETG TTVVDATTGN
SVGIEVNATV DHNRGAVEWY LHPTGSPEQK DISIVDGVDA STPLAVTIAV DDFDPVFMLG
TGNADGWEKT GVDKNTKEIS INVTPAEAYV EPDIRNPDPR EWPLSDHTAS KRYGAIVDMI
AVSMEGRVNP GYRNHLDGAF IGTNAQAFSV PKSSAGGSDS AGSLSITVAA PHYETDGTTV
NTGFYNARIP KSVLAAWGIS PGQVTATYKG ESVSGSSLTV EDKKGAIFVS MPVTYSSGTV
TVSGSQDSSD TTAPTVSNVS LDSDGTGNLT FTVDSNEQLG VATDNVSVSI DGPNTNDVYT
FNRDDLIQSG SGPYTYALDL DSAQPYDNGG GTYTATVDTA TDSAGNDGGG SGLTDSHDHT
VTGSTPTFVS SGTVTTPENP GGTLLDVNAI DGEGGLFDSG VKYTITGGAD SSSFTVDGKT
GAVSFADSVD FESPADADGD NAYELDVTAS TANANATQSI SVTITDVDEQ PTGQVNLTGG
DPGDRTISIV GNKTGYTFDA SKIVDPEGTG VSYTWDFGIG DTSTGSTVTR DYDPGSYLEE
TTVSPTLTVD DGSKQTVIDV SVTFYSDIDG DGLADDNEAT GVPTDNDDDN DGIPDDEDQE
PALGEMSSIS GTITDTNGSR VTQGDVTIIS SDGTHRESVS LDSNGKFDTT VPAGNYRLVV
ENTSAPVHER EDITVGPQTP TSQNLTLEGS GTVAGKFINP DEKSASNIPV QIASRDGGET
YYTKTDSTGE YEVAVEPGDY VVAPLGNDSG NASREVSVEL GETIQQSVTL DPQPVETAAS
LSIASGPGSV TADGHQMVVI PEVTDGLLQI QIANNSDPNR DISVVEDPSE LENFGVTNET
KFRIRVTVTN YTPHTLFWAL RDAEFNSKPN ATNPTATDII ITGSPVSLAT TSTQQKRVGP
LVSEDPSTVS WPSGAADTAD SQYNQTVYFS VYDLSTRPES LRDRLTGLIL STNAQRISLP
EVSNDRLRTW IAGPRYKTAS GTKYEGFYQA QIPQSQLEEW GVADHPTHQL FGEYKSSERN
LTVEDVDGGI SVDVSNISYS ASYVDIKADS TAPVPESALE DDSSNQDSGD DSSNQDSGDD
SSSQNDDGDN SSNQDSGDDS SSQNDDGDNS SNQDSGDDSS SQNDDGDNSS NQDSGDDSSS
QNDDGDDSSS QNDDGDNSSN QDSGDDSSSQ NDDGDNSSNQ DSGDDSSSQN DDGDNSSNQD
SGDDSSSQND DGDNKPNSAA AVGAESGSEM GGETGGESQA GGGDGSSGSS SDAAGGGSSG
GSSSGDSGGS SSGNSGGSSS GNSGGSSSGS GSSTGSMIAD ALTVITAPLR WFGSLSTAGK
AAVAATTGAG AAGAAYGLGG DRIQTPLNIA RRRFQSWLRR RIRGSSRSQI SKLLARLRRL
KWAKIRTQIA GVRKYFTRSY WRELIAKRRR LGSREGMKNW LKSKYRGNRK RKYRGWLRGR
LRSGVNWVAG GLLRGAAPAW LGVVSGPAAT AVSVITGEIR RWIEDKAMDR FDSTRKRYAK
LAIQSSAWIT TVETRLWQLL SGEDSPSSSR SLAAIAGESA SELNEVGVDS VDQLASADPE
QLASALEIDE SAVAEWVNRA GHASGSTERP AFIETRNGKR IQARYEQIAE IVQTGVSIPT
ISVGSVHSIT DIGGRQLARV TGWLQGDVSS ILSGAFERIQ SFSCRLLYRV SIWIYGPSGA
VESIDGIGPE YSDRLVQEGI TDVAVLSACS AERLSERINV SSSQTYRWIT QATAETPDTR
GLHQRLVAGV VRVESVFIAM KTKSSVQLES NRLREDHFSS QPLSEKEMNQ LAVVGITTVS
QLAAINPDRL GAGVGIDTKT AEEWVEMAQV YEMHLNNNS