ZCH18_MOUSE
ID ZCH18_MOUSE Reviewed; 948 AA.
AC Q0P678; Q3TKW3; Q52KI3; Q6KAL8; Q9CU64;
DT 13-NOV-2007, integrated into UniProtKB/Swiss-Prot.
DT 19-SEP-2006, sequence version 1.
DT 03-AUG-2022, entry version 111.
DE RecName: Full=Zinc finger CCCH domain-containing protein 18;
DE AltName: Full=Nuclear protein NHN1;
GN Name=Zc3h18; Synonyms=Nhn1;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC STRAIN=C57BL/6J; TISSUE=Thymus;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=C57BL/6J; TISSUE=Brain, and Eye;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 225-948 (ISOFORM 1).
RC TISSUE=Spleen;
RX PubMed=15449545; DOI=10.1093/dnares/11.2.127;
RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S.,
RA Saga Y., Kitamura H., Nakagawa T., Nagase T., Ohara O., Koga H.;
RT "Prediction of the coding sequences of mouse homologues of FLJ genes: the
RT complete nucleotide sequences of 110 mouse FLJ-homologous cDNAs identified
RT by screening of terminal sequences of cDNA clones randomly sampled from
RT size-fractionated libraries.";
RL DNA Res. 11:127-135(2004).
RN [4]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-45; SER-92 AND SER-530, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Liver;
RX PubMed=17242355; DOI=10.1073/pnas.0609836104;
RA Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.;
RT "Large-scale phosphorylation analysis of mouse liver.";
RL Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007).
RN [5]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-530; SER-847 AND SER-863, AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19144319; DOI=10.1016/j.immuni.2008.11.006;
RA Trost M., English L., Lemieux S., Courcelles M., Desjardins M.,
RA Thibault P.;
RT "The phagosomal proteome in interferon-gamma-activated macrophages.";
RL Immunity 30:143-154(2009).
RN [6]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-530, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Embryonic fibroblast;
RX PubMed=19131326; DOI=10.1074/mcp.m800451-mcp200;
RA Sweet S.M., Bailey C.M., Cunningham D.L., Heath J.K., Cooper H.J.;
RT "Large scale localization of protein phosphorylation by use of electron
RT capture dissociation mass spectrometry.";
RL Mol. Cell. Proteomics 8:904-912(2009).
RN [7]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-45; SER-92; THR-105; SER-106;
RP SER-114 AND SER-530, AND IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE
RP ANALYSIS].
RC TISSUE=Brain, Brown adipose tissue, Heart, Kidney, Liver, Lung, Pancreas,
RC Spleen, and Testis;
RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001;
RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R.,
RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.;
RT "A tissue-specific atlas of mouse protein phosphorylation and expression.";
RL Cell 143:1174-1189(2010).
RN [8]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-810, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Embryonic fibroblast;
RX PubMed=23806337; DOI=10.1016/j.molcel.2013.06.001;
RA Park J., Chen Y., Tishkoff D.X., Peng C., Tan M., Dai L., Xie Z., Zhang Y.,
RA Zwaans B.M., Skinner M.E., Lombard D.B., Zhao Y.;
RT "SIRT5-mediated lysine desuccinylation impacts diverse metabolic
RT pathways.";
RL Mol. Cell 50:919-930(2013).
RN [9]
RP METHYLATION [LARGE SCALE ANALYSIS] AT ARG-245 (ISOFORM 2), AND
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Embryo;
RX PubMed=24129315; DOI=10.1074/mcp.o113.027870;
RA Guo A., Gu H., Zhou J., Mulhern D., Wang Y., Lee K.A., Yang V., Aguiar M.,
RA Kornhauser J., Jia X., Ren J., Beausoleil S.A., Silva J.C., Vemulapalli V.,
RA Bedford M.T., Comb M.J.;
RT "Immunoaffinity enrichment and mass spectrometry analysis of protein
RT methylation.";
RL Mol. Cell. Proteomics 13:372-387(2014).
CC -!- SUBUNIT: Interacts with ZFC3H1 in a RNase-insensitive manner.
CC {ECO:0000250|UniProtKB:Q86VM9}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q0P678-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q0P678-2; Sequence=VSP_029456, VSP_029457;
CC -!- SEQUENCE CAUTION:
CC Sequence=BAD21439.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK017933; BAB31010.3; -; mRNA.
DR EMBL; AK166802; BAE39030.1; -; mRNA.
DR EMBL; BC030495; AAH30495.1; -; mRNA.
DR EMBL; BC094330; AAH94330.1; -; mRNA.
DR EMBL; AK131189; BAD21439.1; ALT_INIT; mRNA.
DR CCDS; CCDS22736.1; -. [Q0P678-1]
DR RefSeq; NP_001025164.1; NM_001029993.1.
DR RefSeq; NP_001025165.1; NM_001029994.1. [Q0P678-1]
DR RefSeq; NP_001297579.1; NM_001310650.1.
DR AlphaFoldDB; Q0P678; -.
DR BioGRID; 217907; 2.
DR IntAct; Q0P678; 4.
DR MINT; Q0P678; -.
DR STRING; 10090.ENSMUSP00000017622; -.
DR iPTMnet; Q0P678; -.
DR PhosphoSitePlus; Q0P678; -.
DR EPD; Q0P678; -.
DR jPOST; Q0P678; -.
DR MaxQB; Q0P678; -.
DR PaxDb; Q0P678; -.
DR PeptideAtlas; Q0P678; -.
DR PRIDE; Q0P678; -.
DR ProteomicsDB; 298507; -. [Q0P678-1]
DR ProteomicsDB; 298508; -. [Q0P678-2]
DR Antibodypedia; 30727; 29 antibodies from 12 providers.
DR DNASU; 76014; -.
DR Ensembl; ENSMUST00000093073; ENSMUSP00000090761; ENSMUSG00000017478. [Q0P678-1]
DR GeneID; 76014; -.
DR KEGG; mmu:76014; -.
DR UCSC; uc009nsl.1; mouse. [Q0P678-2]
DR UCSC; uc009nsn.1; mouse. [Q0P678-1]
DR CTD; 124245; -.
DR MGI; MGI:1923264; Zc3h18.
DR VEuPathDB; HostDB:ENSMUSG00000017478; -.
DR eggNOG; ENOG502R7WP; Eukaryota.
DR GeneTree; ENSGT00730000111190; -.
DR HOGENOM; CLU_012901_0_0_1; -.
DR InParanoid; Q0P678; -.
DR OrthoDB; 618568at2759; -.
DR PhylomeDB; Q0P678; -.
DR BioGRID-ORCS; 76014; 18 hits in 72 CRISPR screens.
DR ChiTaRS; Zc3h18; mouse.
DR PRO; PR:Q0P678; -.
DR Proteomes; UP000000589; Chromosome 8.
DR RNAct; Q0P678; protein.
DR Bgee; ENSMUSG00000017478; Expressed in cranial cartilage and 112 other tissues.
DR ExpressionAtlas; Q0P678; baseline and differential.
DR Genevisible; Q0P678; MM.
DR GO; GO:0016607; C:nuclear speck; ISO:MGI.
DR GO; GO:0032991; C:protein-containing complex; ISO:MGI.
DR GO; GO:1990904; C:ribonucleoprotein complex; ISO:MGI.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0140262; F:mRNA cap binding complex binding; ISO:MGI.
DR GO; GO:0030674; F:protein-macromolecule adaptor activity; ISO:MGI.
DR GO; GO:0050779; P:RNA destabilization; ISO:MGI.
DR InterPro; IPR041367; Znf-CCCH_4.
DR InterPro; IPR000571; Znf_CCCH.
DR InterPro; IPR036855; Znf_CCCH_sf.
DR Pfam; PF18044; zf-CCCH_4; 1.
DR SMART; SM00356; ZnF_C3H1; 1.
DR SUPFAM; SSF90229; SSF90229; 1.
DR PROSITE; PS50103; ZF_C3H1; 1.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Coiled coil; Isopeptide bond;
KW Metal-binding; Methylation; Nucleus; Phosphoprotein; Reference proteome;
KW Ubl conjugation; Zinc; Zinc-finger.
FT CHAIN 1..948
FT /note="Zinc finger CCCH domain-containing protein 18"
FT /id="PRO_0000311243"
FT ZN_FING 215..241
FT /note="C3H1-type"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00723"
FT REGION 1..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 272..296
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..922
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 395..460
FT /evidence="ECO:0000255"
FT COILED 916..945
FT /evidence="ECO:0000255"
FT COMPBIAS 1..84
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 107..121
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..137
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..189
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..205
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..294
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 392..483
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 502..517
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 518..540
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..602
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 603..621
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 635..649
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 676..723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 747..773
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 815..840
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..863
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 888..917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 1
FT /note="N-acetylmethionine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 6
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 33
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 45
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:17242355,
FT ECO:0007744|PubMed:21183079"
FT MOD_RES 58
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q6TQE1"
FT MOD_RES 64
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 71
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 75
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 80
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 92
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:17242355,
FT ECO:0007744|PubMed:21183079"
FT MOD_RES 105
FT /note="Phosphothreonine"
FT /evidence="ECO:0007744|PubMed:21183079"
FT MOD_RES 106
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:21183079"
FT MOD_RES 114
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:21183079"
FT MOD_RES 169
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 483
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 528
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 530
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:17242355,
FT ECO:0007744|PubMed:19131326, ECO:0007744|PubMed:19144319,
FT ECO:0007744|PubMed:21183079"
FT MOD_RES 532
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 810
FT /note="N6-acetyllysine"
FT /evidence="ECO:0007744|PubMed:23806337"
FT MOD_RES 838
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 847
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:19144319"
FT MOD_RES 863
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:19144319"
FT MOD_RES 888
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT MOD_RES 891
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT CROSSLNK 506
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT CROSSLNK 618
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT CROSSLNK 657
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT CROSSLNK 813
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT CROSSLNK 903
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q86VM9"
FT VAR_SEQ 226..317
FT /note="GNCTWGMSCRFIHPGVNDKGNYSLITKAEPFPPNGAPPLGPHPLMPANPWGG
FT PVVDEILPPPPPEPPTESAWERGLRHAKEVLKKATIRKEQ -> DVVTPLSTLLPPIPN
FT SIPLRGQVKGTKYPVFTLCCALKSPAPEARLLSPGSRAQLGRQQGGGLRDRTAVALFSL
FT VSLELGRRCRLRPWHCT (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_029456"
FT VAR_SEQ 318..948
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_029457"
FT MOD_RES Q0P678-2:245
FT /note="Omega-N-methylarginine"
FT /evidence="ECO:0007744|PubMed:24129315"
SQ SEQUENCE 948 AA; 105694 MW; A56716FF866C847B CRC64;
MDVAESPELD PHSPEDEEQP ALSDDDILRE SGSEQDLDGA GERASDLEEE ENATRVQSQE
ETRSDEEDRA SEPKSQDQDS EAHELSRGPA GSPCEEGDDV EEDGTSDLRD EASSVTRELD
EHELDYDEEV PEEPAPAAQE EEAEKAGAEE EEEKGEGAPG EEGKPDVQSV GEQEPTEAAK
EKKKEDDDGE IDDGEIDDDD LEEGEVKDPS DRKVRPRPTC RFFMKGNCTW GMSCRFIHPG
VNDKGNYSLI TKAEPFPPNG APPLGPHPLM PANPWGGPVV DEILPPPPPE PPTESAWERG
LRHAKEVLKK ATIRKEQEPD FEEKRFTVTI GEDDREFDKE NEVFRDWNSR VPRDVRDTTL
EPYADPYYDY EIERFWRGGQ YENFRVQYTE AEPYHNYRER ERERERENRQ RERERERERD
RERERRQRER ERERERERDK ERQRRKEEWE RERAKRDEKD RQHRDRDRDK DREKDKEKPK
PRSPQPPSRQ AEPPKKESTS VGPQVKRADE WKDPWRRSKS PKKKLGVSVS PSRARRRRKT
SASSASASNS SRSSSRSSSY SGSGSSRSRS RSSSYSSYSS RSSRHSSFSG SRSRSRSFSS
SPSPSPTPSP HRPPVRTKGE PAPPPGKAGE KSIKKPAPPP APPQATKTTA PGPEPAKPGD
LREARRKERQ TRTPPRRRTL SGSGSGSGSS YSGSSSRSRS LSVSSVSSVS SATSSSSSVH
SVDSDDMYAD LASPVSSASS RSPTPAQTKK ERGKSKKEDG VREEKRRRDP SAQPPKSSKA
PAGGKASQQA AAPQPAVPGQ PQQGSFVAHK EIKLTLLNKA ADKGSRKRYE PSDKDRQSPP
AKKANLSPDR GSRDRKSGGR MGSPKPERQR GQNAKAPAAP ADRKRPLSPQ SKGSSKVTSV
PGKATDTATA GTKSGKASTL SRREELLKQL KAVEDAIARK RAKIPGKV