ZFHX4_CHICK
ID ZFHX4_CHICK Reviewed; 3573 AA.
AC O73590;
DT 20-FEB-2007, integrated into UniProtKB/Swiss-Prot.
DT 20-FEB-2007, sequence version 2.
DT 03-AUG-2022, entry version 114.
DE RecName: Full=Zinc finger homeobox protein 4;
DE AltName: Full=Zinc finger homeodomain protein 4;
DE Short=ZFH-4;
DE AltName: Full=Zinc finger/apterous-related homeobox protein;
GN Name=ZFHX4; Synonyms=ZAX;
OS Gallus gallus (Chicken).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Phasianinae; Gallus.
OX NCBI_TaxID=9031;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Red jungle fowl;
RX PubMed=15592404; DOI=10.1038/nature03154;
RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., Ponting C.P.,
RA Bork P., Burt D.W., Groenen M.A.M., Delany M.E., Dodgson J.B.,
RA Chinwalla A.T., Cliften P.F., Clifton S.W., Delehaunty K.D., Fronick C.,
RA Fulton R.S., Graves T.A., Kremitzki C., Layman D., Magrini V.,
RA McPherson J.D., Miner T.L., Minx P., Nash W.E., Nhan M.N., Nelson J.O.,
RA Oddy L.G., Pohl C.S., Randall-Maher J., Smith S.M., Wallis J.W.,
RA Yang S.-P., Romanov M.N., Rondelli C.M., Paton B., Smith J., Morrice D.,
RA Daniels L., Tempest H.G., Robertson L., Masabanda J.S., Griffin D.K.,
RA Vignal A., Fillon V., Jacobbson L., Kerje S., Andersson L.,
RA Crooijmans R.P., Aerts J., van der Poel J.J., Ellegren H., Caldwell R.B.,
RA Hubbard S.J., Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M.,
RA Arakawa H., Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K.,
RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E.,
RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M.,
RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., Miller M.M.,
RA Inoko H., Shiina T., Kaufman J., Salomonsen J., Skjoedt K., Wong G.K.-S.,
RA Wang J., Liu B., Wang J., Yu J., Yang H., Nefedov M., Koriabine M.,
RA Dejong P.J., Goodstadt L., Webber C., Dickens N.J., Letunic I., Suyama M.,
RA Torrents D., von Mering C., Zdobnov E.M., Makova K., Nekrutenko A.,
RA Elnitski L., Eswara P., King D.C., Yang S.-P., Tyekucheva S.,
RA Radakrishnan A., Harris R.S., Chiaromonte F., Taylor J., He J.,
RA Rijnkels M., Griffiths-Jones S., Ureta-Vidal A., Hoffman M.M., Severin J.,
RA Searle S.M.J., Law A.S., Speed D., Waddington D., Cheng Z., Tuzun E.,
RA Eichler E., Bao Z., Flicek P., Shteynberg D.D., Brent M.R., Bye J.M.,
RA Huckle E.J., Chatterji S., Dewey C., Pachter L., Kouranov A.,
RA Mourelatos Z., Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M.,
RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O.,
RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., Betran E.,
RA Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., Furey T.S.,
RA Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., Eyras E.,
RA Castelo R., Abril J.F., Castellano S., Camara F., Parra G., Guigo R.,
RA Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., Mardis E.R.,
RA Wilson R.K.;
RT "Sequence and comparative analysis of the chicken genome provide unique
RT perspectives on vertebrate evolution.";
RL Nature 432:695-716(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 2792-2917.
RC STRAIN=White leghorn; TISSUE=Embryo;
RX PubMed=9473273; DOI=10.1006/abio.1997.2500;
RA Peale F.V. Jr., Mason K., Hunter A.W., Bothwell M.;
RT "Multiplex display polymerase chain reaction amplifies and resolves related
RT sequences sharing a single moderately conserved domain.";
RL Anal. Biochem. 256:158-168(1998).
CC -!- FUNCTION: May play a role in neural and muscle differentiation (By
CC similarity). May be involved in transcriptional regulation.
CC {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the krueppel C2H2-type zinc-finger protein
CC family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; U26150; AAC06188.1; -; mRNA.
DR STRING; 9031.ENSGALP00000025229; -.
DR PaxDb; O73590; -.
DR VEuPathDB; HostDB:geneid_395904; -.
DR eggNOG; KOG1146; Eukaryota.
DR InParanoid; O73590; -.
DR OrthoDB; 15351at2759; -.
DR PhylomeDB; O73590; -.
DR Proteomes; UP000000539; Unplaced.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF00096; zf-C2H2; 2.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 23.
DR SMART; SM00451; ZnF_U1; 7.
DR SUPFAM; SSF46689; SSF46689; 4.
DR SUPFAM; SSF57667; SSF57667; 5.
DR PROSITE; PS00027; HOMEOBOX_1; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 13.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7.
PE 2: Evidence at transcript level;
KW Coiled coil; DNA-binding; Homeobox; Metal-binding; Nucleus;
KW Reference proteome; Repeat; Repressor; Transcription;
KW Transcription regulation; Zinc; Zinc-finger.
FT CHAIN 1..3573
FT /note="Zinc finger homeobox protein 4"
FT /id="PRO_0000278467"
FT ZN_FING 609..632
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 640..663
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 695..719
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 763..785
FT /note="C2H2-type 4; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 913..937
FT /note="C2H2-type 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 969..991
FT /note="C2H2-type 6"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1017..1041
FT /note="C2H2-type 7"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1168..1191
FT /note="C2H2-type 8"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1197..1220
FT /note="C2H2-type 9"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1348..1370
FT /note="C2H2-type 10"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1376..1399
FT /note="C2H2-type 11"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1492..1518
FT /note="C2H2-type 12"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1544..1568
FT /note="C2H2-type 13"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 1886..1909
FT /note="C2H2-type 14"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT DNA_BIND 2072..2131
FT /note="Homeobox 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2169..2228
FT /note="Homeobox 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT ZN_FING 2255..2279
FT /note="C2H2-type 15; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 2436..2458
FT /note="C2H2-type 16"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT DNA_BIND 2548..2607
FT /note="Homeobox 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT ZN_FING 2618..2641
FT /note="C2H2-type 17"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT DNA_BIND 2874..2933
FT /note="Homeobox 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT ZN_FING 2952..2976
FT /note="C2H2-type 18; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 3360..3384
FT /note="C2H2-type 19; degenerate"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 3404..3428
FT /note="C2H2-type 20"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 1..54
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 425..479
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 522..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1096..1132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1250..1340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1442..1476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1577..1596
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1795..1843
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1933..2013
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2278..2300
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2318..2426
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2499..2553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2704..2788
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2820..2875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3060..3174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3287..3343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3449..3468
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3518..3543
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 3271..3316
FT /evidence="ECO:0000255"
FT COMPBIAS 1..26
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..54
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..453
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 542..558
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..603
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1096..1110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1111..1127
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1280..1317
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1318..1336
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1580..1596
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1795..1830
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1937..1964
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1978..2010
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2318..2334
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2343..2360
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2361..2426
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2499..2515
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2520..2553
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2704..2722
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2741..2776
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2820..2845
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2846..2867
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3062..3101
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3102..3131
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3140..3174
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3287..3318
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3319..3343
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3451..3468
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3518..3533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 2823
FT /note="K -> E (in Ref. 2; AAC06188)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 3573 AA; 394548 MW; CEB0613699D7993A CRC64;
METCDSPTIS RQENGQSTSK LCGTTQLDNE VPEKVAGMEP DRENSSTDDN LRTDERKSEI
LLGFSVENAA ATQVTSAKEI PCNECATSFP SLQKYMEHHC PNARLPVLKD DNESEISELE
DSDVENLTGE IVYQPDGSAY IIEDSKESGQ NAQTGANSKL FSTAMFLDSL TSAGEKNEQS
ASAPMSFYPQ IINTFHIASS LGKPFTADQA FPNTSALAGV GPVLHSFRVY DLRHKRDKDY
LTSDGSAKNS CVSKDVPNNV DLSKFDGCVS DGKRKPVLMC FLCKLSFGYI RSFVTHAVHD
HRMTLNEEEQ KLLSNKYVSA IIQGIGKDKE PLISFLEPKK STSVYPHFST TNLIGPDPTF
RGLWSAFHVE NGDSLQAGFA FLKGSASTAG SAEQPVGITQ MPKAEVNLGG LSSLVANAPI
TSVSLSHSSS ESNKLSESKD QENNCERQKE TNTLHPNGEF PIKSEPTEPV EEEDEDTYSN
ELDDDEVLGE LADSIGSKDF PLLNQSISPL SSSVLKFIEK GTSSSSATVS DDTDKTKQTA
AHRHNSNVTS NYSISGKDFA DASASKDSPT ALHPNETARG DEDSSVTPHQ HSFTPSTPSA
GDGSPGSGIE CPKCDTVLGS SRSLGGHMTM MHSRNSCKTL KCPKCNWHYK YQQTLEAHMK
EKHPEPGGSC VYCKTGQPHP RLARGESYTC GYKPFRCEVC NYSTTTKGNL SIHMQSDKHL
NNVQNLQNGN GEQVYGHTAP PSNPALSGCG TPSPSKPKQK PTWRCEVCDY ETNVARNLRI
HMTSEKHMHN MMLLQQNMKQ IQHNLHLGLA PAEAELYQYY LAQNIGLTGM KLENPADPQM
MINPFQLDPA TAAALAPGLG ELSPYISDPA LKLFQCAVCN KFTSDSLEAL SVHVSSERLL
PEEEWRAVIG DIYQCKLCNY NTQLKANFQL HCKTDKHMQK YQLVAHIKEG GKTNEWRLKC
IAIGNPVHLK CNACDYYTNS VDKLRLHTTN HRHEAALKLY KHLQKHESTV NPDSCYYYCA
LCDYSTKVKL NLVQHVRSVK HQQTEGLRKL QLHQQGLAPE EDNLSEIFFV KDCPPNELEV
AQLGCRTCDI SLTEQGEDTE GSAKSTSVAI GDDKDSSERD NTEAKKSSKD SVNTVVGAQQ
LLLAKEEDGA AKKSKPPEDN KFCHEQFYQC PYCNYNSRDP NRIQMHVLSQ HSMQPVICCP
LCQDVLSNKM HLQLHLTHLH SVSPDCVEKL LMTVPVPDVM MPNSMLLPAA ASEKSERDTP
ATITAEGSGK YSGESPVDEK STPGTDESKP GMEIKSEEQK PPKESAETPD WNKSGSKDIK
TTDSMPDQLN EQQKKQQLSV SDRHVYKYRC NHCSLAFKTM QKLQIHSQYH AIRAATMCSL
CQRSFRTFQA LKKHLEAGHP ELNEAELQQL CASLPVNGEL WAEGESMGQD DHALEQEIER
DYEMEQEGKA SPVGSDSSSI PDDMGSEPKR TLPFRKGPNF TMEKFLDPSR PYKCTVCKES
FTQKNILLVH YNSVSHLHKL KKVLQEASSP VPQETNSNTD NKPYKCSICN VAYSQSSTLE
IHMRSVLHQT KARAAKLEPS GNISSGNSVA GNVNSPSQGI LDSMSLPAVN SKEPHLDAKE
LNKKQASELI SAQPTHHPPQ SPAQLQMQLQ HELQQQAAFF QPQFLNPAFL PHFPMTPEAL
LQFQQPQFLF PFYIPGTEFS LSPDLGLPGS ATFGMPGMAG MTGSLLEDLK QQIQTQHHVG
QTQLQILQQQ AQQYQSTQPQ LQSQKQQQQS SKLMKAEQTS LVSTDCQLIK DMPSYKESEE
ISEKQEKPKQ EFTNESEGLK ENKDMKKPKS SEPAIPPPRI ASGARGNAAK ALLENFGFEL
VIQYNENRQK VQKKGKTGEG ENTDKLECGT CSKLFSNILI LKSHQEHVHG QFFPYGALEK
FARQYREAYD KLYPISPSSP ETPPPPPPPP PPPPPPPPPT PSQPSSAGAG KIQNTTPTPL
QAPPPTPPPP PPPPPPPPPP PPPPSAPPQV QLPVSLDLPL FPPIMMQPVQ HPALPPQLAL
QLPPMDTLSA DLTQLCQQQL GLDPNFLRHS QFKRPRTRIT DDQLKILRAY FDINNSPSEE
QIQEMAEKSG LSQKVIKHWF RNTLFKERQR NKDSPYNFSN PPITVLEDIR IDPQPSAVEP
YKSDASFSKR SSRTRFTDYQ LRVLQDFFDT NAYPKDDEIE QLSTVLNLPT RVIVVWFQNA
RQKARKSYEN QAETKDNEKR ELTNERYIRT SNMQYQCKKC SVVFPRIFDL ITHQKKQCYK
DEDDDAQDES QTEDSMDASD QTVYKNCTVS GQNDASKSLA VTAASSGSGS STPLIPSPKP
EPEKASPKSE STEKPKPNET ISKQTDTTSQ SSKPVQSASV TPSDPQPSAS QPQQQKQSQI
IGRPPSTSQT TPVPSSPLPI SMTPLQNSLP PQLLQYQCDQ CTVAFPTLEL WQEHQHMHFL
AAQNQFIHSQ FLERPMDMPY MIFDPNNPLM TGQLLNSSLA QMPPQTGSSH AAHPATVSGS
MKRKLDDKED NNCSEKEGGN SGEDQHRDKR LRTTITPEQL EILYEKYLLD SNPTRKMLDH
IAREVGLKKR VVQVWFQNTR ARERKGQFRA VGPAQSHKRC PFCRALFKAK SALESHIRSR
HWNEGKQAGY SLPPSPLIST EDGGDSPQKY IFFDYPSLSL AKTELSSENE LASTVSTPIS
KTAEMSPKNL LSPSSFKAES SEDIENLNAP PADSAYDQNK TDFDETSSIN TAISDATTGD
EGNNEMESTT GSSGDAKPAS PPKEPKPLVN DALTKAATTP TNENTDDKFL FSLTSPSIHF
SEKDGDHDQS YYITDDPDDN ADRSETSSIA DPSSPNPFGA SNPFKSKNSD RPGHKRFRTQ
MSNLQLKVLK ACFSDYRTPT MQECEMLGNE IGLPKRVVQV WFQNARAKEK KFKINIGKPF
MINQTGPDGT KPECSLCGVK YSARLSIRDH IFSKQHITKV RETVGSQLDR EKDYLAPTTV
RQLMAQQELD RIKKATDVLG LTVQQPGMMD SSSLHGISLP AAYPGLPGLP PVLLPGMNGP
SSLPGFPQSS NTLTSPGAGM LGFPTSATSS PALSLSSAPS KPLLQTPPPP PPPPPPPPPP
PPPPPPPPSS SLSGQQTEQQ SKESEKKNTI NKPNKVKKIK EEELEANKPE KHLKKEEKIS
SALSVLGKVV GEAHVDPTQL QALQNAIAGD PASFIGGQFL PYFIPGFASY FTPQLPGTVQ
GGYLPPVCGM ESLFPYGPAV PQTIAGLSPG ALLQQYQQYQ QNLQDSLQKQ QKQQQEQQQK
QVQAKSSKAE NDQQQNSSDT SETKEDRSSA TESTKEEPQL DSKSADFSDT YIVPFVKYEF
ICRKCQMMFT DEDAAVNHQK SFCYFGQPLI DPQETVLRVP VSKYQCLACD VAISGNEALS
QHLQSSLHKE KTIKQAMRNA KEHVRLLPHS VCSPNPNTTS TSQSAASSNN TYPHLSCFSM
KSWPNILFQA SARKAASSPS SPPSLSLPST VTSSLCSTSG VQTSLPTESC SDESDSELSQ
KLEDLDNSLE VKAKPASGLD GNFNSIRMDM FSV