MSP1_PLAFM
ID MSP1_PLAFM Reviewed; 1701 AA.
AC P08569;
DT 01-AUG-1988, integrated into UniProtKB/Swiss-Prot.
DT 10-MAY-2004, sequence version 3.
DT 25-MAY-2022, entry version 91.
DE RecName: Full=Merozoite surface protein 1;
DE AltName: Full=Merozoite surface antigens;
DE AltName: Full=PMMSA;
DE AltName: Full=p190;
DE Flags: Precursor;
GN Name=MSP-1;
OS Plasmodium falciparum (isolate mad20 / Papua New Guinea).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=5841;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=3079521; DOI=10.1016/0022-2836(87)90649-8;
RA Tanabe K., Mackay M., Goman M., Scaife J.G.;
RT "Allelic dimorphism in a surface antigen gene of the malaria parasite
RT Plasmodium falciparum.";
RL J. Mol. Biol. 195:273-287(1987).
RN [2]
RP SEQUENCE REVISION TO 821; 1220; 1403; 1569 AND 1629.
RA Tanabe K.;
RL Submitted (NOV-2003) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-115.
RX PubMed=3004972; DOI=10.1002/j.1460-2075.1985.tb04154.x;
RA Mackay M., Goman M., Bone N., Hyde J.E., Scaife J., Certa U.,
RA Stunnenberg H., Bujard H.;
RT "Polymorphism of the precursor for the major surface antigens of Plasmodium
RT falciparum merozoites: studies at the genetic level.";
RL EMBO J. 4:3823-3829(1985).
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor, GPI-anchor.
CC -!- PTM: Merozoite surface antigen contain the sequence of 83 kDa, 42 kDa
CC and 19 kDa antigens which are the major surface antigens of merozoites.
CC The maturation take place during schizont.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X05624; CAA29112.2; -; Genomic_DNA.
DR AlphaFoldDB; P08569; -.
DR BMRB; P08569; -.
DR SMR; P08569; -.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR010901; MSP1_C.
DR InterPro; IPR024730; MSP1_EGF_1.
DR Pfam; PF12946; EGF_MSP1_1; 1.
DR Pfam; PF07462; MSP1_C; 1.
PE 3: Inferred from homology;
KW Cell membrane; Disulfide bond; Glycoprotein; GPI-anchor; Lipoprotein;
KW Malaria; Membrane; Merozoite; Repeat; Signal.
FT SIGNAL 1..19
FT /evidence="ECO:0000255"
FT CHAIN 20..1680
FT /note="Merozoite surface protein 1"
FT /id="PRO_0000024556"
FT PROPEP 1681..1701
FT /note="Removed in mature form"
FT /evidence="ECO:0000250"
FT /id="PRO_0000024557"
FT REGION 89..118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 322..344
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 704..739
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 889..936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1230..1259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1451..1472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 89..117
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..725
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 889..932
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1239..1259
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT LIPID 1680
FT /note="GPI-anchor amidated serine"
FT /evidence="ECO:0000250"
FT CARBOHYD 110
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 239
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 470
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 536
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 607
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 802
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 899
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 919
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 965
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 991
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1089
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1196
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1588
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT DISULFID 1594..1605
FT /evidence="ECO:0000250"
FT DISULFID 1599..1615
FT /evidence="ECO:0000250"
FT DISULFID 1617..1628
FT /evidence="ECO:0000250"
FT DISULFID 1636..1649
FT /evidence="ECO:0000250"
FT DISULFID 1643..1663
FT /evidence="ECO:0000250"
FT DISULFID 1665..1679
FT /evidence="ECO:0000250"
SQ SEQUENCE 1701 AA; 193721 MW; 40461E9DA599E6E1 CRC64;
MKIIFFLCSF LFFIINTQCV THESYQELVK KLEALEDAVL TGYSLFQKEK MVLNEGTSGT
AVTTSTPGSS GSVTSGGSVA SVASVASGGS GGSVASGGSG NSRRTNPSDN SSDSNTKTYA
DLKHRVQNYL FTIKELKYPE LFDLTNHMLT LSKNVDGFKY LIDGYEEINE LLYKLNFYYD
LLRAKLNDAC ANSYCQIPFN LKIRANELDV LKKIVFGYRK PLDNIKDNVG KMEDYIKKNK
TTIANINELI EGSKKTIDQN KNADNEEGKK KLYQAQYNLF IYNKQLQEAH NLISVLEKRI
DTLKKNENIK KLLEDIDKIK TDAENPTTGS KPNPLPENKK KEVEGHEEKI KEIAKTIKFN
IDSLFTDPLE LEYYLREKNK KVDVTPKSQD PTKSVQIPKV PYPNGIVYPL PLTDIHNSLA
ADNDKNSYGD LMNPDTKEKI NEKIITDNKE RKIFINNIKK QIDLEEKNIN HTKEQNKKLL
EDYEKSKKDY EELLEKFYEM KFNNNFDKDV VDKIFSARYT YNVEKQRYNN KFSSSNNSVY
NVQKLKKALS YLEDYSLRKG ISEKDFNHYY TLKTGLEADI KKLTEEIKSS ENKILEKNFK
GLTHSANASL EVSDIVKLQV QKVLLIKKIE DLRKIELFLK NAQLKDSIHV PNIYKPQNKP
EPYYLIVLKK EVDKLKEFIP KVKDMLKKEQ AVLSSITQPL VAASETTEDG GHSTHTLSQS
GETEVTEETE VTEETVGHTT TVTITLPPKE ESAPKEVKVV ENSIEHKSND NSQALTKTVY
LKKLDEFLTK SYICHKYILV SNSSMDQKLL EVYNLTPEEE NELKSCDPLD LLFNIQNNIP
AMYSLYDSMN NDLQHLFFEL YQKEMIYYLH KLKEENHIKK LLEEQKQITG TSSTSSPGNT
TVNTAQSATH SNSQNQQSNA SSTNTQNGVA VSSGPAVVEE SHDPLTVLSI SNDLKGIVSL
LNLGNKTKVP NPLTISTTEM EKFYENILKN NDTYFNDDIK QFVKSNSKVI TGLTETQKNA
LNDEIKKLKD TLQLSFDLYN KYKLKLDRLF NKKKELGQDK MQIKKLTLLK EQLESKLNSL
NNPHNVLQNF SVFFNKKKEA EIAETENTLE NTKILLKHYK GLVKYYNGES SPLKTLSEVS
IQTEDNYANL EKFRALSKID GKLNDNLHLG KKKLSFLSSG LHHLITELKE VIKNKNYTGN
SPSENNKKVN EALKSYENFL PEAKVTTVVT PPQPDVTPSP LSVRVSGSSG STKEETQIPT
SGSLLTELQQ VVQLQNYDEE DDSLVVLPIF GESEDNDEYL DQVVTGEAIS VTMDNILSGF
ENEYDVIYLK PLAGVYRSLK KQIEKNIITF NLNLNDILNS RLKKRKYFLD VLESDLMQFK
HISSNEYIIE DSFKLLNSEQ KNTLLKSYKY IKESVENDIK FAQEGISYYE KVLAKYKDDL
ESIKKVIKEE KEKFPSSPPT TPPSPAKTDE QKKESKFLPF LTNIETLYNN LVNKIDDYLI
NLKAKINDCN VEKDEAHVKI TKLSDLKAID DKIDLFKNTN DFEAIKKLIN DDTKKDMLGK
LLSTGLVQNF PNTIISKLIE GKFQDMLNIS QHQCVKKQCP ENSGCFRHLD EREECKCLLN
YKQEGDKCVE NPNPTCNENN GGCDADATCT EEDSGSSRKK ITCECTKPDS YPLFDGIFCS
SSNFLGISFL LILMLILYSF I