MSP1_PLAFP
ID MSP1_PLAFP Reviewed; 1726 AA.
AC P50495;
DT 01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-1996, sequence version 1.
DT 25-MAY-2022, entry version 84.
DE RecName: Full=Merozoite surface protein 1;
DE AltName: Full=Gp195;
DE AltName: Full=Merozoite surface antigens;
DE AltName: Full=PMMSA;
DE Flags: Precursor;
GN Name=MSP-1;
OS Plasmodium falciparum (isolate Palo Alto / Uganda).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=57270;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=3049134; DOI=10.1016/0014-4894(88)90002-1;
RA Chang S.P., Kramer K.J., Yamaga K.M., Kato A., Case S.E., Siddiqui W.A.;
RT "Plasmodium falciparum: gene structure and hydropathy profile of the major
RT merozoite surface antigen (gp195) of the Uganda-Palo Alto isolate.";
RL Exp. Parasitol. 67:1-11(1988).
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor, GPI-anchor.
CC -!- PTM: Merozoite surface antigen contain the sequence of 83 kDa, 42 kDa
CC and 19 kDa antigens which are the major surface antigens of merozoites.
CC The maturation take place during schizont.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; M37213; AAA29611.1; -; Genomic_DNA.
DR PDB; 1OB1; X-ray; 2.90 A; C/F=1613-1705.
DR PDBsum; 1OB1; -.
DR AlphaFoldDB; P50495; -.
DR BMRB; P50495; -.
DR SMR; P50495; -.
DR EvolutionaryTrace; P50495; -.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR010901; MSP1_C.
DR InterPro; IPR024730; MSP1_EGF_1.
DR Pfam; PF12946; EGF_MSP1_1; 1.
DR Pfam; PF07462; MSP1_C; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Cell membrane; Disulfide bond; Glycoprotein; GPI-anchor;
KW Lipoprotein; Malaria; Membrane; Merozoite; Repeat; Signal.
FT SIGNAL 1..19
FT /evidence="ECO:0000255"
FT CHAIN 20..1705
FT /note="Merozoite surface protein 1"
FT /id="PRO_0000024559"
FT PROPEP 1706..1726
FT /note="Removed in mature form"
FT /evidence="ECO:0000250"
FT /id="PRO_0000024560"
FT REGION 61..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 735..771
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 914..961
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1254..1284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1476..1497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 61..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 735..754
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 914..957
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1264..1284
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT LIPID 1705
FT /note="GPI-anchor amidated serine"
FT /evidence="ECO:0000250"
FT CARBOHYD 133
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 272
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 501
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 567
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 638
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 827
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 924
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 944
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 990
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1016
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1114
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1221
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 1613
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT DISULFID 1619..1630
FT /evidence="ECO:0000250"
FT DISULFID 1624..1640
FT /evidence="ECO:0000250"
FT DISULFID 1642..1653
FT /evidence="ECO:0000250"
FT DISULFID 1661..1674
FT /evidence="ECO:0000250"
FT DISULFID 1668..1688
FT /evidence="ECO:0000250"
FT DISULFID 1690..1704
FT /evidence="ECO:0000250"
SQ SEQUENCE 1726 AA; 196175 MW; 5B59CEEFA2F9A026 CRC64;
MKIIFFLCSF LFFIINTQCV THESYQELVK KLEALEDAVL TGYGLFHKEK MILNEEEITT
KGASAQSGTS GTSGTSGTSG TSGTSGTSAQ SGTSGTSAQS GTSGTSAQSG TSGTSGTSGT
SPSSRSNTLP RSNTSSGASP PADASDSDAK SYADLKHRVR NYLFTIKELK YPELFDLTNH
MLTLCDNIHG FKYLIDGYEE INELLYKLNF YFDLLRAKLN DVCANDYCQI PFNLKIRANE
LDVLKKLVFG YRKPLDNIKD NVGKMEDYIK KNKTTIANIN ELIEGSKKTI DQNKNADNEE
GKKKLYQAQY DLSIYNKQLE EAHNLISVLE KRIDTLKKNE NIKELLDKIN EIKNPPPANS
GNTPNTLLDK NKKIEEHEEK IKEIAKTIKF NIDSLFTDPL ELEYYLREKN KKVDVTPKSQ
DPTKSVQIPK VPYPNGIVYP LPLTDIHNSL AADNDKNSYG DLMNPDTKEK INEKIITDNK
ERKIFINNIK KQIDLEEKKI NHTKEQNKKL LEDYEKSKKD YEELLEKFYE MKFNNNFDKD
VVDKIFSARY TYNVEKQRYN NKFSSSNNSV YNVQKLKKAL SYLEDYSLRK GISEKDFNHY
YTLKTGLEAD IKKLTEEIKS SENKILEKNF KGLTHSANAS LEVYDIVKLQ VQKVLLIKKI
EDLRKIELFL KNAQLKDSIH VPNIYKPQNK PEPYYLIVLK KEVDKLKEFI PKVKDMLKKE
QAVLSSITQP LVAASETTED GGHSTHTLSQ SGETEVTEET EETEETVGHT TTVTITLPPK
EVKVVENSIE HKSNDNSQAL TKTVYLKKLD EFLTKSYICH KYILVSNSSM DQKLLEVYNL
TPEEENELKS CDPLDLLFNI QNNIPAMYSL YDSMNNDLQH LFFELYQKEM IYYLHKLKEE
NHIKKLLEEQ KQITGTSSTS SPGNTTVNTA QSATHSNSQN QQSNASSTNT QNGVAVSSGP
AVVEESHDPL TVLSISNDLK GIVSLLNLGN KTKVPNPLTI STTEMEKFYE NILKNNDTYF
NDDIKQFVKS NSKVITGLTE TQKNALNDEI KKLKDTLQLS FDLYNKYKLK LDRLFNKKKE
LGQDKMQIKK LTLLKEQLES KLNSLNNPHN VLQNFSVFFN KKKEAEIAET ENTLENTKIL
LKHYKGLVKY YNGESSPLKT LSEVSIQTED NYANLEKFRV LSKIDGKLND NLHLGKKKLS
FLSSGLHQLI TELKEVIKNK NYTGNSPSEN NKKVNEALKS YENFLPEAKV TTVVTPPQPD
VTPSPLSVRV SGSSGSTKEE TQIPTSGSLL TELQQVVQLQ NYDEEDDSLV VLPIFGESED
NDEYLDQVVT GEAISVTMDN ILSGFENEYD VIYLKPLAGV YRSLKKQIEK NIFTFNLNLN
DILNSRLKKR KYFLDVLESD LMQFKHISSN EYIIEDSFKL LNSEQKNTLL KSYKYIKESV
ENDIKFAQEG ISYYEKVLAK YKDDLESIKK VIKEEKEKFP SSPPTTPPSP AKTDEQKKES
KFLPFLTNIE TLYNNLVNKI DDYLINLKAK INDCNVEKDE AHVKITKLSD LKAIDDKIDL
FKNHNDFDAI KKLINDDTKK DMLGKLLSTG LVQNFPNTII SKLIEGKFQD MLNISQHQCV
KKQCPENSGC FRHLDEREEC KCLLNYKQEG DKCVENPNPT CNENNGGCDA DAKCTEEDSG
SNGKKITCEC TKPDSYPLFD GIFCSSSNFL GISFLLILML ILYSFI