G156_PARPR
ID G156_PARPR Reviewed; 2715 AA.
AC P13837;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 01-JAN-1990, sequence version 1.
DT 02-DEC-2020, entry version 77.
DE RecName: Full=G surface protein, allelic form 156;
DE Flags: Precursor;
GN Name=156G;
OS Paramecium primaurelia.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5886;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=156;
RX PubMed=3783679; DOI=10.1016/0022-2836(86)90380-3;
RA Prat A., Katinka M., Caron F., Meyer E.;
RT "Nucleotide sequence of the Paramecium primaurelia G surface protein. A
RT huge protein with a highly periodic structure.";
RL J. Mol. Biol. 189:47-60(1986).
CC -!- FUNCTION: This protein is the surface antigen or immobilization antigen
CC of Paramecium primaurelia.
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor, GPI-anchor.
CC -!- INDUCTION: Expression of G protein occurs at low temperatures (14-32
CC degrees Celsius).
CC -!- DOMAIN: It has internal homologies and a highly periodic structure with
CC 34 periods of about 75 residues, each period containing 8 cysteines,
CC except for four half periods. A variable part of 475 residues comprises
CC 4 almost identical periods in the middle of the protein.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X03882; CAA27514.1; -; Genomic_DNA.
DR PIR; A23475; A23475.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR002895; Paramecium_SA.
DR InterPro; IPR016201; PSI.
DR Pfam; PF01508; Paramecium_SA; 31.
DR SMART; SM00639; PSA; 33.
DR SMART; SM00423; PSI; 10.
PE 2: Evidence at transcript level;
KW Cell membrane; Glycoprotein; GPI-anchor; Lipoprotein; Membrane; Repeat;
KW Signal.
FT SIGNAL 1..20
FT /evidence="ECO:0000255"
FT CHAIN 21..2715
FT /note="G surface protein, allelic form 156"
FT /id="PRO_0000021310"
FT REPEAT 111..171
FT /note="PSA 1"
FT REPEAT 177..237
FT /note="PSA 2"
FT REPEAT 243..303
FT /note="PSA 3"
FT REPEAT 309..366
FT /note="PSA 4"
FT REPEAT 372..404
FT /note="PSA 5"
FT REPEAT 405..467
FT /note="PSA 6"
FT REPEAT 473..530
FT /note="PSA 7"
FT REPEAT 536..596
FT /note="PSA 8"
FT REPEAT 602..673
FT /note="PSA 9"
FT REPEAT 688..748
FT /note="PSA 10"
FT REPEAT 752..812
FT /note="PSA 11"
FT REPEAT 820..895
FT /note="PSA 12"
FT REPEAT 934..1001
FT /note="PSA 13"
FT REPEAT 1008..1067
FT /note="PSA 14"
FT REPEAT 1073..1141
FT /note="PSA 15"
FT REPEAT 1147..1215
FT /note="PSA 16"
FT REPEAT 1221..1289
FT /note="PSA 17"
FT REPEAT 1295..1363
FT /note="PSA 18"
FT REPEAT 1369..1437
FT /note="PSA 19"
FT REPEAT 1443..1507
FT /note="PSA 20"
FT REPEAT 1513..1578
FT /note="PSA 21"
FT REPEAT 1586..1652
FT /note="PSA 22"
FT REPEAT 1693..1751
FT /note="PSA 23"
FT REPEAT 1759..1819
FT /note="PSA 24"
FT REPEAT 1827..1898
FT /note="PSA 25"
FT REPEAT 1904..1976
FT /note="PSA 26"
FT REPEAT 1984..2044
FT /note="PSA 27"
FT REPEAT 2080..2149
FT /note="PSA 28"
FT REPEAT 2155..2215
FT /note="PSA 29"
FT REPEAT 2219..2286
FT /note="PSA 30"
FT REPEAT 2290..2355
FT /note="PSA 31"
FT REPEAT 2359..2430
FT /note="PSA 32"
FT REPEAT 2434..2500
FT /note="PSA 33"
FT REPEAT 2505..2573
FT /note="PSA 34"
SQ SEQUENCE 2715 AA; 279551 MW; 97BE359AB9C7C298 CRC64;
MNNKFIIFSL LLALVASQTY SLTSCTCAQL LSEGDCIKNV SLGCSWDTTK KTCGVSTTPV
TPTVTYAAYC DTFAETDCPK AKPCTDCGNY AACAWVESKC TFFTGCTPFA KTLDSECQAI
SNRCITDGTH CVEVDACSTY KKQLPCAKNA AGSLCYWDTT NNTCVDANTC DKLPATFATD
KDCRDVISTC TTKTGGGCVD SGNNCSDQTL EIQCVWNKLK TTSCYWDGAA CKDRICDNAP
TSLTTDDACK TFRTDGTCTT KANGGCVTRT TCAAATIQAS CIKNSSGGDC YWTGTACVDK
ACANTPTTIA TNSACAGFVT GCITKSGGGC VVNGACSVAN VQAACVKNPS NFDCIWDTTC
KEKTCANAST TNNTHDLCTS YLSTCTVKSG GGCQNRTCAN APTTMTTNDA CEAYFTGNNC
ITKSGGGCVT NTTCAAITLE AACVKNSSGS TCFWDTASSS CKDKTCVNAP ATNTTHDLCQ
PFLNTCTVNS TSAGCVEKTC ENSLVLAICD KDTSSRACIW KGKCYKKQCV LASSATTTHA
DCQTYHSTCT LSNSGTGCVP LPLKCEAITI EAACNLKANG QPCGWNGSQC IDKACSTASK
TFTTTSQCTG HISTCVANNP VTVNGSLTIQ GCQDLPTTCA RRKSSENCEI TRVGFPTCLW
VSSSTSCVEK SCATASTVGT TGALSAGGFT FSGCQTYLNT CISNNTADGC IAKPSSCSSL
VSSNCRDGSK ASGDCYWNGS SCVDKTCANI IQTTHNSCNT TFNQCTVNNG GTACQTLATA
CTSYSTQENC KFTSTNKNCV WTGLACRNAT CADAPDTTAY DSDTECLAYP TPSETCTVVY
KVGAQGCVSK SANCSDYMTS AQCHKTLTNL TANDDCKWIV DRCYALSSFA TGACTTFKGT
KTMCEGYRAG CTNTVGAASS ASCTLDCTLK TGSGLTFADC QALDSTCSVK KDGTGCIAIQ
STCAGYGSTA ANCFRSSASG TAGYCAMNTN CQSVTSAAEC AFVTGLTGLD HSKCQLYHSS
CTSLKDGTGC QEYKTTCSGY AATNNCATSG QGKCFFDVEC LRFSNCASIT GTGLTTAICG
TYDAGCVANV NGTACQEKLA TCDLYLTQNS CSTSAAAATA DKCAWSGTAC LAVTTVGTHC
PYVTGTGLTD LICAAYNANC TANKAGTACQ EKKATCNLYT TEATCSTSAA AATADKCAWS
GAACLAVTTV ATECAYVTGT GLTDLICAAY NANCTANKAG TACQEKKATC NLYTTEATCS
TSAAAATADK CAWSGAACLA VTTVATECAY VTGTGLTNAI CAAYNANCTA NKAGTACQEK
KATCNLYTTE ATCSTSAAAA TADKCAWSGA ACLAVTTVAT ECAYVTGTGL TNAICAAYNA
NCTANKAGTA CQEKKATCNL YTTEATCSTS AAAATADKCA WSGAACLAVT TVATECAYVT
GTGLTKAICA TYNAGCINLK DGTGCQEAKA NCKDYTTSNK CTAQTTSTLS CLWIDNSCYP
VTDLNCSVIT GLGFVHAQCQ AYSTGCTSVS DGSKCQDFKS TCEQYPGTTL GCTKTASTKC
YLQGSACITI SNVATDCAKI TGSAGTITFE ICQSYNTGCS VNRARSACVQ QQAQCSGYTS
AMTSCYKSGA GLCIASTNTD TACVAATAAT CDAVYLGAGN YSSANCNEMK AGCTNNGTTA
CVAKTCANAA GITFNHTNCN SYLNTCTVNS GNSACQTMAS KCADQTQASC LYSVEGECVV
VGTSCVRKTC DTAATDATRD DDTECSTYQQ SCTVARLGAC QARAACATYK SSLQCKFNTS
GGKCFWNPTN KTCVDLNCGN IEATTLYDTH NECVAVDATL ACTVRATNGA AAQGCMARGA
CASYTIEEQC KTNASNGVCV WNTNANLPAP ACQDKSCTSA PTSTTTHNDC YAYYNTATVK
CTVVATPSNS GGNPTLGGCQ QTAACSSYID KEQCQINANG DPCGWNGTQC ADKSCATASA
TADYDDDTKC RAYITNKCTV SDSGQGCVEI PATCETMTQK QCYYNKAGDP CYWTGTACIT
KSCDNAPDAT ATADECNTYL AGCTLNNVKC KTKVCEDFAF ATDALCKQAI STCTTNGTNC
VTRGTCFQAL SQAGCVTSST NQQCEWIPAV LNASNVITSP AYCTIKNCST APITLTSEAA
CAGYFTNCTT KNGGGCVTKS TCSAVTIDVA CTTALNGTVC AWDSAQNKCR DKDCQDFSGT
THAACQAQRA GCTAGAGGKC ARVQNCEQTS VRAACIEGTN GPCLWIDKYQ NTDGTKGACF
RYTSCKSLNW NNDSSCKWIS NKCTTNGSNC VGITLCSETN TDGGCVTGYD GACIQSVPDL
NSSDPKVCKP YTSCADAFYT THSDCQIASS KCTTNGTTGC IALGSCSSYT VQAGCYFNDK
GTLYTSGVIT STGICTWDTT SSSCRDQSCA DLTGTTHATC SSQLSTCTSD GTTCLLKGAC
TSYTTQTACT TAVGSDGACY WELASATNNN TAKCRLLTCA DIQNGTATNV CSVALSTCVS
NGTACIPKAN CSTYTSKVAC NSGGLDGICV FTQSTATGAA AGTGTCALMT ACTVANNDQT
ACQAARDRCS WTAASGTRAT AVASKCATHT CATNQATNGA CTRFLNWDKK TQQVCTLVSG
ACTATDPSSF SSNDCFLVSG YTYTCNASTS KCGVCTAVVV QPNTTDNNTN TTDNNTTTDS
GYILGLSIVL GYLMF