G168_PARPR
ID G168_PARPR Reviewed; 2704 AA.
AC P17053;
DT 01-AUG-1990, integrated into UniProtKB/Swiss-Prot.
DT 01-AUG-1990, sequence version 1.
DT 02-DEC-2020, entry version 75.
DE RecName: Full=G surface protein, allelic form 168;
DE Flags: Precursor;
GN Name=168G;
OS Paramecium primaurelia.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5886;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=168;
RX PubMed=2308165; DOI=10.1016/0022-2836(90)90263-l;
RA Prat A.;
RT "Conserved sequences flank variable tandem repeats in two alleles of the G
RT surface protein of Paramecium primaurelia.";
RL J. Mol. Biol. 211:521-535(1990).
CC -!- FUNCTION: This protein is the surface antigen or immobilization antigen
CC of Paramecium primaurelia.
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor, GPI-anchor.
CC -!- INDUCTION: Expression of G protein occurs at low temperatures (14-32
CC degrees Celsius).
CC -!- DOMAIN: It has internal homologies and a highly periodic structure with
CC 37 periods of about 75 residues, each period containing 8 cysteines,
CC except for four half periods. A variable part of 475 residues comprises
CC 4 almost identical periods in the middle of the protein.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; X52133; CAA36378.1; -; Genomic_DNA.
DR PIR; S09118; S09118.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR002895; Paramecium_SA.
DR InterPro; IPR016201; PSI.
DR Pfam; PF01508; Paramecium_SA; 31.
DR SMART; SM00639; PSA; 33.
DR SMART; SM00423; PSI; 12.
PE 2: Evidence at transcript level;
KW Cell membrane; Glycoprotein; GPI-anchor; Lipoprotein; Membrane; Repeat;
KW Signal.
FT SIGNAL 1..20
FT /evidence="ECO:0000255"
FT CHAIN 21..2704
FT /note="G surface protein, allelic form 168"
FT /id="PRO_0000021311"
FT REPEAT 112..165
FT /note="PSA 1"
FT /evidence="ECO:0000255"
FT REPEAT 172..231
FT /note="PSA 2"
FT /evidence="ECO:0000255"
FT REPEAT 238..297
FT /note="PSA 3"
FT /evidence="ECO:0000255"
FT REPEAT 304..360
FT /note="PSA 4"
FT /evidence="ECO:0000255"
FT REPEAT 400..460
FT /note="PSA 5"
FT /evidence="ECO:0000255"
FT REPEAT 468..523
FT /note="PSA 6"
FT /evidence="ECO:0000255"
FT REPEAT 530..590
FT /note="PSA 7"
FT /evidence="ECO:0000255"
FT REPEAT 596..667
FT /note="PSA 8"
FT /evidence="ECO:0000255"
FT REPEAT 683..742
FT /note="PSA 9"
FT /evidence="ECO:0000255"
FT REPEAT 747..806
FT /note="PSA 10"
FT /evidence="ECO:0000255"
FT REPEAT 815..881
FT /note="PSA 11"
FT /evidence="ECO:0000255"
FT REPEAT 929..994
FT /note="PSA 12"
FT /evidence="ECO:0000255"
FT REPEAT 1003..1061
FT /note="PSA 13"
FT /evidence="ECO:0000255"
FT REPEAT 1069..1123
FT /note="PSA 14"
FT /evidence="ECO:0000255"
FT REPEAT 1141..1196
FT /note="PSA 15"
FT /evidence="ECO:0000255"
FT REPEAT 1214..1269
FT /note="PSA 16"
FT /evidence="ECO:0000255"
FT REPEAT 1287..1342
FT /note="PSA 17"
FT /evidence="ECO:0000255"
FT REPEAT 1360..1415
FT /note="PSA 18"
FT /evidence="ECO:0000255"
FT REPEAT 1433..1495
FT /note="PSA 19"
FT /evidence="ECO:0000255"
FT REPEAT 1503..1566
FT /note="PSA 20"
FT /evidence="ECO:0000255"
FT REPEAT 1576..1641
FT /note="PSA 21"
FT /evidence="ECO:0000255"
FT REPEAT 1684..1740
FT /note="PSA 22"
FT /evidence="ECO:0000255"
FT REPEAT 1750..1807
FT /note="PSA 23"
FT /evidence="ECO:0000255"
FT REPEAT 1817..1887
FT /note="PSA 24"
FT /evidence="ECO:0000255"
FT REPEAT 1893..1965
FT /note="PSA 25"
FT /evidence="ECO:0000255"
FT REPEAT 1974..2033
FT /note="PSA 26"
FT /evidence="ECO:0000255"
FT REPEAT 2070..2137
FT /note="PSA 27"
FT /evidence="ECO:0000255"
FT REPEAT 2145..2204
FT /note="PSA 28"
FT /evidence="ECO:0000255"
FT REPEAT 2209..2274
FT /note="PSA 29"
FT /evidence="ECO:0000255"
FT REPEAT 2348..2419
FT /note="PSA 30"
FT /evidence="ECO:0000255"
FT REPEAT 2424..2489
FT /note="PSA 31"
FT /evidence="ECO:0000255"
SQ SEQUENCE 2704 AA; 278776 MW; 40EA0A0B18EE2119 CRC64;
MNNKFIIFSL LLALVASQTY SLTSCTCAQL LSEGDCIKNV SLGCSWDTTK KTCGVSTTPV
TPTVTYAAYC DTFAETDCPK AKPCTDCGNY AACAWVESKC TFFTGCTPFA KTLDSECQAI
SNRCITDGTH CVEVDACSTY KKQLPCVKNA AGSLCYWDTT NNTCDKLPAT FATDKDCRDV
ISTCTTKTGG GCVDSGNNCS DQTLEIQCVW NKLKTTSCYW DGAACKDRIC DNAPTSLTTD
DACKTFRTDG TCTTKANGGC VTRTTCAAAT IQASCIKNSS GGDCYWTGTA CVDKTCANAP
TTMTTNSACA GFVTGCITKS GGGCVANGAC SVANVQAACV KNSSNFDCIW DTTCKEKTCA
NAPTTNNTHD LCTSYLSTCT VKSGGGCQNR SCANAPTTMT TNDACEAYLT GNNCITKSGG
GCVTNTTCAA ITLEAACVKN SSGSTCFWDT ASSSCKDKTC VNAPATNTTH DLCQAFLNTC
TVNSTSAGCV EKTCENSLVL AICDKDTSSR ACIWKGKCYK KQCVLASSAT TTHADCQTYH
STCTLSNSGT GCVPLPLKCE AITIEAACNL KANGQPCGWN GSQCIDKACS TASKTFTTTS
QCTGHISTCV ANNPVTVNGS LTIQGCQDLP TSCAARKSSE NCEIARVGFP TCLWVSSSTS
CVEKSCATAS TVGTTGALSA GGFTFSGCQT YLNTCISNNT ADGCIAKPSS CSSLVSSNCR
DGSKASGDCY WNGSSCVDKT CANITLTSHA SCYSIFNQCT VNNGGTACQT LATACTSYST
QENCKFTSTN KNCVWTGLAC RNATCADAPD TTAYDSDTEC LAYPTPSETC TVVYKVGAQG
CVSKSANCSD YMTSAQCHKT LTNLTANDDC KWIVDRCYAL SSFATGACTT FKGNKTMCEG
YRAGCTNTVG AASSASCTLD CTLKTGSGLT FADCQALDST CSVKKDGTGC IVIQSTCAGY
GSTATNCFRS SASGTAGYCA MNTNCQSVTS AAECAFVTGL TGLDHSKCQL YHSSCTSLKD
GTGCQEYKTA CSSYATGNTC ANSVQGKCFD DATDCLRFAN CASITGTGLT NTICVTYDPG
CVANVNGTAC QEKLATCAAY LTQNSCSTST AGTCAWSGSA CLTVVDANVA TECAYITGTG
LTNAICAGYN AKCTVNRAGT ACQKKEALCA TYAAVQATCS QSDAGLCAWS GSACLTVVDA
NVATECPYIT GTGLTNAICA GYNAKCTVNR AGTACQKKEA LCATYAAVQA TCSQSDAGLC
AWSGSACLTV VDANVATECP YITGTGLTDA ICAGYNAKCT VNRAGTACQK KEALCATYAA
VQATCSQSDA GLCAWSGSAC LTVVDANVAT ECPYITGTGL TNAICAGYNA KCTVNRAGTA
CQKKEALCAT YAAVQATCSQ SDAGLCAWSG SACLTVVDAN VATECAYITG TGLTDAICAG
YNAKCTNLKD GTGCQDEKAT CKLYTTQNKC TSQTTGPLSC LWFDNSCSPI TDVTCSAIVQ
SGLDHAQCQA YSTGCTSVSD GSKCQDFKTT CEQYAGTALS CTKTATSKCY LQGSNCITIS
NVATDCAKIT GSAGTITYEI CQSYNTGCSV NRARSACVQQ QAQCSGYTSA MTSCYKSGAG
LCIASTNTDT ACVAATAATT CDAVYLGTGN YSSANCNEMK AGCTNNGATA CVAKTCANAV
VIFNHTNCNG YLNTCTVNSG NSACQTMASK CADQTQASCL YSVEGECVVV GTSCVRKTCD
TAATDATRDD DTECSAYQQS CTVARLGACQ ARAACASYKS SLQCKFNTSG GRCFWNPTNK
TCVDLNCGNI EASTLYDTHN ECVVVDATLA CTVRATNGAA VQGCMARGAC SSYTIEEQCK
TNASNGVCVW NTNANLPAPA CQDKSCTSAP TSTTTHNDCY AYYNTATVKC TVVATPSNSG
GNPTLGGCQQ TAACSSYIDK EQCQINANGE PCGWNGTQCA DKSCATAPAT ADYDDDTKCR
AYITNKCTVS DSGQGCVEIP ATCETMTQKQ CYYNKAGDPC YWTGTACITK SCDNAPDATA
TADECNTYLA GCTLDNVKCK TKVCEDFAFA TDALCKQAIS TCTTNGTNCV TRGTCFQALS
QAGCVTSSTN QQCEWIPAVL NASNVITSPA YCTIKNCSTA PITLTSEAAC AGYFTNCTTK
NGGGCVTKST CSAVTIDVAC TTALNGTVCA WDSAQNKCRD KDCQDFSGTT HAACQAQRAG
CTAGASGKCA RVQNCEQTSV RAACIEGTNG PCLWIDKYQN TDGTKGACFR YTSCKSLNWN
NDSSCKWISN KCTTNGSNCV GITLCSETNT DGGCVTGYDG ACIQSVPALN SSDPKVCKPY
TSCADAFYTT HSDCQIASSK CTTNGTTGCI ALGSCSSYTA QAGCYFNDKG TLYTSGVITS
TGICTWDTTS SSCRDQSCAD LTGTTHATCS SQLSTCTSDG TTCLLKGACT SYTTQTACTT
AVGSDGACYW ELASATNNNT AKCRLLTCAD IQNGTATNVC SVALSTCVSN GTACIPKANC
STYTSKIACN SGGLDGICVF TQSTATGAAA GTGTCALMTA CTVANNDQTA CQAARDRCSW
TAASGTGATA VASKCATHTC ATNQATNGAC TRFLNWDKKT QQVCTLVSGA CTATDPSTLS
SNDCFLVSGY TYTWNASTSK CGVCTAVVVQ PNTTDNNTNT TDNNTTTDSG YILGLSIVLG
YLMF