YSM5_CAEEL
ID YSM5_CAEEL Reviewed; 645 AA.
AC Q10125;
DT 01-FEB-1996, integrated into UniProtKB/Swiss-Prot.
DT 02-DEC-2020, sequence version 4.
DT 03-AUG-2022, entry version 113.
DE RecName: Full=Uncharacterized protein F52C9.5;
DE Flags: Precursor;
GN ORFNames=F52C9.5 {ECO:0000312|WormBase:F52C9.5};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284603; CCD71608.2; -; Genomic_DNA.
DR PIR; T16417; T16417.
DR RefSeq; NP_498133.3; NM_065732.4.
DR AlphaFoldDB; Q10125; -.
DR PaxDb; Q10125; -.
DR EnsemblMetazoa; F52C9.5.1; F52C9.5.1; WBGene00018675.
DR UCSC; F52C9.5; c. elegans.
DR WormBase; F52C9.5; CE54167; WBGene00018675; -.
DR eggNOG; ENOG502S0RB; Eukaryota.
DR GeneTree; ENSGT00970000196729; -.
DR HOGENOM; CLU_464011_0_0_1; -.
DR InParanoid; Q10125; -.
DR OrthoDB; 1531014at2759; -.
DR PRO; PR:Q10125; -.
DR Proteomes; UP000001940; Chromosome III.
DR Bgee; WBGene00018675; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR InterPro; IPR003609; Pan_app.
DR Pfam; PF00024; PAN_1; 3.
DR SMART; SM00473; PAN_AP; 3.
DR PROSITE; PS50948; PAN; 3.
PE 3: Inferred from homology;
KW Disulfide bond; Glycoprotein; Reference proteome; Repeat; Signal.
FT SIGNAL 1..23
FT /evidence="ECO:0000255"
FT CHAIN 24..645
FT /note="Uncharacterized protein F52C9.5"
FT /id="PRO_0000014284"
FT DOMAIN 135..214
FT /note="PAN 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DOMAIN 281..369
FT /note="PAN 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DOMAIN 378..465
FT /note="PAN 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT REGION 30..58
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 92..129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 225..247
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 556..582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 92..106
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 107
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
FT CARBOHYD 421
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
FT CARBOHYD 570
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
FT DISULFID 161..187
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 165..175
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 281..369
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 313..341
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 317..329
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 378..465
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 407..436
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
FT DISULFID 411..422
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00315"
SQ SEQUENCE 645 AA; 72817 MW; 86F6BCC924CFAF22 CRC64;
MPSSHRLSAT ILIFLSLTYI SSSSNLVEDI TDKQDSSEDD DHLQTFPTPP PIGTTTASAD
SLFNRMNKIL RKEEKQKSQN FQIFNEKDLV TNSNANPYFS TTRKPKNRSD SSQKARDPDP
QNQVIAGIPD LSDPCFRRYE NSIIVNAQPY ERRSSTGLIH CKSHCLNSQI GVYSCRSFVY
DNVNRVCDLF AHVGDQAPAR LLKFQTRDYF EPTDIVHCLS MINGESSSSA PSSEDEDSPP
SPPPSAPIVA LATNTDKRDE HEEMTETIED ITVASPSSDS CPRGKQSTFL RTEGFELFSH
DDQELVVGDV AECAKACIEN KINGVALKCK SFDFLSSTST CAFTSEAAVP VGNGQLKQRE
DASYHEKICV SKSFVESCPS TFFSRHPQMI LVGFAESVSD SPSFEHCFDT CLNSYQLFGF
NCTSGMYYFE ENQLNCILNS ENRNTQRELF TEENTDIVDY FEVECTTPRS KQSKRKMAGV
RNFETDAIGA DKMVTDHEDV EEDGSKWESW SECQDGKQTR RKICANFNQI EDCAEEVRDC
VDEIDSTDMR MSIKRAGELE NNDHEQIEDN NTDASEDPVP TKEEIAEVKQ KIRRTGFKCP
LNECCRVFLS CSYGLRHNSH TKQLEWCRRP CDPSLSSFKR SRLLR