F37C4_CAEEL
ID F37C4_CAEEL Reviewed; 556 AA.
AC O44400; Q86MF4;
DT 13-SEP-2005, integrated into UniProtKB/Swiss-Prot.
DT 23-JAN-2007, sequence version 3.
DT 03-AUG-2022, entry version 122.
DE RecName: Full=Protein F37C4.5;
GN ORFNames=F37C4.5;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2] {ECO:0000305}
RP PROTEIN SEQUENCE OF 2-9; 35-89; 96-100; 118-130; 259-268 AND 456-463
RP (ISOFORM A), IDENTIFICATION BY MASS SPECTROMETRY, AND ACETYLATION AT ALA-2.
RA Bienvenut W.V.;
RL Submitted (SEP-2005) to UniProtKB.
CC -!- INTERACTION:
CC O44400; Q21319: scrm-8; NbExp=2; IntAct=EBI-312011, EBI-312006;
CC O44400; Q9NAD6: sta-1; NbExp=2; IntAct=EBI-312011, EBI-312137;
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=a {ECO:0000303|PubMed:9851916};
CC IsoId=O44400-1; Sequence=Displayed;
CC Name=b {ECO:0000303|PubMed:9851916};
CC IsoId=O44400-2; Sequence=VSP_051824;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO081250; CCD70196.1; -; Genomic_DNA.
DR EMBL; FO081250; CCD70197.1; -; Genomic_DNA.
DR PIR; T32567; T32567.
DR RefSeq; NP_001023185.1; NM_001028014.4. [O44400-1]
DR RefSeq; NP_001023186.1; NM_001028015.3.
DR AlphaFoldDB; O44400; -.
DR SMR; O44400; -.
DR BioGRID; 42285; 38.
DR DIP; DIP-26757N; -.
DR IntAct; O44400; 11.
DR STRING; 6239.F37C4.5a.1; -.
DR World-2DPAGE; 0020:O44400; -.
DR EPD; O44400; -.
DR PaxDb; O44400; -.
DR PeptideAtlas; O44400; -.
DR EnsemblMetazoa; F37C4.5a.1; F37C4.5a.1; WBGene00018145. [O44400-1]
DR EnsemblMetazoa; F37C4.5b.1; F37C4.5b.1; WBGene00018145. [O44400-2]
DR GeneID; 177146; -.
DR KEGG; cel:CELE_F37C4.5; -.
DR UCSC; F37C4.5a.1; c. elegans. [O44400-1]
DR CTD; 177146; -.
DR WormBase; F37C4.5a; CE17048; WBGene00018145; -. [O44400-1]
DR WormBase; F37C4.5b; CE33640; WBGene00018145; -. [O44400-2]
DR eggNOG; ENOG502QWQ0; Eukaryota.
DR GeneTree; ENSGT00970000195975; -.
DR HOGENOM; CLU_010457_2_0_1; -.
DR InParanoid; O44400; -.
DR OMA; IANQHTH; -.
DR OrthoDB; 426277at2759; -.
DR PhylomeDB; O44400; -.
DR PRO; PR:O44400; -.
DR Proteomes; UP000001940; Chromosome IV.
DR Bgee; WBGene00018145; Expressed in adult organism and 4 other tissues.
DR InterPro; IPR011935; CHP02231.
DR InterPro; IPR037291; DUF4139.
DR InterPro; IPR025554; DUF4140.
DR PANTHER; PTHR31005; PTHR31005; 1.
DR Pfam; PF13598; DUF4139; 1.
DR Pfam; PF13600; DUF4140; 1.
DR TIGRFAMs; TIGR02231; TIGR02231; 1.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Direct protein sequencing;
KW Reference proteome.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0000269|Ref.2"
FT CHAIN 2..556
FT /note="Protein F37C4.5"
FT /id="PRO_0000087151"
FT MOD_RES 2
FT /note="N-acetylalanine"
FT /evidence="ECO:0000269|Ref.2"
FT VAR_SEQ 1..240
FT /note="Missing (in isoform b)"
FT /evidence="ECO:0000303|PubMed:9851916"
FT /id="VSP_051824"
SQ SEQUENCE 556 AA; 61444 MW; 3E06216A9741A0BB CRC64;
MAHIVQTHKL TLHDVAIDQA LVYSSDSNCA ELKRTFQVEL AHGYNEVKVQ NLPFDLVQDS
IRVSGAGEAV IHDVSVKNQE GAEFVIPERV LAIKAIFEEK ERAKDKVADS RVAVQKRIEG
LDNLITEVAK HGKDGAFHFD GRTIESLNAL HGFHQDTTVD LRAQIRTLDQ DLRKAEEEYA
RASQDYDNTG YRWRNSAQYA SIIVESEAGG AAQLTITYQV NNVSWTPFYD IRVTAGVEAE
MHVTYFGKVR QYSGEDWKTV PLVLSTARPA HGVKQLPKLG ALEASIVVPE PECNRGGRGG
YGGGYAQDSV VMACAAPMME MGRSRKSMKM SYAAVKSSNI ASEFSIGRPA TIDDRTDEYK
VNIGQFTLDT KLSNVTVPSR NATAFLVANS VNTSDYPLVA GQASIFLDGA FVNKTEFEDA
VVSQKFEVSL GVDPNIRIEY KPVRNYQEQS GTVEKINSQV TEKTTAVTNL RPNSVLLTIR
EQLPRSTDSR IKVHLNTPEA VEVDEASVEP TVGAAITPEK ILDYTVQLAP GQSSTFVVKY
TTEHPQAEQI RYEEKF