YMV2_CAEEL
ID YMV2_CAEEL Reviewed; 1463 AA.
AC P34504; P34505; P34506; P90907; Q6BEV9;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 18-SEP-2013, sequence version 8.
DT 03-AUG-2022, entry version 159.
DE RecName: Full=Uncharacterized protein K04H4.2;
DE Flags: Precursor;
GN ORFNames=K04H4.2;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=7906398; DOI=10.1038/368032a0;
RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., Bonfield J.,
RA Burton J., Connell M., Copsey T., Cooper J., Coulson A., Craxton M.,
RA Dear S., Du Z., Durbin R., Favello A., Fraser A., Fulton L., Gardner A.,
RA Green P., Hawkins T., Hillier L., Jier M., Johnston L., Jones M.,
RA Kershaw J., Kirsten J., Laisster N., Latreille P., Lightning J., Lloyd C.,
RA Mortimore B., O'Callaghan M., Parsons J., Percy C., Rifken L., Roopra A.,
RA Saunders D., Shownkeen R., Sims M., Smaldon N., Smith A., Smith M.,
RA Sonnhammer E., Staden R., Sulston J., Thierry-Mieg J., Thomas K.,
RA Vaudin M., Vaughan K., Waterston R., Watson A., Weinstock L.,
RA Wilkinson-Sproat J., Wohldman P.;
RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
RT elegans.";
RL Nature 368:32-38(1994).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=c;
CC IsoId=P34504-3; Sequence=Displayed;
CC Name=b;
CC IsoId=P34504-1; Sequence=VSP_012169, VSP_012168, VSP_026510,
CC VSP_026511;
CC Name=a;
CC IsoId=P34504-2; Sequence=VSP_012169, VSP_012168, VSP_026510,
CC VSP_026511, VSP_002449;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z27078; CAA81587.2; -; Genomic_DNA.
DR EMBL; Z27078; CAA81588.3; -; Genomic_DNA.
DR EMBL; Z27078; CAH04706.4; -; Genomic_DNA.
DR PIR; B88553; B88553.
DR PIR; S40992; S40992.
DR PIR; S40994; S40994.
DR RefSeq; NP_001022664.4; NM_001027493.5.
DR RefSeq; NP_499058.3; NM_066657.5.
DR RefSeq; NP_499059.2; NM_066658.4.
DR AlphaFoldDB; P34504; -.
DR BioGRID; 41513; 12.
DR IntAct; P34504; 11.
DR MINT; P34504; -.
DR STRING; 6239.K04H4.2c.1; -.
DR EPD; P34504; -.
DR PaxDb; P34504; -.
DR PeptideAtlas; P34504; -.
DR PRIDE; P34504; -.
DR EnsemblMetazoa; K04H4.2a.1; K04H4.2a.1; WBGene00010573. [P34504-2]
DR EnsemblMetazoa; K04H4.2b.1; K04H4.2b.1; WBGene00010573. [P34504-1]
DR EnsemblMetazoa; K04H4.2c.1; K04H4.2c.1; WBGene00010573. [P34504-3]
DR GeneID; 176315; -.
DR UCSC; K04H4.2a.2; c. elegans. [P34504-1]
DR CTD; 176315; -.
DR WormBase; K04H4.2a; CE32463; WBGene00010573; -. [P34504-2]
DR WormBase; K04H4.2b; CE36653; WBGene00010573; -. [P34504-1]
DR WormBase; K04H4.2c; CE47897; WBGene00010573; -. [P34504-3]
DR eggNOG; KOG1217; Eukaryota.
DR HOGENOM; CLU_255206_0_0_1; -.
DR InParanoid; P34504; -.
DR OMA; NICCPIN; -.
DR OrthoDB; 123591at2759; -.
DR PRO; PR:P34504; -.
DR Proteomes; UP000001940; Chromosome III.
DR Bgee; WBGene00010573; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:UniProtKB-KW.
DR InterPro; IPR007026; CC_domain.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR InterPro; IPR006150; Cys_repeat_1.
DR Pfam; PF04942; CC; 2.
DR SMART; SM00494; ChtBD2; 1.
DR SMART; SM00289; WR1; 19.
DR SUPFAM; SSF57625; SSF57625; 1.
PE 3: Inferred from homology;
KW Alternative splicing; Chitin-binding; Disulfide bond; Reference proteome;
KW Signal.
FT SIGNAL 1..17
FT /evidence="ECO:0000255"
FT CHAIN 18..1463
FT /note="Uncharacterized protein K04H4.2"
FT /id="PRO_0000014291"
FT DOMAIN 94..164
FT /note="Chitin-binding type-2"
FT REGION 70..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 720..773
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 841..869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..88
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 720..742
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 141..154
FT /evidence="ECO:0000250"
FT VAR_SEQ 165
FT /note="P -> R (in isoform a and isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_012169"
FT VAR_SEQ 166..679
FT /note="Missing (in isoform a and isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_012168"
FT VAR_SEQ 721
FT /note="D -> V (in isoform a and isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_026510"
FT VAR_SEQ 722..882
FT /note="Missing (in isoform a and isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_026511"
FT VAR_SEQ 1281..1440
FT /note="Missing (in isoform a)"
FT /evidence="ECO:0000305"
FT /id="VSP_002449"
SQ SEQUENCE 1463 AA; 153114 MW; 00A97AEFD99B3A7B CRC64;
MLRNLILITL LVASGHGQTP VIGGTCKLGT ADVQIGGKQT QFFLKCETTA DSAEGEGVWV
VKSRAAAAPS SVPSVPAENT QPQQHPKARK PASPNICEQD NGARESEVCA VSATCLQAHN
DFPSSYLQCD QTTLRWVRKS CQENFLFNFE QQTCIVPKRM SSLSPSTSSP SNTENPCSKC
PLGSACRNGN CIPLTTSNLC SDGSPPNNTC TRDPYSCPKG HFCTAQKVCC PSTALQSSIG
CSTVCTIDES CPKGMTCQNN CCEERKLLRH PKVYRYATVE ATNTIFEVDN DIFDSAAIES
LPTQKPQRLD EIMAPGITPT PTRTTEPPKL RCLSSNTDEV NSLGGASSSS ATCGGTNANC
TSDEDCPTTF KCYQGCCKLA VCPRSLTAVK FTCKTQYHCR ANEHCFFGGC CPKTIELAVI
KSQVLTMSKD NEHTKETEKL IIGDCEVDTR VKKCDIDIIC PEMSECVDGI CCKQPPKARC
GNGLMALSIP VHCSLSDDCP IASRCEYGKC CPFLSESADS TSDSVGETTP VIIKEEIIST
ATKVWKKVDK TSGVSINKNK CLSTQRCDLH TLCPPDFTCS LSGKCCKLNI HCPDGTVPET
SCQSASNHDH CPSSSHKCTL LNKEHFACCY SPGLVVEGSV TAEVSSECPI GSVEVDPRFG
TSCRYSLQCP SPYFCNQRGQ QASGLVCTFS SCSNSNPCSV GTCNNGYCCS SGSNSGSAII
DSDTNSTTNP SQPETTKTKN NTKKSNSSKK HRKPKKKDVD PLSDPLLQND FPIGPPGYGF
PEHLSNLDEV LIRAQGDGVS CAGGFQSSLI CSVGSECPAG LHCDTAINLC CPLLLPLTDP
KNPKKRKTKR RKQKQDENEM EASANFPDSD PARFSSYSCG CMGGGSSNCV GCQNAPQIIT
IPQNSCPGGG YSVGGCSSGY CATGYSCIQN QCCPSYNSAP RISVYTCPSG GNAVGACMSG
RCASGYTCSN NVCCPQTTTT NPFVCPDGTQ AAGGCVNGQC GTGYTCSNGL CCAGTSTTVK
CLDGSDAVGA CIPSCTGDGC GGVQVSYYCG SGYTCTTGNI CCPINSCPNG GEVLGPTING
LCPTGYTVQG NLCCSATCTD GSTGLPSVNG VCIDGYSLTN GVCCPASVTC TDEISIGPCT
GTGFNGGCPA GYACDSNQVN CCPVVRYTDE SCQVGPAIDG LCPPGYVVVY IPNSPLITNG
VNPGTCIDLQ CTTGLCLTAN QIGDCDTATD AGTCPTGYTC FTNAGICCST TTFSRLRIGN
SRQMAQKPNY GRPLHSYMPP RFGGPSSSCS DGSLSSGPCM NGLCGIGLEC QNGKCCSPSS
NKPAGLLQSK CPSGDTAVSG CFPNGSCGTG YECVSSLNLC CPPGQPQTFP SFPGNNNGFN
INNNNRFGSL SMSPRPIGAR CQLDGECVGQ AEGLSMCHAG VCQCSPIAYT QGIACVRRKS
FQMNDDPVID AANDDKSSSS VSV