YL57_CAEEL
ID YL57_CAEEL Reviewed; 471 AA.
AC P34437; Q629J7; Q8MQ44; Q8MQ45;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 06-JUN-2002, sequence version 2.
DT 03-AUG-2022, entry version 138.
DE RecName: Full=Putative zinc finger protein F44E2.7;
GN ORFNames=F44E2.7;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=7906398; DOI=10.1038/368032a0;
RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., Bonfield J.,
RA Burton J., Connell M., Copsey T., Cooper J., Coulson A., Craxton M.,
RA Dear S., Du Z., Durbin R., Favello A., Fraser A., Fulton L., Gardner A.,
RA Green P., Hawkins T., Hillier L., Jier M., Johnston L., Jones M.,
RA Kershaw J., Kirsten J., Laisster N., Latreille P., Lightning J., Lloyd C.,
RA Mortimore B., O'Callaghan M., Parsons J., Percy C., Rifken L., Roopra A.,
RA Saunders D., Shownkeen R., Sims M., Smaldon N., Smith A., Smith M.,
RA Sonnhammer E., Staden R., Sulston J., Thierry-Mieg J., Thomas K.,
RA Vaudin M., Vaughan K., Waterston R., Watson A., Weinstock L.,
RA Wilkinson-Sproat J., Wohldman P.;
RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
RT elegans.";
RL Nature 368:32-38(1994).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- INTERACTION:
CC P34437; O16893: CELE_F13A2.4; NbExp=4; IntAct=EBI-2416085, EBI-2416091;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=a;
CC IsoId=P34437-1; Sequence=Displayed;
CC Name=b;
CC IsoId=P34437-2; Sequence=VSP_015936;
CC Name=c;
CC IsoId=P34437-3; Sequence=VSP_015937, VSP_015938;
CC Name=d;
CC IsoId=P34437-4; Sequence=VSP_015939;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO081383; CCD71225.1; -; Genomic_DNA.
DR EMBL; FO081383; CCD71226.1; -; Genomic_DNA.
DR EMBL; FO081383; CCD71227.1; -; Genomic_DNA.
DR EMBL; FO081383; CCD71228.1; -; Genomic_DNA.
DR PIR; S44819; S44819.
DR RefSeq; NP_001022588.1; NM_001027417.3. [P34437-4]
DR RefSeq; NP_741256.1; NM_171218.4. [P34437-1]
DR RefSeq; NP_741257.1; NM_171219.1. [P34437-3]
DR RefSeq; NP_741258.1; NM_171220.4.
DR AlphaFoldDB; P34437; -.
DR BioGRID; 41447; 3.
DR IntAct; P34437; 2.
DR STRING; 6239.F44E2.7a; -.
DR iPTMnet; P34437; -.
DR EPD; P34437; -.
DR PaxDb; P34437; -.
DR PeptideAtlas; P34437; -.
DR PRIDE; P34437; -.
DR EnsemblMetazoa; F44E2.7a.1; F44E2.7a.1; WBGene00018420. [P34437-1]
DR EnsemblMetazoa; F44E2.7b.1; F44E2.7b.1; WBGene00018420. [P34437-2]
DR EnsemblMetazoa; F44E2.7c.1; F44E2.7c.1; WBGene00018420. [P34437-3]
DR EnsemblMetazoa; F44E2.7d.1; F44E2.7d.1; WBGene00018420. [P34437-4]
DR GeneID; 176245; -.
DR KEGG; cel:CELE_F44E2.7; -.
DR UCSC; F44E2.7a; c. elegans. [P34437-1]
DR CTD; 176245; -.
DR WormBase; F44E2.7a; CE29802; WBGene00018420; -. [P34437-1]
DR WormBase; F44E2.7b; CE30994; WBGene00018420; -. [P34437-2]
DR WormBase; F44E2.7c; CE30995; WBGene00018420; -. [P34437-3]
DR WormBase; F44E2.7d; CE24975; WBGene00018420; -. [P34437-4]
DR eggNOG; ENOG502TH0V; Eukaryota.
DR HOGENOM; CLU_046468_0_0_1; -.
DR InParanoid; P34437; -.
DR OMA; LWSDHTM; -.
DR OrthoDB; 1402258at2759; -.
DR PRO; PR:P34437; -.
DR Proteomes; UP000001940; Chromosome III.
DR Bgee; WBGene00018420; Expressed in embryo and 4 other tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR InterPro; IPR013087; Znf_C2H2_type.
DR SMART; SM00355; ZnF_C2H2; 3.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 2.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; DNA-binding; Metal-binding; Nucleus;
KW Reference proteome; Repeat; Zinc; Zinc-finger.
FT CHAIN 1..471
FT /note="Putative zinc finger protein F44E2.7"
FT /id="PRO_0000046903"
FT ZN_FING 26..54
FT /note="C2H2-type 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 293..317
FT /note="C2H2-type 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT ZN_FING 344..367
FT /note="C2H2-type 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042"
FT REGION 80..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 132..176
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 418..471
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 81..96
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 219..244
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..433
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 435..464
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..92
FT /note="Missing (in isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_015936"
FT VAR_SEQ 1..60
FT /note="Missing (in isoform c)"
FT /evidence="ECO:0000305"
FT /id="VSP_015937"
FT VAR_SEQ 61..85
FT /note="DHTMDTLGLIWSESKRSYDRDTSCS -> MWKKTEQNRQMIQGTAHRPDEGS
FT EI (in isoform c)"
FT /evidence="ECO:0000305"
FT /id="VSP_015938"
FT VAR_SEQ 87..88
FT /note="Missing (in isoform d)"
FT /evidence="ECO:0000305"
FT /id="VSP_015939"
SQ SEQUENCE 471 AA; 55140 MW; BA069A4C233E7CB7 CRC64;
MVKPTSSLLQ RYKNNVRVDP STLNHYLCYY CGKTLSDRLE YQQHMLKVHE VMSQSYQLWS
DHTMDTLGLI WSESKRSYDR DTSCSLSVPA SPMSRESRNR NSDDDYYDYV YEDEKPKRRR
IDVEQSRRSV AAAAEKHRKL LEEQRRKAEA EYERKRKADE KKKEKAIKDE EKRKAEDLRK
CLQLQQQKTA AAAQIRETAE KAAVARRMST FEVGESSEQL AKPEGEKRKR RRTESRWREI
ESDSDKPEVT ALVKKILEES KKKEASSQEV EDADLVPELS TRKPYDNTEH VSKLCKVCKK
GPHYTFANLF EHYQDLHNAT IKSLHYYGFN GNKLIGKKNL IERDHCQRCV IKFPRARDYF
AHMIKHHVYE SVRCQLDFEN ATNADVEARM MFRDRILTLG YNFKFEQVAD PNLVSDVLEP
GQEPSTSAEQ EDPSSLKIVK LEEPELEEQK ALKPESEAME QQEDVHSLNP E