SOR1_CAEEL
ID SOR1_CAEEL Reviewed; 1000 AA.
AC P34619; Q1NZ36;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 01-FEB-1994, sequence version 1.
DT 03-AUG-2022, entry version 119.
DE RecName: Full=Sop-2-related protein 1;
GN Name=sor-1; ORFNames=ZK1236.3;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=7906398; DOI=10.1038/368032a0;
RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., Bonfield J.,
RA Burton J., Connell M., Copsey T., Cooper J., Coulson A., Craxton M.,
RA Dear S., Du Z., Durbin R., Favello A., Fraser A., Fulton L., Gardner A.,
RA Green P., Hawkins T., Hillier L., Jier M., Johnston L., Jones M.,
RA Kershaw J., Kirsten J., Laisster N., Latreille P., Lightning J., Lloyd C.,
RA Mortimore B., O'Callaghan M., Parsons J., Percy C., Rifken L., Roopra A.,
RA Saunders D., Shownkeen R., Sims M., Smaldon N., Smith A., Smith M.,
RA Sonnhammer E., Staden R., Sulston J., Thierry-Mieg J., Thomas K.,
RA Vaudin M., Vaughan K., Waterston R., Watson A., Weinstock L.,
RA Wilkinson-Sproat J., Wohldman P.;
RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C.
RT elegans.";
RL Nature 368:32-38(1994).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [3]
RP FUNCTION, INTERACTION WITH SOP-2, SUBCELLULAR LOCATION, DEVELOPMENTAL
RP STAGE, AND DISRUPTION PHENOTYPE.
RX PubMed=16501168; DOI=10.1242/dev.02275;
RA Zhang T., Sun Y., Tian E., Deng H., Zhang Y., Luo X., Cai Q., Wang H.,
RA Chai J., Zhang H.;
RT "RNA-binding proteins SOP-2 and SOR-1 form a novel PcG-like complex in C.
RT elegans.";
RL Development 133:1023-1033(2006).
CC -!- FUNCTION: Acts synergistically with sop-2 to maintain the
CC transcriptionally repressive state of homeotic genes throughout
CC development. Not required to initiate repression, but to maintain it
CC during later stages of development. Also required to repress expression
CC of other genes. Binds RNA in a sequence-independent manner.
CC {ECO:0000269|PubMed:16501168}.
CC -!- SUBUNIT: Binds through its N-terminal region to the N-terminal region
CC of sop-2.
CC -!- INTERACTION:
CC P34619; Q965H3: sop-2; NbExp=4; IntAct=EBI-326963, EBI-331594;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:16501168}. Note=Forms
CC subnuclear bodies. Sop-2 is required for nuclear localization.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=a;
CC IsoId=P34619-1; Sequence=Displayed;
CC Name=b;
CC IsoId=P34619-2; Sequence=VSP_020783;
CC -!- DEVELOPMENTAL STAGE: Expressed at all developmental stages with levels
CC declining as development proceeds. {ECO:0000269|PubMed:16501168}.
CC -!- DISRUPTION PHENOTYPE: Worms exhibit early larval lethality and anterior
CC to posterior cell fate transformation in a Hox gene-dependent manner.
CC Hermaphrodites show vulva defects, partial hermaphrodite-to-male sexual
CC transformation and are sterile. {ECO:0000269|PubMed:16501168}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO080723; CCD66163.1; -; Genomic_DNA.
DR EMBL; FO080723; CCD66164.1; -; Genomic_DNA.
DR PIR; S44898; S44898.
DR RefSeq; NP_001040893.1; NM_001047428.3.
DR RefSeq; NP_001040894.1; NM_001047429.1. [P34619-2]
DR AlphaFoldDB; P34619; -.
DR BioGRID; 41393; 5.
DR DIP; DIP-27140N; -.
DR IntAct; P34619; 1.
DR STRING; 6239.ZK1236.3a; -.
DR EPD; P34619; -.
DR PaxDb; P34619; -.
DR PeptideAtlas; P34619; -.
DR EnsemblMetazoa; ZK1236.3a.1; ZK1236.3a.1; WBGene00023405.
DR EnsemblMetazoa; ZK1236.3b.1; ZK1236.3b.1; WBGene00023405. [P34619-2]
DR GeneID; 176189; -.
DR UCSC; ZK1236.3a; c. elegans. [P34619-1]
DR CTD; 176189; -.
DR WormBase; ZK1236.3a; CE00532; WBGene00023405; sor-1. [P34619-1]
DR WormBase; ZK1236.3b; CE40223; WBGene00023405; sor-1. [P34619-2]
DR eggNOG; ENOG502TITC; Eukaryota.
DR HOGENOM; CLU_348247_0_0_1; -.
DR InParanoid; P34619; -.
DR PRO; PR:P34619; -.
DR Proteomes; UP000001940; Chromosome III.
DR Bgee; WBGene00023405; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR GO; GO:0016604; C:nuclear body; IDA:UniProtKB.
DR GO; GO:0016607; C:nuclear speck; IDA:WormBase.
DR GO; GO:0005654; C:nucleoplasm; IDA:WormBase.
DR GO; GO:0003723; F:RNA binding; IDA:UniProtKB.
DR GO; GO:0040029; P:regulation of gene expression, epigenetic; IMP:UniProtKB.
PE 1: Evidence at protein level;
KW Alternative splicing; Nucleus; Reference proteome; Repressor; RNA-binding;
KW Transcription; Transcription regulation.
FT CHAIN 1..1000
FT /note="Sop-2-related protein 1"
FT /id="PRO_0000065562"
FT REGION 355..374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 379..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 466..509
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 633..720
FT /note="RNA-binding"
FT REGION 948..1000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..372
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..407
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 474..509
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 954..1000
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..190
FT /note="Missing (in isoform b)"
FT /evidence="ECO:0000305"
FT /id="VSP_020783"
SQ SEQUENCE 1000 AA; 113420 MW; D230C9A6F84928E2 CRC64;
MTINIKYSSK FSSSKTSSSE ELKPKTYIPA YYQPPVSMPK YYVNWLRIKL SLNKLKNIRA
IYLFDQCQNF NQYQESSRKF SAGQVSSLTP WYSNFSNYST VILRMKDTSL PPLENPSNGG
TYLFNLITVF PSIALSFNYS WRGDTGPSIS IFSFSLSVLF FSLFPQHKNI CARAWCRPFR
RSLSFSLRFI MSSEPASSST EKVPEEPHPH SIKHKFQGPQ FVIPRALSDY VLNVNNQTPE
SYEKALNAKY GRDDYITLCL HVTSLCTEYG PLDDGPEYVL LCDSRARVDK LIEDLVKLLE
IDTDYVQLEL HGGKRLHLQK PDAVLRDIAY KQSNEGSDKF FLEMKLVPSE SMKAKIMKQE
EEEEKARKHG QYQQYQEYHQ QHQAMNDGQS SSSVPSTSSP SCSSEANRKE METVREPAGP
SELMRAINAP VAPAPVVIKI ETPVALPEED ETLMDDDEMP SLTVEAPSEE ASFEAEQPSP
QVPQASIEGP SQQQQIPGTS QQKRQVARGS RTNMISYHDL PPGTGNAPPM ACPQVTLKLE
KNVPFEAKIR AVAGYTRKPI SEVQKMRPSD LESIFHSICI ASVQRIKRRN ELVQQLQEIN
AQSCKSPTMT MNKKFTLAKA YQRVQNEIEK IDREQILPQQ YMNMPPMPPQ GQQRLPPPAY
PPGILPPQQN RQQGVPPQFQ RSPQFMIGPD GQRYAHPYMQ LPNSNQRARI LNTSSVQPSE
EVRNRLVKIE AMAMNMAQLN PPRPPPPQPP HRALQGELQF LRPGAPDPCN FRPDSKQTYN
NTYVTVASPA TLTNSIIPWH FPPYEKSGRL NVSNTIKAIN EYRLLCNSRQ ADPASFLEFY
FLGDPMPHFN KILSIADYNM YLSRRRCDEA DVKIHRMSHS DQLQLYLLEL QSDESNVEKW
KTFYRIMQWD LPLNNEFPRI LLPSSLDIGR PVVDRKKKSI DQVMNHIHRM HSQRPPSMGN
SSTSSEASST SPTNAATATS SPASNRPTTS TAQPPTLNPT