WD82A_XENLA
ID WD82A_XENLA Reviewed; 313 AA.
AC Q640J6;
DT 06-MAR-2007, integrated into UniProtKB/Swiss-Prot.
DT 25-OCT-2004, sequence version 1.
DT 03-AUG-2022, entry version 82.
DE RecName: Full=WD repeat-containing protein 82-A {ECO:0000305};
GN Name=wdr82-a;
OS Xenopus laevis (African clawed frog).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Xenopus.
OX NCBI_TaxID=8355;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Embryo;
RG NIH - Xenopus Gene Collection (XGC) project;
RL Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Regulatory component of the SET1 complex implicated in the
CC tethering of this complex to transcriptional start sites of active
CC genes. Facilitates histone H3 'Lys-4' methylation (H3K4me) via
CC recruitment of the SETD1A or SETD1B to the 'Ser-5' phosphorylated C-
CC terminal domain (CTD) of RNA polymerase II large subunit (POLR2A). Part
CC of a transcription termination checkpoint that promotes transcription
CC termination of long non-coding RNAs (lncRNAs).
CC {ECO:0000250|UniProtKB:Q6UXN9}.
CC -!- SUBUNIT: Component of the SET1 complex. {ECO:0000250|UniProtKB:Q6UXN9}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q6UXN9}.
CC Chromosome {ECO:0000250|UniProtKB:Q8BFQ4}. Note=Associates with
CC chromatin (By similarity). Recruited at sites of high RNA polymerase II
CC occupancy (By similarity). {ECO:0000250|UniProtKB:Q6UXN9,
CC ECO:0000250|UniProtKB:Q8BFQ4}.
CC -!- SIMILARITY: Belongs to the WD repeat SWD2 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC082629; AAH82629.1; -; mRNA.
DR RefSeq; NP_001087972.1; NM_001094503.1.
DR RefSeq; XP_018112333.1; XM_018256844.1.
DR AlphaFoldDB; Q640J6; -.
DR SMR; Q640J6; -.
DR BioGRID; 104737; 1.
DR IntAct; Q640J6; 1.
DR DNASU; 494657; -.
DR GeneID; 494657; -.
DR KEGG; xla:494657; -.
DR CTD; 494657; -.
DR Xenbase; XB-GENE-6255612; wdr82.L.
DR OMA; MAFRDYN; -.
DR OrthoDB; 1146727at2759; -.
DR Proteomes; UP000186698; Chromosome 4L.
DR Bgee; 494657; Expressed in egg cell and 19 other tissues.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0006353; P:DNA-templated transcription, termination; IEA:UniProtKB-KW.
DR GO; GO:0110064; P:lncRNA catabolic process; ISS:UniProtKB.
DR GO; GO:0032785; P:negative regulation of DNA-templated transcription, elongation; ISS:UniProtKB.
DR GO; GO:0140744; P:negative regulation of lncRNA transcription; ISS:UniProtKB.
DR GO; GO:0071027; P:nuclear RNA surveillance; ISS:UniProtKB.
DR Gene3D; 2.130.10.10; -; 1.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR037867; Swd2/WDR82.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_repeat.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR19861; PTHR19861; 1.
DR Pfam; PF00400; WD40; 3.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF50978; SSF50978; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 3.
DR PROSITE; PS50294; WD_REPEATS_REGION; 1.
PE 2: Evidence at transcript level;
KW Chromosome; Nucleus; Reference proteome; Repeat; Transcription;
KW Transcription regulation; Transcription termination; WD repeat.
FT CHAIN 1..313
FT /note="WD repeat-containing protein 82-A"
FT /id="PRO_0000279689"
FT REPEAT 19..58
FT /note="WD 1"
FT REPEAT 105..144
FT /note="WD 2"
FT REPEAT 146..184
FT /note="WD 3"
FT REPEAT 192..231
FT /note="WD 4"
FT REPEAT 236..276
FT /note="WD 5"
FT REPEAT 280..313
FT /note="WD 6"
SQ SEQUENCE 313 AA; 35147 MW; A36C18347926E916 CRC64;
MKLTDGVLRS FRVAKVFREN SDKINCFDFS PTGETVISSS DDDSIVLYDC QEGKPKRTLY
SKKYGVDLIR YTHAANTVVY SSNKIDDTIR YLSLHDNKYI RYFPGHSKRV VALSMSPVDD
TFISASLDKT IRLWDLRSPN CQGLMHLQGK PVCSFDPEGL IFAAGVNSEM VKLYDLRSFD
KGPFATFKMQ YDRTCEWTSL KFSQDGKLIL MSTNGGFLRL VDAFKGAVMH TFGGYNNSKA
VTLEASFTPD SQFIMIGSED GKIHVWNCES GMKVAVLDGK HTGPITCLQF NPKFMTFASA
CSNMAFWLPT IDD