INT9_MACFA
ID INT9_MACFA Reviewed; 637 AA.
AC Q4R5Z4;
DT 31-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 19-JUL-2005, sequence version 1.
DT 03-AUG-2022, entry version 55.
DE RecName: Full=Integrator complex subunit 9;
DE Short=Int9;
GN Name=INTS9; ORFNames=QtsA-19691;
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Testis;
RG International consortium for macaque cDNA sequencing and analysis;
RT "DNA sequences of macaque genes expressed in brain or testis and its
RT evolutionary implications.";
RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Component of the Integrator (INT) complex, a complex involved
CC in the small nuclear RNAs (snRNA) U1 and U2 transcription and in their
CC 3'-box-dependent processing. The Integrator complex is associated with
CC the C-terminal domain (CTD) of RNA polymerase II largest subunit
CC (POLR2A) and is recruited to the U1 and U2 snRNAs genes. Mediates
CC recruitment of cytoplasmic dynein to the nuclear envelope, probably as
CC component of the INT complex. {ECO:0000250|UniProtKB:Q9NV88}.
CC -!- SUBUNIT: Belongs to the multiprotein complex Integrator, at least
CC composed of INTS1, INTS2, INTS3, INTS4, INTS5, INTS6, INTS7, INTS8,
CC INTS9/RC74, INTS10, INTS11/CPSF3L and INTS12 (By similarity). Interacts
CC with ESRRB, ESRRB is probably not a core component of the multiprotein
CC complex Integrator and this association is a bridge for the interaction
CC with the multiprotein complex Integrator; attracts the transcriptional
CC machinery (By similarity). {ECO:0000250|UniProtKB:Q8K114,
CC ECO:0000250|UniProtKB:Q9NV88}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q9NV88}.
CC -!- MISCELLANEOUS: Although strongly related to RNA-specific endonuclease
CC proteins, it lacks the HXHXDH motif that binds zinc and participates in
CC the catalytic center. Its function as endonuclease is therefore unsure.
CC -!- SIMILARITY: Belongs to the metallo-beta-lactamase superfamily. RNA-
CC metabolizing metallo-beta-lactamase-like family. INTS9 subfamily.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB169398; BAE01481.1; -; mRNA.
DR RefSeq; NP_001270778.1; NM_001283849.1.
DR AlphaFoldDB; Q4R5Z4; -.
DR SMR; Q4R5Z4; -.
DR STRING; 9541.XP_005563022.1; -.
DR GeneID; 101867367; -.
DR CTD; 55756; -.
DR eggNOG; KOG1138; Eukaryota.
DR Proteomes; UP000233100; Unplaced.
DR GO; GO:0032039; C:integrator complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0016180; P:snRNA processing; IEA:InterPro.
DR Gene3D; 3.60.15.10; -; 1.
DR InterPro; IPR022712; Beta_Casp.
DR InterPro; IPR027074; Integrator_9su.
DR InterPro; IPR001279; Metallo-B-lactamas.
DR InterPro; IPR036866; RibonucZ/Hydroxyglut_hydro.
DR PANTHER; PTHR46094; PTHR46094; 1.
DR Pfam; PF16661; Lactamase_B_6; 1.
DR SMART; SM01027; Beta-Casp; 1.
DR SUPFAM; SSF56281; SSF56281; 1.
PE 2: Evidence at transcript level;
KW Isopeptide bond; Nucleus; Reference proteome; Ubl conjugation.
FT CHAIN 1..637
FT /note="Integrator complex subunit 9"
FT /id="PRO_0000259558"
FT REGION 527..553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CROSSLNK 58
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q9NV88"
SQ SEQUENCE 637 AA; 71425 MW; 6F6164CE93DE87D7 CRC64;
MKLYCLSGHP TLPCNVLKFK STTIMLDCGL DMTSTLNFLP LPLVQSPRLS SLPGWSLKDG
NAFLDKTELI DLSTVDVILI SNYHCMMALP YITEHTGFTG TVYATEPTVQ IGRLLMEELV
NFIERVPKAQ SASLWKNKDI QRLLPSPLKD AVEVSTWRRC YTMQEVNSAL SKIQLVGFSQ
KIELFGAVQV TPLSSGYALG SSNWIIQSHY EKVSYVSGSS LLTTHPQPMD QASLKNSDVL
VLTGLTQIPT ANPDGMVGEF CSNLALTVRN GGNVLVPCYP SGVIYDLLEC LYQYIDSAGL
SSVPLYFISP VANSSLEFSQ IFAEWLCHNK QSKVYLPEPP FPHAELIQTN KLKHYPSIHG
DFSNDFRQPC VVFTGHPSLR FGDVVHFMEL WGKSSLNTVI FTEPDFSYLE ALAPYQPLAM
KCIYCPIDTR LNFIQVSKLL KEVQPLHVVC PEQYTQPPPA QSHRMDLMID CQPPAMSYRR
AEVLALPFKR RYEKIEIMPE LADSLVPMEI KPGISLATVS AVLHTKDNKH LLQPPPRPAQ
PTSGKKRKRV SDDVPDCKVL KPLLSGSIPV EQFVQTLEKH GFSDIKVEDT AKGHIVLLQE
AETLIQIEED STHIICDNDE MLRVRLRDLV LKFLQKF