INT4_MOUSE
ID INT4_MOUSE Reviewed; 964 AA.
AC Q8CIM8; Q3TQQ4; Q8C2H1; Q91YV5; Q9CSY4;
DT 31-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 137.
DE RecName: Full=Integrator complex subunit 4;
DE Short=Int4;
GN Name=Ints4;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3), AND NUCLEOTIDE
RP SEQUENCE [LARGE SCALE MRNA] OF 622-964 (ISOFORM 1).
RC STRAIN=C57BL/6J, and NOD; TISSUE=Thymus;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=FVB/N; TISSUE=Mammary tumor;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Liver, Pancreas, Spleen, and Testis;
RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001;
RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R.,
RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.;
RT "A tissue-specific atlas of mouse protein phosphorylation and expression.";
RL Cell 143:1174-1189(2010).
RN [4]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-27, AND IDENTIFICATION BY MASS
RP SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Embryonic fibroblast;
RX PubMed=23806337; DOI=10.1016/j.molcel.2013.06.001;
RA Park J., Chen Y., Tishkoff D.X., Peng C., Tan M., Dai L., Xie Z., Zhang Y.,
RA Zwaans B.M., Skinner M.E., Lombard D.B., Zhao Y.;
RT "SIRT5-mediated lysine desuccinylation impacts diverse metabolic
RT pathways.";
RL Mol. Cell 50:919-930(2013).
CC -!- FUNCTION: Component of the Integrator (INT) complex, a complex involved
CC in the small nuclear RNAs (snRNA) U1 and U2 transcription and in their
CC 3'-box-dependent processing. The Integrator complex is associated with
CC the C-terminal domain (CTD) of RNA polymerase II largest subunit
CC (POLR2A) and is recruited to the U1 and U2 snRNAs genes. Mediates
CC recruitment of cytoplasmic dynein to the nuclear envelope, probably as
CC component of the INT complex. {ECO:0000250|UniProtKB:Q96HW7}.
CC -!- SUBUNIT: Belongs to the multiprotein complex Integrator, at least
CC composed of INTS1, INTS2, INTS3, INTS4, INTS5, INTS6, INTS7, INTS8,
CC INTS9/RC74, INTS10, INTS11/CPSF3L and INTS12.
CC {ECO:0000250|UniProtKB:Q96HW7}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q96HW7}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q8CIM8-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8CIM8-2; Sequence=VSP_021456;
CC Name=3;
CC IsoId=Q8CIM8-3; Sequence=VSP_021454, VSP_021455;
CC -!- SIMILARITY: Belongs to the Integrator subunit 4 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH13813.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAE37328.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK011676; BAB27773.1; -; mRNA.
DR EMBL; AK088635; BAC40468.1; -; mRNA.
DR EMBL; AK163383; BAE37328.1; ALT_FRAME; mRNA.
DR EMBL; BC013710; AAH13710.1; -; mRNA.
DR EMBL; BC013813; AAH13813.1; ALT_INIT; mRNA.
DR CCDS; CCDS21460.1; -. [Q8CIM8-1]
DR RefSeq; NP_081532.1; NM_027256.2. [Q8CIM8-1]
DR AlphaFoldDB; Q8CIM8; -.
DR SMR; Q8CIM8; -.
DR BioGRID; 221739; 20.
DR IntAct; Q8CIM8; 9.
DR MINT; Q8CIM8; -.
DR STRING; 10090.ENSMUSP00000026126; -.
DR iPTMnet; Q8CIM8; -.
DR PhosphoSitePlus; Q8CIM8; -.
DR EPD; Q8CIM8; -.
DR MaxQB; Q8CIM8; -.
DR PaxDb; Q8CIM8; -.
DR PeptideAtlas; Q8CIM8; -.
DR PRIDE; Q8CIM8; -.
DR ProteomicsDB; 269492; -. [Q8CIM8-1]
DR ProteomicsDB; 269493; -. [Q8CIM8-2]
DR ProteomicsDB; 269494; -. [Q8CIM8-3]
DR Antibodypedia; 31280; 132 antibodies from 26 providers.
DR DNASU; 101861; -.
DR Ensembl; ENSMUST00000026126; ENSMUSP00000026126; ENSMUSG00000025133. [Q8CIM8-1]
DR GeneID; 101861; -.
DR KEGG; mmu:101861; -.
DR UCSC; uc009ijg.2; mouse. [Q8CIM8-3]
DR UCSC; uc009ijh.2; mouse. [Q8CIM8-1]
DR CTD; 92105; -.
DR MGI; MGI:1917164; Ints4.
DR VEuPathDB; HostDB:ENSMUSG00000025133; -.
DR eggNOG; KOG2259; Eukaryota.
DR GeneTree; ENSGT00390000010128; -.
DR HOGENOM; CLU_012910_1_0_1; -.
DR InParanoid; Q8CIM8; -.
DR OMA; PHMQGSF; -.
DR OrthoDB; 256550at2759; -.
DR PhylomeDB; Q8CIM8; -.
DR TreeFam; TF315047; -.
DR Reactome; R-MMU-6807505; RNA polymerase II transcribes snRNA genes.
DR BioGRID-ORCS; 101861; 26 hits in 79 CRISPR screens.
DR ChiTaRS; Ints4; mouse.
DR PRO; PR:Q8CIM8; -.
DR Proteomes; UP000000589; Chromosome 7.
DR RNAct; Q8CIM8; protein.
DR Bgee; ENSMUSG00000025133; Expressed in dorsal pancreas and 253 other tissues.
DR ExpressionAtlas; Q8CIM8; baseline and differential.
DR Genevisible; Q8CIM8; MM.
DR GO; GO:0032039; C:integrator complex; ISS:HGNC.
DR GO; GO:0005730; C:nucleolus; ISO:MGI.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0016180; P:snRNA processing; ISS:HGNC.
DR Gene3D; 1.25.10.10; -; 3.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR026003; Cohesin_HEAT.
DR Pfam; PF12765; Cohesin_HEAT; 1.
DR SUPFAM; SSF48371; SSF48371; 1.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Isopeptide bond; Nucleus;
KW Reference proteome; Repeat; Ubl conjugation.
FT CHAIN 1..964
FT /note="Integrator complex subunit 4"
FT /id="PRO_0000259539"
FT REPEAT 67..106
FT /note="HEAT 1"
FT REPEAT 146..184
FT /note="HEAT 2"
FT REPEAT 191..229
FT /note="HEAT 3"
FT REPEAT 230..264
FT /note="HEAT 4"
FT REPEAT 278..314
FT /note="HEAT 5"
FT REPEAT 370..406
FT /note="HEAT 6"
FT REPEAT 407..445
FT /note="HEAT 7"
FT REPEAT 447..485
FT /note="HEAT 8"
FT MOD_RES 27
FT /note="N6-acetyllysine"
FT /evidence="ECO:0007744|PubMed:23806337"
FT CROSSLNK 792
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO1); alternate"
FT /evidence="ECO:0000250|UniProtKB:Q96HW7"
FT CROSSLNK 792
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2); alternate"
FT /evidence="ECO:0000250|UniProtKB:Q96HW7"
FT VAR_SEQ 221..223
FT /note="LQL -> VTG (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_021454"
FT VAR_SEQ 224..964
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_021455"
FT VAR_SEQ 507..964
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_021456"
SQ SEQUENCE 964 AA; 108193 MW; BF00ADA2FF1076FF CRC64;
MAAHLKKRVY EEFTKVVQQQ QEEIATKKLR LTKPSKSAAL HIDLCKATSP ADALQYLLQF
ARKPVEAESV EGVVRILLEH YYKENDPSVR LKIASLLGLL SKTAGFSPDC IMDDAINILQ
NEKSHQVLAQ LLDTLLAIGS KLPENQATQV RLVDVACKHL TDTSHGVRNK CLQLLGNLGS
LEKSVTKDTE GSAARDVQKI IGDHFSDQDP RVRTAAIKAM LQLHERGLKL HQTIYNQACK
LLSDDYEQVR SAAVQLIWVV SQLYPESIVP IPSSNEEIRL VDDAFGKICH MVSDGSWVVR
VQAAKLLGSM EQVSSHFLEQ TLDKKLMSDL RRKRTAHERA KELYSSGEFS SGRKWGDDAP
KEEIDTGAVN LIESGACGAF VHGLEDEMYE VRIAAVEALC MLAQSSPSFA EKCLDFLVDM
FNDEIEEVRL QSIHTMRKIS NNITLREDQL DTVLAVLEDS SRDIREALHE LLCCTNVSTK
EGIHLALVEL LKNLTKYPTD RDSIWKCLKF LGSRHPTLVL PLVPELLSTH PFFDTAEPDM
DDPAYIAVLV LIFNAAKTCP TMPALFSDHT LRHYAYLRDS LSHLVPALRL PGRKLVSSTV
PSNITPHEDP SQQFLQQSLE RVYSVQHLDP QGAQELLEFT IRDLQRLGEL QSELAGVADF
SATYLQCQLL LIKALQEKLW NVAAPLYLKQ SDLASAAAKQ IMEETYKMEF MYSGVENKQV
VIIQHMRLQA KALQLIVTAR TTRGVDPLFG MCEKFLQEVD FFQRCFIADL PHLQDSFVDK
LLDLMPRLMA SKPVEVIKIL QTMLRQSTFL HLPLPEQIHK ASATIIEPAG ESDNPLRFTS
GLVVALDVDA TLEHVQDPQN TVKVQVLYPD GQAQMIHPKP ADFRNPGPGR HRLLTQVYLS
HTAWTEPCQV EVRLLLAYNS GARIPKSPWL EGSEMSPQVE TSIEGTIPFS KPVKVYIMPK
PARR