INT4_DICDI
ID INT4_DICDI Reviewed; 1233 AA.
AC Q54LH5;
DT 22-JUL-2008, integrated into UniProtKB/Swiss-Prot.
DT 24-MAY-2005, sequence version 1.
DT 25-MAY-2022, entry version 87.
DE RecName: Full=Integrator complex subunit 4 homolog;
GN Name=ints4; ORFNames=DDB_G0286633;
OS Dictyostelium discoideum (Slime mold).
OC Eukaryota; Amoebozoa; Evosea; Eumycetozoa; Dictyostelia; Dictyosteliales;
OC Dictyosteliaceae; Dictyostelium.
OX NCBI_TaxID=44689;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AX4;
RX PubMed=15875012; DOI=10.1038/nature03481;
RA Eichinger L., Pachebat J.A., Gloeckner G., Rajandream M.A., Sucgang R.,
RA Berriman M., Song J., Olsen R., Szafranski K., Xu Q., Tunggal B.,
RA Kummerfeld S., Madera M., Konfortov B.A., Rivero F., Bankier A.T.,
RA Lehmann R., Hamlin N., Davies R., Gaudet P., Fey P., Pilcher K., Chen G.,
RA Saunders D., Sodergren E.J., Davis P., Kerhornou A., Nie X., Hall N.,
RA Anjard C., Hemphill L., Bason N., Farbrother P., Desany B., Just E.,
RA Morio T., Rost R., Churcher C.M., Cooper J., Haydock S., van Driessche N.,
RA Cronin A., Goodhead I., Muzny D.M., Mourier T., Pain A., Lu M., Harper D.,
RA Lindsay R., Hauser H., James K.D., Quiles M., Madan Babu M., Saito T.,
RA Buchrieser C., Wardroper A., Felder M., Thangavelu M., Johnson D.,
RA Knights A., Loulseged H., Mungall K.L., Oliver K., Price C., Quail M.A.,
RA Urushihara H., Hernandez J., Rabbinowitsch E., Steffen D., Sanders M.,
RA Ma J., Kohara Y., Sharp S., Simmonds M.N., Spiegler S., Tivey A.,
RA Sugano S., White B., Walker D., Woodward J.R., Winckler T., Tanaka Y.,
RA Shaulsky G., Schleicher M., Weinstock G.M., Rosenthal A., Cox E.C.,
RA Chisholm R.L., Gibbs R.A., Loomis W.F., Platzer M., Kay R.R.,
RA Williams J.G., Dear P.H., Noegel A.A., Barrell B.G., Kuspa A.;
RT "The genome of the social amoeba Dictyostelium discoideum.";
RL Nature 435:43-57(2005).
CC -!- FUNCTION: Component of the Integrator complex, a complex involved in
CC the small nuclear RNAs (snRNA) U1 and U2 transcription and in their 3'-
CC box-dependent processing. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the Integrator subunit 4 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAFI02000089; EAL64072.1; -; Genomic_DNA.
DR RefSeq; XP_637586.1; XM_632494.1.
DR AlphaFoldDB; Q54LH5; -.
DR STRING; 44689.DDB0234080; -.
DR PaxDb; Q54LH5; -.
DR EnsemblProtists; EAL64072; EAL64072; DDB_G0286633.
DR GeneID; 8625726; -.
DR KEGG; ddi:DDB_G0286633; -.
DR dictyBase; DDB_G0286633; ints4.
DR eggNOG; KOG2259; Eukaryota.
DR HOGENOM; CLU_267504_0_0_1; -.
DR InParanoid; Q54LH5; -.
DR OMA; CIKLIWI; -.
DR PRO; PR:Q54LH5; -.
DR Proteomes; UP000002195; Chromosome 4.
DR GO; GO:0032039; C:integrator complex; IBA:GO_Central.
DR GO; GO:0016180; P:snRNA processing; IBA:GO_Central.
DR Gene3D; 1.25.10.10; -; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR SUPFAM; SSF48371; SSF48371; 1.
PE 3: Inferred from homology;
KW Nucleus; Reference proteome; Repeat.
FT CHAIN 1..1233
FT /note="Integrator complex subunit 4 homolog"
FT /id="PRO_0000344376"
FT REPEAT 236..273
FT /note="HEAT 1"
FT REPEAT 275..310
FT /note="HEAT 2"
FT REPEAT 487..524
FT /note="HEAT 3"
FT REPEAT 525..561
FT /note="HEAT 4"
FT REPEAT 563..597
FT /note="HEAT 5"
FT REGION 426..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 767..796
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 993..1057
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1014..1046
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1233 AA; 141988 MW; 330219236A04458A CRC64;
MNKKSHQQQV LQQQQQQQQQ QFQQQHFQQQ QQQQQFQQQQ QQQQQLQQQQ LQQQQQQQIQ
QQQQIQQQQQ QQQQQQQQIQ QQQQPQQIQQ PQQQLPNFSV PIINQSIISN IKLYSTFETL
LLSLSSTDTR EQTKAIIYLS SLTINNKSNG IDLVYCIQKQ LVVEAHNHIK VLLINLLGDI
SLDPFIDSFH ILNKLLYLIR NEKSKKVLSS CLININKISK SLTFSKGNDV NSSSSSLINQ
IIEYIEPLLH STSPLVRRET IVLLGSIVKN LDKESEEIQI LLLNYLKDTD FRVREASLKS
LSVIFQRGAS LSVNKLYQSI ILLLLDSFEQ VRLECIKLIW IFGNIYPNHI VVSGGTKIRL
VDDVFKKICN AVNDSSVIVR NCACKLLGCT YDVSLNYLIQ TLSKEVMVWG KGKQYQIGHS
SITKQRLQQK QQQQQQQQQQ QQQQPPQQQP SQQPNQQPNQ QQTNVSGTHI ATPEGDFDVV
GSDSLNILES GVIGAFIQGL EDEFYEVRSS AIDSMCELSV RNDEFAQKNI DFLVDIFNDE
IESVRINSIN SLRKIGNNVV IKEEQLHIIL ANLESSSKEE RQSLHRLLTS IHLSNYSCLH
ATTQALLMNL SRYPYDIHSI FETLKIIGQT NPFTEFIVDD LLRIDPKFAS VEPNMDDIFY
VAVMVLVLNS CIKNRNILSL LPSFSFQHHL YFKDKYPKYF PQNLQKDSNI ILAPTLKYSS
VVNNNNNKNN NNNTTTTSII NDNGEINQFL NLTLSLLYGD NNNIINNNNT NNNNNNNNNN
NNNNNNNNNN NNDENDENLN YGFQRLFLNQ KKEIQLNKLF QECKRNFKRI SSICLSLKPT
SDFYNKYLKI LTTIVSSSRR FKTITTSNQL NHLLYKLNHK FIGVDNNCKL ILMEIGIYSF
IIEILSTITL STSSPPTTTT TTSSTSLFTS INSTPIELSN DQINQLTKKL NEYFKFSNDF
KLEISTSIKS LLDLFNIKNN NINNNNNIIN KKDKKLKENE ENEENEENEN NENENEKENG
KNKEKEKNEN DNENENENER PSKIQKTTSE LIKTTTTTST STSTLIHYNL IEWTDNYIPP
ILKISNLIKK KSVDVILPIP NDKPIEFLDI FPLKLMINAK IENIRNLDSL FIQSTWQSIH
TGQLNSQIHI IPKSAFTPTK PLSYNISCSI YLSLQNIRHN RLSSKESIPL RISIVEQFLN
NNNNNNVTII PLSKFVIFNI LPVDNVNLLP NLI