THOC7_DANRE
ID THOC7_DANRE Reviewed; 202 AA.
AC Q6DGZ3;
DT 13-NOV-2007, integrated into UniProtKB/Swiss-Prot.
DT 16-AUG-2004, sequence version 1.
DT 03-AUG-2022, entry version 92.
DE RecName: Full=THO complex subunit 7 homolog;
GN Name=thoc7; ORFNames=zgc:92711;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC TISSUE=Eye;
RG NIH - Zebrafish Gene Collection (ZGC) project;
RL Submitted (JUL-2004) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Required for efficient export of polyadenylated RNA. Acts as
CC component of the THO subcomplex of the TREX complex which is thought to
CC couple mRNA transcription, processing and nuclear export, and which
CC specifically associates with spliced mRNA and not with unspliced pre-
CC mRNA. TREX is recruited to spliced mRNAs by a transcription-independent
CC mechanism, binds to mRNA upstream of the exon-junction complex (EJC)
CC and is recruited in a splicing- and cap-dependent manner to a region
CC near the 5' end of the mRNA where it functions in mRNA export to the
CC cytoplasm via the TAP/NFX1 pathway. {ECO:0000250|UniProtKB:Q6I9Y2}.
CC -!- SUBUNIT: Component of the THO subcomplex of the transcription/export
CC (TREX) complex which seems to have a dynamic structure involving ATP-
CC dependent remodeling. {ECO:0000250|UniProtKB:Q6I9Y2}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000250|UniProtKB:Q6I9Y2}. Nucleus
CC {ECO:0000250|UniProtKB:Q6I9Y2}. Nucleus speckle
CC {ECO:0000250|UniProtKB:Q6I9Y2}.
CC -!- SIMILARITY: Belongs to the THOC7 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BC076191; AAH76191.1; -; mRNA.
DR RefSeq; NP_001003429.1; NM_001003429.2.
DR AlphaFoldDB; Q6DGZ3; -.
DR SMR; Q6DGZ3; -.
DR STRING; 7955.ENSDARP00000022217; -.
DR PaxDb; Q6DGZ3; -.
DR Ensembl; ENSDART00000005639; ENSDARP00000022217; ENSDARG00000015394.
DR GeneID; 445035; -.
DR KEGG; dre:445035; -.
DR CTD; 80145; -.
DR ZFIN; ZDB-GENE-040801-17; thoc7.
DR eggNOG; KOG3215; Eukaryota.
DR GeneTree; ENSGT00390000002873; -.
DR HOGENOM; CLU_087727_0_0_1; -.
DR InParanoid; Q6DGZ3; -.
DR OMA; WANSKND; -.
DR OrthoDB; 1394258at2759; -.
DR PhylomeDB; Q6DGZ3; -.
DR TreeFam; TF319308; -.
DR Reactome; R-DRE-159236; Transport of Mature mRNA derived from an Intron-Containing Transcript.
DR Reactome; R-DRE-72187; mRNA 3'-end processing.
DR Reactome; R-DRE-73856; RNA Polymerase II Transcription Termination.
DR PRO; PR:Q6DGZ3; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Chromosome 11.
DR Bgee; ENSDARG00000015394; Expressed in early embryo and 29 other tissues.
DR GO; GO:0005737; C:cytoplasm; ISS:UniProtKB.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; ISS:UniProtKB.
DR GO; GO:0000445; C:THO complex part of transcription export complex; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006406; P:mRNA export from nucleus; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR008501; THOC7/Mft1.
DR Pfam; PF05615; THOC7; 1.
PE 2: Evidence at transcript level;
KW Coiled coil; Cytoplasm; mRNA processing; mRNA splicing; mRNA transport;
KW Nucleus; Reference proteome; RNA-binding; Transport.
FT CHAIN 1..202
FT /note="THO complex subunit 7 homolog"
FT /id="PRO_0000310756"
FT REGION 181..202
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 86..167
FT /evidence="ECO:0000255"
SQ SEQUENCE 202 AA; 23643 MW; 6E8A4828D6BD6BF1 CRC64;
MGSITDDEVI RKRLLIDGDG AGDDRRINVL MKSFTKWCHS SFSPEEGMSQ YQRMMMSLAQ
CEFSMGKTLL VYNMNLKEME NYEGIYTDIE KSIASAHEKI AECKKEIQRA KRIRKNRQEY
DALARVIKQH PDRHETLKQL EALDKDLQQL SHIKENVEDK LELRKKQFHV LLTTIQELQQ
TLENDEKMES DDTQDSPMEN GD