THO5B_ARATH
ID THO5B_ARATH Reviewed; 819 AA.
AC F4K4J0; Q94BS3; Q9FMM6;
DT 19-MAR-2014, integrated into UniProtKB/Swiss-Prot.
DT 28-JUN-2011, sequence version 1.
DT 25-MAY-2022, entry version 64.
DE RecName: Full=THO complex subunit 5B;
DE AltName: Full=THO complex subunit 5;
DE Short=AtTHO5;
GN Name=THO5B; Synonyms=THO5, THOC5A; OrderedLocusNames=At5g42920;
GN ORFNames=MBD2.12;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=9501997; DOI=10.1093/dnares/4.6.401;
RA Nakamura Y., Sato S., Kaneko T., Kotani H., Asamizu E., Miyajima N.,
RA Tabata S.;
RT "Structural analysis of Arabidopsis thaliana chromosome 5. III. Sequence
RT features of the regions of 1,191,918 bp covered by seventeen physically
RT assigned P1 clones.";
RL DNA Res. 4:401-414(1997).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP IDENTIFICATION BY MASS SPECTROMETRY, AND SUBUNIT.
RX PubMed=20634427; DOI=10.1073/pnas.0911341107;
RA Yelina N.E., Smith L.M., Jones A.M., Patel K., Kelly K.A., Baulcombe D.C.;
RT "Putative Arabidopsis THO/TREX mRNA export complex is involved in transgene
RT and endogenous siRNA biosynthesis.";
RL Proc. Natl. Acad. Sci. U.S.A. 107:13948-13953(2010).
CC -!- FUNCTION: Acts as component of the THO subcomplex of the TREX complex
CC which is thought to couple mRNA transcription, processing and nuclear
CC export. {ECO:0000250}.
CC -!- SUBUNIT: Component of the THO complex, which is composed of THO1, THO2,
CC THO3, THO5, THO6 and THO7. {ECO:0000269|PubMed:20634427}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=F4K4J0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=F4K4J0-2; Sequence=VSP_053743;
CC -!- SIMILARITY: Belongs to the THOC5 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB09194.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB008264; BAB09194.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002688; AED94887.1; -; Genomic_DNA.
DR EMBL; CP002688; AED94888.1; -; Genomic_DNA.
DR EMBL; AY039925; AAK64029.1; -; mRNA.
DR EMBL; AY091346; AAM14285.1; -; mRNA.
DR RefSeq; NP_568616.1; NM_123657.3. [F4K4J0-2]
DR RefSeq; NP_974873.1; NM_203144.4. [F4K4J0-1]
DR AlphaFoldDB; F4K4J0; -.
DR SMR; F4K4J0; -.
DR BioGRID; 19554; 45.
DR IntAct; F4K4J0; 2.
DR STRING; 3702.AT5G42920.2; -.
DR iPTMnet; F4K4J0; -.
DR PaxDb; F4K4J0; -.
DR PRIDE; F4K4J0; -.
DR ProteomicsDB; 246460; -. [F4K4J0-1]
DR EnsemblPlants; AT5G42920.1; AT5G42920.1; AT5G42920. [F4K4J0-2]
DR EnsemblPlants; AT5G42920.2; AT5G42920.2; AT5G42920. [F4K4J0-1]
DR GeneID; 834304; -.
DR Gramene; AT5G42920.1; AT5G42920.1; AT5G42920. [F4K4J0-2]
DR Gramene; AT5G42920.2; AT5G42920.2; AT5G42920. [F4K4J0-1]
DR KEGG; ath:AT5G42920; -.
DR Araport; AT5G42920; -.
DR TAIR; locus:2160006; AT5G42920.
DR eggNOG; KOG2216; Eukaryota.
DR HOGENOM; CLU_019074_0_0_1; -.
DR InParanoid; F4K4J0; -.
DR OMA; SLMKLKW; -.
DR OrthoDB; 1048314at2759; -.
DR PRO; PR:F4K4J0; -.
DR Proteomes; UP000006548; Chromosome 5.
DR ExpressionAtlas; F4K4J0; baseline and differential.
DR Genevisible; F4K4J0; AT.
DR GO; GO:0000347; C:THO complex; IDA:UniProtKB.
DR GO; GO:0000445; C:THO complex part of transcription export complex; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0006406; P:mRNA export from nucleus; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0032786; P:positive regulation of DNA-templated transcription, elongation; IBA:GO_Central.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR019163; THO_Thoc5.
DR PANTHER; PTHR13375; PTHR13375; 1.
DR Pfam; PF09766; FmiP_Thoc5; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; mRNA processing; mRNA splicing; mRNA transport;
KW Nucleus; Reference proteome; RNA-binding; Transport.
FT CHAIN 1..819
FT /note="THO complex subunit 5B"
FT /id="PRO_0000425590"
FT REGION 285..332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..330
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..117
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14593172"
FT /id="VSP_053743"
SQ SEQUENCE 819 AA; 92577 MW; 8F4D02F6E600BC0F CRC64;
MEDGEIEEGM VTADEFPTPE VTTIETIQPP REPGKSPLEL LRESKTSVEE IVAKMLSMKK
QGNHKSEIRE LLTQMFLNFV NLRQANRAIL TEEDKVKAET ERAKAPVDFT TLQLHNLMYE
KSHYVKAIKA CRDFKSKYPD IDLVPEQDFF RHAPEAIKDQ SLSSDSSHVL MPKRLNFELH
QRKELCKHRA RLEQQKKSLL ETIAERKKFL SSLPLHLKSL KKASLPVQNH LGIQHTKKLK
QHNLAELLPP PLYVLYSQLL AQKEAFEESI ELEVVGSLKD AQAYARQQSR KDSGMSSNTE
SSRLEDDGPD DDDDGQRRRK RPKKLTSKEG SDKAGLYQVH PLKIVLHIYD DEIPDTKSLK
LVILKFEYLL KLNVVCVGAE GSQDGPEKNI FCNLFPDDAG LEPPHQSTKL ILGDGQTFDE
NRTSRPYKWV QHLAGIDISP VLLGQEAHNT DPAKSDTFVP DLSLYRQQHR VQTVLRRIRL
RKKAHLALAE QLDLLMKHEL PVVNCEDAPW ALHKVLCALD SWLHIQSSAS KSCSLTLNSV
EQVPEPMEID VDGRSISGKE DFESIREDGE LPSLVTAAAS LTSSNHTPSK VSNQARSRQL
ALMTKNLDSP ISKGKSPSFK KYEDDLDLVL DDDSEIDEPT GRTEAHVEEL CPEKADNSWV
DYGSREFALV FSRKTDGGKL WKLEAMVQIS MEYPLRPPLF SLSLHASSSS GNENGTNESD
HYNELRAMEA EVNLHMLKII PSDQENYLLS HQIRCLAMLF DYYVDDPSPD SKRGTATTVV
DVGLCKPVDG KLLVRSFRGR DHRKMISWKG RGCASGYPC