TFB1A_ARATH
ID TFB1A_ARATH Reviewed; 591 AA.
AC Q3ECP0; Q9LFZ6;
DT 08-MAR-2011, integrated into UniProtKB/Swiss-Prot.
DT 08-NOV-2005, sequence version 1.
DT 25-MAY-2022, entry version 107.
DE RecName: Full=General transcription and DNA repair factor IIH subunit TFB1-1;
DE Short=AtTFB1-1 {ECO:0000303|PubMed:15645454};
DE Short=TFIIH subunit TFB1-1;
DE AltName: Full=RNA polymerase II transcription factor B subunit 1-1;
GN Name=TFB1-1 {ECO:0000305}; Synonyms=GTF2H1-1 {ECO:0000305};
GN OrderedLocusNames=At1g55750; ORFNames=F20N2.15;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP COMPONENT OF TFIIH CORE COMPLEX, AND NOMENCLATURE.
RX PubMed=15645454; DOI=10.1002/em.20094;
RA Kunz B.A., Anderson H.J., Osmond M.J., Vonarx E.J.;
RT "Components of nucleotide excision repair and DNA damage tolerance in
RT Arabidopsis thaliana.";
RL Environ. Mol. Mutagen. 45:115-127(2005).
CC -!- FUNCTION: Component of the general transcription and DNA repair factor
CC IIH (TFIIH) core complex, which is involved in general and
CC transcription-coupled nucleotide excision repair (NER) of damaged DNA
CC and, when complexed to CAK, in RNA transcription by RNA polymerase II.
CC In NER, TFIIH acts by opening DNA around the lesion to allow the
CC excision of the damaged oligonucleotide and its replacement by a new
CC DNA fragment. In transcription, TFIIH has an essential role in
CC transcription initiation. When the pre-initiation complex (PIC) has
CC been established, TFIIH is required for promoter opening and promoter
CC escape. Phosphorylation of the C-terminal tail (CTD) of the largest
CC subunit of RNA polymerase II by the kinase module CAK controls the
CC initiation of transcription. {ECO:0000250|UniProtKB:P32780}.
CC -!- SUBUNIT: Component of the 7-subunit TFIIH core complex composed of XPB,
CC XPD, TFB1/GTF2H1, GTF2H2/P44, TFB4/GTF2H3, TFB2/GTF2H4 and TFB5/GTF2H5,
CC which is active in NER. The core complex associates with the 3-subunit
CC CDK-activating kinase (CAK) module composed of CYCH1/cyclin H1, CDKD
CC and MAT1/At4g30820 to form the 10-subunit holoenzyme (holo-TFIIH)
CC active in transcription. {ECO:0000250|UniProtKB:P32780,
CC ECO:0000305|PubMed:15645454}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- SIMILARITY: Belongs to the TFB1 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAF79503.1; Type=Erroneous gene model prediction; Note=The predicted gene has been split into 2 genes: At1g55750 and At1g55760.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC002328; AAF79503.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE33293.1; -; Genomic_DNA.
DR EMBL; CP002684; ANM59713.1; -; Genomic_DNA.
DR EMBL; AK226523; BAE98663.1; -; mRNA.
DR RefSeq; NP_001319242.1; NM_001333725.1.
DR RefSeq; NP_175971.3; NM_104451.4.
DR AlphaFoldDB; Q3ECP0; -.
DR SMR; Q3ECP0; -.
DR BioGRID; 27249; 7.
DR IntAct; Q3ECP0; 7.
DR STRING; 3702.AT1G55750.1; -.
DR PaxDb; Q3ECP0; -.
DR PRIDE; Q3ECP0; -.
DR ProteomicsDB; 234406; -.
DR EnsemblPlants; AT1G55750.1; AT1G55750.1; AT1G55750.
DR EnsemblPlants; AT1G55750.6; AT1G55750.6; AT1G55750.
DR GeneID; 842024; -.
DR Gramene; AT1G55750.1; AT1G55750.1; AT1G55750.
DR Gramene; AT1G55750.6; AT1G55750.6; AT1G55750.
DR KEGG; ath:AT1G55750; -.
DR Araport; AT1G55750; -.
DR TAIR; locus:2020447; AT1G55750.
DR eggNOG; KOG2074; Eukaryota.
DR HOGENOM; CLU_017639_2_0_1; -.
DR InParanoid; Q3ECP0; -.
DR OMA; ENVPAKM; -.
DR OrthoDB; 946128at2759; -.
DR PhylomeDB; Q3ECP0; -.
DR PRO; PR:Q3ECP0; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q3ECP0; baseline and differential.
DR Genevisible; Q3ECP0; AT.
DR GO; GO:0000439; C:transcription factor TFIIH core complex; IBA:GO_Central.
DR GO; GO:0005675; C:transcription factor TFIIH holo complex; IBA:GO_Central.
DR GO; GO:0006281; P:DNA repair; IBA:GO_Central.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR GO; GO:0070816; P:phosphorylation of RNA polymerase II C-terminal domain; IBA:GO_Central.
DR GO; GO:0006360; P:transcription by RNA polymerase I; IBA:GO_Central.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IBA:GO_Central.
DR Gene3D; 1.10.3970.10; -; 1.
DR InterPro; IPR005607; BSD_dom.
DR InterPro; IPR035925; BSD_dom_sf.
DR InterPro; IPR027079; Tfb1/GTF2H1.
DR InterPro; IPR013876; TFIIH_BTF_p62_N.
DR PANTHER; PTHR12856; PTHR12856; 1.
DR Pfam; PF03909; BSD; 1.
DR Pfam; PF08567; PH_TFIIH; 1.
DR SMART; SM00751; BSD; 2.
DR SUPFAM; SSF140383; SSF140383; 2.
DR PROSITE; PS50858; BSD; 2.
PE 2: Evidence at transcript level;
KW DNA damage; DNA repair; Nucleus; Reference proteome; Repeat; Transcription;
KW Transcription regulation.
FT CHAIN 1..591
FT /note="General transcription and DNA repair factor IIH
FT subunit TFB1-1"
FT /id="PRO_0000406092"
FT DOMAIN 112..166
FT /note="BSD 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00036"
FT DOMAIN 191..243
FT /note="BSD 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00036"
SQ SEQUENCE 591 AA; 67514 MW; 26CEAA2CE8178C2A CRC64;
MAGGQIEKLV KYKSTVKDPG TPGFLRIREG MLLFVPNDPK SDSKLKVLTQ NIKSQKYTKE
GSNKPPWLNL TNKQAKSHIF EFENYPDMHA CRDFITKALA KCELEPNKSV VSTSSEQLSI
KELELRFKLL RENSELQRLH KQFVESKVLT EDEFWATRKK LLGKDSIRKS KQQLGLKSMM
VSGIKPSTDG RTNRVTFNLT PEIIFQIFAE KPAVRQAFIN YVPSKMTEKD FWTKYFRAEY
LYSTKNTAVA AAEAAEDEEL AVFLKPDEIL ARETRHKIRR VDPTLDMEAD QGDDYTHLMD
HGIQRDGTMD VVEPQNDQFK RSLLQDLNRH AAVVLEGRSI DVESEDTRIV AEALTRVKQV
SKADGETTKD ANQERLERMS RVAGMEDLQA PQNFPLAPLS IKDPRDYFES QQGNVLNVPR
GAKGLKRNVH EAYGLLKESI LEIRATGLSD PLIKPEVSFE VFSSLTRTIA TAKNINGKNP
RESFLDRLPK STKDEVLHHW TSIQELLKHF WSSYPITTTY LHTKVGKLKD AMSNTYSKLE
AMKESVQSDL RHQVSLLVRP MQQALDAAFH HYEVDLQRRT AKSGERPNGY V