USF1_RABIT
ID USF1_RABIT Reviewed; 310 AA.
AC O02818; O02819;
DT 15-DEC-1998, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-1997, sequence version 1.
DT 25-MAY-2022, entry version 111.
DE RecName: Full=Upstream stimulatory factor 1;
DE AltName: Full=Major late transcription factor 1;
GN Name=USF1;
OS Oryctolagus cuniculus (Rabbit).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; Oryctolagus.
OX NCBI_TaxID=9986;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS USF1A AND USF1B).
RC STRAIN=New Zealand white; TISSUE=Lung;
RX PubMed=9287355; DOI=10.1074/jbc.272.37.23398;
RA Gao E., Wang Y., Alcorn J.L., Mendelson C.R.;
RT "The basic helix-loop-helix-zipper transcription factor USF1 regulates
RT expression of the surfactant protein-A gene.";
RL J. Biol. Chem. 272:23398-23406(1997).
CC -!- FUNCTION: Transcription factor that binds to a symmetrical DNA sequence
CC (E-boxes) (5'-CACGTG-3') that is found in a variety of viral and
CC cellular promoters. Regulates the expression of the surfactant protein-
CC A (SP-A) gene.
CC -!- SUBUNIT: Efficient DNA binding requires dimerization with another bHLH
CC protein. Binds DNA as a homodimer or a heterodimer (USF1/USF2).
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=USF1A;
CC IsoId=O02818-1; Sequence=Displayed;
CC Name=USF1B;
CC IsoId=O02818-2; Sequence=VSP_002163;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF003894; AAC48764.1; -; mRNA.
DR EMBL; AF003895; AAC48765.1; -; mRNA.
DR RefSeq; NP_001076104.1; NM_001082635.1. [O02818-1]
DR AlphaFoldDB; O02818; -.
DR SMR; O02818; -.
DR STRING; 9986.ENSOCUP00000002031; -.
DR GeneID; 100009324; -.
DR KEGG; ocu:100009324; -.
DR CTD; 7391; -.
DR eggNOG; KOG1318; Eukaryota.
DR InParanoid; O02818; -.
DR OrthoDB; 1345445at2759; -.
DR Proteomes; UP000001811; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR Gene3D; 4.10.280.10; -; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SUPFAM; SSF47459; SSF47459; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; DNA-binding; Isopeptide bond; Nucleus;
KW Reference proteome; Transcription; Transcription regulation;
KW Ubl conjugation.
FT CHAIN 1..310
FT /note="Upstream stimulatory factor 1"
FT /id="PRO_0000127498"
FT DOMAIN 199..254
FT /note="bHLH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00981"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 171..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 271..292
FT /note="Leucine-zipper"
FT COMPBIAS 192..209
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CROSSLNK 306
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:P22415"
FT VAR_SEQ 131..158
FT /note="Missing (in isoform USF1B)"
FT /evidence="ECO:0000303|PubMed:9287355"
FT /id="VSP_002163"
SQ SEQUENCE 310 AA; 33510 MW; 46EE3E5F6FE08E0E CRC64;
MKGQQKTAET EEGTVQIQEG AVATGEDPTS VAIASIQSAA TFPDPNVKYV FRTENGGQVM
YRVIQVSEGQ LDGQTEGTGA ISGYPATQSM TQAVIQGAFT SDDTVDTEGT AAETHYTYFP
STAVGDGAGG TTSGSTAAVV TTQGSEALLG QATPPGTGQF FVMMSPQEVL QGGSQRSIAP
RTHPYSPKSA APRTTRDEKR RAQHNEVERR RRDKINNWIV QLSKIIPDCS MESTKSGQSK
GGILSKACDY IQELRQSNHR LSEELQGLDQ LQLDNDVLRQ QVEDLKNKNL LLRAQLRHHG
LEVVIKNDSN