PO5F1_PIG
ID PO5F1_PIG Reviewed; 360 AA.
AC Q9TSV5;
DT 01-FEB-2003, integrated into UniProtKB/Swiss-Prot.
DT 01-MAY-2000, sequence version 1.
DT 03-AUG-2022, entry version 143.
DE RecName: Full=POU domain, class 5, transcription factor 1;
DE AltName: Full=Octamer-binding protein 3;
DE Short=Oct-3;
DE AltName: Full=Octamer-binding protein 4;
DE Short=Oct-4;
DE AltName: Full=Octamer-binding transcription factor 3;
DE Short=OTF-3;
GN Name=POU5F1; Synonyms=OCT3, OCT4;
OS Sus scrofa (Pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX NCBI_TaxID=9823;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Large white; TISSUE=Fibroblast;
RX PubMed=11169259; DOI=10.1034/j.1399-0039.2001.057001055.x;
RA Chardon P., Rogel-Gaillard C., Cattolico L., Duprat S., Vaiman M.,
RA Renard C.;
RT "Sequence of the swine major histocompatibility complex region containing
RT all non-classical class I genes.";
RL Tissue Antigens 57:55-65(2001).
RN [2]
RP BIOTECHNOLOGY, AND FUNCTION.
RX PubMed=19738434; DOI=10.4161/cc.8.19.9589;
RA Roberts R.M., Telugu B.P., Ezashi T.;
RT "Induced pluripotent stem cells from swine (Sus scrofa): why they may prove
RT to be important.";
RL Cell Cycle 8:3078-3081(2009).
CC -!- FUNCTION: Transcription factor that binds to the octamer motif (5'-
CC ATTTGCAT-3'). Forms a trimeric complex with SOX2 or SOX15 on DNA and
CC controls the expression of a number of genes involved in embryonic
CC development such as YES1, FGF4, UTF1 and ZFP206. Critical for early
CC embryogenesis and for embryonic stem cell pluripotency.
CC {ECO:0000250|UniProtKB:Q01860, ECO:0000269|PubMed:19738434}.
CC -!- SUBUNIT: Interacts with PKM. Interacts with WWP2. Interacts with UBE2I
CC and ZSCAN10. Interacts with PCGF1. Interacts with ESRRB; recruits ESRRB
CC near the POU5F1-SOX2 element in the NANOG proximal promoter; the
CC interaction is DNA independent. Interacts with ZNF322. Interacts with
CC MAPK8 and MAPK9; the interaction allows MAPK8 and MAPK9 to
CC phosphorylate POU5F1 on Ser-355. Interacts (when phosphorylated on Ser-
CC 355) with FBXW8. Interacts with FBXW4. Interacts with SOX2 and SOX15;
CC binds synergistically with either SOX2 or SOX15 to DNA (By similarity).
CC {ECO:0000250|UniProtKB:P20263, ECO:0000250|UniProtKB:Q01860}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm. Nucleus. Note=Expressed in a diffuse
CC and slightly punctuate pattern. Colocalizes with MAPK8 and MAPK9 in the
CC nucleus. {ECO:0000250|UniProtKB:P20263, ECO:0000250|UniProtKB:Q01860}.
CC -!- DOMAIN: The POU-specific domain mediates interaction with PKM.
CC {ECO:0000250|UniProtKB:Q01860}.
CC -!- DOMAIN: The 9aaTAD motif is a transactivation domain present in a large
CC number of yeast and animal transcription factors.
CC {ECO:0000250|UniProtKB:Q01860}.
CC -!- PTM: Sumoylation enhances the protein stability, DNA binding and
CC transactivation activity. Sumoylation is required for enhanced YES1
CC expression (By similarity). {ECO:0000250|UniProtKB:P20263}.
CC -!- PTM: Ubiquitinated; undergoes 'Lys-63'-linked polyubiquitination by
CC WWP2 leading to proteasomal degradation.
CC {ECO:0000250|UniProtKB:P20263}.
CC -!- PTM: ERK1/2-mediated phosphorylation at Ser-111 promotes nuclear
CC exclusion and proteasomal degradation. Phosphorylation at Thr-235 and
CC Ser-236 decrease DNA-binding and alters ability to activate
CC transcription. {ECO:0000250|UniProtKB:Q01860}.
CC -!- BIOTECHNOLOGY: POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four
CC Yamanaka factors. When combined, these factors are sufficient to
CC reprogram differentiated cells to an embryonic-like state designated
CC iPS (induced pluripotent stem) cells. iPS cells exhibit the morphology
CC and growth properties of ES cells and express ES cell marker genes.
CC {ECO:0000269|PubMed:19738434}.
CC -!- SIMILARITY: Belongs to the POU transcription factor family. Class-5
CC subfamily. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ251914; CAB63862.1; -; Genomic_DNA.
DR RefSeq; NP_001106531.1; NM_001113060.1.
DR AlphaFoldDB; Q9TSV5; -.
DR SMR; Q9TSV5; -.
DR STRING; 9823.ENSSSCP00000001474; -.
DR PaxDb; Q9TSV5; -.
DR PRIDE; Q9TSV5; -.
DR Ensembl; ENSSSCT00015062014; ENSSSCP00015024895; ENSSSCG00015046092.
DR Ensembl; ENSSSCT00025108244; ENSSSCP00025048995; ENSSSCG00025077765.
DR Ensembl; ENSSSCT00030095752; ENSSSCP00030044105; ENSSSCG00030068457.
DR Ensembl; ENSSSCT00035034185; ENSSSCP00035013523; ENSSSCG00035025921.
DR Ensembl; ENSSSCT00040103222; ENSSSCP00040046730; ENSSSCG00040074631.
DR Ensembl; ENSSSCT00045067635; ENSSSCP00045048058; ENSSSCG00045038915.
DR Ensembl; ENSSSCT00050007076; ENSSSCP00050003046; ENSSSCG00050005168.
DR Ensembl; ENSSSCT00055060100; ENSSSCP00055048158; ENSSSCG00055030204.
DR Ensembl; ENSSSCT00060050970; ENSSSCP00060021742; ENSSSCG00060037650.
DR Ensembl; ENSSSCT00065024219; ENSSSCP00065009893; ENSSSCG00065018209.
DR Ensembl; ENSSSCT00070048504; ENSSSCP00070040957; ENSSSCG00070024285.
DR GeneID; 100127461; -.
DR KEGG; ssc:100127461; -.
DR CTD; 5460; -.
DR eggNOG; KOG3802; Eukaryota.
DR HOGENOM; CLU_066243_0_0_1; -.
DR InParanoid; Q9TSV5; -.
DR OrthoDB; 841019at2759; -.
DR TreeFam; TF316413; -.
DR Proteomes; UP000008227; Unplaced.
DR Proteomes; UP000314985; Chromosome 7.
DR Genevisible; Q9TSV5; SS.
DR GO; GO:0000785; C:chromatin; ISS:AgBase.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000987; F:cis-regulatory region sequence-specific DNA binding; ISS:AgBase.
DR GO; GO:0019955; F:cytokine binding; ISS:AgBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0010629; P:negative regulation of gene expression; ISS:AgBase.
DR GO; GO:0010628; P:positive regulation of gene expression; ISS:AgBase.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.260.40; -; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR InterPro; IPR013847; POU.
DR InterPro; IPR000327; POU_dom.
DR InterPro; IPR015585; POU_dom_5.
DR PANTHER; PTHR11636:SF86; PTHR11636:SF86; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00157; Pou; 1.
DR PRINTS; PR00028; POUDOMAIN.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00352; POU; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR SUPFAM; SSF47413; SSF47413; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00035; POU_1; 1.
DR PROSITE; PS00465; POU_2; 1.
DR PROSITE; PS51179; POU_3; 1.
PE 1: Evidence at protein level;
KW Cytoplasm; Developmental protein; DNA-binding; Homeobox; Isopeptide bond;
KW Nucleus; Phosphoprotein; Reference proteome; Transcription;
KW Transcription regulation; Ubl conjugation.
FT CHAIN 1..360
FT /note="POU domain, class 5, transcription factor 1"
FT /id="PRO_0000100751"
FT DOMAIN 138..212
FT /note="POU-specific"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00530"
FT DNA_BIND 230..289
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 1..50
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 87..118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 180..186
FT /note="DNA-binding"
FT /evidence="ECO:0000250|UniProtKB:P20263"
FT REGION 193..196
FT /note="DNA-binding"
FT /evidence="ECO:0000250|UniProtKB:P20263"
FT REGION 287..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOTIF 4..12
FT /note="9aaTAD"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT BINDING 157
FT /ligand="DNA"
FT /ligand_id="ChEBI:CHEBI:16991"
FT /evidence="ECO:0000250|UniProtKB:P20263"
FT BINDING 164
FT /ligand="DNA"
FT /ligand_id="ChEBI:CHEBI:16991"
FT /evidence="ECO:0000250|UniProtKB:P20263"
FT MOD_RES 111
FT /note="Phosphoserine; by MAPK"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT MOD_RES 235
FT /note="Phosphothreonine"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT MOD_RES 236
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT MOD_RES 289
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT MOD_RES 290
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q01860"
FT MOD_RES 355
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:P20263"
FT CROSSLNK 123
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO)"
FT /evidence="ECO:0000250|UniProtKB:P20263"
SQ SEQUENCE 360 AA; 38342 MW; 9BEC167A66409A9C CRC64;
MAGHLASDFA FSPPPGGGGD GPGGPEPGWV DPRTWLSFQG PPGGSGIGPG VGPGAEVWGL
PACPPPYDFC GGMAYCAPQV GVGLVPQGGL ETPQPEGEAG AGVESNSEGA SPEPCAAPAG
AAKLDKEKLE PNPEESQDIK ALQKDLEQFA KLLKQKRITL GYTQADVGLT LGVLFGKVFS
QTTICRFEAL QLSFKNMCKL RPLLQKWVEE ADNNENLQEI CKAETLVQAR KRKRTSIENR
VRGNLESMFL QCPKPTLQQI SHIAQQLGLE KDVVRVWFCN RRQKGKRSSS DYSQREDFEA
AGSPFPGGPV SFPLAPGPHF GTPGYGGPHF TTLYSSVPFP EGEAFPSVSV TPLGSPMHSN