MSGN1_CHICK
ID MSGN1_CHICK Reviewed; 159 AA.
AC Q9DEQ9;
DT 29-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2001, sequence version 1.
DT 03-AUG-2022, entry version 111.
DE RecName: Full=Mesogenin-1;
DE AltName: Full=Protein cMespo;
GN Name=MSGN1; Synonyms=MESPO;
OS Gallus gallus (Chicken).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Phasianinae; Gallus.
OX NCBI_TaxID=9031;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, AND DEVELOPMENTAL STAGE.
RC TISSUE=Embryo;
RX PubMed=11025230; DOI=10.1016/s0925-4773(00)00424-x;
RA Buchberger A., Bonneick S., Arnold H.-H.;
RT "Expression of the novel basic-helix-loop-helix transcription factor cMespo
RT in presomitic mesoderm of chicken embryos.";
RL Mech. Dev. 97:223-226(2000).
CC -!- FUNCTION: Involved in specifying the paraxial, but not dorsal,
CC mesoderm. May regulate the expression of T-box transcription factors
CC required for mesoderm formation and differentiation (By similarity).
CC {ECO:0000250, ECO:0000269|PubMed:11025230}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00981}.
CC -!- DEVELOPMENTAL STAGE: Expressed in the presomitic mesoderm preceding the
CC formation of somites. At stage 4 expressed in and around the primitive
CC streak. During subsequent development, expression domain persists, and
CC gradually retracts in parallel to the retracting Hensen's node towards
CC the caudal end. Expression begins to accumulate in gastrulating
CC mesoderm and is later restricted to paraxial mesoderm, prior to the
CC onset of somite formation. No expression is seen within somites, nor in
CC the tailbud mesoderm. {ECO:0000269|PubMed:11025230}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJ292363; CAC15548.1; -; mRNA.
DR RefSeq; NP_990015.2; NM_204684.2.
DR AlphaFoldDB; Q9DEQ9; -.
DR SMR; Q9DEQ9; -.
DR STRING; 9031.ENSGALP00000026522; -.
DR PaxDb; Q9DEQ9; -.
DR Ensembl; ENSGALT00000026573; ENSGALP00000026522; ENSGALG00000016469.
DR GeneID; 395419; -.
DR KEGG; gga:395419; -.
DR CTD; 343930; -.
DR VEuPathDB; HostDB:geneid_395419; -.
DR eggNOG; KOG4029; Eukaryota.
DR GeneTree; ENSGT00530000063712; -.
DR HOGENOM; CLU_084234_1_0_1; -.
DR InParanoid; Q9DEQ9; -.
DR OMA; SSWDWKN; -.
DR OrthoDB; 1493381at2759; -.
DR PhylomeDB; Q9DEQ9; -.
DR TreeFam; TF325707; -.
DR PRO; PR:Q9DEQ9; -.
DR Proteomes; UP000000539; Chromosome 3.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0001707; P:mesoderm formation; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR Gene3D; 4.10.280.10; -; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR040259; Mesogenin/MesP.
DR PANTHER; PTHR20937; PTHR20937; 1.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SUPFAM; SSF47459; SSF47459; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; Differentiation; DNA-binding; Nucleus;
KW Reference proteome; Transcription; Transcription regulation.
FT CHAIN 1..159
FT /note="Mesogenin-1"
FT /id="PRO_0000330030"
FT DOMAIN 95..149
FT /note="bHLH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00981"
FT REGION 79..101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 159 AA; 17469 MW; 0480B647C2099C61 CRC64;
MEDTLGSEHS VCLSSWDWKN TAGAFELHSV SSPHSLSPTP SFESYSSSPC PAAAETPYSG
GGLVGYGLVD FPPAYLPSPG QARLPKGTKV RMSAQRRRKA SEREKLRMRT LADALHTLRN
YLPPAYSQRG QPLTKIQTLK CTIKYISELT ELLNSVKRA