LHX3_HALRO
ID LHX3_HALRO Reviewed; 692 AA.
AC Q25132; C4B8H2;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 05-OCT-2010, sequence version 2.
DT 25-MAY-2022, entry version 97.
DE RecName: Full=LIM/homeobox protein Lhx3;
DE Short=Hr-Lhx3;
DE Short=LIM homeobox protein 3;
DE AltName: Full=LIM/homeobox protein LIM;
DE Short=HrLIM;
GN Name=LHX3; Synonyms=LIM;
OS Halocynthia roretzi (Sea squirt) (Cynthia roretzi).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Stolidobranchia;
OC Pyuridae; Halocynthia.
OX NCBI_TaxID=7729;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A), TISSUE SPECIFICITY, AND
RP DEVELOPMENTAL STAGE.
RC TISSUE=Egg;
RX PubMed=7669687; DOI=10.1016/0925-4773(95)00359-9;
RA Wada S., Katsuyama Y., Yasugi S., Saiga H.;
RT "Spatially and temporally regulated expression of the LIM class homeobox
RT gene Hrlim suggests multiple distinct functions in development of the
RT ascidian, Halocynthia roretzi.";
RL Mech. Dev. 51:115-126(1995).
RN [2]
RP SEQUENCE REVISION TO C-TERMINUS.
RC TISSUE=Egg;
RA Wada S., Saiga H.;
RL Submitted (FEB-2010) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM B), ALTERNATIVE SPLICING, TISSUE
RP SPECIFICITY, AND DEVELOPMENTAL STAGE.
RX PubMed=20123132; DOI=10.1016/j.gep.2010.01.004;
RA Kobayashi M., Takatori N., Nakajima Y., Kumano G., Nishida H., Saiga H.;
RT "Spatial and temporal expression of two transcriptional isoforms of Lhx3, a
RT LIM class homeobox gene, during embryogenesis of two phylogenetically
RT remote ascidians, Halocynthia roretzi and Ciona intestinalis.";
RL Gene Expr. Patterns 10:98-104(2010).
CC -!- FUNCTION: May be involved in the determination of cell lineage of
CC endoderm and notochord cells before gastrulation, and in specification
CC of a distinct group of cells in neural tissue after gastrulation.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=a; Synonyms=Hr-Lhx3a;
CC IsoId=Q25132-1; Sequence=Displayed;
CC Name=b; Synonyms=Hr-Lhx3b;
CC IsoId=Q25132-2; Sequence=VSP_039723;
CC -!- TISSUE SPECIFICITY: Isoform a and isoform b are differentially
CC expressed: isoform a is maternally expressed in the animal half of
CC early cleavage stage embryos, and zygotically in the sensory vesicle
CC and the visceral ganglion lineages from the neurula stage onward. In
CC contrast, isoform b is expressed only zygotically in the endoderm,
CC notochord and mesenchyme lineages during cleavage stage.
CC {ECO:0000269|PubMed:20123132, ECO:0000269|PubMed:7669687}.
CC -!- DEVELOPMENTAL STAGE: Isoform a but not isoform b is expressed
CC maternally. Both isoform a and isoform b are expressed zygotically.
CC {ECO:0000269|PubMed:20123132, ECO:0000269|PubMed:7669687}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D38572; BAA07578.2; -; mRNA.
DR EMBL; AB500704; BAH58772.1; -; mRNA.
DR AlphaFoldDB; Q25132; -.
DR SMR; Q25132; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001781; Znf_LIM.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00412; LIM; 2.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00132; LIM; 2.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50023; LIM_DOMAIN_2; 2.
PE 2: Evidence at transcript level;
KW Alternative splicing; Developmental protein; DNA-binding; Homeobox;
KW LIM domain; Metal-binding; Nucleus; Repeat; Zinc.
FT CHAIN 1..692
FT /note="LIM/homeobox protein Lhx3"
FT /id="PRO_0000075813"
FT DOMAIN 280..339
FT /note="LIM zinc-binding 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00125"
FT DOMAIN 340..402
FT /note="LIM zinc-binding 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00125"
FT DNA_BIND 410..469
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 93..134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 480..509
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..132
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..277
FT /note="MNLMFHQARNNNSFMVAATSNQQHHKTALHVLEDKASGQVYDNQRIIASEQH
FT SYGEYYIDEGKENSIPSESSARDYCGKELNKMASVTKESYVGGRLVDEDEDDYDDEGDR
FT EVYEDDITFDDDDDDENDDDDSDVKMNLRTNVNKLRFDSGFEYTLPTSSPENMTSFNPM
FT SDSSHLFSDYSKHNSTILANGKSGTSYLTSTPRRIYDNTNMINIPEETKNIPTCYNGNN
FT HVPSESIFDNPLSSGANRRMTSPGHKIQPSTCSDMLFTLLAKDNKIEA -> MKNIDMV
FT GGNHICSDGVFTTPCSKLSDLLENNSWEQQKSEKSWRHHPYGTPLGASHNLKKRLEKRR
FT LDSGYSSCSPAASLFDSMCSVVQPPTKCIRFATSPVPINFTSEEEKESSTQKSSDSIFK
FT SLQSYSPVSDASENSPNLIPSIERENFLPPTRDVPEIFDIIWSENPTRGNDTVNAKSNE
FT IDDYSDIMKVLT (in isoform b)"
FT /evidence="ECO:0000303|PubMed:20123132"
FT /id="VSP_039723"
FT CONFLICT 306
FT /note="G -> C (in Ref. 1; BAA07578)"
FT /evidence="ECO:0000305"
FT CONFLICT 366
FT /note="G -> C (in Ref. 1; BAA07578)"
FT /evidence="ECO:0000305"
FT CONFLICT 559..560
FT /note="Missing (in Ref. 1; BAA07578)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 692 AA; 77345 MW; 9F1EA011243339E4 CRC64;
MNLMFHQARN NNSFMVAATS NQQHHKTALH VLEDKASGQV YDNQRIIASE QHSYGEYYID
EGKENSIPSE SSARDYCGKE LNKMASVTKE SYVGGRLVDE DEDDYDDEGD REVYEDDITF
DDDDDDENDD DDSDVKMNLR TNVNKLRFDS GFEYTLPTSS PENMTSFNPM SDSSHLFSDY
SKHNSTILAN GKSGTSYLTS TPRRIYDNTN MINIPEETKN IPTCYNGNNH VPSESIFDNP
LSSGANRRMT SPGHKIQPST CSDMLFTLLA KDNKIEAEIP KCTGCEHRIF DRFILKVQDK
PWHSQGLKCN DCSAQLSEKC FSRGNLVFCK DDFFKRFGTK CTACGHGIPP TEVIRRAQDN
VYHLEGFCCF LCHEKMGTGD QFYLLEDNRL VCKKDYEQAK SRDADIENGV KRPRTTITAK
QLETLKSAYN QSPKPARHVR EQLSSETGLD MRVVQVWFQN RRAKEKRIKR DTGRQRWGHF
FSRNQLPSGP TSPISAPVTT GQKKRSNTKC GNRLIHCKDN QQNSQSHSSP LDGSPVFPIG
SIGTGMQGEI GASFQSGCAV GIEGTSDLPS VMDPKNLPPY SETLMSYTEA NPSHFVHGQN
PSDVTYMNTL PVGGSHVPLT AVPQHNISAF SLPFHMEGNM VPDPSTQFMM ATGHLGSQDT
DLVSNSSGRS NFSDLSASPG SWLGELEHVQ PF