MAT2_EUGGR
ID MAT2_EUGGR Reviewed; 758 AA.
AC P31916; P31917; Q8SN99;
DT 01-JUL-1993, integrated into UniProtKB/Swiss-Prot.
DT 15-MAR-2004, sequence version 3.
DT 25-MAY-2022, entry version 42.
DE RecName: Full=Maturase-like protein 2;
GN Name=mat2;
OS Euglena gracilis.
OG Plastid; Chloroplast.
OC Eukaryota; Discoba; Euglenozoa; Euglenida; Spirocuta; Euglenophyceae;
OC Euglenales; Euglenaceae; Euglena.
OX NCBI_TaxID=3039;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Z / UTEX 753;
RX PubMed=8346031; DOI=10.1093/nar/21.15.3537;
RA Hallick R.B., Hong L., Drager R.G., Favreau M.R., Monfort A., Orsat B.,
RA Spielmann A., Stutz E.;
RT "Complete sequence of Euglena gracilis chloroplast DNA.";
RL Nucleic Acids Res. 21:3537-3544(1993).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=Z / UTEX 753;
RX PubMed=8595563;
RA Zhang L., Jenkins K.P., Stutz E., Hallick R.B.;
RT "The Euglena gracilis intron-encoded mat2 locus is interrupted by three
RT additional group II introns.";
RL RNA 1:1079-1088(1995).
CC -!- SUBCELLULAR LOCATION: Plastid, chloroplast.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z11874; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; X70810; CAC69146.1; -; Genomic_DNA.
DR PIR; S34499; S34499.
DR PIR; S34500; S34500.
DR AlphaFoldDB; P31916; -.
DR GO; GO:0009507; C:chloroplast; IEA:UniProtKB-SubCell.
PE 4: Predicted;
KW Chloroplast; Plastid.
FT CHAIN 1..758
FT /note="Maturase-like protein 2"
FT /id="PRO_0000217279"
SQ SEQUENCE 758 AA; 93025 MW; 9327463EA1FB3DB2 CRC64;
MYQSNLNLQK KVTSDLLYYS WLSLKCGWKY FFEIHNYCLF NSISRSWFKR TSSLIKKGFF
IYPTVPLKIK NFFLSCTKKN FNLLKFKIVE NAFLIIIKNF FIYKIYVQSM NLIECIFNVT
SFSFMKPFFC KQCPNLLFSL TFFPFFKNDF LQKKKYFNSN KNFVKNFFSN EYFFSSSNSF
FSIKFWDSQI KNFMTLKIIK LFDYVHKVRL KNIFSKFTYD SFFFVEIDKM FNLNLVNISS
NLIYNSIENF GCSLLSSFFL NLYILEGDFF LDRFIFKICF KRNLFKTFFS FKKVSFFYQY
SLKNFIPLRL EKNFFVSSFL KEVNSGKYHN IDIFYLFNNK VFTVYEKNIY YVRYLNFLIF
GFLSSKNFIF FFKLKYLFFL RNKLYFNFRE VQIFSSSNDK VIFLGVYIAY NKIYNFFEKL
RVNKKYFLNV FQKIITKHNT FLKALKNIFH YSRCFNFFKK NCYPNFYKKK NFSFFNFYTE
SFRILKFFDA FFINTHHFFL PVELITSTKL VNFQKYTMYS FDFYNQKLSI LLKDILENFN
QFLSCSLVSI DLNLYNCLFE FKKHLVLLYN YYSPVYSFFS KRQRYKLNFN SFNYYSFSNF
SGQNRSFFSS KANQFRFFKF FVPFKVFLKK LRLLGFIHPF KFRPIGNVRL LLFEDKFILR
NFGFFVYSVL NWFSICENFS HLRFFVELIR ESCFLTLCRK HNKMKLWSYS VYTFDLVFSK
SVYRTISFFP TRKFIFNLKR KSFLVDVRFN LDETIFLE