Y1239_ARATH
ID Y1239_ARATH Reviewed; 373 AA.
AC Q9LR97; B3H657; Q2V4L6; Q3E7C4; Q94K54;
DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot.
DT 01-OCT-2000, sequence version 1.
DT 03-AUG-2022, entry version 91.
DE RecName: Full=UPF0725 protein At1g23950;
GN OrderedLocusNames=At1g23950; ORFNames=T23E23.13;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC STRAIN=cv. Columbia; TISSUE=Callus, and Flower bud;
RX PubMed=14993207; DOI=10.1101/gr.1515604;
RA Castelli V., Aury J.-M., Jaillon O., Wincker P., Clepet C., Menard M.,
RA Cruaud C., Quetier F., Scarpelli C., Schaechter V., Temple G., Caboche M.,
RA Weissenbach J., Salanoubat M.;
RT "Whole genome sequence comparisons and 'full-length' cDNA sequences: a
RT combined approach to evaluate and improve Arabidopsis genome annotation.";
RL Genome Res. 14:406-413(2004).
RN [5]
RP ACETYLATION [LARGE SCALE ANALYSIS] AT THR-2, CLEAVAGE OF INITIATOR
RP METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS SPECTROMETRY
RP [LARGE SCALE ANALYSIS].
RX PubMed=22223895; DOI=10.1074/mcp.m111.015131;
RA Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., Meinnel T.,
RA Giglione C.;
RT "Comparative large-scale characterisation of plant vs. mammal proteins
RT reveals similar and idiosyncratic N-alpha acetylation features.";
RL Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=5;
CC Name=1;
CC IsoId=Q9LR97-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q9LR97-2; Sequence=VSP_036248;
CC Name=3;
CC IsoId=Q9LR97-3; Sequence=VSP_036249;
CC Name=4;
CC IsoId=Q9LR97-4; Sequence=VSP_036252, VSP_036253;
CC Name=5;
CC IsoId=Q9LR97-5; Sequence=VSP_036250, VSP_036251;
CC -!- SIMILARITY: Belongs to the UPF0725 (EMB2204) family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BX813888; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC Sequence=BX815042; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC Sequence=BX816706; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC002423; AAF87155.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30453.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30454.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30455.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30456.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE30457.1; -; Genomic_DNA.
DR EMBL; AF370288; AAK44103.1; -; mRNA.
DR EMBL; AY063057; AAL34231.1; -; mRNA.
DR EMBL; BX813888; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BX815042; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; BX816706; -; NOT_ANNOTATED_CDS; mRNA.
DR RefSeq; NP_001031085.2; NM_001036008.2. [Q9LR97-5]
DR RefSeq; NP_001077591.1; NM_001084122.1. [Q9LR97-3]
DR RefSeq; NP_564210.2; NM_102242.3. [Q9LR97-4]
DR RefSeq; NP_973904.1; NM_202175.3. [Q9LR97-1]
DR RefSeq; NP_973905.1; NM_202176.2. [Q9LR97-2]
DR AlphaFoldDB; Q9LR97; -.
DR SMR; Q9LR97; -.
DR iPTMnet; Q9LR97; -.
DR PaxDb; Q9LR97; -.
DR PRIDE; Q9LR97; -.
DR ProteomicsDB; 243183; -. [Q9LR97-1]
DR EnsemblPlants; AT1G23950.1; AT1G23950.1; AT1G23950. [Q9LR97-4]
DR EnsemblPlants; AT1G23950.2; AT1G23950.2; AT1G23950. [Q9LR97-1]
DR EnsemblPlants; AT1G23950.3; AT1G23950.3; AT1G23950. [Q9LR97-2]
DR EnsemblPlants; AT1G23950.4; AT1G23950.4; AT1G23950. [Q9LR97-5]
DR EnsemblPlants; AT1G23950.5; AT1G23950.5; AT1G23950. [Q9LR97-3]
DR GeneID; 839006; -.
DR Gramene; AT1G23950.1; AT1G23950.1; AT1G23950. [Q9LR97-4]
DR Gramene; AT1G23950.2; AT1G23950.2; AT1G23950. [Q9LR97-1]
DR Gramene; AT1G23950.3; AT1G23950.3; AT1G23950. [Q9LR97-2]
DR Gramene; AT1G23950.4; AT1G23950.4; AT1G23950. [Q9LR97-5]
DR Gramene; AT1G23950.5; AT1G23950.5; AT1G23950. [Q9LR97-3]
DR KEGG; ath:AT1G23950; -.
DR Araport; AT1G23950; -.
DR TAIR; locus:2199867; AT1G23950.
DR InParanoid; Q9LR97; -.
DR OMA; EQNANCL; -.
DR PhylomeDB; Q9LR97; -.
DR PRO; PR:Q9LR97; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q9LR97; baseline and differential.
DR Genevisible; Q9LR97; AT.
DR InterPro; IPR006462; MS5.
DR PANTHER; PTHR31260; PTHR31260; 2.
DR Pfam; PF04776; protein_MS5; 1.
DR TIGRFAMs; TIGR01572; A_thl_para_3677; 2.
PE 1: Evidence at protein level;
KW Acetylation; Alternative splicing; Reference proteome.
FT INIT_MET 1
FT /note="Removed"
FT /evidence="ECO:0007744|PubMed:22223895"
FT CHAIN 2..373
FT /note="UPF0725 protein At1g23950"
FT /id="PRO_0000363130"
FT MOD_RES 2
FT /note="N-acetylthreonine"
FT /evidence="ECO:0007744|PubMed:22223895"
FT VAR_SEQ 146..150
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14993207"
FT /id="VSP_036248"
FT VAR_SEQ 151..180
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000305"
FT /id="VSP_036249"
FT VAR_SEQ 236..252
FT /note="KESEWQATDWISMYLEL -> RNQSGKPLIGFLCIWNL (in isoform
FT 5)"
FT /evidence="ECO:0000305"
FT /id="VSP_036250"
FT VAR_SEQ 253..373
FT /note="Missing (in isoform 5)"
FT /evidence="ECO:0000305"
FT /id="VSP_036251"
FT VAR_SEQ 264..273
FT /note="KPEVLSKLEI -> VSFTYIYIN (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:14593172"
FT /id="VSP_036252"
FT VAR_SEQ 274..373
FT /note="Missing (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:14593172"
FT /id="VSP_036253"
SQ SEQUENCE 373 AA; 41825 MW; 0CB12126362B07D1 CRC64;
MTTEANSTAE EGYSVQRDFW RQAAKSDGFD LENISLPPGT NGIVMGLIPY DCQRARHYPF
PVLVKLYAKF GLHRYNMLKG TSFQLATLMK FNMLPNYISS FYMTLLAHDP DPAAGSSQKT
FQVRVDEQQF GSLDINCSIA RPKHEGDLLE VSTETPFMPH FHGGALGDGI FKVELPDCLS
DTALNELAGA VLRGELPEHV FDDALYARAG GIFQGELPDW PSDDVLNDGK RFYMVKESEW
QATDWISMYL ELVITTTDKS ISIKPEVLSK LEIVKVAIET ATKDEEPSNE RLKAYRAHVY
ITFKGLAEPR AHERVFEIGE HVERQAIVRR VMGHRGDLTL KGKLCGGQYI KKRSLALKSG
KKSQKCKKQA LVG