Y4424_ARATH
ID Y4424_ARATH Reviewed; 494 AA.
AC Q67XQ0; O23282; Q66GK0; Q680W1;
DT 27-JUL-2011, integrated into UniProtKB/Swiss-Prot.
DT 11-OCT-2004, sequence version 1.
DT 03-AUG-2022, entry version 101.
DE RecName: Full=DUF21 domain-containing protein At4g14240;
DE AltName: Full=CBS domain-containing protein CBSDUF1;
GN Name=CBSDUF1; OrderedLocusNames=At4g14240; ORFNames=dl3160c, FCAALL.149;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=9461215; DOI=10.1038/35140;
RA Bevan M., Bancroft I., Bent E., Love K., Goodman H.M., Dean C.,
RA Bergkamp R., Dirkse W., van Staveren M., Stiekema W., Drost L., Ridley P.,
RA Hudson S.-A., Patel K., Murphy G., Piffanelli P., Wedler H., Wedler E.,
RA Wambutt R., Weitzenegger T., Pohl T., Terryn N., Gielen J., Villarroel R.,
RA De Clercq R., van Montagu M., Lecharny A., Aubourg S., Gy I., Kreis M.,
RA Lao N., Kavanagh T., Hempel S., Kotter P., Entian K.-D., Rieger M.,
RA Schaefer M., Funk B., Mueller-Auer S., Silvey M., James R., Monfort A.,
RA Pons A., Puigdomenech P., Douka A., Voukelatou E., Milioni D.,
RA Hatzopoulos P., Piravandi E., Obermaier B., Hilbert H., Duesterhoeft A.,
RA Moores T., Jones J.D.G., Eneva T., Palme K., Benes V., Rechmann S.,
RA Ansorge W., Cooke R., Berger C., Delseny M., Voet M., Volckaert G.,
RA Mewes H.-W., Klosterman S., Schueller C., Chalwatzis N.;
RT "Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis
RT thaliana.";
RL Nature 391:485-488(1998).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC STRAIN=cv. Columbia;
RA Cheuk R.F., Chen H., Kim C.J., Shinn P., Ecker J.R.;
RT "Arabidopsis ORF clones.";
RL Submitted (AUG-2004) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y.,
RA Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases.
RN [6]
RP GENE FAMILY, AND NOMENCLATURE.
RX PubMed=19400948; DOI=10.1186/1471-2164-10-200;
RA Kushwaha H.R., Singh A.K., Sopory S.K., Singla-Pareek S.L., Pareek A.;
RT "Genome wide expression analysis of CBS domain containing proteins in
RT Arabidopsis thaliana (L.) Heynh and Oryza sativa L. reveals their
RT developmental and stress regulation.";
RL BMC Genomics 10:200-200(2009).
RN [7]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19376835; DOI=10.1104/pp.109.138677;
RA Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A.,
RA Grossmann J., Gruissem W., Baginsky S.;
RT "Large-scale Arabidopsis phosphoproteome profiling reveals novel
RT chloroplast kinase substrates and phosphorylation networks.";
RL Plant Physiol. 150:889-903(2009).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Multi-pass membrane
CC protein {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q67XQ0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q67XQ0-2; Sequence=VSP_041627;
CC -!- SEQUENCE CAUTION:
CC Sequence=CAB10203.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=CAB78466.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z97335; CAB10203.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161538; CAB78466.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE83399.1; -; Genomic_DNA.
DR EMBL; CP002687; AEE83400.1; -; Genomic_DNA.
DR EMBL; BT015402; AAU05525.1; -; mRNA.
DR EMBL; AK176768; BAD44531.1; -; mRNA.
DR EMBL; AK175696; BAD43459.1; -; mRNA.
DR EMBL; AK175756; BAD43519.1; -; mRNA.
DR EMBL; AK176131; BAD43894.1; -; mRNA.
DR PIR; A71404; A71404.
DR RefSeq; NP_001031633.2; NM_001036556.2. [Q67XQ0-2]
DR RefSeq; NP_193160.3; NM_117501.5. [Q67XQ0-1]
DR AlphaFoldDB; Q67XQ0; -.
DR SMR; Q67XQ0; -.
DR STRING; 3702.AT4G14240.1; -.
DR iPTMnet; Q67XQ0; -.
DR PaxDb; Q67XQ0; -.
DR PRIDE; Q67XQ0; -.
DR ProteomicsDB; 242853; -. [Q67XQ0-1]
DR EnsemblPlants; AT4G14240.1; AT4G14240.1; AT4G14240. [Q67XQ0-1]
DR EnsemblPlants; AT4G14240.2; AT4G14240.2; AT4G14240. [Q67XQ0-2]
DR GeneID; 827065; -.
DR Gramene; AT4G14240.1; AT4G14240.1; AT4G14240. [Q67XQ0-1]
DR Gramene; AT4G14240.2; AT4G14240.2; AT4G14240. [Q67XQ0-2]
DR KEGG; ath:AT4G14240; -.
DR Araport; AT4G14240; -.
DR TAIR; locus:2129540; AT4G14240.
DR eggNOG; KOG2118; Eukaryota.
DR InParanoid; Q67XQ0; -.
DR OMA; QNEMPYA; -.
DR PhylomeDB; Q67XQ0; -.
DR PRO; PR:Q67XQ0; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; Q67XQ0; baseline and differential.
DR Genevisible; Q67XQ0; AT.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0010960; P:magnesium ion homeostasis; IEA:InterPro.
DR CDD; cd04590; CBS_pair_CorC_HlyC_assoc; 1.
DR Gene3D; 3.10.580.10; -; 2.
DR InterPro; IPR045095; ACDP.
DR InterPro; IPR046342; CBS_dom_sf.
DR InterPro; IPR002550; CNNM.
DR InterPro; IPR044751; Ion_transp-like_CBS.
DR PANTHER; PTHR12064; PTHR12064; 1.
DR Pfam; PF01595; DUF21; 1.
DR SUPFAM; SSF54631; SSF54631; 1.
DR PROSITE; PS51371; CBS; 2.
DR PROSITE; PS51846; CNNM; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; CBS domain; Glycoprotein; Membrane;
KW Reference proteome; Repeat; Transmembrane; Transmembrane helix.
FT CHAIN 1..494
FT /note="DUF21 domain-containing protein At4g14240"
FT /id="PRO_0000411678"
FT TOPO_DOM 1..43
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 44..64
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 65..93
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT TRANSMEM 94..114
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 115..121
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT TRANSMEM 122..142
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 143..159
FT /note="Cytoplasmic"
FT /evidence="ECO:0000255"
FT TRANSMEM 160..180
FT /note="Helical"
FT /evidence="ECO:0000255"
FT TOPO_DOM 181..494
FT /note="Extracellular"
FT /evidence="ECO:0000255"
FT DOMAIN 31..213
FT /note="CNNM transmembrane"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU01193"
FT DOMAIN 232..292
FT /note="CBS 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00703"
FT DOMAIN 297..352
FT /note="CBS 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00703"
FT DOMAIN 364..425
FT /note="CBS 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00703"
FT REGION 459..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 459..478
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 350
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 385
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT VAR_SEQ 74..82
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|Ref.4, ECO:0000303|Ref.5"
FT /id="VSP_041627"
FT CONFLICT 134
FT /note="F -> Y (in Ref. 1; CAB10203/CAB78466 and 4;
FT AAU05525)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 494 AA; 53582 MW; BE6A0EF1936654B2 CRC64;
MHLINAVAAA RILSGIGQSN GNNGGEAIPF GSFEWITYAG ISCFLVLFAG IMSGLTLGLM
SLGLVELEIL QRSGTPNEKK QAAAIFPVVQ KQHQLLVTLL LCNAMAMEGL PIYLDKLFNE
YVAIILSVTF VLAFGEVIPQ AICTRYGLAV GANFVWLVRI LMTLCYPIAF PIGKILDLVL
GHNDALFRRA QLKALVSIHS QEAGKGGELT HDETTIISGA LDLTEKTAQE AMTPIESTFS
LDVNSKLDWE AMGKILARGH SRVPVYSGNP KNVIGLLLVK SLLTVRPETE TLVSAVCIRR
IPRVPADMPL YDILNEFQKG SSHMAAVVKV KGKSKVPPST LLEEHTDESN DSDLTAPLLL
KREGNHDNVI VTIDKANGQS FFQNNESGPH GFSHTSEAIE DGEVIGIITL EDVFEELLQE
EIVDETDEYV DVHKRIRVAA AAAASSIARA PSSRKLLAQK GTGGQNKQGQ TNKVPGQEQD
KMLGTITEPI RRNN