Y4837_ARATH
ID Y4837_ARATH Reviewed; 606 AA.
AC P58223; O49507; Q8H0Y7;
DT 14-AUG-2001, integrated into UniProtKB/Swiss-Prot.
DT 14-AUG-2001, sequence version 1.
DT 03-AUG-2022, entry version 134.
DE RecName: Full=KH domain-containing protein At4g18375;
GN OrderedLocusNames=At4g18375; ORFNames=F28J12.2;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617198; DOI=10.1038/47134;
RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B.,
RA Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M.,
RA de Simone V., Obermaier B., Mache R., Mueller M., Kreis M., Delseny M.,
RA Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D.,
RA Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J.,
RA Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B.,
RA Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J.,
RA Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R.,
RA Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M.,
RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J.,
RA Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S.,
RA Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A.,
RA Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M.,
RA Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D.,
RA Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E.,
RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S.,
RA Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R.,
RA Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W.,
RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H.,
RA Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=P58223-1; Sequence=Displayed;
CC Name=2;
CC IsoId=P58223-2; Sequence=VSP_008899, VSP_008900;
CC -!- MISCELLANEOUS: [Isoform 2]: May be due to a competing acceptor site.
CC {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=CAA16717.1; Type=Erroneous gene model prediction; Note=The predicted gene At4g18370 has been split into 2 genes: At4g18375 and At4g18370.; Evidence={ECO:0000305};
CC Sequence=CAB78839.1; Type=Erroneous gene model prediction; Note=The predicted gene At4g18370 has been split into 2 genes: At4g18375 and At4g18370.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL021710; CAA16717.1; ALT_SEQ; Genomic_DNA.
DR EMBL; AL161548; CAB78839.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002687; AEE84036.1; -; Genomic_DNA.
DR EMBL; AY133701; AAM91635.1; -; mRNA.
DR EMBL; BT001108; AAN64172.1; -; mRNA.
DR RefSeq; NP_193572.1; NM_117948.5. [P58223-1]
DR AlphaFoldDB; P58223; -.
DR SMR; P58223; -.
DR STRING; 3702.AT4G18375.2; -.
DR MEROPS; S01.441; -.
DR PaxDb; P58223; -.
DR PRIDE; P58223; -.
DR ProteomicsDB; 242886; -. [P58223-1]
DR EnsemblPlants; AT4G18375.2; AT4G18375.2; AT4G18375. [P58223-1]
DR GeneID; 827566; -.
DR Gramene; AT4G18375.2; AT4G18375.2; AT4G18375. [P58223-1]
DR KEGG; ath:AT4G18375; -.
DR Araport; AT4G18375; -.
DR TAIR; locus:2831364; AT4G18375.
DR eggNOG; KOG2190; Eukaryota.
DR HOGENOM; CLU_018025_0_0_1; -.
DR InParanoid; P58223; -.
DR PhylomeDB; P58223; -.
DR PRO; PR:P58223; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; P58223; baseline and differential.
DR Genevisible; P58223; AT.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0010468; P:regulation of gene expression; IBA:GO_Central.
DR Gene3D; 3.30.1370.10; -; 5.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR Pfam; PF00013; KH_1; 5.
DR SMART; SM00322; KH; 5.
DR SUPFAM; SSF54791; SSF54791; 5.
DR PROSITE; PS50084; KH_TYPE_1; 5.
PE 2: Evidence at transcript level;
KW Alternative splicing; Nucleus; Reference proteome; Repeat; RNA-binding.
FT CHAIN 1..606
FT /note="KH domain-containing protein At4g18375"
FT /id="PRO_0000050164"
FT DOMAIN 35..99
FT /note="KH 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT DOMAIN 138..210
FT /note="KH 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT DOMAIN 311..380
FT /note="KH 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT DOMAIN 394..455
FT /note="KH 4"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT DOMAIN 535..599
FT /note="KH 5"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 532
FT /note="L -> F (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14593172"
FT /id="VSP_008899"
FT VAR_SEQ 533..606
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14593172"
FT /id="VSP_008900"
SQ SEQUENCE 606 AA; 65761 MW; 61F135BBB8647C0C CRC64;
MVERKKRKQI QRNNSESNRN QKRRISHDKI NRDELVVYRI LCPIDVVGGV IGKSGKVINA
IRHNTKAKIK VFDQLHGCSQ RVITIYCSVK EKQEEIGFTK SENEPLCCAQ DALLKVYDAI
VASDEENNTK TNVDRDDNKE CRLLVPFSQS SSLIGKAGEN IKRIRRRTRA SVKVVSKDVS
DPSHVCAMEY DNVVVISGEP ESVKQALFAV SAIMYKINPR ENIPLDSTSQ DVPAASVIVP
SDLSNSVYPQ TGFYSNQDHI LQQGAGVPSY FNALSVSDFQ GYAETAANPV PVFASSLPVT
HGFGGSSRSE ELVFKVLCPL CNIMRVIGKG GSTIKRIREA SGSCIEVNDS RTKCGDDECV
IIVTATESPD DMKSMAVEAV LLLQEYINDE DAENVKMQLL VSSKVIGCVI GKSGSVINEI
RKRTNANICI SKGKKDDLVE VSGEVSSVRD ALIQIVLRLR EDVLGDKDSV ATRKPPARTD
NCSFLSGSSN AGYTLPSFMS SMASTSGFHG YGSFPAGDNV LGSTGPYSYG RLPSSSALEI
LIPAHAMSKV MGKGGGNLEN IRRISGAMIE ISASKTSHGD HIALLSGTLE QMRCAENLVQ
AFVMST