SUGP1_ARATH
ID SUGP1_ARATH Reviewed; 443 AA.
AC Q94C11; Q9SUZ3;
DT 24-JAN-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-DEC-2001, sequence version 1.
DT 25-MAY-2022, entry version 111.
DE RecName: Full=SURP and G-patch domain-containing protein 1-like protein;
DE AltName: Full=Splicing factor 4-like protein;
DE Short=SF4-like protein;
GN OrderedLocusNames=At3g52120; ORFNames=F4F15.230;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130713; DOI=10.1038/35048706;
RA Salanoubat M., Lemcke K., Rieger M., Ansorge W., Unseld M., Fartmann B.,
RA Valle G., Bloecker H., Perez-Alonso M., Obermaier B., Delseny M.,
RA Boutry M., Grivell L.A., Mache R., Puigdomenech P., De Simone V.,
RA Choisne N., Artiguenave F., Robert C., Brottier P., Wincker P.,
RA Cattolico L., Weissenbach J., Saurin W., Quetier F., Schaefer M.,
RA Mueller-Auer S., Gabel C., Fuchs M., Benes V., Wurmbach E., Drzonek H.,
RA Erfle H., Jordan N., Bangert S., Wiedelmann R., Kranz H., Voss H.,
RA Holland R., Brandt P., Nyakatura G., Vezzi A., D'Angelo M., Pallavicini A.,
RA Toppo S., Simionati B., Conrad A., Hornischer K., Kauer G., Loehnert T.-H.,
RA Nordsiek G., Reichelt J., Scharfe M., Schoen O., Bargues M., Terol J.,
RA Climent J., Navarro P., Collado C., Perez-Perez A., Ottenwaelder B.,
RA Duchemin D., Cooke R., Laudie M., Berger-Llauro C., Purnelle B., Masuy D.,
RA de Haan M., Maarse A.C., Alcaraz J.-P., Cottet A., Casacuberta E.,
RA Monfort A., Argiriou A., Flores M., Liguori R., Vitale D., Mannhaupt G.,
RA Haase D., Schoof H., Rudd S., Zaccaria P., Mewes H.-W., Mayer K.F.X.,
RA Kaul S., Town C.D., Koo H.L., Tallon L.J., Jenkins J., Rooney T., Rizzo M.,
RA Walts A., Utterback T., Fujii C.Y., Shea T.P., Creasy T.H., Haas B.,
RA Maiti R., Wu D., Peterson J., Van Aken S., Pai G., Militscher J.,
RA Sellers P., Gill J.E., Feldblyum T.V., Preuss D., Lin X., Nierman W.C.,
RA Salzberg S.L., White O., Venter J.C., Fraser C.M., Kaneko T., Nakamura Y.,
RA Sato S., Kato T., Asamizu E., Sasamoto S., Kimura T., Idesawa K.,
RA Kawashima K., Kishida Y., Kiyokawa C., Kohara M., Matsumoto M., Matsuno A.,
RA Muraki A., Nakayama S., Nakazaki N., Shinpo S., Takeuchi C., Wada T.,
RA Watanabe A., Yamada M., Yasuda M., Tabata S.;
RT "Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana.";
RL Nature 408:820-822(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M.,
RA Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G.,
RA Liu S.X., Lam B., Sakano H., Wu T., Yu G., Miranda M., Quach H.L.,
RA Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C.,
RA Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J.,
RA Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A.,
RA Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C.,
RA Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X.,
RA Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M.,
RA Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K.,
RA Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A.,
RA Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
CC -!- INTERACTION:
CC Q94C11; O23160: MYB73; NbExp=3; IntAct=EBI-25516347, EBI-25506855;
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=1;
CC Comment=A number of isoforms are produced. According to EST
CC sequences.;
CC Name=1;
CC IsoId=Q94C11-1; Sequence=Displayed;
CC -!- SEQUENCE CAUTION:
CC Sequence=CAB41332.1; Type=Erroneous gene model prediction; Note=The predicted gene At3g52120 has been split into 2 genes: At3g52115 and At3g52120.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL049711; CAB41332.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002686; AEE78896.1; -; Genomic_DNA.
DR EMBL; AY037259; AAK59860.1; -; mRNA.
DR EMBL; AY094004; AAM16265.1; -; mRNA.
DR PIR; T49091; T49091.
DR RefSeq; NP_566957.1; NM_115071.4. [Q94C11-1]
DR AlphaFoldDB; Q94C11; -.
DR SMR; Q94C11; -.
DR BioGRID; 9694; 2.
DR IntAct; Q94C11; 1.
DR STRING; 3702.AT3G52120.1; -.
DR PaxDb; Q94C11; -.
DR PRIDE; Q94C11; -.
DR ProteomicsDB; 228282; -. [Q94C11-1]
DR EnsemblPlants; AT3G52120.1; AT3G52120.1; AT3G52120. [Q94C11-1]
DR GeneID; 824376; -.
DR Gramene; AT3G52120.1; AT3G52120.1; AT3G52120. [Q94C11-1]
DR KEGG; ath:AT3G52120; -.
DR Araport; AT3G52120; -.
DR TAIR; locus:2083750; AT3G52120.
DR eggNOG; KOG0965; Eukaryota.
DR HOGENOM; CLU_624709_0_0_1; -.
DR PhylomeDB; Q94C11; -.
DR PRO; PR:Q94C11; -.
DR Proteomes; UP000006548; Chromosome 3.
DR ExpressionAtlas; Q94C11; baseline and differential.
DR Genevisible; Q94C11; AT.
DR GO; GO:0005654; C:nucleoplasm; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 1.10.10.790; -; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; PTHR23340; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 1.
DR SUPFAM; SSF109905; SSF109905; 1.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; mRNA processing; mRNA splicing; Nucleus;
KW Reference proteome.
FT CHAIN 1..443
FT /note="SURP and G-patch domain-containing protein 1-like
FT protein"
FT /id="PRO_0000097704"
FT REPEAT 142..185
FT /note="SURP motif"
FT DOMAIN 360..407
FT /note="G-patch"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00092"
FT REGION 45..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 83..141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 198..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 241..272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 285..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..122
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 200..221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..264
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 443 AA; 48693 MW; B293DB80B299D647 CRC64;
MDKGAPPSIF VNDGSFMERF RQLQQEKDKD KDKVVQVEDS KPVKIISNPK PAANKISIGL
KPNDAQKKGG KLAFSLKQKS KLLAPPVKLG TEEDEDDEDV KHEQGFGSVK RQKLEQRDTP
VKSAKVSDVA PPPPSDPTVK KVADKLASFV AKHGRPFEHI TRQKNPGDTP FKFLFDENCA
DYKYYVFRLA EEEKLISQTK DSGVLHSGDA GSRTSTAAIP LQKPAYQQTG YQIPASALYD
TPVEPGASSR SAQASITRPS DSDSFSGPRG ADPLSMMEFY MKKAAQEEKM RRPRQSKDEM
PPPASLQGPS ETSSTDPGKR GHHMGDYIPL EELDKFLSKC NDAAAQKATK EAAEKAKIQA
DNVGHKLLSK MGWKEGEGIG SSRKGMADPI MAGDVKTNNL GVGASAPGEV KPEDDIYEQY
KKRMMLGYKH RPNPLGNPRK AYY