U2A2A_ORYSJ
ID U2A2A_ORYSJ Reviewed; 574 AA.
AC Q2R0Q1; B9G8F7; Q84P67;
DT 14-OCT-2008, integrated into UniProtKB/Swiss-Prot.
DT 11-JUL-2006, sequence version 2.
DT 25-MAY-2022, entry version 107.
DE RecName: Full=Splicing factor U2af large subunit A;
DE AltName: Full=U2 auxiliary factor 65 kDa subunit A;
DE AltName: Full=U2 small nuclear ribonucleoprotein auxiliary factor large subunit A;
DE Short=U2 snRNP auxiliary factor large subunit A;
GN Name=U2AF65A; OrderedLocusNames=Os11g0636900, LOC_Os11g41820;
GN ORFNames=OsJ_34542 {ECO:0000312|EMBL:EEE52422.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=16188032; DOI=10.1186/1741-7007-3-20;
RG The rice chromosomes 11 and 12 sequencing consortia;
RT "The sequence of rice chromosomes 11 and 12, rich in disease resistance
RT genes and recent gene duplications.";
RL BMC Biol. 3:20-20(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare;
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
RN [4]
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare;
RX PubMed=24280374; DOI=10.1186/1939-8433-6-4;
RA Kawahara Y., de la Bastide M., Hamilton J.P., Kanamori H., McCombie W.R.,
RA Ouyang S., Schwartz D.C., Tanaka T., Wu J., Zhou S., Childs K.L.,
RA Davidson R.M., Lin H., Quesada-Ocampo L., Vaillancourt B., Sakai H.,
RA Lee S.S., Kim J., Numa H., Itoh T., Buell C.R., Matsumoto T.;
RT "Improvement of the Oryza sativa Nipponbare reference genome using next
RT generation sequence and optical map data.";
RL Rice 6:4-4(2013).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038;
RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S.,
RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., Cong L.,
RA Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., Wang J.,
RA Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., Wang J., Wang X.,
RA Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., Zhang Z., Bao J., Han Y.,
RA Dong L., Ji J., Chen P., Wu S., Liu J., Xiao Y., Bu D., Tan J., Yang L.,
RA Ye C., Zhang J., Xu J., Zhou Y., Yu Y., Zhang B., Zhuang S., Wei H.,
RA Liu B., Lei M., Yu H., Li Y., Xu H., Wei S., He X., Fang L., Zhang Z.,
RA Zhang Y., Huang X., Su Z., Tong W., Li J., Tong Z., Li S., Ye J., Wang L.,
RA Fang L., Lei T., Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F.,
RA Xu H., Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q.,
RA Li W., Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J.,
RA Gao L., Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M.,
RA McDermott J., Samudrala R., Wang J., Wong G.K.-S., Yang H.;
RT "The genomes of Oryza sativa: a history of duplications.";
RL PLoS Biol. 3:266-281(2005).
RN [6]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Nipponbare;
RX PubMed=12869764; DOI=10.1126/science.1081288;
RG The rice full-length cDNA consortium;
RT "Collection, mapping, and annotation of over 28,000 cDNA clones from
RT japonica rice.";
RL Science 301:376-379(2003).
RN [7]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 219-536.
RC STRAIN=cv. Nipponbare;
RX PubMed=12684538; DOI=10.1073/pnas.0737574100;
RA Cooper B., Clarke J.D., Budworth P., Kreps J., Hutchison D., Park S.,
RA Guimil S., Dunn M., Luginbuehl P., Ellero C., Goff S.A., Glazebrook J.;
RT "A network of rice genes associated with stress response and seed
RT development.";
RL Proc. Natl. Acad. Sci. U.S.A. 100:4945-4950(2003).
CC -!- FUNCTION: Necessary for the splicing of pre-mRNA. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- DOMAIN: N-terminal RS domain has a very strong bias in favor of D over
CC S.
CC -!- SIMILARITY: Belongs to the splicing factor SR family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAO72620.1; Type=Frameshift; Evidence={ECO:0000305};
CC Sequence=AAO72620.1; Type=Miscellaneous discrepancy; Note=Sequencing errors.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000010; ABA94914.2; -; Genomic_DNA.
DR EMBL; AP008217; BAF28693.1; -; Genomic_DNA.
DR EMBL; AP014967; BAT14972.1; -; Genomic_DNA.
DR EMBL; CM000148; EEE52422.1; -; Genomic_DNA.
DR EMBL; AK073768; -; NOT_ANNOTATED_CDS; mRNA.
DR EMBL; AY224501; AAO72620.1; ALT_SEQ; mRNA.
DR RefSeq; XP_015617576.1; XM_015762090.1.
DR AlphaFoldDB; Q2R0Q1; -.
DR SMR; Q2R0Q1; -.
DR IntAct; Q2R0Q1; 1.
DR STRING; 4530.OS11T0636900-02; -.
DR PaxDb; Q2R0Q1; -.
DR PRIDE; Q2R0Q1; -.
DR EnsemblPlants; Os11t0636900-02; Os11t0636900-02; Os11g0636900.
DR GeneID; 4350981; -.
DR Gramene; Os11t0636900-02; Os11t0636900-02; Os11g0636900.
DR KEGG; osa:4350981; -.
DR eggNOG; KOG0120; Eukaryota.
DR HOGENOM; CLU_021795_4_1_1; -.
DR InParanoid; Q2R0Q1; -.
DR OMA; MTQWDIK; -.
DR OrthoDB; 896650at2759; -.
DR Proteomes; UP000000763; Chromosome 11.
DR Proteomes; UP000007752; Chromosome 11.
DR Proteomes; UP000059680; Chromosome 11.
DR ExpressionAtlas; Q2R0Q1; baseline and differential.
DR Genevisible; Q2R0Q1; OS.
DR GO; GO:0000243; C:commitment complex; IBA:GO_Central.
DR GO; GO:0016607; C:nuclear speck; IBA:GO_Central.
DR GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR GO; GO:0089701; C:U2AF complex; IBA:GO_Central.
DR GO; GO:0008187; F:poly-pyrimidine tract binding; IBA:GO_Central.
DR GO; GO:0030628; F:pre-mRNA 3'-splice site binding; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 3.30.70.330; -; 3.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR003954; RRM_dom_euk.
DR InterPro; IPR006529; U2AF_lg.
DR Pfam; PF00076; RRM_1; 1.
DR SMART; SM00360; RRM; 3.
DR SMART; SM00361; RRM_1; 2.
DR SUPFAM; SSF54928; SSF54928; 2.
DR TIGRFAMs; TIGR01642; U2AF_lg; 1.
DR PROSITE; PS50102; RRM; 2.
PE 2: Evidence at transcript level;
KW mRNA processing; mRNA splicing; Nucleus; Reference proteome; Repeat;
KW RNA-binding.
FT CHAIN 1..574
FT /note="Splicing factor U2af large subunit A"
FT /id="PRO_0000352271"
FT DOMAIN 238..321
FT /note="RRM 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT DOMAIN 358..436
FT /note="RRM 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT DOMAIN 479..565
FT /note="RRM 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT REGION 1..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 141..170
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 101
FT /note="R -> S (in Ref. 6; AK073768)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 574 AA; 63402 MW; 82F9954912BF5730 CRC64;
MAEHEEQPYE GNGNGGDPAP ASAYAEYPAP EGSPPAAAAK PTGFSDGATD GGRSQHETQP
HDGRSSKSRE RERERDKDKE RDRDRDRDRR DRDRGDKDRD RDRHREHRDR SERREHHDRE
RSDDRDRRRG HDSERRRDRD RDGHRRHRSR SRSPSKGRDR RSRSRSRSRS SKRVSGFDQG
PQAAIPALAA GAAPGQVPVV APAISGMLPN MFNLTQTPFT PLVIQPQAMT QQATRHARRV
YVGGLPPTAN EHTVAVYFNQ VMAAVGGNTA GPGDAVLNVY INHDKKFAFV EMRSVEEASN
AMALDGIMFE GAPVKVRRPT DYNPSLAAAL GPSQPNPNLN LAAVGLTPGS AGGLEGPDRI
FVGGLPYYFT EAQVRELLES FGPLRGFDLV KDRETGNSKG YAFCVYQDLN VTDIACAALN
GIKMGDKTLT VRRANQGASQ PRPEQESMLL HVQQQAQMQK LMFQVGGGAL PTKVVCLTQV
VSPDELRDDE EYEDIVQDMR EEGCRYGNLV KVVIPRPDPS GAPVAGVGRV FLEFADVESS
TKAKNGMHGR KFANNQVVAV FYPEDKFAEG QYDG