位置:首页 > 蛋白库 > PR39B_ARATH
PR39B_ARATH
ID   PR39B_ARATH             Reviewed;        1036 AA.
AC   F4KHG8; Q9FL22;
DT   23-FEB-2022, integrated into UniProtKB/Swiss-Prot.
DT   28-JUN-2011, sequence version 1.
DT   03-AUG-2022, entry version 72.
DE   RecName: Full=Pre-mRNA-processing factor 39-2 {ECO:0000303|PubMed:28971960};
DE            Short=AtPRP39-2 {ECO:0000303|PubMed:28971960};
DE   AltName: Full=PRP39 homolog B {ECO:0000305};
DE   AltName: Full=Pre-mRNA-processing factor 39b {ECO:0000303|PubMed:28971960};
GN   Name=PRP39-2 {ECO:0000303|PubMed:28971960};
GN   Synonyms=PRP39b {ECO:0000303|PubMed:28971960};
GN   OrderedLocusNames=At5g46400 {ECO:0000312|Araport:AT5G46400};
GN   ORFNames=MPL12.20 {ECO:0000312|EMBL:BAB11095.1};
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia;
RX   PubMed=9679202; DOI=10.1093/dnares/5.2.131;
RA   Kaneko T., Kotani H., Nakamura Y., Sato S., Asamizu E., Miyajima N.,
RA   Tabata S.;
RT   "Structural analysis of Arabidopsis thaliana chromosome 5. V. Sequence
RT   features of the regions of 1,381,565 bp covered by twenty one physically
RT   assigned P1 and TAC clones.";
RL   DNA Res. 5:131-145(1998).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia;
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
RN   [3]
RP   GENE FAMILY, AND NOMENCLATURE.
RC   STRAIN=cv. Columbia;
RX   PubMed=28971960; DOI=10.1534/genetics.117.300149;
RA   Kanno T., Lin W.-D., Fu J.L., Chang C.-L., Matzke A.J.M., Matzke M.;
RT   "A genetic screen for pre-mRNA splicing mutants of Arabidopsis thaliana
RT   identifies putative U1 snRNP components RBM25 and PRP39a.";
RL   Genetics 207:1347-1359(2017).
CC   -!- FUNCTION: Involved in pre-mRNA splicing.
CC       {ECO:0000250|UniProtKB:Q93ZR3}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:P39682}.
CC   -!- SIMILARITY: Belongs to the PRP39 family. {ECO:0000305}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=BAB11095.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AB010698; BAB11095.1; ALT_SEQ; Genomic_DNA.
DR   EMBL; CP002688; AED95379.1; -; Genomic_DNA.
DR   RefSeq; NP_199452.2; NM_124010.3.
DR   AlphaFoldDB; F4KHG8; -.
DR   SMR; F4KHG8; -.
DR   STRING; 3702.AT5G46400.1; -.
DR   iPTMnet; Q9FL22; -.
DR   PaxDb; F4KHG8; -.
DR   PRIDE; F4KHG8; -.
DR   ProteomicsDB; 199352; -.
DR   EnsemblPlants; AT5G46400.1; AT5G46400.1; AT5G46400.
DR   GeneID; 834683; -.
DR   Gramene; AT5G46400.1; AT5G46400.1; AT5G46400.
DR   Araport; AT5G46400; -.
DR   TAIR; locus:2170453; AT5G46400.
DR   eggNOG; KOG1258; Eukaryota.
DR   HOGENOM; CLU_007864_0_0_1; -.
DR   InParanoid; F4KHG8; -.
DR   OMA; KEKIKCR; -.
DR   OrthoDB; 887474at2759; -.
DR   Proteomes; UP000006548; Chromosome 5.
DR   ExpressionAtlas; F4KHG8; baseline and differential.
DR   GO; GO:0000243; C:commitment complex; IBA:GO_Central.
DR   GO; GO:0005685; C:U1 snRNP; IBA:GO_Central.
DR   GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR   GO; GO:0000395; P:mRNA 5'-splice site recognition; IBA:GO_Central.
DR   Gene3D; 1.25.40.10; -; 2.
DR   InterPro; IPR003107; HAT.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   SMART; SM00386; HAT; 4.
DR   SUPFAM; SSF48452; SSF48452; 2.
PE   3: Inferred from homology;
KW   mRNA processing; mRNA splicing; Nucleus; Reference proteome; Repeat.
FT   CHAIN           1..1036
FT                   /note="Pre-mRNA-processing factor 39-2"
FT                   /id="PRO_0000454966"
FT   REPEAT          62..94
FT                   /note="HAT 1"
FT                   /evidence="ECO:0000255"
FT   REPEAT          96..128
FT                   /note="HAT 2"
FT                   /evidence="ECO:0000255"
FT   REPEAT          131..166
FT                   /note="HAT 3"
FT                   /evidence="ECO:0000255"
FT   REPEAT          168..201
FT                   /note="HAT 4"
FT                   /evidence="ECO:0000255"
FT   REPEAT          278..310
FT                   /note="HAT 5"
FT                   /evidence="ECO:0000255"
FT   REPEAT          312..344
FT                   /note="HAT 6"
FT                   /evidence="ECO:0000255"
FT   REGION          1..24
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          595..618
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          714..767
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          995..1036
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        714..731
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        732..754
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1036 AA;  118658 MW;  803D10A71DBE5F96 CRC64;
     MVTTEVRTAV SDKEPLQRSP ELDSSTDFLD NDRLKETFSS GALDFDEWTL LISEIETTSF
     PDDIEKLCLV YDAFLLEFPL CHGYWRKYAY HKIKLCTLED AVEVFERAVQ AATYSVAVWL
     DYCAFAVAAY EDPHDVSRLF ERGLSFIGKD YSCCTLWDKY IEYLLGQQQW SSLANVYLRT
     LKYPSKKLDL YYKNFRKIAA SLKEKIKCRI DVNGDLSSDP MEEDLVHTRH TDEEISIVVR
     ELMGPSSSSA VSKALHTYLS IGEQFYQDSR QLMEKISCFE TQIRRPYFHV KPLDTNQLDN
     WHAYLSFGET YGDFDWAINL YERCLIPCAN YTEFWFRYVD FVESKGGREL ANFALARASQ
     TFVKSASVIH LFNARFKEHV GDASAASVAL SRCGEELGFG FVENVTKKAN MEKRLGNFEA
     AVTTYREALN KTLIGKENLE TTARLYVQFS RLKYVITNSA DDAAQILLEG NENVPHCKLL
     LEELMRLLMM HGGSRQVDLL DPIIDKELSH QADSSDGLSA EDKEEISNLY MEFIDLSGTI
     HDVRKALGRH IKLFPHSARA KLRGSRPSGN LFRELIQRRE KTRERLNQDL LTNKGISSIV
     DSPPKEKKES SLDSYGTQSK DAVRADYVNT EPNQGCLTSG HLVEGNDNVI ERETLCESQS
     DLSMGLKANE GGKRSHEVSL PIQASPEHGF VTKQAHFSSN SVDTVKSDAI VIQPSGSQSP
     QSYQSQESLR QTGRNRYHRR DLNQMHRDSK PRSQERPPQM PYSPVGTGRE ILGQHMAFTH
     QDNRVALQSS TSQNPQNQFQ NSALQMHPVV QTSNAYPQSQ IHGQHMIVSP PESQNPQNQC
     QNSTSQVQTS FAYPQTQIPQ NPVQSNYQQE GQMQSHEAYN QMWQQYYYSY YYYQQQQQLM
     SEQPQPNQNP QPQLDQNLVQ LLSKQYQSQA KTQYLQPQQV EQVNTQQQSQ EPQNQQQIQF
     QQQQQQQEWF QQQQQWQQQQ YLLYIQQQQL QGEAKGDEQR LSMPQGSTTN SDIQKSQESG
     AVNEANLSSD TSISSI
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2024