PRP39_DANRE
ID PRP39_DANRE Reviewed; 752 AA.
AC Q1JPZ7; Q801W4;
DT 31-OCT-2006, integrated into UniProtKB/Swiss-Prot.
DT 31-OCT-2006, sequence version 2.
DT 03-AUG-2022, entry version 90.
DE RecName: Full=Pre-mRNA-processing factor 39;
DE AltName: Full=PRP39 homolog;
GN Name=prpf39; ORFNames=si:dz261o22.3;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tuebingen;
RX PubMed=23594743; DOI=10.1038/nature12111;
RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M.,
RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I.,
RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J.,
RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y.,
RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B.,
RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S.,
RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M.,
RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J.,
RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Threadgold G.,
RA Harden G., Ware D., Begum S., Mortimore B., Kerry G., Heath P.,
RA Phillimore B., Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S.,
RA Pelan S., Griffiths G., Smith M., Glithero R., Howden P., Barker N.,
RA Lloyd C., Stevens C., Harley J., Holt K., Panagiotidis G., Lovell J.,
RA Beasley H., Henderson C., Gordon D., Auger K., Wright D., Collins J.,
RA Raisen C., Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D.,
RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S.,
RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E.,
RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., Babbage A.,
RA Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., Wray P.,
RA Ellington A., Matthews N., Ellwood M., Woodmansey R., Clark G., Cooper J.,
RA Tromans A., Grafham D., Skuce C., Pandian R., Andrews R., Harrison E.,
RA Kimberley A., Garnett J., Fosker N., Hall R., Garner P., Kelly D., Bird C.,
RA Palmer S., Gehring I., Berger A., Dooley C.M., Ersan-Urun Z., Eser C.,
RA Geiger H., Geisler M., Karotki L., Kirn A., Konantz J., Konantz M.,
RA Oberlander M., Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G.,
RA Osoegawa K., Zhu B., Rapp A., Widaa S., Langford C., Yang F.,
RA Schuster S.C., Carter N.P., Harrow J., Ning Z., Herrero J., Searle S.M.,
RA Enright A., Geisler R., Plasterk R.H., Lee C., Westerfield M.,
RA de Jong P.J., Zon L.I., Postlethwait J.H., Nusslein-Volhard C.,
RA Hubbard T.J., Roest Crollius H., Rogers J., Stemple D.L.;
RT "The zebrafish reference genome sequence and its relationship to the human
RT genome.";
RL Nature 496:498-503(2013).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=AB; TISSUE=Skin;
RG NIH - Zebrafish Gene Collection (ZGC) project;
RL Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Involved in pre-mRNA splicing. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the PRP39 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL591492; CAD87784.1; -; Genomic_DNA.
DR EMBL; BC116540; AAI16541.1; -; mRNA.
DR RefSeq; NP_001004520.1; NM_001004520.1.
DR AlphaFoldDB; Q1JPZ7; -.
DR SMR; Q1JPZ7; -.
DR STRING; 7955.ENSDARP00000061671; -.
DR PaxDb; Q1JPZ7; -.
DR PRIDE; Q1JPZ7; -.
DR Ensembl; ENSDART00000170482; ENSDARP00000139799; ENSDARG00000100209.
DR GeneID; 368864; -.
DR KEGG; dre:368864; -.
DR CTD; 55015; -.
DR ZFIN; ZDB-GENE-030616-420; prpf39.
DR eggNOG; KOG1258; Eukaryota.
DR GeneTree; ENSGT00390000005033; -.
DR HOGENOM; CLU_007434_2_0_1; -.
DR InParanoid; Q1JPZ7; -.
DR OMA; NYCAFKV; -.
DR OrthoDB; 887474at2759; -.
DR PhylomeDB; Q1JPZ7; -.
DR TreeFam; TF314746; -.
DR PRO; PR:Q1JPZ7; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Chromosome 20.
DR Bgee; ENSDARG00000100209; Expressed in somite and 27 other tissues.
DR GO; GO:0000243; C:commitment complex; IBA:GO_Central.
DR GO; GO:0005685; C:U1 snRNP; IBA:GO_Central.
DR GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR GO; GO:0000395; P:mRNA 5'-splice site recognition; IBA:GO_Central.
DR Gene3D; 1.25.40.10; -; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR SMART; SM00386; HAT; 7.
DR SUPFAM; SSF48452; SSF48452; 2.
PE 2: Evidence at transcript level;
KW mRNA processing; mRNA splicing; Nucleus; Reference proteome; Repeat.
FT CHAIN 1..752
FT /note="Pre-mRNA-processing factor 39"
FT /id="PRO_0000259650"
FT REPEAT 180..212
FT /note="HAT 1"
FT REPEAT 214..246
FT /note="HAT 2"
FT REPEAT 254..289
FT /note="HAT 3"
FT REPEAT 408..440
FT /note="HAT 4"
FT REPEAT 442..474
FT /note="HAT 5"
FT REPEAT 700..731
FT /note="HAT 6"
FT REGION 1..148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 347..374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 678..703
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..60
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..101
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 678..698
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 10
FT /note="G -> R (in Ref. 2; AAI16541)"
FT /evidence="ECO:0000305"
FT CONFLICT 217
FT /note="Q -> R (in Ref. 2; AAI16541)"
FT /evidence="ECO:0000305"
FT CONFLICT 355
FT /note="A -> T (in Ref. 2; AAI16541)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 752 AA; 85947 MW; 764733541159D874 CRC64;
MEDSGESMTG MLDSKSPESG DSPAMEGTTG TDDVTGLSTS DLTTEQPPES QEQTQPVSDM
EFSVEHLKTA VQNIDQSASP AEPAAENSEQ PPESNGQQED QSEQPDDVKE AGQGDSESPS
NMELEDAPKE PAEPAAEADP AAPQEPELPT EYERLSKVVE DNPEDFNGWV YLLQYVEQEN
HLLGSRKAFD AFFLHYPYCY GYWKKYADIE RKHGYIQMAD EVYRRGLQAI PLSVDLWLHY
ITFLRENQDT SDGEAESRIR ASYEHAVLAC GTDFRSDRLW EAYIAWETEQ GKLANVTAIY
DRLLCIPTQL YSQHFQKFKD HVQSNNPKHF LSEEEFVSLR VELANANKPS GDEDAETEAP
GEELPPGTED LPDPAKRVTE IENMRHKVIE TRQEMFNHNE HEVSKRWAFE EGIKRPYFHV
KALEKTQLNN WREYLDFELE NGTPERVVVL FERCLIACAL YEEFWIKYAK YLESYSTEAV
RHIYKKACTV HLPKKPNVHL LWAAFEEQQG SIDEARSILK AVEVSVPGLA MVRLRRVSLE
RRHGNMEEAE ALLQDAITNG RNSSESSFYS VKLARQLVKV QKSIGRAKKV LLEAVEKDET
NPKLYLNLLE LEYSGDVQQN EAEIIACFDR ALSSSMALES RITFSQRKVD FLEDFGSDIN
TLMAAYEQHQ RLLAEQESFK RKAENGSEEP DAKRQRTDDQ SVASGQMMDM QANHAGYNYN
NWYQYNSWGS QNSWGQYGQY GQYNQYYPPP PT