UTP20_YEAST
ID UTP20_YEAST Reviewed; 2493 AA.
AC P35194; D6VPZ8;
DT 01-FEB-1994, integrated into UniProtKB/Swiss-Prot.
DT 15-MAY-2007, sequence version 3.
DT 03-AUG-2022, entry version 179.
DE RecName: Full=U3 small nucleolar RNA-associated protein 20;
DE Short=U3 snoRNA-associated protein 20;
DE AltName: Full=U three protein 20;
GN Name=UTP20; OrderedLocusNames=YBL004W; ORFNames=YBL0101;
OS Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Saccharomyces.
OX NCBI_TaxID=559292;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 204508 / S288c;
RX PubMed=7813418; DOI=10.1002/j.1460-2075.1994.tb06923.x;
RA Feldmann H., Aigle M., Aljinovic G., Andre B., Baclet M.C., Barthe C.,
RA Baur A., Becam A.-M., Biteau N., Boles E., Brandt T., Brendel M.,
RA Brueckner M., Bussereau F., Christiansen C., Contreras R., Crouzet M.,
RA Cziepluch C., Demolis N., Delaveau T., Doignon F., Domdey H.,
RA Duesterhus S., Dubois E., Dujon B., El Bakkoury M., Entian K.-D.,
RA Feuermann M., Fiers W., Fobo G.M., Fritz C., Gassenhuber J., Glansdorff N.,
RA Goffeau A., Grivell L.A., de Haan M., Hein C., Herbert C.J.,
RA Hollenberg C.P., Holmstroem K., Jacq C., Jacquet M., Jauniaux J.-C.,
RA Jonniaux J.-L., Kallesoee T., Kiesau P., Kirchrath L., Koetter P.,
RA Korol S., Liebl S., Logghe M., Lohan A.J.E., Louis E.J., Li Z.Y.,
RA Maat M.J., Mallet L., Mannhaupt G., Messenguy F., Miosga T., Molemans F.,
RA Mueller S., Nasr F., Obermaier B., Perea J., Pierard A., Piravandi E.,
RA Pohl F.M., Pohl T.M., Potier S., Proft M., Purnelle B., Ramezani Rad M.,
RA Rieger M., Rose M., Schaaff-Gerstenschlaeger I., Scherens B.,
RA Schwarzlose C., Skala J., Slonimski P.P., Smits P.H.M., Souciet J.-L.,
RA Steensma H.Y., Stucka R., Urrestarazu L.A., van der Aart Q.J.M.,
RA Van Dyck L., Vassarotti A., Vetter I., Vierendeels F., Vissers S.,
RA Wagner G., de Wergifosse P., Wolfe K.H., Zagulski M., Zimmermann F.K.,
RA Mewes H.-W., Kleine K.;
RT "Complete DNA sequence of yeast chromosome II.";
RL EMBO J. 13:5795-5809(1994).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=ATCC 204508 / S288c;
RX PubMed=24374639; DOI=10.1534/g3.113.008995;
RA Engel S.R., Dietrich F.S., Fisk D.G., Binkley G., Balakrishnan R.,
RA Costanzo M.C., Dwight S.S., Hitz B.C., Karra K., Nash R.S., Weng S.,
RA Wong E.D., Lloyd P., Skrzypek M.S., Miyasato S.R., Simison M., Cherry J.M.;
RT "The reference genome sequence of Saccharomyces cerevisiae: Then and now.";
RL G3 (Bethesda) 4:389-398(2014).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1214-2493.
RC STRAIN=ATCC 204508 / S288c;
RX PubMed=8091860; DOI=10.1002/yea.320100006;
RA Wolfe K.H., Lohan A.J.E.;
RT "Sequence around the centromere of Saccharomyces cerevisiae chromosome II:
RT similarity of CEN2 to CEN4.";
RL Yeast 10:S41-S46(1994).
RN [4]
RP SUBCELLULAR LOCATION [LARGE SCALE ANALYSIS].
RX PubMed=14562095; DOI=10.1038/nature02026;
RA Huh W.-K., Falvo J.V., Gerke L.C., Carroll A.S., Howson R.W.,
RA Weissman J.S., O'Shea E.K.;
RT "Global analysis of protein localization in budding yeast.";
RL Nature 425:686-691(2003).
RN [5]
RP FUNCTION, INTERACTION WITH MPP10 AND SNORNA U3, IDENTIFICATION IN SSU
RP PROCESSOME, AND SUBCELLULAR LOCATION.
RX PubMed=15590835; DOI=10.1128/ec.3.6.1619-1626.2004;
RA Bernstein K.A., Gallagher J.E.G., Mitchell B.M., Granneman S.,
RA Baserga S.J.;
RT "The small-subunit processome is a ribosome assembly intermediate.";
RL Eukaryot. Cell 3:1619-1626(2004).
CC -!- FUNCTION: Involved in nucleolar processing of pre-18S ribosomal RNA and
CC ribosome assembly. {ECO:0000269|PubMed:15590835}.
CC -!- SUBUNIT: Interacts with snoRNA U3. Interacts with MPP10. Component of
CC the ribosomal small subunit (SSU) processome composed of at least 40
CC protein subunits and snoRNA U3. {ECO:0000269|PubMed:15590835}.
CC -!- INTERACTION:
CC P35194; Q05022: RRP5; NbExp=2; IntAct=EBI-1871, EBI-16011;
CC P35194; P53254: UTP22; NbExp=3; IntAct=EBI-1871, EBI-1878;
CC -!- SUBCELLULAR LOCATION: Cytoplasm. Nucleus, nucleolus.
CC -!- SIMILARITY: Belongs to the UTP20 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z35765; CAA84821.1; -; Genomic_DNA.
DR EMBL; Z26494; CAA81266.1; -; Genomic_DNA.
DR EMBL; BK006936; DAA07118.1; -; Genomic_DNA.
DR PIR; S45734; S45734.
DR RefSeq; NP_009551.2; NM_001178244.1.
DR PDB; 6KE6; EM; 3.40 A; RP=1-2493.
DR PDB; 6LQP; EM; 3.20 A; RP=1-2493.
DR PDB; 6LQQ; EM; 4.10 A; RP=1-2493.
DR PDB; 6LQR; EM; 8.60 A; RP=1-2493.
DR PDB; 6LQS; EM; 3.80 A; RP=1-2493.
DR PDB; 6LQT; EM; 4.90 A; RP=1-2493.
DR PDB; 6LQU; EM; 3.70 A; RP=1-2493.
DR PDB; 6LQV; EM; 4.80 A; RP=1-2493.
DR PDB; 6ZQB; EM; 3.90 A; UT=1-2493.
DR PDB; 6ZQC; EM; 3.80 A; UT=1-2493.
DR PDB; 6ZQD; EM; 3.80 A; UT=1-2493.
DR PDB; 6ZQE; EM; 7.10 A; UT=1-2493.
DR PDB; 7AJT; EM; 4.60 A; UT=1-2493.
DR PDB; 7AJU; EM; 3.80 A; UT=1-2493.
DR PDB; 7D4I; EM; 4.00 A; RP=1-2493.
DR PDB; 7D5T; EM; 6.00 A; RP=1-2493.
DR PDB; 7D63; EM; 12.30 A; RP=1-2493.
DR PDBsum; 6KE6; -.
DR PDBsum; 6LQP; -.
DR PDBsum; 6LQQ; -.
DR PDBsum; 6LQR; -.
DR PDBsum; 6LQS; -.
DR PDBsum; 6LQT; -.
DR PDBsum; 6LQU; -.
DR PDBsum; 6LQV; -.
DR PDBsum; 6ZQB; -.
DR PDBsum; 6ZQC; -.
DR PDBsum; 6ZQD; -.
DR PDBsum; 6ZQE; -.
DR PDBsum; 7AJT; -.
DR PDBsum; 7AJU; -.
DR PDBsum; 7D4I; -.
DR PDBsum; 7D5T; -.
DR PDBsum; 7D63; -.
DR AlphaFoldDB; P35194; -.
DR SMR; P35194; -.
DR BioGRID; 32698; 114.
DR ComplexPortal; CPX-1604; Small ribosomal subunit processome, variant 1.
DR ComplexPortal; CPX-1607; Small ribosomal subunit processome, variant 2.
DR ComplexPortal; CPX-1608; Small ribosomal subunit processome, variant 3.
DR DIP; DIP-2826N; -.
DR IntAct; P35194; 43.
DR MINT; P35194; -.
DR STRING; 4932.YBL004W; -.
DR iPTMnet; P35194; -.
DR MaxQB; P35194; -.
DR PaxDb; P35194; -.
DR PRIDE; P35194; -.
DR EnsemblFungi; YBL004W_mRNA; YBL004W; YBL004W.
DR GeneID; 852282; -.
DR KEGG; sce:YBL004W; -.
DR SGD; S000000100; UTP20.
DR VEuPathDB; FungiDB:YBL004W; -.
DR eggNOG; KOG1823; Eukaryota.
DR GeneTree; ENSGT00390000016813; -.
DR HOGENOM; CLU_000327_0_0_1; -.
DR InParanoid; P35194; -.
DR OMA; LAWIFKF; -.
DR BioCyc; YEAST:G3O-28910-MON; -.
DR Reactome; R-SCE-6791226; Major pathway of rRNA processing in the nucleolus and cytosol.
DR PRO; PR:P35194; -.
DR Proteomes; UP000002311; Chromosome II.
DR RNAct; P35194; protein.
DR GO; GO:0030686; C:90S preribosome; IDA:SGD.
DR GO; GO:0005737; C:cytoplasm; IDA:SGD.
DR GO; GO:0005730; C:nucleolus; IDA:SGD.
DR GO; GO:0005654; C:nucleoplasm; IDA:SGD.
DR GO; GO:0030688; C:preribosome, small subunit precursor; IDA:SGD.
DR GO; GO:0032040; C:small-subunit processome; IDA:SGD.
DR GO; GO:0003729; F:mRNA binding; HDA:SGD.
DR GO; GO:0000480; P:endonucleolytic cleavage in 5'-ETS of tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA); IMP:SGD.
DR GO; GO:0000447; P:endonucleolytic cleavage in ITS1 to separate SSU-rRNA from 5.8S rRNA and LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA); IMP:SGD.
DR GO; GO:0000472; P:endonucleolytic cleavage to generate mature 5'-end of SSU-rRNA from (SSU-rRNA, 5.8S rRNA, LSU-rRNA); IMP:SGD.
DR GO; GO:0030490; P:maturation of SSU-rRNA; IC:ComplexPortal.
DR Gene3D; 1.25.10.10; -; 3.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR011430; DRIM.
DR Pfam; PF07539; DRIM; 1.
DR SUPFAM; SSF48371; SSF48371; 4.
PE 1: Evidence at protein level;
KW 3D-structure; Cytoplasm; Nucleus; Reference proteome; Repeat;
KW Ribonucleoprotein; Ribosome biogenesis; rRNA processing.
FT CHAIN 1..2493
FT /note="U3 small nucleolar RNA-associated protein 20"
FT /id="PRO_0000202465"
FT REPEAT 227..264
FT /note="HEAT 1"
FT REPEAT 495..532
FT /note="HEAT 2"
FT REPEAT 576..613
FT /note="HEAT 3"
FT REPEAT 845..882
FT /note="HEAT 4"
FT REPEAT 1176..1214
FT /note="HEAT 5"
FT REPEAT 1216..1252
FT /note="HEAT 6"
FT REPEAT 1342..1380
FT /note="HEAT 7"
FT REPEAT 1393..1430
FT /note="HEAT 8"
FT REPEAT 1480..1520
FT /note="HEAT 9"
FT REPEAT 1522..1558
FT /note="HEAT 10"
FT REPEAT 1588..1625
FT /note="HEAT 11"
FT REPEAT 1630..1667
FT /note="HEAT 12"
FT REPEAT 1890..1927
FT /note="HEAT 13"
FT REPEAT 1953..1992
FT /note="HEAT 14"
FT REPEAT 2120..2157
FT /note="HEAT 15"
FT REPEAT 2358..2397
FT /note="HEAT 16"
FT REGION 2457..2493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 2440
FT /note="R -> S (in Ref. 3; CAA81266)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 2493 AA; 287560 MW; F6ED4E3E9AE0F468 CRC64;
MAKQRQTTKS SKRYRYSSFK ARIDDLKIEP ARNLEKRVHD YVESSHFLAS FDQWKEINLS
AKFTEFAAEI EHDVQTLPQI LYHDKKIFNS LVSFINFHDE FSLQPLLDLL AQFCHDLGPD
FLKFYEEAIK TLINLLDAAI EFESSNVFEW GFNCLAYIFK YLSKFLVKKL VLTCDLLIPL
LSHSKEYLSR FSAEALSFLV RKCPVSNLRE FVRSVFEKLE GDDEQTNLYE GLLILFTESM
TSTQETLHSK AKAIMSVLLH EALTKSSPER SVSLLSDIWM NISKYASIES LLPVYEVMYQ
DFNDSLDATN IDRILKVLTT IVFSESGRKI PDWNKITILI ERIMSQSENC ASLSQDKVAF
LFALFIRNSD VKTLTLFHQK LFNYALTNIS DCFLEFFQFA LRLSYERVFS FNGLKFLQLF
LKKNWQSQGK KIALFFLEVD DKPELQKVRE VNFPEEFILS IRDFFVTAEI NDSNDLFEIY
WRAIIFKYSK LQNTEIIIPL LERIFSTFAS PDNFTKDMVG TLLKIYRKED DASGNNLLKT
ILDNYENYKE SLNFLRGWNK LVSNLHPSES LKGLMSHYPS LLLSLTDNFM LPDGKIRYET
LELMKTLMIL QGMQVPDLLS SCMVIEEIPL TLQNARDLTI RIKNVGAEFG KTKTDKLVSS
FFLKYLFGLL TVRFSPVWTG VFDTLPNVYT KDEALVWKLV LSFIKLPDEN QNLDYYQPLL
EDGANKVLWD SSVVRLRDTI DTFSHIWSKY STQNTSIIST TIERRGNTTY PILIRNQALK
VMLSIPQVAE NHFVDIAPFV YNDFKTYKDE EDMENERVIT GSWTEVDRNV FLKTLSKFKN
IKNVYSATEL HDHLMVLLGS RNTDVQKLAL DALLAYKNPT LNKYRDNLKN LLDDTLFKDE
ITTFLTENGS QSIKAEDEKV VMPYVLRIFF GRAQVPPTSG QKRSRKIAVI SVLPNFKKPY
INDFLSLASE RLDYNYFFGN SHQINSSKAT LKTIRRMTGF VNIVNSTLSV LRTNFPLHTN
SVLQPLIYSI AMAYYVLDTE STEEVHLRKM ASNLRQQGLK CLSSVFEFVG NTFDWSTSME
DIYAVVVKPR ISHFSDENLQ QPSSLLRLFL YWAHNPSLYQ FLYYDEFATA TALMDTISNQ
HVKEAVIGPI IEAADSIIRN PVNDDHYVDL VTLICTSCLK ILPSLYVKLS DSNSISTFLN
LLVSITEMGF IQDDHVRSRL ISSLISILKG KLKKLQENDT QKILKILKLI VFNYNCSWSD
IEELYTTISS LFKTFDERNL RVSLTELFIE LGRKVPELES ISKLVADLNS YSSSRMHEYD
FPRILSTFKG LIEDGYKSYS ELEWLPLLFT FLHFINNKEE LALRTNASHA IMKFIDFINE
KPNLNEASKS ISMLKDILLP NIRIGLRDSL EEVQSEYVSV LSYMVKNTKY FTDFEDMAIL
LYNGDEEADF FTNVNHIQLH RRQRAIKRLG EHAHQLKDNS ISHYLIPMIE HYVFSDDERY
RNIGNETQIA IGGLAQHMSW NQYKALLRRY ISMLKTKPNQ MKQAVQLIVQ LSVPLRETLR
IVRDGAESKL TLSKFPSNLD EPSNFIKQEL YPTLSKILGT RDDETIIERM PIAEALVNIV
LGLTNDDITN FLPSILTNIC QVLRSKSEEL RDAVRVTLGK ISIILGAEYL VFVIKELMAT
LKRGSQIHVL SYTVHYILKS MHGVLKHSDL DTSSSMIVKI IMENIFGFAG EEKDSENYHT
KVKEIKSNKS YDAGEILASN ISLTEFGTLL SPVKALLMVR INLRNQNKLS ELLRRYLLGL
NHNSDSESES ILKFCHQLFQ ESEMSNSPQI PKKKVKDQVD EKEDFFLVNL ESKSYTINSN
SLLLNSTLQK FALDLLRNVI TRHRSFLTVS HLEGFIPFLR DSLLSENEGV VISTLRILIT
LIRLDFSDES SEIFKNCARK VLNIIKVSPS TSSELCQMGL KFLSAFIRHT DSTLKDTALS
YVLGRVLPDL NEPSRQGLAF NFLKALVSKH IMLPELYDIA DTTREIMVTN HSKEIRDVSR
SVYYQFLMEY DQSKGRLEKQ FKFMVDNLQY PTESGRQSVM ELINLIITKA NPALLSKLSS
SFFLALVNVS FNDDAPRCRE MASVLISTML PKLENKDLEI VEKYIAAWLK QVDNASFLNL
GLRTYKVYLK SIGFEHTIEL DELAIKRIRY ILSDTSVGSE HQWDLVYSAL NTFSSYMEAT
ESVYKHGFKD IWDGIITCLL YPHSWVRQSA ANLVHQLIAN KDKLEISLTN LEIQTIATRI
LHQLGAPSIP ENLANVSIKT LVNISILWKE QRTPFIMDVS KQTGEDLKYT TAIDYMVTRI
GGIIRSDEHR MDSFMSKKAC IQLLALLVQV LDEDEVIAEG EKILLPLYGY LETYYSRAVD
EEQEELRTLS NECLKILEDK LQVSDFTKIY TAVKQTVLER RKERRSKRAI LAVNAPQISA
DKKLRKHARS REKRKHEKDE NGYYQRRNKR KRA