UTP20_MOUSE
ID UTP20_MOUSE Reviewed; 2788 AA.
AC Q5XG71; Q80V22; Q8BXH9; Q8CHL6; Q99K11;
DT 12-APR-2005, integrated into UniProtKB/Swiss-Prot.
DT 12-APR-2005, sequence version 2.
DT 23-FEB-2022, entry version 118.
DE RecName: Full=Small subunit processome component 20 homolog;
DE AltName: Full=Down-regulated in metastasis protein;
GN Name=Utp20; Synonyms=Drim;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-916 AND 1324-2788.
RC STRAIN=C57BL/6J; TISSUE=Mammary gland, and Oocyte;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 1878-2788.
RA Daigo Y., Takayama I., Fujino M.A.;
RT "Isolation and characterization of novel human and mouse genes, which are
RT expressed in the digestive tract.";
RL Submitted (DEC-2000) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1929-2788.
RC STRAIN=C57BL/6J; TISSUE=Cerebellum;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [5]
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Pancreas, Spleen, and Testis;
RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001;
RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R.,
RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.;
RT "A tissue-specific atlas of mouse protein phosphorylation and expression.";
RL Cell 143:1174-1189(2010).
CC -!- FUNCTION: Involved in 18S pre-rRNA processing. Associates with U3
CC snoRNA (By similarity). {ECO:0000250}.
CC -!- SUBUNIT: Interacts with FBL and PPP1R26. {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus {ECO:0000250}.
CC Note=Colocalizes with NCL in the nucleolus. {ECO:0000250}.
CC -!- SIMILARITY: Belongs to the UTP20 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH84586.1; Type=Miscellaneous discrepancy; Note=Contaminating sequence. Potential poly-A sequence.; Evidence={ECO:0000305};
CC Sequence=BAC53793.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAC53793.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC155820; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AC164567; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC005522; AAH05522.1; -; mRNA.
DR EMBL; BC048955; AAH48955.1; -; mRNA.
DR EMBL; BC084586; AAH84586.1; ALT_SEQ; mRNA.
DR EMBL; AB052760; BAC53793.1; ALT_SEQ; mRNA.
DR EMBL; AK047081; BAC32954.2; -; mRNA.
DR SMR; Q5XG71; -.
DR STRING; 10090.ENSMUSP00000004470; -.
DR iPTMnet; Q5XG71; -.
DR PhosphoSitePlus; Q5XG71; -.
DR EPD; Q5XG71; -.
DR MaxQB; Q5XG71; -.
DR PaxDb; Q5XG71; -.
DR PeptideAtlas; Q5XG71; -.
DR PRIDE; Q5XG71; -.
DR ProteomicsDB; 297973; -.
DR MGI; MGI:1917933; Utp20.
DR eggNOG; KOG1823; Eukaryota.
DR InParanoid; Q5XG71; -.
DR PhylomeDB; Q5XG71; -.
DR Reactome; R-MMU-6791226; Major pathway of rRNA processing in the nucleolus and cytosol.
DR ChiTaRS; Utp20; mouse.
DR PRO; PR:Q5XG71; -.
DR Proteomes; UP000000589; Unplaced.
DR RNAct; Q5XG71; protein.
DR GO; GO:0030686; C:90S preribosome; IBA:GO_Central.
DR GO; GO:0005730; C:nucleolus; ISS:UniProtKB.
DR GO; GO:0005886; C:plasma membrane; ISO:MGI.
DR GO; GO:0032040; C:small-subunit processome; IBA:GO_Central.
DR GO; GO:0006364; P:rRNA processing; ISS:UniProtKB.
DR Gene3D; 1.25.10.10; -; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR011430; DRIM.
DR Pfam; PF07539; DRIM; 1.
DR SUPFAM; SSF48371; SSF48371; 3.
PE 1: Evidence at protein level;
KW Coiled coil; Nucleus; Phosphoprotein; Reference proteome; Repeat;
KW rRNA processing.
FT CHAIN 1..2788
FT /note="Small subunit processome component 20 homolog"
FT /id="PRO_0000080012"
FT REPEAT 165..202
FT /note="HEAT 1"
FT REPEAT 1841..1878
FT /note="HEAT 2"
FT REGION 771..795
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 866..908
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1718..1752
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2598..2617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 2691..2768
FT /evidence="ECO:0000255"
FT COMPBIAS 771..786
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 878..897
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1721..1737
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 788
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:O75691"
FT MOD_RES 2640
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:O75691"
FT CONFLICT 1883
FT /note="Y -> C (in Ref. 3; BAC53793)"
FT /evidence="ECO:0000305"
FT CONFLICT 2190
FT /note="V -> R (in Ref. 3; BAC53793)"
FT /evidence="ECO:0000305"
FT CONFLICT 2397
FT /note="V -> L (in Ref. 2; AAH05522)"
FT /evidence="ECO:0000305"
FT CONFLICT 2472
FT /note="L -> V (in Ref. 4; BAC32954)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 2788 AA; 317744 MW; 258424B5802D4636 CRC64;
MKPKPLSHKT ENTYRFLTFA ERLGNVNIDI IHRIDRTASY DEDVETYFFE ALLKWRELNL
TEHFGKFYKE VIDKCQSFNQ LVYHQNEIVQ SLKTHLQIRN SLAYQPLLDL VVQLARDLQT
DFYPHFEDFF LTITSILETQ DTELLEWAFT SLSYLYKYLW RLMVKDMSKI YSLYSTLLAH
KKLHIRNFAA ESFTFLMRKV SDKNALFNLM FLDLNEHPEK VEGVGQLLFE MCKGVRNMFH
SCTGQALKLL LQKLGPVTET ETQLPWILVG ETLKTMAKSS VVYIYKEHFG VFFDCLQESL
LELHNKVTEA NCCENSEQMR RLLETYLIVV KHGSGSKITR PADVCGVLSE ALQTASLSTS
CRKTLLDVVS ALLLAENVSL PETLIKETVE KVFESKFERR SVLDFSEVMF AMKQFEQLFL
PSFLLYIENC FLMDNSVVSD EALAILAKLI LHKAPPPTAG SMAIEKYPLV FSQQTVGSYL
KQRKADSKRR KEQFPVLSHL LSIVQLPPNK DATYLSRSWA ALVVLPHLRP LEKEKTISLV
SCFIESLFLA VDRGSFGKGH LFVLCQAVNT LLSLEESSEL LHLVPVGRVK HLVLTSPTEP
SVLLLADLYY QRLALCGCKG PLSEEALMEL FPKLQANIST GVSKIRLLTI RILNHFDIRL
PVSMEDDGLS ERQSAFAILR QAELVPATVS DYREKLLHLR KLRHDVVQGA VPQGRLQEVP
LRYLLGMLYV NFSALWDPVI ELISSHAYGM ENKQFWNVCY EHLEKAASHA EKELHKDVRD
EESTGDESWE QTQEGDVGDL YQQQLALKTD CRERLDHTNF RFLLWRALAK FPERVEPRSR
ELSPLFLRFI NNEYYPADLQ VAPTQDLRKK GRGAVAEEEE EEEPAAGEDE ELEEEAVPTE
DAPQKKKTRR AAAKQLIAHL QVFSKFSNPR ALYLESKLYE LYLQLLLHQD QAVQKITLDC
IMTYRHPHIL PYRENLQRLL DDRSFKEEIV HFNISEDNTV VKAAHRADLF PILMRILYGR
MKNKTGSKTQ GKSASGTRMA IVLRFLAGTQ PEEIQLFLDL LSEPVKHFKD GDCCSAVIQA
VEDLDVSKVL PVGRQHGVLN SLEVVLKNIS HLISTYLPKI LQILLCMTAT VSHILDQREK
IQLRFINPLK NLRRLGIKMV TDIFLDWESY QFKAEEIDAV FHGTVWPQIC RLGSESQYSP
TPLLKLISIW SRNARYFPLL AKQKPGHPEY DILTNVFAVL SAKNLSEATA SIIMDIVDDL
LNLPDFQPTE AVPSLPVTGC VYADVAEDTE PVTVGGRLVL PHVPAILQYL SKTTISAEKV
KKKKNRAQVS KELGILSKIS KFMKDREQCS LLITLLLPFL LRGNVAQDTE LDILVTVQNL
LQHCLHPAHF LRPLAKLFSV IKNKLSRQLL CTVFQMSDFE SRLKYITDIV KLNAFDKRHL
DDINFDVRFS AFQTITSNIK AMQTVDADYL IAVMHNCFYN MEIGDMSLSD NASICLTSII
KRLAALNVTE KEYKEIIHRT LLEKLRKGLK SQTESVQHDY TLILSCLIQT FPNQLEFKDL
VQLTHCHDPE MDFFENMKHI QIHRRARALK KLAKQLLEGQ VVLSSKSLQN YIMPYAMAPI
LDEKMLKHEN ITIAATEVIG AICRHLSWPA YVYYLKHFIH VLQSGQINQK LAVSLLVIVL
EAFHFDYKTL EEQMGNVKNE ENTVEMAELL EPEAMEVEDM DEAGKEQASE RLSDSKEALG
APEAAASEGT VAKEQECISK SVSFLPRNKE ELERTIQTIQ GAITGDILPR LHKCLASATK
REEEHKLVKS KVVNDEEVVR VPLAFAMVKL MRSLPREVME ANLPSILLKV CVLLKNRAQE
IRDIARSTLS KIIEDLGVHF LQYVLKELQT TLVRGYQVHV LTFTVYTLLQ GLSSKLQVGD
LDSCLHIMTE IFNHELFGAL AEEKEVKQIL SKVMEARRSK SYDSYEILGK FVGKQQVTKL
ILPLKEILQN TTSLKLARKV HETLRRIIAG LIVNPDMTAD ALLLLSYGLV SENLPLLTEK
EKKPAAPVPD ARLPPQSCLL LPATPVRGGP KAVVNKKTNM HIFIESGLRL LHLSLKTSRI
KSSSEHVLEM LDPFVSVLIN CLGAQDVKVI TGALQCLIWV LRFPLPSIAS KAEQLTKHLF
LLLKNYARVG AARGQNFHLV VNCFKCVTIV VKKVKSHQIT EKQLQVLLAY AEEDIYDTSR
QATAFGLLKA ILSRKLLVPE IDDIMRKVSK LAISAQNEPA RVQCRQVFLK YILDYPLGEK
LRPNLEFMLA QLNYEHETGR ESTLEMIAYL FETFPQGLLH EHCGMFFIPL CLMMVNDDSA
MCKRMASMAI KSLLSKVDRE KKDWLFGLVT SWFEAKKRLN RQLAALACGL FVESEGVDFE
RRLGTLLPVI EKEIDPENFK DIIEETEEKA ADRLLFGFLT LMRKLIKECS IIHFTKPSET
LSKIWSHVHS HLRHPHSWVW LTAAQIFGLL FASCQPEELI QKWKGKKTKK KTSDPIAVRF
LTSDLGQKMK SISLASCHQL HSKFLDESLG EQVVKNLLFI AKVLYLLELE SGNKRGEVKD
SEEQDTLADA LAREAAEEKA GAGGKMESNR EKKEEPSKPA TLMWLIQKLS RMAKLEAAYS
PRNPLKRTCI FKFLGAVAVD LGVDRVKPYL PLIIAPLFRE LNSTFAEQDP VLKNLSQEII
ELLKKLVGLE SFSLAFASVQ KQASEKRALR KKRKALEFVT NPDIAAKKKL KKHKNKSEAK
KRKIEFLRPG YKAKRQKSHS LRDLAMVE