CS047_MOUSE
ID CS047_MOUSE Reviewed; 413 AA.
AC Q8R3Y5; Q3US50; Q8BVL8; Q8C1M9;
DT 26-JUN-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-JUN-2003, sequence version 2.
DT 25-MAY-2022, entry version 115.
DE RecName: Full=Uncharacterized protein C19orf47 homolog;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3).
RC STRAIN=C57BL/6J; TISSUE=Embryonic head, Head, and Tongue;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=Czech II; TISSUE=Mammary tumor;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [3]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-269, AND IDENTIFICATION BY
RP MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC TISSUE=Pancreas, and Testis;
RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001;
RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R.,
RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.;
RT "A tissue-specific atlas of mouse protein phosphorylation and expression.";
RL Cell 143:1174-1189(2010).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q8R3Y5-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q8R3Y5-2; Sequence=VSP_026277, VSP_026280;
CC Name=3;
CC IsoId=Q8R3Y5-3; Sequence=VSP_026277, VSP_026278, VSP_026279;
CC -!- MISCELLANEOUS: [Isoform 3]: Due to an intron retention. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAC25259.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK009456; BAC25259.1; ALT_FRAME; mRNA.
DR EMBL; AK077369; BAC36772.1; -; mRNA.
DR EMBL; AK140814; BAE24486.1; -; mRNA.
DR EMBL; BC023369; AAH23369.2; -; mRNA.
DR CCDS; CCDS21026.1; -. [Q8R3Y5-1]
DR RefSeq; NP_001116239.1; NM_001122767.1.
DR RefSeq; NP_780316.3; NM_175107.5.
DR AlphaFoldDB; Q8R3Y5; -.
DR SMR; Q8R3Y5; -.
DR BioGRID; 211418; 1.
DR iPTMnet; Q8R3Y5; -.
DR PhosphoSitePlus; Q8R3Y5; -.
DR EPD; Q8R3Y5; -.
DR MaxQB; Q8R3Y5; -.
DR PaxDb; Q8R3Y5; -.
DR PeptideAtlas; Q8R3Y5; -.
DR PRIDE; Q8R3Y5; -.
DR DNASU; 66367; -.
DR GeneID; 66367; -.
DR KEGG; mmu:66367; -.
DR UCSC; uc009fwp.2; mouse. [Q8R3Y5-3]
DR MGI; MGI:1913617; 2310022A10Rik.
DR eggNOG; KOG3930; Eukaryota.
DR InParanoid; Q8R3Y5; -.
DR OrthoDB; 1034351at2759; -.
DR PhylomeDB; Q8R3Y5; -.
DR BioGRID-ORCS; 66367; 1 hit in 73 CRISPR screens.
DR PRO; PR:Q8R3Y5; -.
DR Proteomes; UP000000589; Unplaced.
DR RNAct; Q8R3Y5; protein.
DR GO; GO:0005654; C:nucleoplasm; ISO:MGI.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR CDD; cd09531; SAM_CS047; 1.
DR Gene3D; 1.10.150.50; -; 1.
DR InterPro; IPR039161; C19orf47-like.
DR InterPro; IPR040772; C19orf47_SAM.
DR InterPro; IPR041477; DUF5577.
DR InterPro; IPR013761; SAM/pointed_sf.
DR PANTHER; PTHR21359; PTHR21359; 1.
DR Pfam; PF17740; DUF5577; 1.
DR SUPFAM; SSF47769; SSF47769; 1.
PE 1: Evidence at protein level;
KW Alternative splicing; Isopeptide bond; Phosphoprotein; Reference proteome;
KW Ubl conjugation.
FT CHAIN 1..413
FT /note="Uncharacterized protein C19orf47 homolog"
FT /id="PRO_0000291861"
FT REGION 108..158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 232..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 290..336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 129..158
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 115
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8N9M1"
FT MOD_RES 141
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8N9M1"
FT MOD_RES 269
FT /note="Phosphoserine"
FT /evidence="ECO:0007744|PubMed:21183079"
FT MOD_RES 296
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8N9M1"
FT MOD_RES 342
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8N9M1"
FT CROSSLNK 239
FT /note="Glycyl lysine isopeptide (Lys-Gly) (interchain with
FT G-Cter in SUMO2)"
FT /evidence="ECO:0000250|UniProtKB:Q8N9M1"
FT VAR_SEQ 3..15
FT /note="FRIGKNLLFNLRK -> PLPSLFQ (in isoform 2 and isoform
FT 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_026277"
FT VAR_SEQ 174..218
FT /note="ALAHREEESLVVPTKRRRVTAEMEGKYIIHMPKGTTPRTRKILEQ -> GEG
FT TWASVGQSCRGIWVSRGCECVRVCVCVTSKCGEVLQRPSESH (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_026278"
FT VAR_SEQ 219..413
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_026279"
FT VAR_SEQ 367..413
FT /note="KSSAEVKFAIKRTLVGPRGSSSSESLGAQMDHAGTVSVFKRLGQRTF -> P
FT TVRCILPDPPAPLASQRPPRRRWRRTCKDC (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_026280"
FT CONFLICT 49
FT /note="G -> K (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 55
FT /note="A -> P (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 66
FT /note="S -> N (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 74
FT /note="E -> K (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 179
FT /note="E -> D (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 196
FT /note="M -> N (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 197
FT /note="E -> G (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 206
FT /note="K -> D (in Ref. 1; BAC25259)"
FT /evidence="ECO:0000305"
FT CONFLICT 308
FT /note="A -> T (in Ref. 1; BAC25259/BAC36772)"
FT /evidence="ECO:0000305"
FT CONFLICT 313
FT /note="S -> T (in Ref. 1; BAC25259/BAC36772)"
FT /evidence="ECO:0000305"
FT CONFLICT 332
FT /note="Q -> E (in Ref. 1; BAC25259/BAC36772)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 413 AA; 44410 MW; AEFF1D9CF6F15C3E CRC64;
MGFRIGKNLL FNLRKAPGSR VKARKTMVSV TMATSEWIQF FKEAGIPPGP AVNYAVMFVD
NRIQKSMLLD LNKEIMNELG VTVVGDIIAI LKHAKVVHRQ DMCKAATESV PCNPSPLQGE
LRRGASSAAS RMIANSLNHD SPPHTPTRRS DNSTSKISVT VSNKMAAKSA KAAALAHREE
ESLVVPTKRR RVTAEMEGKY IIHMPKGTTP RTRKILEQQQ AAKGLHRTSV FDRLGAESKA
DTTTGTKPTG VFSRLGATPE MDEDLAWDSD NDSSSSSVLQ YAGVLKKLGR GPTKASAQPA
LTVKAKAASS ATSTATTPKL RRLALPSRPG LQKKPDSLPK VSILQRLGKA AVVSEAQDSQ
VTSTKSKSSA EVKFAIKRTL VGPRGSSSSE SLGAQMDHAG TVSVFKRLGQ RTF