PROB1_HUMAN
ID PROB1_HUMAN Reviewed; 1015 AA.
AC E7EW31; B4E007;
DT 19-OCT-2011, integrated into UniProtKB/Swiss-Prot.
DT 28-JUN-2011, sequence version 2.
DT 03-AUG-2022, entry version 64.
DE RecName: Full=Proline-rich basic protein 1;
GN Name=PROB1; Synonyms=C5orf65;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15372022; DOI=10.1038/nature02919;
RA Schmutz J., Martin J., Terry A., Couronne O., Grimwood J., Lowry S.,
RA Gordon L.A., Scott D., Xie G., Huang W., Hellsten U., Tran-Gyamfi M.,
RA She X., Prabhakar S., Aerts A., Altherr M., Bajorek E., Black S.,
RA Branscomb E., Caoile C., Challacombe J.F., Chan Y.M., Denys M.,
RA Detter J.C., Escobar J., Flowers D., Fotopulos D., Glavina T., Gomez M.,
RA Gonzales E., Goodstein D., Grigoriev I., Groza M., Hammon N., Hawkins T.,
RA Haydu L., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A.,
RA Lopez F., Lou Y., Martinez D., Medina C., Morgan J., Nandkeshwar R.,
RA Noonan J.P., Pitluck S., Pollard M., Predki P., Priest J., Ramirez L.,
RA Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., Thayer N.,
RA Tice H., Tsai M., Ustaszewska A., Vo N., Wheeler J., Wu K., Yang J.,
RA Dickson M., Cheng J.-F., Eichler E.E., Olsen A., Pennacchio L.A.,
RA Rokhsar D.S., Richardson P., Lucas S.M., Myers R.M., Rubin E.M.;
RT "The DNA sequence and comparative analysis of human chromosome 5.";
RL Nature 431:268-274(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 548-1015.
RC TISSUE=Thymus;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
CC -!- SEQUENCE CAUTION:
CC Sequence=BAG64269.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAH14854.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC135457; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AK303170; BAG64269.1; ALT_INIT; mRNA.
DR EMBL; AK316483; BAH14854.1; ALT_INIT; mRNA.
DR CCDS; CCDS54909.1; -.
DR RefSeq; NP_001155018.1; NM_001161546.1.
DR AlphaFoldDB; E7EW31; -.
DR STRING; 9606.ENSP00000416033; -.
DR iPTMnet; E7EW31; -.
DR PhosphoSitePlus; E7EW31; -.
DR BioMuta; PROB1; -.
DR EPD; E7EW31; -.
DR jPOST; E7EW31; -.
DR MassIVE; E7EW31; -.
DR PaxDb; E7EW31; -.
DR PeptideAtlas; E7EW31; -.
DR PRIDE; E7EW31; -.
DR ProteomicsDB; 18759; -.
DR Antibodypedia; 64696; 5 antibodies from 5 providers.
DR DNASU; 389333; -.
DR Ensembl; ENST00000434752.4; ENSP00000416033.2; ENSG00000228672.4.
DR GeneID; 389333; -.
DR KEGG; hsa:389333; -.
DR MANE-Select; ENST00000434752.4; ENSP00000416033.2; NM_001161546.2; NP_001155018.1.
DR UCSC; uc011czc.2; human.
DR CTD; 389333; -.
DR DisGeNET; 389333; -.
DR GeneCards; PROB1; -.
DR HGNC; HGNC:41906; PROB1.
DR HPA; ENSG00000228672; Tissue enriched (skeletal).
DR neXtProt; NX_E7EW31; -.
DR OpenTargets; ENSG00000228672; -.
DR VEuPathDB; HostDB:ENSG00000228672; -.
DR eggNOG; ENOG502S8N1; Eukaryota.
DR GeneTree; ENSGT00730000111496; -.
DR HOGENOM; CLU_012186_0_0_1; -.
DR InParanoid; E7EW31; -.
DR OMA; AQFECVE; -.
DR OrthoDB; 226448at2759; -.
DR PhylomeDB; E7EW31; -.
DR TreeFam; TF343894; -.
DR PathwayCommons; E7EW31; -.
DR BioGRID-ORCS; 389333; 15 hits in 1068 CRISPR screens.
DR ChiTaRS; PROB1; human.
DR GenomeRNAi; 389333; -.
DR Pharos; E7EW31; Tdark.
DR PRO; PR:E7EW31; -.
DR Proteomes; UP000005640; Chromosome 5.
DR RNAct; E7EW31; protein.
DR Bgee; ENSG00000228672; Expressed in quadriceps femoris and 95 other tissues.
DR GO; GO:0005654; C:nucleoplasm; IDA:HPA.
DR InterPro; IPR027838; DUF4585.
DR Pfam; PF15232; DUF4585; 1.
PE 2: Evidence at transcript level;
KW Reference proteome.
FT CHAIN 1..1015
FT /note="Proline-rich basic protein 1"
FT /id="PRO_0000413693"
FT REGION 1..111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 259..460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 488..678
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 690..883
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 991..1015
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..37
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 279..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..460
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..546
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 559..578
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 747..769
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 814..829
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..871
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 763
FT /note="V -> A (in Ref. 2; BAG64269/BAH14854)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1015 AA; 106917 MW; 4BF1673A980B2609 CRC64;
MLTALAPPAL PGIPRQLPTA PARRQDSSGS SGSYYTAPGS PEPPDVGPDA KGPANWPWVA
PGRGAGAQPR LSVSAQNSRQ RHGPGSGFPR GPGSGPRPPQ PQLRTLPSGE MEVIFGVGPL
FGCSGADDRE AQQQFTEPAF ISPLPPGPAS PAAVPRQSQV PDGGSRWATY LELRPRGPSP
AAPAQFECVE VALEEGAAPA RPRTVPKRQI ELRPRPQSPP RAAGAPRPRL LLRTGSLDES
LGPLQAAAGF VQTALARKLS PEAPAPSSAT FGSTGRSEPE TRETARSTHV VLEKAKSRPL
RVRDNSAPAK APRPWPSLRE RAIRRDKPAP GTEPLGPVSS SIFLQSEEKI QEARKTRFPR
EAPDRTVQRA RSPPFECRIP SEVPSRAVRP RSPSPPRQTP NGAVRGPRCP SPQNLSPWDR
TTRRVSSPLF PEASSEWENQ NPAVEETVSR RSPSPPILSQ WNQCVAGERS PSLEAPSLWE
IPHSAVADAV EPRSSPSPPA FFPWEAPDRP IGTWGPSPQE TWDPMGPGSS IAFTQEAQNG
LTQEELAPPT PSAPGTPEPT EMQSPSTREI SDLAFGGSQQ SPEVAAPEPP GSHPVGTLDA
DKCPEVLGPG EAASGRPRMA IPRPRDVRKL VKTTYAPGFP AGAQGSGLPA PPADPCGEEG
GESKTQEPPA LGPPAPAHYT SVFIKDFLPV VPHPYEPPEP SFDTVARDAS QPNGVLRRRA
ENSTAKPFKR TEIRLPGALA LGRRPEVTSR VRARGPGGEN RDVEAQRLVP DGDGRTSPLG
GARSSSQRSP VGPAGVRSPR PGSPQMQASP SPGIAPKPKT PPTAPEPAAA VQAPLPREPL
ALAGRTAPAQ PRAASAPPTD RSPQSPSQGA RRQPGAAPLG KVLVDPESGR YYFVEAPRQP
RLRVLFDPES GQYVEVLLPP SSPGPPHRVY TPLALGLGLY PPAYGPIPSL SLPPSPGPQA
LGSPQLPWVS EAGPLDGTYY LPVSGTPNPA PPLLLCAPPS SSGPTQPGKG SLFPL