DOP1_HUMAN
ID DOP1_HUMAN Reviewed; 2465 AA.
AC Q5JWR5; Q86XV1; Q9H5J5; Q9NSL4; Q9UPN5; Q9Y414;
DT 21-AUG-2007, integrated into UniProtKB/Swiss-Prot.
DT 20-FEB-2007, sequence version 1.
DT 03-AUG-2022, entry version 102.
DE RecName: Full=Protein dopey-1 {ECO:0000305};
GN Name=DOP1A {ECO:0000312|HGNC:HGNC:21194};
GN Synonyms=DOPEY1 {ECO:0000312|HGNC:HGNC:21194}, KIAA1117;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=14574404; DOI=10.1038/nature02055;
RA Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L.,
RA Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R.,
RA Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D.,
RA Andrews T.D., Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J.,
RA Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H.,
RA Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J.,
RA Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P.,
RA Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V.,
RA Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J.,
RA Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E.,
RA Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J.,
RA French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J.,
RA Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C.,
RA Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A.,
RA Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R.,
RA Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M.,
RA Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K.,
RA Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R.,
RA Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M.,
RA Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A.,
RA Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L.,
RA Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I.,
RA Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y.,
RA Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E.,
RA Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A.,
RA Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W.,
RA Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M.,
RA West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J.,
RA Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M.,
RA Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I.,
RA Rogers J., Beck S.;
RT "The DNA sequence and analysis of human chromosome 6.";
RL Nature 425:805-811(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 908-2465.
RC TISSUE=Brain;
RX PubMed=10470851; DOI=10.1093/dnares/6.3.197;
RA Kikuno R., Nagase T., Ishikawa K., Hirosawa M., Miyajima N., Tanaka A.,
RA Kotani H., Nomura N., Ohara O.;
RT "Prediction of the coding sequences of unidentified human genes. XIV. The
RT complete sequences of 100 new cDNA clones from brain which code for large
RT proteins in vitro.";
RL DNA Res. 6:197-205(1999).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1212-2465.
RC TISSUE=Amygdala;
RX PubMed=17974005; DOI=10.1186/1471-2164-8-399;
RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U.,
RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D.,
RA Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A.,
RA Wiemann S., Schupp I.;
RT "The full-ORF clone resource of the German cDNA consortium.";
RL BMC Genomics 8:399-399(2007).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1326-2465.
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [5]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1910-2465.
RC TISSUE=Brain;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [6]
RP VARIANT [LARGE SCALE ANALYSIS] HIS-1155.
RX PubMed=16959974; DOI=10.1126/science.1133427;
RA Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D.,
RA Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., Buckhaults P.,
RA Farrell C., Meeh P., Markowitz S.D., Willis J., Dawson D., Willson J.K.V.,
RA Gazdar A.F., Hartigan J., Wu L., Liu C., Parmigiani G., Park B.H.,
RA Bachman K.E., Papadopoulos N., Vogelstein B., Kinzler K.W.,
RA Velculescu V.E.;
RT "The consensus coding sequences of human breast and colorectal cancers.";
RL Science 314:268-274(2006).
CC -!- FUNCTION: May be involved in protein traffic between late Golgi and
CC early endosomes. {ECO:0000250|UniProtKB:Q03921}.
CC -!- SUBCELLULAR LOCATION: Golgi apparatus membrane
CC {ECO:0000250|UniProtKB:Q03921}; Peripheral membrane protein
CC {ECO:0000250|UniProtKB:Q03921}.
CC -!- SIMILARITY: Belongs to the dopey family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB15631.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL121716; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AL139333; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AB029040; BAA83069.2; -; mRNA.
DR EMBL; AL050080; CAB43259.1; -; mRNA.
DR EMBL; AL162056; CAB82395.1; -; mRNA.
DR EMBL; AK027030; BAB15631.1; ALT_INIT; mRNA.
DR EMBL; BC048342; AAH48342.1; -; mRNA.
DR CCDS; CCDS4996.1; -.
DR PIR; T47141; T47141.
DR RefSeq; NP_055833.2; NM_015018.3.
DR RefSeq; XP_016866052.1; XM_017010563.1.
DR RefSeq; XP_016866053.1; XM_017010564.1.
DR AlphaFoldDB; Q5JWR5; -.
DR BioGRID; 116672; 10.
DR IntAct; Q5JWR5; 2.
DR STRING; 9606.ENSP00000237163; -.
DR GlyGen; Q5JWR5; 1 site, 1 O-linked glycan (1 site).
DR iPTMnet; Q5JWR5; -.
DR PhosphoSitePlus; Q5JWR5; -.
DR BioMuta; DOPEY1; -.
DR DMDM; 156630499; -.
DR EPD; Q5JWR5; -.
DR jPOST; Q5JWR5; -.
DR MassIVE; Q5JWR5; -.
DR MaxQB; Q5JWR5; -.
DR PaxDb; Q5JWR5; -.
DR PeptideAtlas; Q5JWR5; -.
DR PRIDE; Q5JWR5; -.
DR ProteomicsDB; 63399; -.
DR Antibodypedia; 31656; 42 antibodies from 14 providers.
DR DNASU; 23033; -.
DR Ensembl; ENST00000349129.7; ENSP00000195654.3; ENSG00000083097.15.
DR GeneID; 23033; -.
DR KEGG; hsa:23033; -.
DR MANE-Select; ENST00000349129.7; ENSP00000195654.3; NM_015018.4; NP_055833.2.
DR UCSC; uc003pjs.2; human.
DR CTD; 23033; -.
DR DisGeNET; 23033; -.
DR GeneCards; DOP1A; -.
DR HGNC; HGNC:21194; DOP1A.
DR HPA; ENSG00000083097; Low tissue specificity.
DR MalaCards; DOP1A; -.
DR neXtProt; NX_Q5JWR5; -.
DR OpenTargets; ENSG00000083097; -.
DR PharmGKB; PA134924787; -.
DR VEuPathDB; HostDB:ENSG00000083097; -.
DR eggNOG; KOG3613; Eukaryota.
DR GeneTree; ENSGT00390000016421; -.
DR InParanoid; Q5JWR5; -.
DR OrthoDB; 29961at2759; -.
DR PhylomeDB; Q5JWR5; -.
DR TreeFam; TF316855; -.
DR PathwayCommons; Q5JWR5; -.
DR SignaLink; Q5JWR5; -.
DR BioGRID-ORCS; 23033; 7 hits in 1068 CRISPR screens.
DR ChiTaRS; DOPEY1; human.
DR GenomeRNAi; 23033; -.
DR Pharos; Q5JWR5; Tdark.
DR PRO; PR:Q5JWR5; -.
DR Proteomes; UP000005640; Chromosome 6.
DR RNAct; Q5JWR5; protein.
DR Bgee; ENSG00000083097; Expressed in calcaneal tendon and 191 other tissues.
DR ExpressionAtlas; Q5JWR5; baseline and differential.
DR Genevisible; Q5JWR5; HS.
DR GO; GO:0005829; C:cytosol; IEA:GOC.
DR GO; GO:0005768; C:endosome; IBA:GO_Central.
DR GO; GO:0000139; C:Golgi membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005802; C:trans-Golgi network; IBA:GO_Central.
DR GO; GO:0006895; P:Golgi to endosome transport; IEA:InterPro.
DR GO; GO:0015031; P:protein transport; IEA:UniProtKB-KW.
DR InterPro; IPR040314; DOP1.
DR InterPro; IPR007249; Dopey_N.
DR PANTHER; PTHR14042; PTHR14042; 1.
DR Pfam; PF04118; Dopey_N; 1.
PE 2: Evidence at transcript level;
KW Golgi apparatus; Membrane; Phosphoprotein; Protein transport;
KW Reference proteome; Transport.
FT CHAIN 1..2465
FT /note="Protein dopey-1"
FT /id="PRO_0000297947"
FT REGION 559..600
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 705..733
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1282..1315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 566..586
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 628..646
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 710..733
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1300..1315
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 1266
FT /note="Phosphoserine"
FT /evidence="ECO:0000250|UniProtKB:Q8BL99"
FT VARIANT 596
FT /note="R -> Q (in dbSNP:rs4706980)"
FT /id="VAR_034690"
FT VARIANT 1155
FT /note="D -> H (in a breast cancer sample; somatic
FT mutation)"
FT /evidence="ECO:0000269|PubMed:16959974"
FT /id="VAR_036607"
FT VARIANT 1781
FT /note="Q -> L (in dbSNP:rs9444039)"
FT /id="VAR_034691"
SQ SEQUENCE 2465 AA; 277355 MW; 97C58882414BAA6E CRC64;
MNTEELELLS DSKYRNYVAA IDKALKNFEY SSEWADLISA LGKLNKVLQN NAKYQVVPKK
LTIGKRLAQC LHPALPGGVH RKALETYEII FKIIGPKRLA KDLFLYSSGL FPLLANAAMS
VKPTLLSLYE IYYLPLGKTL KPGLQGLLTG ILPGLEEGSE YYERTNMLLE KVAAAVDQSA
FYSALWGSLL TSPAVRLPGI TYVLAHLNRK LSMEDQLYII GSDIELMVEA VSTSVQDSSV
LVQRSTLDLI LFCFPFHMSQ ATRPDMIRIL SAALHVVLRR DMSLNRRLYA WLLGFDNNGA
IIGPRSTRHS NPEEHATYYF TTFSKELLVQ AMVGILQVNG FGEENTLMQD LKPFRILISL
LDKPELGPVI LEDVLIEVFR TLYSQCKAEL DLQTEPPFSK DHAQLSSKLR ENKKTAELIK
TANLLFNSFE PYYMWDYVAR WFEECCRRTL HVRLQIGPGD SNDSSELQLT NFCLLVDFLL
DIVSLPTRSM RVLCQETYIE IQTEHLPQLL LRMISALTSH LQTLHLSELT DSLRLCSKIL
SKVQPPLLSA STGGVLQFPS GQNNSVKEWE DKKVSSVSHE NPTEVFEDGE NPPSSRSSES
GFTEFIQYQA DRTDDIDREL SEGQGAAAIP IGSTSSETET ASTVGSEETI IQTPSVVTQG
TATRSRKTAQ KTAMQCCLEY VQQFLTRLIN LYIIQNNSFS QSLATEHQGD LGREQGETSK
WDRNSQGDVK EKNISKQKTS KEYLSAFLAA CQLFLECSSF PVYIAEGNHT SELRSEKLET
DCEHVQPPQW LQTLMNACSQ ASDFSVQSVA ISLVMDLVGL TQSVAMVTGE NINSVEPAQP
LSPNQGRVAV VIRPPLTQGN LRYIAEKTEF FKHVALTLWD QLGDGTPQHH QKSVELFYQL
HNLVPSSSIC EDVISQQLTH KDKKIRMEAH AKFAVLWHLT RDLHINKSSS FVRSFDRSLF
IMLDSLNSLD GSTSSVGQAW LNQVLQRHDI ARVLEPLLLL LLHPKTQRVS VQRVQAERYW
NKSPCYPGEE SDKHFMQNFA CSNVSQVQLI TSKGNGEKPL TMDEIENFSL TVNPLSDRLS
LLSTSSETIP MVVSDFDLPD QQIEILQSSD SGCSQSSAGD NLSYEVDPET VNAQEDSQMP
KESSPDDDVQ QVVFDLICKV VSGLEVESAS VTSQLEIEAM PPKCSDIDPD EETIKIEDDS
IQQSQNALLS NESSQFLSVS AEGGHECVAN GISRNSSSPC ISGTTHTLHD SSVASIETKS
RQRSHSSIQF SFKEKLSEKV SEKETIVKES GKQPGAKPKV KLARKKDDDK KKSSNEKLKQ
TSVFFSDGLD LENWYSCGEG DISEIESDMG SPGSRKSPNF NIHPLYQHVL LYLQLYDSSR
TLYAFSAIKA ILKTNPIAFV NAISTTSVNN AYTPQLSLLQ NLLARHRISV MGKDFYSHIP
VDSNHNFRSS MYIEILISLC LYYMRSHYPT HVKVTAQDLI GNRNMQMMSI EILTLLFTEL
AKVIESSAKG FPSFISDMLS KCKVQKVILH CLLSSIFSAQ KWHSEKMAGK NLVAVEEGFS
EDSLINFSED EFDNGSTLQS QLLKVLQRLI VLEHRVMTIP EENETGFDFV VSDLEHISPH
QPMTSLQYLH AQPITCQGMF LCAVIRALHQ HCACKMHPQW IGLITSTLPY MGKVLQRVVV
SVTLQLCRNL DNLIQQYKYE TGLSDSRPLW MASIIPPDMI LTLLEGITAI IHYCLLDPTT
QYHQLLVSVD QKHLFEARSG ILSILHMIMS SVTLLWSILH QADSSEKMTI AASASLTTIN
LGATKNLRQQ ILELLGPISM NHGVHFMAAI AFVWNERRQN KTTTRTKVIP AASEEQLLLV
ELVRSISVMR AETVIQTVKE VLKQPPAIAK DKKHLSLEVC MLQFFYAYIQ RIPVPNLVDS
WASLLILLKD SIQLSLPAPG QFLILGVLNE FIMKNPSLEN KKDQRDLQDV THKIVDAIGA
IAGSSLEQTT WLRRNLEVKP SPKIMVDGTN LESDVEDMLS PAMETANITP SVYSVHALTL
LSEVLAHLLD MVFYSDEKER VIPLLVNIMH YVVPYLRNHS AHNAPSYRAC VQLLSSLSGY
QYTRRAWKKE AFDLFMDPSF FQMDASCVNH WRAIMDNLMT HDKTTFRDLM TRVAVAQSSS
LNLFANRDVE LEQRAMLLKR LAFAIFSSEI DQYQKYLPDI QERLVESLRL PQVPTLHSQV
FLFFRVLLLR MSPQHLTSLW PTMITELVQV FLLMEQELTA DEDISRTSGP SVAGLETTYT
GGNGFSTSYN SQRWLNLYLS ACKFLDLALA LPSENLPQFQ MYRWAFIPEA SDDSGLEVRR
QGIHQREFKP YVVRLAKLLR KRAKKNPEED NSGRTLGWEP GHLLLTICTV RSMEQLLPFF
NVLSQVFNSK VTSRCGGHSG SPILYSNAFP NKDMKLENHK PCSSKARQKI EEMVEKDFLE
GMIKT