MUC19_BOVIN
ID MUC19_BOVIN Reviewed; 4596 AA.
AC P98091;
DT 01-FEB-1996, integrated into UniProtKB/Swiss-Prot.
DT 23-FEB-2022, sequence version 2.
DT 03-AUG-2022, entry version 86.
DE RecName: Full=Mucin-19 {ECO:0000250|UniProtKB:Q7Z5P9};
DE Short=MUC-19 {ECO:0000250|UniProtKB:Q7Z5P9};
DE AltName: Full=Submaxillary mucin-like protein {ECO:0000303|PubMed:2204065};
DE Flags: Precursor;
GN Name=MUC19 {ECO:0000250|UniProtKB:Q7Z5P9};
OS Bos taurus (Bovine).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=9913;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hereford;
RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42;
RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., Puiu D.,
RA Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., Marcais G.,
RA Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.;
RT "A whole-genome assembly of the domestic cow, Bos taurus.";
RL Genome Biol. 10:R42.01-R42.10(2009).
RN [2]
RP NUCLEOTIDE SEQUENCE [MRNA] OF 4045-4596, SUBCELLULAR LOCATION, AND TISSUE
RP SPECIFICITY.
RC TISSUE=Submandibular gland;
RX PubMed=2204065; DOI=10.1073/pnas.87.17.6798;
RA Bhargava A.K., Woitach J.T., Davidson E.A., Bhavanandan V.P.;
RT "Cloning and cDNA sequence of a bovine submaxillary gland mucin-like
RT protein containing two distinct domains.";
RL Proc. Natl. Acad. Sci. U.S.A. 87:6798-6802(1990).
CC -!- FUNCTION: May function in ocular mucus homeostasis.
CC {ECO:0000250|UniProtKB:Q7Z5P9}.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305|PubMed:2204065}.
CC -!- TISSUE SPECIFICITY: Submaxillary mucosae. {ECO:0000269|PubMed:2204065}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAA30657.1; Type=Miscellaneous discrepancy; Note=Probable cloning artifact.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NKLS02000005; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; M36192; AAA30657.1; ALT_SEQ; mRNA.
DR PIR; A36054; A36054.
DR AlphaFoldDB; P98091; -.
DR STRING; 9913.ENSBTAP00000051018; -.
DR PaxDb; P98091; -.
DR PRIDE; P98091; -.
DR eggNOG; KOG1216; Eukaryota.
DR Proteomes; UP000009136; Unplaced.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; Unchr_dom_Cys-rich.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 2.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 2.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57567; SSF57567; 3.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
DR PROSITE; PS51233; VWFD; 3.
PE 2: Evidence at transcript level;
KW Disulfide bond; Glycoprotein; Reference proteome; Repeat; Secreted; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT CHAIN 22..4596
FT /note="Mucin-19"
FT /id="PRO_0000158960"
FT DOMAIN 351..522
FT /note="VWFD 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DOMAIN 688..868
FT /note="VWFD 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DOMAIN 1147..1320
FT /note="VWFD 3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DOMAIN 4371..4437
FT /note="VWFC"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00220"
FT DOMAIN 4504..4588
FT /note="CTCK"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00039"
FT REGION 208..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1589..1695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1784..3205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3333..3578
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3615..3642
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3660..3713
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3769..3935
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3984..4014
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4060..4364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 208..247
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 272..330
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1589..1622
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1648..1693
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1807..1939
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1974..2047
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2053..2085
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2101..2120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2136..2268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2280..2414
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2421..2449
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2465..2481
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2497..2533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2541..2597
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2609..2743
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2750..2778
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2794..2810
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2826..2862
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2870..2926
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2938..3072
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3079..3107
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3123..3139
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3155..3205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3619..3642
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3769..3802
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3808..3935
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4079..4245
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4259..4364
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CARBOHYD 4061
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 4427
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT CARBOHYD 4510
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255"
FT DISULFID 375..521
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 690..825
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 711..867
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 730..738
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 1149..1284
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 1171..1319
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 1180..1281
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 1196..1203
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00580"
FT DISULFID 4504..4551
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00039"
FT DISULFID 4518..4565
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00039"
FT DISULFID 4527..4581
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00039"
FT DISULFID 4531..4583
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00039"
FT CONFLICT 4129
FT /note="E -> K (in Ref. 2; AAA30657)"
FT /evidence="ECO:0000305"
FT CONFLICT 4151
FT /note="R -> G (in Ref. 2; AAA30657)"
FT /evidence="ECO:0000305"
FT CONFLICT 4167
FT /note="S -> T (in Ref. 2; AAA30657)"
FT /evidence="ECO:0000305"
FT CONFLICT 4269
FT /note="A -> R (in Ref. 2; AAA30657)"
FT /evidence="ECO:0000305"
FT CONFLICT 4458
FT /note="N -> K (in Ref. 2; AAA30657)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 4596 AA; 462120 MW; ECD646BF70341097 CRC64;
MKLIFLCLVV ALCIFCKNGE ALFYRLNSDD KIAERKSEIQ KRESVGTESF GWEVGAGRGN
AAFAFGASGS SSFGDSSFSS KTVEGDQVAR SGRSISSDLG DTGLRSGDTF VGDSSGNLET
GLGSSGQQGL KIDELERDSL SGTASVGAGF KDLGSDVSSS VETGSFGWEV GAGRVNAAFD
FGASGSSSFG DSDFSSKTVE GNRVVRSEGS ISSDLGDTSL RSVSTDVGDR SENLESNLGS
SGQQGLEINE LGGDGLSGSA SVEDELKGFA SDASSSGGNI WSSNSGSGEG NKGEAGLGTS
GQNVSDETGV SSTGITSSSD YSTSGPLSTP EKGSHIPEAT PKYSETNAII GEASTWGKGA
YKAFNGRVFS FESSCTYTFC RHCVESGGDF NIEIKRNNDS EIEKITVIID NNDVSIFGDI
LLVNGESVQI PYNNKLIHIK KYGEHNVLNS RRGILSLMWD KNNKLSLTLH KQYPTCGLCG
NFNSTPGDDI NEHIADSKIP DDCSKAVSKS YEVCEDGVQY CNKIIGTYFE KCGKVSTLSS
DYKMICIDEY CQSRDRTSTC DTYSELSRLC ASDGPGTFES WRDDPDVVCE KPICPEKHIY
KECGPSNPAT CSNVAPFQDT ECVSGCTCPE GYLLDDIGEK GRCVLKSDCP CESNGKVYQS
GEVREGSCGS LCTCQEAKWS CTKTLCPGRC KIEGSLITTF DGVKYNHPGN CHFLAIHDKD
WSISVELRPC PSGQSGTCLN SVTLLLNSSV QVDKYVFNRD GTVTNDKFGN LGYYYSDKIQ
IFNASSSYLQ AETYFHGKMQ IQIFPVMQLY VSMPPNQFTD TVGLCGSHNN RAEDDFMSSQ
NILEKTSQAF ASSWEMMPCP KASTASCISI EKERFAERHC GILLDLSGPF ASCHSIVDPK
PYHEECKKYT CTCENSQDCL CTILGNYVKA CAEKETSMVG WRAGLCDQSC PSGLVFKYNV
KTCNSSCRSL SERDKSCDME GISVDGCTCP DGMYKNNEGN CVSKSQCDCY INDEVMQPGK
LIHIDDNKCV CRDGILLCQT PIDLTLQNCS GGAEYVDCRN PKAQRRVDST CSTRNIPSFD
ENLPCKRGCY CPEGMVRNSK GSCVFPDDCP CSFGGREYDQ GSVTSVGCNK CTCIKGSWNC
TQNECQTTCH IYGEGHVRTF DGKSYSFDGL CQYSFIEDYC GRENGTFRIL TESVPCCEDG
LTCSRKIIVA FQDQNIVLHD GKVTAVKTTE SKECELNGNS YSVHTVGLYL ILKFLNGITI
IWDKNTRISV ILDPRWNGQV CGLCGNNNGD LKDDFTTRYS SVAAGTLEFG NSWKTSQECS
DTVAQTFPCD SNPYCKAWAV RKCEIIRDST FRECHNKVDP NEYYDACIEE ACACDMEGKY
LGFCTAVAMY AEACSAVGVC VTWRKPDLCP VYCDYYNAPG EFSWHYEPCG TVTAKTCKDR
VIGQKFSALL EGCYAKCPDS APYLDENTMK CVSLAECSCF YNDIVPAGGV IQDNCGRTCY
CIAGELECSE TAPTNSTYTV STTTATSILS TKAAITLATN SSGTVASIPG ITSSSEITGT
TLSFLSETFT TGVTRTPAPI TSTAGSVGTT GLVGSTFTSS GRISGSTGVS VSTITETEDG
STGDTGFRVG GTEGPTAPVR GEEDGTPGQP STGVTSSEKQ GPQELQKASQ PPLGARAQMQ
TQLSQTQPLE ANQRLPDHQL VKQLENEAEQ LEVKMLPPLE LLVITLLEQQ EIIYSEKELV
YLTFLTSSMP ESTTKRRRKT GIYAAGSEKN VHLYETTRTI IIGSGTSIPP SGAPVTPEPP
LISTGASAGP PASSESTVTL PGATGTDVLR SGTSLPVSGG AVTPASSPGG SSATAGPGVG
SETTVQVSGA TGTDVLRSGT SLPVSGAAVS PGSSPGRSRA TAVSGEGSQP TVALSGATGT
SAGPSGTRSA SSGIPATPGS TTGRAAGAGT PGVDSQQTAS LPAAARPTAL GPGTSAPSGE
TSESRSSVPG GSETTQQPGA GSESPTLSPG VTRTTALRGS ETRVPSTGVS GLPGSTQGGS
AATGGSGAGS GPTAPVSGET RTSVISGTNV PVSGAPVTPG SSAGSSGAPG AGGPGSETAS
PLSGAAGTSA TGSGTSIPPS GAPVTPEPPL ISTGASAGPP ASSESTVTLP GATGTDVLRS
GTSLPVSGGA VTPASSPGGS SATAGPAVGS ETTVQVSGAT GTDVLRSGTS LPVSGAAVSP
GSSPGRSRAT AVSGEGSQPT VALSGATGTS AGPSGTRSSS SGIPATPGST TGRAAGAGTP
GVDSQQTARL PAAARTTAPG SGSSAPSGET SESRSSVPGG SETTQQPGAG SEPTTLSPGV
TRTTALRGSE TGVPSTGVSG LPGSTQGGSA ATGSSGAGSE PTAPVSGETR TSVISGANVP
VSGAPVTPGS SAGSSAAPGA RAPGSETTSP LSGAAGTSAI GSGTSIPPSG APVTPEPPLR
STEASARPPA SSESTVTLPG ATGTDVLRPG TSLPVSGGAV TPASSPGGSS ATAGPGVGSE
TTVQVSGATG ADVLRSGTSL PVSGAAVSPG SSPGRSGATA VSGEGSQPTV ALSGATGTSA
GPSGTRFSSS GIPVTPGSTT GRAAGAGTPG VDSQQTARLP AAARTTAPGS GSSAPSGETS
ESRSSVPGGS ETTQQPGAGS EPTTLSPGVT RTTALRGSET GVPSTGVSGL PGSTQGGSAA
TGSSGAGSEP TAPVSGETRT SVISGANVPV SGAPVTPGSS AGSSAAPGAR APGSETTSPL
SGAAGTSAIG SGTSIPPSGA PVTPEPPLRS TEASARPPAS SESTVTLPGA TGTDVLRPGT
SLPVSGGAVT PASSPGGSSA TAGPGVGSET TVQVSGATGA DVLRSGTSLP VSGAAVSPGS
SPGRSGATAV SGEGSQPTVA LSGATGTSAG PSGTRFSSSG IPATPGSTTG RAAGAGTPGV
DSQQTARLPA AARTTAPGSG SSAPSGETSE SRSSVPGGSE TTQQPGAGSE PTTLSPGVTR
TTALRGSETG VPSTGVSGLP GSTQGGSAAT GSSGAGSEPT APVSGETRTS VISGANVPVS
GAPVTPGSSA GSSAAPGARA PGSETTSPLS GAAGTSAIGS GTSIPPSGAP VTPEPPLRST
EASARPPASS ESTVTLPGAT GTDVLRPGTS LPVSGGAVTP ASSPGGSSAT AGPGVGSETT
VQVSGETATH VKGSNTNESS TEISKTTGAT AGLTLTSKSS IISSATRALS SSVTKATVTY
DVVSWTTGSS SGRSRTNVIE SASSVSSAEQ IAPSLSTNGL AGTTRISDVV ARTIRPSYGI
SGTTGSSIDE IVTTNTSPEF TETNRFSVVR LRTTRPSSGE IGTTLTESST SASSSEESGT
TGSIAGLRRT NRISLIRSGT TRPSSGETQT TVIESRVSGS SDQGLGTIGS TAGLMRTTRI
SVVVSGTTGP SSGKTGSTLS EFRTSGSLVK GSETTESTTG LARMTRISXG GSRTTRPSSG
ETGTTVIESR TSGSPSEGLG RTGSTAGLTR TTSISVVGSA TTEPSSRETE TTVTESXNNG
SLGEGSGTTG AIAGLTRTTR ISGVGSGTTR PSSGETRTTV IKSITRRTSA EGSETTGSAG
GLIIATRISS ADLLTPGPLS GETRTTVIGS GTSGKSGEVS GLTQSPAELT TTTRISHVAS
GTRAPSSGMT RTTVTSGVAS RTSGLSSGEK GTSVTETRTS GSSIEGSQTT GTADRLTITT
RTSVVVSGTD APSSGTSGTI RSSVDLTGTT KVSVVGTGTI EPSTVESWTT EPRDLGSSTT
LFSAGAIGTT RPGTSGASRP SVVGSETAGP LSAKTETTVI RSGSSGSSLE GRGTSGSTDG
LTGTTTISFV GLGTTGPSAR GSRPTGKGDI RSSTTVSSVD ATGNIRSGGS GTTGPSIVGS
ETVGPSSGEA GTTATGSGTS GKSAERLGTT VSTDGLRRTT RISLVSLGTT GPSSGVMRTT
QTSIVGLETT RSSTGVLATT STSAEGLXTT GPSPGGLWTT GTSVEGSETT ESSTGKITGA
RRTTWESGSE VATYEGTSGK FSKAAISGSS HTEATTLIVS NSTSGTGLRP EDNTAVAGGQ
ATGRVTGTTK VIPGTTVAPG SSNTESTTSL GESRTRIGRI TGATTGTSER SSPGSKTGNT
GAISGTTVAP RSSNTGATTS LGSGETSQGG IKIVTMGVTT GTTIAPGSSN TKATTPTEVR
TTTEVRTATE TTTSRHSSDA TGSGIQTGIT GTGSGTTSSP GGFNAEATTF KEHVRTTETR
ILSGTTRGAS GTTVIPESSN TGTSTGVGRQ TSTAVVSGRV TGVSESSSPG TSKEASETTT
GPGISTTGST SKSNRITTSS RIPYPETTVV ATGEQETETK TGCTTSLPPP PACYGPLGEK
KSPGDIWTAN CHKCTCTDAE TVDCKLKECP SPPTCKPEER LVKFKDNDTC CEIAYCEPRT
CLFNNNDYEV GASFADPNNP CISYSCHNTG FVAVVQDCPK QTWCAEEDRV YDSTKCCYTC
KPYCRSSSVN VTVNYNGCKK KVEMARCAGE CKKTIKYDYD IFQLKNSCLC CQEENYEYRE
IDLDCPDGGT IPYRYRHIIT CSCLDICQQS MTSTVS