VW5B2_MOUSE
ID VW5B2_MOUSE Reviewed; 1248 AA.
AC Q3UR50; Q2TB01; Q80WS8; Q8BR57;
DT 10-JUN-2008, integrated into UniProtKB/Swiss-Prot.
DT 10-JUN-2008, sequence version 2.
DT 03-AUG-2022, entry version 104.
DE RecName: Full=von Willebrand factor A domain-containing protein 5B2;
GN Name=Vwa5b2;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3).
RC STRAIN=C57BL/6J; TISSUE=Corpora quadrigemina, and Spinal ganglion;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1067-1248 (ISOFORM 1).
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=3;
CC Name=1;
CC IsoId=Q3UR50-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q3UR50-2; Sequence=VSP_034144, VSP_034145;
CC Name=3;
CC IsoId=Q3UR50-5; Sequence=VSP_034141, VSP_034146;
CC -!- SEQUENCE CAUTION:
CC Sequence=AAI10636.1; Type=Miscellaneous discrepancy; Note=Probable cloning artifact.; Evidence={ECO:0000305};
CC Sequence=AAI25017.1; Type=Miscellaneous discrepancy; Note=Probable cloning artifact.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK045595; BAC32429.1; -; mRNA.
DR EMBL; AK141799; BAE24838.1; -; mRNA.
DR EMBL; AC087898; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC110635; AAI10636.1; ALT_SEQ; mRNA.
DR EMBL; BC125016; AAI25017.1; ALT_SEQ; mRNA.
DR CCDS; CCDS37288.1; -. [Q3UR50-2]
DR CCDS; CCDS49792.1; -. [Q3UR50-1]
DR RefSeq; NP_001138425.1; NM_001144953.1. [Q3UR50-1]
DR RefSeq; NP_872574.2; NM_182636.4. [Q3UR50-2]
DR RefSeq; XP_006522361.1; XM_006522298.3.
DR AlphaFoldDB; Q3UR50; -.
DR STRING; 10090.ENSMUSP00000097652; -.
DR iPTMnet; Q3UR50; -.
DR PhosphoSitePlus; Q3UR50; -.
DR PaxDb; Q3UR50; -.
DR PRIDE; Q3UR50; -.
DR ProteomicsDB; 297830; -. [Q3UR50-1]
DR ProteomicsDB; 297831; -. [Q3UR50-2]
DR ProteomicsDB; 297832; -. [Q3UR50-5]
DR Antibodypedia; 52386; 51 antibodies from 10 providers.
DR Ensembl; ENSMUST00000096197; ENSMUSP00000093911; ENSMUSG00000046613. [Q3UR50-1]
DR Ensembl; ENSMUST00000100074; ENSMUSP00000097652; ENSMUSG00000046613. [Q3UR50-2]
DR GeneID; 328643; -.
DR KEGG; mmu:328643; -.
DR UCSC; uc007yqb.1; mouse. [Q3UR50-2]
DR UCSC; uc012acu.1; mouse. [Q3UR50-1]
DR UCSC; uc029swo.1; mouse. [Q3UR50-5]
DR CTD; 90113; -.
DR MGI; MGI:2681859; Vwa5b2.
DR VEuPathDB; HostDB:ENSMUSG00000046613; -.
DR eggNOG; ENOG502QW0V; Eukaryota.
DR GeneTree; ENSGT00940000157096; -.
DR HOGENOM; CLU_389132_0_0_1; -.
DR InParanoid; Q3UR50; -.
DR OMA; XESLAMA; -.
DR OrthoDB; 955432at2759; -.
DR PhylomeDB; Q3UR50; -.
DR BioGRID-ORCS; 328643; 2 hits in 71 CRISPR screens.
DR ChiTaRS; Vwa5b2; mouse.
DR PRO; PR:Q3UR50; -.
DR Proteomes; UP000000589; Chromosome 16.
DR RNAct; Q3UR50; protein.
DR Bgee; ENSMUSG00000046613; Expressed in primary visual cortex and 113 other tissues.
DR ExpressionAtlas; Q3UR50; baseline and differential.
DR Genevisible; Q3UR50; MM.
DR Gene3D; 3.40.50.410; -; 1.
DR InterPro; IPR013694; VIT.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR Pfam; PF13757; VIT_2; 1.
DR Pfam; PF13768; VWA_3; 1.
DR SUPFAM; SSF53300; SSF53300; 1.
DR PROSITE; PS51468; VIT; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Reference proteome.
FT CHAIN 1..1248
FT /note="von Willebrand factor A domain-containing protein
FT 5B2"
FT /id="PRO_0000339303"
FT DOMAIN 1..138
FT /note="VIT"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00801"
FT DOMAIN 354..527
FT /note="VWFA"
FT REGION 184..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 590..650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 672..710
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 751..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1008..1037
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1126..1168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 591..627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..706
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1152
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1154..1168
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..944
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_034141"
FT VAR_SEQ 574..602
FT /note="GQEPGWQSLAGSVFPSPEEVLSATSPGTE -> VGLGWGIPGGPGGRKSLYL
FT TRSLSLLRIL (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_034144"
FT VAR_SEQ 603..1248
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_034145"
FT VAR_SEQ 945..989
FT /note="VDATTREVLPGALQVWSSDPAELSGMSASQDQLAAAPLSTAVHSK -> MSA
FT SQDQLAAAPLSTAVHSKGRAQMQWIGHRWALDHLLVLGDPTV (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:16141072"
FT /id="VSP_034146"
SQ SEQUENCE 1248 AA; 133414 MW; 15918846B4268369 CRC64;
MPGLYCPTSW TPLPLTDSCV RAYAKGPCLS LRARLTYHNP QPQPVEGVFV YPLAEAEVVS
GFEAEAAGRR VSFQLHSRRR SQAACCRALG PGLGTSTPRR CAQGHLVLNL AQARSTLVLP
TGLVAAAGTM TVTLCSSREL PSRPDGVMHV ALPTVFTPLA QPNLPGSPRS PGLCDDSPTS
CFGVGSPEEE RPTWEQPTAT PDVFSGPARC PAPYTFSFEM LVTGPCLLAG LESPSHALRA
DALPHASSAA TIRVTLAEGH QCDRALEILL HPSEPHQPHL MLETGSLSSA EYEAQVRARH
DFQRLQQRDS GGERQVWFLQ RRFHKDILLN PVLVLNFCPD LSSKPGHLNA ATRELLFLLD
GSGAGHKDAI VLAVKSLPAQ TLVNLAIFGT LVQPLFPESR PCSDDTVQLI CESIETLQTV
NGPPDMLAVL DWALGQPQHR AYPRQMFLIT AASPTAATTH QALEFMRWHR GAARCFSFAL
APACRQLLHD LSVLSRGQAY FLRPGERLQP KLVQALRKAL EPALSDISVD WFVPDAVEAL
LTPREIPALY PGDQLLGYCS LFRVDGFRSH ALGGQEPGWQ SLAGSVFPSP EEVLSATSPG
TEPTHTTEPL GTGTVSAELS SPWAVGDSEQ SMEALTDPVM DPGPNPSSDT AIWRRIFQSS
YIREQYVLTH CSASPEPGPG STCSSESPGS QGPGSPSGSR PLDPPSQQGC RSLAWVEPAG
SRSCPLPVPP PSPFKVGAMS AEVLGRRQRA ALAGRSLSSP SGRANPVPGR ARHPSLDAIP
DGLGPEPGQQ LGQGLDDSGN LLSPAPLDWD MLMEPSFLFK PVPSSAESAP PAECLPPQAP
RCHVVIRALC GEQPMCWEVG VGLEELWGPG DGSQPESLPM REAAWDQALH RLTAASVVQD
NEQLALRGRA ETRAEQGRVR RSWLRAIQTS KVSSAPSCFT CPVAVDATTR EVLPGALQVW
SSDPAELSGM SASQDQLAAA PLSTAVHSKG HQGGCSAGAW DLDLNDNSKS ALGEPISPTG
DHHGLPHQPP ASSRLSLGRH RRLCSSNKGQ THENSNDGSN HDYLPLVRLQ EAPGSFRLDE
PFCAAVCIPQ ERLCRASPFA AHRASLSPTS ASSPWAFLSP GIGQGDSATA SCSQSPSSGS
EGPGQVDSGR GSDTEASEGM ERQDSSDLRG RTWATAVALA WLEHRCAAAF GEWELTASKA
DCWLRAQHLP DGLDLTALKA AARGLFLLLR HWDQNLQLHL LCYSPSNV