SH321_HUMAN
ID SH321_HUMAN Reviewed; 640 AA.
AC A4FU49; B4DLI6; D3DPS6; J3KQM5; Q5VTK7; Q86XZ6; Q8N445; Q96DN4; Q9H5W5;
DT 20-MAY-2008, integrated into UniProtKB/Swiss-Prot.
DT 20-MAY-2008, sequence version 2.
DT 03-AUG-2022, entry version 121.
DE RecName: Full=SH3 domain-containing protein 21;
GN Name=SH3D21; Synonyms=C1orf113;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3).
RC TISSUE=Signet-ring cell carcinoma, Teratocarcinoma, and
RC Umbilical cord blood;
RX PubMed=14702039; DOI=10.1038/ng1285;
RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R.,
RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H.,
RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.,
RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K.,
RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H.,
RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M.,
RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K.,
RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T.,
RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M.,
RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S.,
RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H.,
RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K.,
RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N.,
RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S.,
RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O.,
RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H.,
RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B.,
RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y.,
RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K.,
RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T.,
RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T.,
RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y.,
RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H.,
RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y.,
RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H.,
RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O.,
RA Isogai T., Sugano S.;
RT "Complete sequencing and characterization of 21,243 full-length human
RT cDNAs.";
RL Nat. Genet. 36:40-45(2004).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16710414; DOI=10.1038/nature04727;
RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A.,
RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C.,
RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K.,
RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C.,
RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W.,
RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J.,
RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J.,
RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y.,
RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J.,
RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H.,
RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L.,
RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J.,
RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S.,
RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K.,
RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R.,
RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M.,
RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S.,
RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J.,
RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W.,
RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N.,
RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V.,
RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J.,
RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E.,
RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S.,
RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M.,
RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H.,
RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E.,
RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G.,
RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.;
RT "The DNA sequence and biological annotation of human chromosome 1.";
RL Nature 441:315-321(2006).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 162-640 (ISOFORMS 3 AND 4), AND
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 379-640 (ISOFORM 2).
RC TISSUE=Brain, Liver, and Ovary;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=5;
CC Name=1;
CC IsoId=A4FU49-1; Sequence=Displayed;
CC Name=2;
CC IsoId=A4FU49-3; Sequence=VSP_040924;
CC Name=3;
CC IsoId=A4FU49-4; Sequence=VSP_040923, VSP_040925, VSP_040926;
CC Name=4;
CC IsoId=A4FU49-5; Sequence=VSP_040927, VSP_040928;
CC Name=5;
CC IsoId=A4FU49-6; Sequence=VSP_045936;
CC -!- MISCELLANEOUS: [Isoform 1]: Gene prediction based on EST data and
CC similarity to mouse and macaca fascicularis orthologs.
CC -!- MISCELLANEOUS: [Isoform 3]: May be produced at very low levels due to a
CC premature stop codon in the mRNA, leading to nonsense-mediated mRNA
CC decay. {ECO:0000305}.
CC -!- MISCELLANEOUS: [Isoform 4]: May be produced at very low levels due to a
CC premature stop codon in the mRNA, leading to nonsense-mediated mRNA
CC decay. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAI01677.1; Type=Erroneous translation; Note=Wrong choice of CDS.; Evidence={ECO:0000305};
CC Sequence=AAI01679.1; Type=Erroneous translation; Note=Wrong choice of CDS.; Evidence={ECO:0000305};
CC Sequence=BAB15504.1; Type=Erroneous initiation; Note=Truncated N-terminus.; Evidence={ECO:0000305};
CC Sequence=BAB71191.1; Type=Erroneous translation; Note=Wrong choice of CDS.; Evidence={ECO:0000305};
CC Sequence=EAX07375.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC Sequence=EAX07376.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK026591; BAB15504.1; ALT_INIT; mRNA.
DR EMBL; AK297015; BAG59548.1; -; mRNA.
DR EMBL; AK056459; BAB71191.1; ALT_SEQ; mRNA.
DR EMBL; AL591845; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CH471059; EAX07374.1; -; Genomic_DNA.
DR EMBL; CH471059; EAX07375.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CH471059; EAX07376.1; ALT_SEQ; Genomic_DNA.
DR EMBL; BC036763; AAH36763.1; -; mRNA.
DR EMBL; BC048273; AAH48273.1; -; mRNA.
DR EMBL; BC101676; AAI01677.1; ALT_SEQ; mRNA.
DR EMBL; BC101678; AAI01679.1; ALT_SEQ; mRNA.
DR CCDS; CCDS30674.2; -. [A4FU49-3]
DR RefSeq; NP_001156002.1; NM_001162530.1. [A4FU49-6]
DR RefSeq; NP_078952.4; NM_024676.4. [A4FU49-3]
DR RefSeq; XP_016857835.1; XM_017002346.1.
DR AlphaFoldDB; A4FU49; -.
DR BioGRID; 122844; 2.
DR IntAct; A4FU49; 6.
DR MINT; A4FU49; -.
DR STRING; 9606.ENSP00000421294; -.
DR iPTMnet; A4FU49; -.
DR PhosphoSitePlus; A4FU49; -.
DR BioMuta; SH3D21; -.
DR EPD; A4FU49; -.
DR jPOST; A4FU49; -.
DR MassIVE; A4FU49; -.
DR MaxQB; A4FU49; -.
DR PaxDb; A4FU49; -.
DR PeptideAtlas; A4FU49; -.
DR PRIDE; A4FU49; -.
DR ProteomicsDB; 663; -. [A4FU49-1]
DR ProteomicsDB; 664; -. [A4FU49-3]
DR ProteomicsDB; 665; -. [A4FU49-4]
DR ProteomicsDB; 666; -. [A4FU49-5]
DR Antibodypedia; 51050; 60 antibodies from 10 providers.
DR DNASU; 79729; -.
DR Ensembl; ENST00000453908.8; ENSP00000403476.2; ENSG00000214193.11. [A4FU49-6]
DR Ensembl; ENST00000505871.7; ENSP00000421294.1; ENSG00000214193.11. [A4FU49-3]
DR GeneID; 79729; -.
DR KEGG; hsa:79729; -.
DR MANE-Select; ENST00000453908.8; ENSP00000403476.2; NM_001162530.2; NP_001156002.1. [A4FU49-6]
DR UCSC; uc010oia.2; human. [A4FU49-1]
DR CTD; 79729; -.
DR DisGeNET; 79729; -.
DR GeneCards; SH3D21; -.
DR HGNC; HGNC:26236; SH3D21.
DR HPA; ENSG00000214193; Tissue enhanced (intestine, thyroid gland).
DR neXtProt; NX_A4FU49; -.
DR OpenTargets; ENSG00000214193; -.
DR PharmGKB; PA142672497; -.
DR VEuPathDB; HostDB:ENSG00000214193; -.
DR eggNOG; KOG4348; Eukaryota.
DR GeneTree; ENSGT00940000160627; -.
DR HOGENOM; CLU_029596_1_0_1; -.
DR InParanoid; A4FU49; -.
DR OMA; SKAKPSW; -.
DR OrthoDB; 1577081at2759; -.
DR PhylomeDB; A4FU49; -.
DR TreeFam; TF351294; -.
DR PathwayCommons; A4FU49; -.
DR SignaLink; A4FU49; -.
DR BioGRID-ORCS; 79729; 8 hits in 1040 CRISPR screens.
DR ChiTaRS; SH3D21; human.
DR GenomeRNAi; 79729; -.
DR Pharos; A4FU49; Tdark.
DR PRO; PR:A4FU49; -.
DR Proteomes; UP000005640; Chromosome 1.
DR RNAct; A4FU49; protein.
DR Bgee; ENSG00000214193; Expressed in right lobe of thyroid gland and 145 other tissues.
DR ExpressionAtlas; A4FU49; baseline and differential.
DR Genevisible; A4FU49; HS.
DR GO; GO:0005654; C:nucleoplasm; IDA:HPA.
DR GO; GO:0005886; C:plasma membrane; IDA:HPA.
DR GO; GO:0007015; P:actin filament organization; IBA:GO_Central.
DR GO; GO:0016477; P:cell migration; IBA:GO_Central.
DR CDD; cd12142; SH3_D21-like; 1.
DR InterPro; IPR036028; SH3-like_dom_sf.
DR InterPro; IPR001452; SH3_domain.
DR InterPro; IPR035468; SH3D21_SH3.
DR Pfam; PF07653; SH3_2; 1.
DR PRINTS; PR00452; SH3DOMAIN.
DR SMART; SM00326; SH3; 1.
DR SUPFAM; SSF50044; SSF50044; 1.
DR PROSITE; PS50002; SH3; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Coiled coil; Reference proteome; SH3 domain.
FT CHAIN 1..640
FT /note="SH3 domain-containing protein 21"
FT /id="PRO_0000337129"
FT DOMAIN 65..126
FT /note="SH3"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00192"
FT REGION 1..60
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 133..361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..551
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 618..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 572..626
FT /evidence="ECO:0000255"
FT COMPBIAS 158..196
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 203..227
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..257
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 460..535
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..96
FT /note="MVQSELQLQPRAGGRAEAASWGDRGNDKGGLGNPDMPSVSPGPQRPPKLSSL
FT AYDSPPDYLQTVSHPEVYRVLFDYQPEAPDELALRRGDVVKVLS -> MIKEIEDGWWL
FT GKRNGQLGAFPSNFVELLDSGPPSLGNPDMPSVSPGPQRPP (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040923"
FT VAR_SEQ 1..30
FT /note="MVQSELQLQPRAGGRAEAASWGDRGNDKGG -> MIKEIEDGWWLGKKNGQL
FT GAFPSNFVELLDSGPPS (in isoform 2)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040924"
FT VAR_SEQ 1..30
FT /note="MVQSELQLQPRAGGRAEAASWGDRGNDKGG -> MEVLVLAGYRAQKEDELS
FT LAPGDVVRQVRWVPARGWLRGEFGGRYGLFPERLVQEIPETLRGSGEARRPRCARRRGH
FT PAKHPRPQRWCKVNFSYSPEQADELKLQAGEIVEMIKEIEDGWWLGKKNGQLGAFPSNF
FT VELLDSGPPS (in isoform 5)"
FT /evidence="ECO:0000305"
FT /id="VSP_045936"
FT VAR_SEQ 173..179
FT /note="TSRTPSR -> RKSTAAR (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040925"
FT VAR_SEQ 180..640
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:14702039,
FT ECO:0000303|PubMed:15489334"
FT /id="VSP_040926"
FT VAR_SEQ 191..232
FT /note="PNGGFQSGGSYHPGRKRSKTQTPQQRSVSSQEEEHSSPVKAP -> GRAQQP
FT GKGPLCEENPHAGQDCHPREAPSSRERPQLQEDPGS (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_040927"
FT VAR_SEQ 233..640
FT /note="Missing (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_040928"
FT VARIANT 217
FT /note="S -> A (in dbSNP:rs12121759)"
FT /id="VAR_043619"
FT VARIANT 455
FT /note="A -> S (in dbSNP:rs12121759)"
FT /id="VAR_056763"
FT CONFLICT 327
FT /note="K -> E (in Ref. 1; BAB71191)"
FT /evidence="ECO:0000305"
FT CONFLICT 413
FT /note="E -> G (in Ref. 1; BAB71191)"
FT /evidence="ECO:0000305"
FT CONFLICT 541..542
FT /note="TR -> NS (in Ref. 1; BAB71191)"
FT /evidence="ECO:0000305"
FT CONFLICT 582
FT /note="E -> G (in Ref. 1; BAB71191)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 640 AA; 70519 MW; 59992262EE553FA3 CRC64;
MVQSELQLQP RAGGRAEAAS WGDRGNDKGG LGNPDMPSVS PGPQRPPKLS SLAYDSPPDY
LQTVSHPEVY RVLFDYQPEA PDELALRRGD VVKVLSKTTE DKGWWEGECQ GRRGVFPDNF
VLPPPPIKKL VPRKVVSRES APIKEPKKLM PKTSLPTVKK LATATTGPSK AKTSRTPSRD
SQKLTSRDSG PNGGFQSGGS YHPGRKRSKT QTPQQRSVSS QEEEHSSPVK APSVKRTPMP
DKTATPERPP APENAPSSKK IPAPDKVPSP EKTLTLGDKA SIPGNSTSGK IPAPDKVPTP
EKMVTPEDKA SIPENSIIPE ETLTVDKPST PERVFSVEES PALEAPPMDK VPNPKMAPLG
DEAPTLEKVL TPELSEEEVS TRDDIQFHHF SSEEALQKVK YFVAKEDPSS QEEAHTPEAP
PPQPPSSERC LGEMKCTLVR GDSSPRQAEL KSGPASRPAL EKPHPHEEAT TLPEEAPSND
ERTPEEEAPP NEQRPLREEV LPKEGVASKE EVTLKEELPP KEEVAPKEEV PPIERAFAQK
TRPIKPPPDS QETLALPSLV PQNYTENKNE GVDVTSLRGE VESLRRALEL MEVQLERKLT
DIWEELKSEK EQRRRLEVQV MQGTQKSQTP RVIHTQTQTY