ANKHM_CAEEL
ID ANKHM_CAEEL Reviewed; 2620 AA.
AC Q21920; A3RMT6; A3RMT7; D3YT52; L8E837; L8E927; L8EC41; Q21927; Q9TW88;
DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot.
DT 22-SEP-2009, sequence version 3.
DT 03-AUG-2022, entry version 170.
DE RecName: Full=Ankyrin repeat and KH domain-containing protein mask-1 {ECO:0000305};
DE AltName: Full=Multiple ankyrin repeats single KH domain homolog {ECO:0000312|WormBase:R11A8.7a};
GN Name=mask-1 {ECO:0000312|WormBase:R11A8.7a};
GN ORFNames=R11A8.7 {ECO:0000312|WormBase:R11A8.7a};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1] {ECO:0000312|EMBL:CAM35836.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000312|EMBL:CAM35836.1};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000250|UniProtKB:Q9VCA8}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=8;
CC Name=a {ECO:0000312|WormBase:R11A8.7a};
CC IsoId=Q21920-1; Sequence=Displayed;
CC Name=b {ECO:0000312|WormBase:R11A8.7b};
CC IsoId=Q21920-2; Sequence=VSP_052628;
CC Name=c {ECO:0000312|WormBase:R11A8.7c};
CC IsoId=Q21920-3; Sequence=VSP_052627, VSP_052628;
CC Name=d {ECO:0000312|WormBase:R11A8.7d};
CC IsoId=Q21920-4; Sequence=VSP_052627;
CC Name=e {ECO:0000312|WormBase:R11A8.7e};
CC IsoId=Q21920-5; Sequence=VSP_053226, VSP_052628;
CC Name=f {ECO:0000312|WormBase:R11A8.7f};
CC IsoId=Q21920-6; Sequence=VSP_053227;
CC Name=g {ECO:0000312|WormBase:R11A8.7g};
CC IsoId=Q21920-7; Sequence=VSP_053227, VSP_052628;
CC Name=h {ECO:0000312|WormBase:R11A8.7h};
CC IsoId=Q21920-8; Sequence=VSP_053226, VSP_053227, VSP_052628;
CC -!- SIMILARITY: Belongs to the mask family. {ECO:0000255}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284604; CAA94370.2; -; Genomic_DNA.
DR EMBL; BX284604; CAB54294.2; -; Genomic_DNA.
DR EMBL; BX284604; CAM35836.1; -; Genomic_DNA.
DR EMBL; BX284604; CAM35837.1; -; Genomic_DNA.
DR EMBL; BX284604; CCQ25661.1; -; Genomic_DNA.
DR EMBL; BX284604; CCQ25715.1; -; Genomic_DNA.
DR EMBL; BX284604; CCQ25716.1; -; Genomic_DNA.
DR EMBL; BX284604; CBK19467.1; -; Genomic_DNA.
DR PIR; T24157; T24157.
DR PIR; T24158; T24158.
DR RefSeq; NP_001122794.1; NM_001129322.2. [Q21920-3]
DR RefSeq; NP_001122795.1; NM_001129323.2. [Q21920-4]
DR RefSeq; NP_001255486.1; NM_001268557.1. [Q21920-5]
DR RefSeq; NP_001263772.1; NM_001276843.1. [Q21920-6]
DR RefSeq; NP_001263773.1; NM_001276844.1. [Q21920-7]
DR RefSeq; NP_001263774.1; NM_001276845.1.
DR RefSeq; NP_501915.2; NM_069514.2. [Q21920-1]
DR RefSeq; NP_501916.2; NM_069515.3. [Q21920-2]
DR AlphaFoldDB; Q21920; -.
DR SMR; Q21920; -.
DR BioGRID; 43029; 3.
DR IntAct; Q21920; 2.
DR STRING; 6239.R11A8.7f; -.
DR EPD; Q21920; -.
DR PaxDb; Q21920; -.
DR PeptideAtlas; Q21920; -.
DR PRIDE; Q21920; -.
DR EnsemblMetazoa; R11A8.7a.1; R11A8.7a.1; WBGene00011240. [Q21920-1]
DR EnsemblMetazoa; R11A8.7b.1; R11A8.7b.1; WBGene00011240. [Q21920-2]
DR EnsemblMetazoa; R11A8.7c.1; R11A8.7c.1; WBGene00011240. [Q21920-3]
DR EnsemblMetazoa; R11A8.7d.1; R11A8.7d.1; WBGene00011240. [Q21920-4]
DR EnsemblMetazoa; R11A8.7e.1; R11A8.7e.1; WBGene00011240. [Q21920-5]
DR EnsemblMetazoa; R11A8.7f.1; R11A8.7f.1; WBGene00011240. [Q21920-6]
DR EnsemblMetazoa; R11A8.7g.1; R11A8.7g.1; WBGene00011240. [Q21920-7]
DR EnsemblMetazoa; R11A8.7h.1; R11A8.7h.1; WBGene00011240. [Q21920-8]
DR EnsemblMetazoa; R11A8.7h.2; R11A8.7h.2; WBGene00011240. [Q21920-8]
DR GeneID; 177927; -.
DR UCSC; R11A8.7a; c. elegans. [Q21920-1]
DR CTD; 177927; -.
DR WormBase; R11A8.7a; CE43470; WBGene00011240; mask-1. [Q21920-1]
DR WormBase; R11A8.7b; CE43456; WBGene00011240; mask-1. [Q21920-2]
DR WormBase; R11A8.7c; CE40772; WBGene00011240; mask-1. [Q21920-3]
DR WormBase; R11A8.7d; CE40773; WBGene00011240; mask-1. [Q21920-4]
DR WormBase; R11A8.7e; CE44619; WBGene00011240; mask-1. [Q21920-5]
DR WormBase; R11A8.7f; CE48093; WBGene00011240; mask-1. [Q21920-6]
DR WormBase; R11A8.7g; CE48142; WBGene00011240; mask-1. [Q21920-7]
DR WormBase; R11A8.7h; CE48120; WBGene00011240; mask-1. [Q21920-8]
DR eggNOG; KOG4369; Eukaryota.
DR GeneTree; ENSGT00940000174194; -.
DR InParanoid; Q21920; -.
DR OMA; GWLEMER; -.
DR OrthoDB; 74671at2759; -.
DR PhylomeDB; Q21920; -.
DR SignaLink; Q21920; -.
DR PRO; PR:Q21920; -.
DR Proteomes; UP000001940; Chromosome IV.
DR Bgee; WBGene00011240; Expressed in adult organism and 4 other tissues.
DR ExpressionAtlas; Q21920; baseline and differential.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR Gene3D; 1.25.40.20; -; 6.
DR Gene3D; 3.30.1370.10; -; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR Pfam; PF12796; Ank_2; 5.
DR Pfam; PF13606; Ank_3; 1.
DR Pfam; PF13637; Ank_4; 1.
DR Pfam; PF00013; KH_1; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 22.
DR SMART; SM00322; KH; 1.
DR SUPFAM; SSF48403; SSF48403; 3.
DR SUPFAM; SSF54791; SSF54791; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 2.
DR PROSITE; PS50088; ANK_REPEAT; 15.
DR PROSITE; PS50084; KH_TYPE_1; 1.
PE 3: Inferred from homology;
KW Alternative splicing; ANK repeat; Coiled coil; Cytoplasm;
KW Reference proteome; Repeat; RNA-binding.
FT CHAIN 1..2620
FT /note="Ankyrin repeat and KH domain-containing protein
FT mask-1"
FT /id="PRO_0000312832"
FT REPEAT 254..283
FT /note="ANK 1"
FT /evidence="ECO:0000255"
FT REPEAT 288..318
FT /note="ANK 2"
FT /evidence="ECO:0000255"
FT REPEAT 361..390
FT /note="ANK 3"
FT /evidence="ECO:0000255"
FT REPEAT 402..431
FT /note="ANK 4"
FT /evidence="ECO:0000255"
FT REPEAT 437..466
FT /note="ANK 5"
FT /evidence="ECO:0000255"
FT REPEAT 470..502
FT /note="ANK 6"
FT /evidence="ECO:0000255"
FT REPEAT 507..536
FT /note="ANK 7"
FT /evidence="ECO:0000255"
FT REPEAT 538..566
FT /note="ANK 8"
FT /evidence="ECO:0000255"
FT REPEAT 568..597
FT /note="ANK 9"
FT /evidence="ECO:0000255"
FT REPEAT 600..629
FT /note="ANK 10"
FT /evidence="ECO:0000255"
FT REPEAT 634..663
FT /note="ANK 11"
FT /evidence="ECO:0000255"
FT REPEAT 667..697
FT /note="ANK 12"
FT /evidence="ECO:0000255"
FT REPEAT 1234..1263
FT /note="ANK 13"
FT /evidence="ECO:0000255"
FT REPEAT 1267..1296
FT /note="ANK 14"
FT /evidence="ECO:0000255"
FT REPEAT 1301..1330
FT /note="ANK 15"
FT /evidence="ECO:0000255"
FT REPEAT 1334..1363
FT /note="ANK 16"
FT /evidence="ECO:0000255"
FT REPEAT 1369..1398
FT /note="ANK 17"
FT /evidence="ECO:0000255"
FT REPEAT 1403..1432
FT /note="ANK 18"
FT /evidence="ECO:0000255"
FT REPEAT 1436..1465
FT /note="ANK 19"
FT /evidence="ECO:0000255"
FT REPEAT 1471..1500
FT /note="ANK 20"
FT /evidence="ECO:0000255"
FT REPEAT 1504..1533
FT /note="ANK 21"
FT /evidence="ECO:0000255"
FT REPEAT 1537..1566
FT /note="ANK 22"
FT /evidence="ECO:0000255"
FT DOMAIN 1807..1873
FT /note="KH"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00117"
FT REGION 699..726
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 994..1032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1192..1229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1621..1720
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1759..1804
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1899..1962
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1976..2010
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2067..2143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2267..2294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2307..2343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2372..2391
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2429..2448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2496..2620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1596..1648
FT /evidence="ECO:0000255"
FT COMPBIAS 1008..1029
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1192..1210
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1211..1226
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1660..1683
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1684..1701
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1770..1804
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1899..1946
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1981..2010
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2067..2083
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2100..2143
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2320..2343
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2496..2532
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2550..2620
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..1193
FT /note="Missing (in isoform c and isoform d)"
FT /evidence="ECO:0000305"
FT /id="VSP_052627"
FT VAR_SEQ 1..1115
FT /note="Missing (in isoform e and isoform h)"
FT /evidence="ECO:0000305"
FT /id="VSP_053226"
FT VAR_SEQ 1136
FT /note="L -> FSV (in isoform f, isoform g and isoform h)"
FT /evidence="ECO:0000305"
FT /id="VSP_053227"
FT VAR_SEQ 1906..1927
FT /note="Missing (in isoform b, isoform c, isoform e, isoform
FT g and isoform h)"
FT /evidence="ECO:0000305"
FT /id="VSP_052628"
SQ SEQUENCE 2620 AA; 287078 MW; 626D93A935EFAA24 CRC64;
MAHLNMFNHL IPLEMDMDGA DEEKRQRVFS SFYHFGARLY DCLHTIAIEI EDDDCEHFPT
KAISTVLKIL NFEHFLSSDK CRFPPVSNLL DIIESRQMDD CQILFKISDL LEDHKSEKPE
RFGPYNPLDP KKVPIKNAVD ALTSVSSMAY SFLATTFAEE LMKAAIRDIY VFDEDSLEDN
DETQLSDENG VFDSTPSNNE KKDSIINFRQ LPSIDSQIVQ QNAMLLLAAR VGIEQFVEYS
HEIGVMQFRG DKLSKITPLM EAAASSSETI VRRLLELGAD PNVASIPNCN TALIYAASTD
GRDVVREILM TEGPKKPDVY LINNHYHDAM MEVALVGGTD TLKEFLEMGY RPRFLNLRQQ
ERDSALTLSA QKGHIKIVTA IMDYYEKNPP QTEEEKQELC LERYSALMEA AMEGHIDVCK
LMLSRGTPAD LCTEVTIEPS PLIVASAGGY PEVVEVLLAA GAKIEELSNK KNTPLMEACA
GDQGDQAGVV KLLLSKHAEV DVSNPDTGDT PLSLAARNGY IAIMKMLIEK GGDLTAGKTS
PIVEAARNGH LECIQFILAH CKTIPQDQLS RALVSAADFG SLLIVEEVIR AGADLNFEQD
ERTALMKAAK GDHFEVVQLL LSKGASVNFK SSKNDATALS LACSEGNMEI AEFLIRNGAD
PMLKMDDGVN CFMEVARHGS IDLMSLLVEF TKGNMPMDKD PPKLGITRCS SKNGKKRRKG
MPSGQDMLSM FNGMYPKRKG SKQMGLHEMP FSTQEIDMLT HLLKMQQQMV AYETHKSTET
ETAEIKKVLR AIESVYGFTS EGKINFPPPP NRQDMDKLYN GELVPNIKLW AELVSHGWLE
MERKIGRPIE LSSFQQCNEG HSTNAAAAVS AVAAAATGMD SQTYLASVFA KMNNGEEMPR
VPATVGSLNA ASAAMTGISF HSDDAMRLFG GASFATKLLS DNKKPCNHQQ YASIHHIQEG
AFRAALIKMS SMFRERNGCA ISVRDMESNF PIEHQEERFG PSKPIPSGPK KTSLTAPNPA
DTSDVTTKQP GAMKKDGLEA KKIYPGIIKL AAEMEKLFRA NPTESNRDLA LTTAYIASAL
PDHFCSELQL ESGDRILKKL LSGLTEKQKN TVISRMRSVV NKESGSSLLR RSVDNLTDKR
IKEDYLKLFR DSTDCAFYDK CVQEKNHLLK AIEIQKKGKT SSGTLISTSS KSLMAKSVQS
QQQQGQLRRT HSEGDGAERA KSRSNAIDKA TETTLETPLT IACANGHKDI VELLLKEGAN
IEHRDKKGFS PLIIAATAGH SSVVEVLLKN HAAIEAQSDR TKDTALSLAC SGGRKDVVEL
LLAHGANKEH RNVSDYTPLS LASSGGYIEI VNMLLTAGSE INSRTGSKLG ISPLMLASMN
GHREATRVLL EKGSDINAQI ETNRNTALTL ASFQGRTEVV KLLLAYNANV EHRAKTGLTP
LMECASGGYV DVGNLLIAAG ADTNASPVQQ TKDTALTISA EKGHEKFVRM LLNGDAAVDV
RNKKGCTALW LACNGGYLST AQALLEKGAD PDMFDNRKIS PMMAAFRKGH VEIVKYMVNS
AKQFPNEQDL IRAQQTAETD DIKKKCGECI DIIRSAKKAQ AESAELAAQK LLELIDEEKV
QKEVKKQKQK DKKIKKKEEK KIKKQEAEPE PEPEPEPEPV PEPEPVVISE PVPEPVPIVV
EEPPKEPPKP RRNRRKTNPD GVPKGPKVVV EPKASIAEEP SEMPYEPIVV TIPPPAKIHA
PMVSPGSYSE SEEWCKAGKE GKKVKSSKKS GYGAPSSAGS SQAKESSTTS SVISDQTPPY
EIDTRNESSW KLTIPAYAAS RVIGKGGSNV NAVREATGAI IEINKIQESN KQAERTVLAK
GTPEMVRYAM NIINYMIYDA DVLVTDAIRT VLRGNLSVAS SFSSEGTSKS AVDSTYAPSS
IPKSLSSASI ARQSASPIPQ QSSQRSAKSH HHQKDSGGGN VWHQRMAARE EKVEPLMETK
RISQSPKQAP QIPSTQQQSK LQSRQDQASE TLVRVAPAEN FVAPVPASSI AAPSRPNQNV
LDRVIAPSLR REPTTTPLIQ PVHPVQSVQS VQHMQQQQLA RPEQKLAQPC LPDPIGQRYS
QPISRPQSSV QVSQSSFSKA PGTRPSTDFS RAPGPPQQQQ TQNNMTTARN ELFDEQLAFG
QFKPTGMNSV ATVIPAKPSV NNQNDDKNGN SDDFDFSKMR MFDEGKVGNI WGKSDEDSAW
GGLFSQFLPQ LGANSSLNNS NSKNDSSNEW GQNEFISQLL INSSLPNATS SPQGASTISS
APVQQPSTSS VTTGLSSLEL KGWMPASFAP SARDPNRSQP PLFARSQSNS VANSTSTNIQ
QQQQQQIQHL QQQQALQQQQ QRIQQFQQQY QQHQSQSSQQ PSDLMSSKFS MLQSQQQQQQ
LYNQMQSLGQ DGQGSNLYNH LAAQLLTHQE SSTGAPGPTS SQLANSYYPS PSYTDASVLG
QISMPSLSQR GIKQFDGFNN DSDGVIAAIL EQQQQQKKQG LAQQSFMHNS QQPQPFGAPS
NASANQSRLG MIQPRPQPPP FVAPQAPPGF SSLGNASSTT NPSRTSMQQM YQQYGQSSQQ
QPYGQMPQAM DWNRLGQQQQ SASGQQNHQS SSSNKWSSNW