SFI1_MOUSE
ID SFI1_MOUSE Reviewed; 1216 AA.
AC Q3UZY0; Q3KQN2; Q5NC00; Q6ZQ99; Q80ZJ4; Q80ZY0; Q8R0V8; Q9CTY8;
DT 20-MAY-2008, integrated into UniProtKB/Swiss-Prot.
DT 11-OCT-2005, sequence version 1.
DT 03-AUG-2022, entry version 106.
DE RecName: Full=Protein SFI1 homolog;
GN Name=Sfi1; Synonyms=Kiaa0542;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 5).
RC TISSUE=Embryonic tail;
RX PubMed=14621295; DOI=10.1093/dnares/10.4.167;
RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S.,
RA Saga Y., Nagase T., Ohara O., Koga H.;
RT "Prediction of the coding sequences of mouse homologues of KIAA gene: III.
RT The complete nucleotide sequences of 500 mouse KIAA-homologous cDNAs
RT identified by screening of terminal sequences of cDNA clones randomly
RT sampled from size-fractionated libraries.";
RL DNA Res. 10:167-180(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC STRAIN=C57BL/6J; TISSUE=Pituitary, and Tongue;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.,
RA Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R.,
RA Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T.,
RA Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A.,
RA Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B.,
RA Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M.,
RA Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S.,
RA Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D.,
RA Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M.,
RA Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H.,
RA Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V.,
RA Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S.,
RA Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H.,
RA Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N.,
RA Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F.,
RA Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G.,
RA Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z.,
RA Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S.,
RA Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K.,
RA Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R.,
RA van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H.,
RA Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M.,
RA Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C.,
RA Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S.,
RA Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K.,
RA Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M.,
RA Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C.,
RA Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A.,
RA Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C57BL/6J;
RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112;
RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X.,
RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y.,
RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S.,
RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R.,
RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K.,
RA Eichler E.E., Ponting C.P.;
RT "Lineage-specific biology revealed by a finished genome assembly of the
RT mouse.";
RL PLoS Biol. 7:E1000112-E1000112(2009).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 3 AND 4), AND NUCLEOTIDE
RP SEQUENCE [LARGE SCALE MRNA] OF 357-1216 (ISOFORM 2).
RC STRAIN=Czech II, and FVB/N;
RC TISSUE=Colon, Eye, Mammary tumor, and Salivary gland;
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA project:
RT the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
CC -!- FUNCTION: Plays a role in the dynamic structure of centrosome-
CC associated contractile fibers via its interaction with CETN2.
CC {ECO:0000250}.
CC -!- SUBUNIT: Interacts with CETN2 (via C-terminus). {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytoskeleton, microtubule organizing
CC center, centrosome, centriole {ECO:0000250}. Note=Localized close to
CC the centriole. {ECO:0000250}.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=5;
CC Name=1;
CC IsoId=Q3UZY0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q3UZY0-2; Sequence=VSP_033714;
CC Name=3;
CC IsoId=Q3UZY0-3; Sequence=VSP_033711, VSP_033714, VSP_033715;
CC Name=4;
CC IsoId=Q3UZY0-4; Sequence=VSP_033709, VSP_033710;
CC Name=5;
CC IsoId=Q3UZY0-5; Sequence=VSP_033712, VSP_033713, VSP_033716;
CC -!- DOMAIN: CETN2-binding regions contains a conserved Trp residue in their
CC C-terminal ends, which seems critical for interaction with CETN2.
CC -!- SIMILARITY: Belongs to the SFI1 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAH48950.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK129159; BAC97969.1; -; mRNA.
DR EMBL; AK019095; BAB31543.1; -; mRNA.
DR EMBL; AK133558; BAE21725.1; -; mRNA.
DR EMBL; AL671968; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BX572640; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; BC026390; AAH26390.1; -; mRNA.
DR EMBL; BC046305; AAH46305.1; -; mRNA.
DR EMBL; BC048950; AAH48950.1; ALT_INIT; mRNA.
DR EMBL; BC106124; AAI06125.1; -; mRNA.
DR CCDS; CCDS36094.1; -. [Q3UZY0-1]
DR RefSeq; NP_084483.2; NM_030207.2. [Q3UZY0-1]
DR RefSeq; XP_006514959.1; XM_006514896.3. [Q3UZY0-1]
DR RefSeq; XP_006514961.1; XM_006514898.3. [Q3UZY0-2]
DR AlphaFoldDB; Q3UZY0; -.
DR SMR; Q3UZY0; -.
DR BioGRID; 219683; 2.
DR STRING; 10090.ENSMUSP00000080066; -.
DR iPTMnet; Q3UZY0; -.
DR PhosphoSitePlus; Q3UZY0; -.
DR jPOST; Q3UZY0; -.
DR PaxDb; Q3UZY0; -.
DR PRIDE; Q3UZY0; -.
DR ProteomicsDB; 261326; -. [Q3UZY0-1]
DR ProteomicsDB; 261327; -. [Q3UZY0-2]
DR ProteomicsDB; 261328; -. [Q3UZY0-3]
DR Antibodypedia; 45464; 56 antibodies from 17 providers.
DR Ensembl; ENSMUST00000066391; ENSMUSP00000067261; ENSMUSG00000023764. [Q3UZY0-2]
DR Ensembl; ENSMUST00000081318; ENSMUSP00000080066; ENSMUSG00000023764. [Q3UZY0-1]
DR GeneID; 78887; -.
DR KEGG; mmu:78887; -.
DR UCSC; uc007hrs.1; mouse. [Q3UZY0-2]
DR UCSC; uc007hrt.1; mouse. [Q3UZY0-1]
DR UCSC; uc007hrx.1; mouse. [Q3UZY0-3]
DR CTD; 9814; -.
DR MGI; MGI:1926137; Sfi1.
DR VEuPathDB; HostDB:ENSMUSG00000023764; -.
DR eggNOG; KOG4775; Eukaryota.
DR GeneTree; ENSGT00940000154110; -.
DR InParanoid; Q3UZY0; -.
DR OMA; QTHFCDW; -.
DR OrthoDB; 941268at2759; -.
DR PhylomeDB; Q3UZY0; -.
DR TreeFam; TF328940; -.
DR Reactome; R-MMU-2565942; Regulation of PLK1 Activity at G2/M Transition.
DR Reactome; R-MMU-380259; Loss of Nlp from mitotic centrosomes.
DR Reactome; R-MMU-380270; Recruitment of mitotic centrosome proteins and complexes.
DR Reactome; R-MMU-380284; Loss of proteins required for interphase microtubule organization from the centrosome.
DR Reactome; R-MMU-380320; Recruitment of NuMA to mitotic centrosomes.
DR Reactome; R-MMU-5620912; Anchoring of the basal body to the plasma membrane.
DR Reactome; R-MMU-8854518; AURKA Activation by TPX2.
DR BioGRID-ORCS; 78887; 22 hits in 62 CRISPR screens.
DR ChiTaRS; Sfi1; mouse.
DR PRO; PR:Q3UZY0; -.
DR Proteomes; UP000000589; Chromosome 11.
DR RNAct; Q3UZY0; protein.
DR Bgee; ENSMUSG00000023764; Expressed in bronchus and 267 other tissues.
DR ExpressionAtlas; Q3UZY0; baseline and differential.
DR Genevisible; Q3UZY0; MM.
DR GO; GO:0005814; C:centriole; IEA:UniProtKB-SubCell.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
DR GO; GO:0019902; F:phosphatase binding; ISS:UniProtKB.
DR InterPro; IPR030516; SFI1.
DR PANTHER; PTHR22028:SF4; PTHR22028:SF4; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Cytoplasm; Cytoskeleton; Reference proteome; Repeat.
FT CHAIN 1..1216
FT /note="Protein SFI1 homolog"
FT /id="PRO_0000334622"
FT REPEAT 114..146
FT /note="HAT 1"
FT REPEAT 148..177
FT /note="HAT 2"
FT REPEAT 246..278
FT /note="HAT 3"
FT REPEAT 375..407
FT /note="HAT 5"
FT REPEAT 1122..1154
FT /note="HAT 6"
FT REGION 87..106
FT /note="Interaction with CETN2"
FT /evidence="ECO:0000250"
FT REGION 451..470
FT /note="Interaction with CETN2"
FT /evidence="ECO:0000250"
FT REGION 617..636
FT /note="Interaction with CETN2"
FT /evidence="ECO:0000250"
FT REGION 940..967
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1003..1029
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1066..1085
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..7
FT /note="MEKKIGS -> MTAEVNGSTSGNH (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_033709"
FT VAR_SEQ 115..1216
FT /note="Missing (in isoform 4)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_033710"
FT VAR_SEQ 197..425
FT /note="CFWWSKWRWRLGQAHAEHALHAVAVKHRALSLQLQGWLRWQEQLLISQRDRR
FT KEATAVQHYQHWQKQRSLKAWLKYLQICRVKRWQNEMAVQFHRATVLQIHFCDWQWAWE
FT WRQSLSAHQALVVKLAGRMVLRRAFTHWKHYMLLQAEEAAQREAAAEHRQHYLLYSCFR
FT AFKDNVTQARLQQTRKKLAQQLRDTTLLHRFWNLWQSRIEQREERVQTPSLHAALSHYR
FT -> W (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_033711"
FT VAR_SEQ 362..366
FT /note="YSCFR -> VRAWS (in isoform 5)"
FT /evidence="ECO:0000303|PubMed:14621295"
FT /id="VSP_033712"
FT VAR_SEQ 367..1125
FT /note="Missing (in isoform 5)"
FT /evidence="ECO:0000303|PubMed:14621295"
FT /id="VSP_033713"
FT VAR_SEQ 604..635
FT /note="Missing (in isoform 2 and isoform 3)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_033714"
FT VAR_SEQ 978..1216
FT /note="Missing (in isoform 3)"
FT /evidence="ECO:0000303|PubMed:15489334"
FT /id="VSP_033715"
FT VAR_SEQ 1182
FT /note="E -> EVRPGQPRASPWLSFLSACLVPPSRPCPQ (in isoform 5)"
FT /evidence="ECO:0000303|PubMed:14621295"
FT /id="VSP_033716"
FT CONFLICT 23
FT /note="T -> A (in Ref. 4; AAI06125)"
FT /evidence="ECO:0000305"
FT CONFLICT 133
FT /note="I -> S (in Ref. 1; BAC97969)"
FT /evidence="ECO:0000305"
FT CONFLICT 134
FT /note="F -> L (in Ref. 1; BAC97969)"
FT /evidence="ECO:0000305"
FT CONFLICT 759
FT /note="H -> R (in Ref. 4; AAH26390)"
FT /evidence="ECO:0000305"
FT CONFLICT 779
FT /note="R -> I (in Ref. 2; BAB31543)"
FT /evidence="ECO:0000305"
FT CONFLICT 1098
FT /note="Q -> H (in Ref. 4; AAH26390)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1216 AA; 144033 MW; 304079976848CD01 CRC64;
MEKKIGSRSF RDGVVKKPCS PKTLPLKKSS AFSGIQREPS RSCHSIYYHA SQNWTRYRLQ
ELRIRCVARK FLYLWIRVTF GRVTPSRARI FHEQKILQKV FGEWREEWWV SQREWKLCVR
ADCHYRYYLY NLIFQNWKTF VHQQREMRKR FRIAEHHDTK QKMCQAWKSW LIYMVSRRTK
LHMKTTALEF RRQSVLCFWW SKWRWRLGQA HAEHALHAVA VKHRALSLQL QGWLRWQEQL
LISQRDRRKE ATAVQHYQHW QKQRSLKAWL KYLQICRVKR WQNEMAVQFH RATVLQIHFC
DWQWAWEWRQ SLSAHQALVV KLAGRMVLRR AFTHWKHYML LQAEEAAQRE AAAEHRQHYL
LYSCFRAFKD NVTQARLQQT RKKLAQQLRD TTLLHRFWNL WQSRIEQREE RVQTPSLHAA
LSHYRVTVLH KCVRVWLRYV HKRQWQQLLR ARADGHFQQR ALPAAFYTWY RGWLWHQQRR
ILHTKAVRFH RGTLEKQVFA LWRQKMSQHR ENCLAERMAI LQAEQQLLRR FWFVWHQQAA
VCQLERQQQA MAIAHHHSGL LRRAFCIWKE STQGFRIERM GRAQAAHFHS AQLLSRAWSM
WRECLALRLE EQQKLKCAAL HSQCILLRRA LQKWLVYQNR VRSVLREVAA RERQHNRQLL
WWALHLWREN TMARLDGAKK TSQARVHYSR TLCSKVLVQW REVTSVQIYY RQKEAAALRE
ARKALDRGRL QNWFQHWRFC SQRAAQQRFQ LGQAAQHHHW QLLMEAMARW KAHHLGCIRK
KFLQRQAAQL LAQRLSRACF CQWRKQLAVR KQEQWGTARA LWLWAFSLQA KVWTAWLGFV
LERRRKKARL ERAMQAYQQQ LLQEGATRLL RFTAGTKAFR QQLQAQQQVQ AAHSLHCAVR
HCAELWKKKV LGPGKTSQPP APTTFSKRVT FKDSFLSGHA AEAGDATQET KKLRAPPSQG
VLGSLAGAAG EPCHLDLNAA RSSRKQPRRP SFLLERLGSQ RSPEWYSLGE QQLEKPPEEE
STALLGGSSL TRPFLPGVLP NVPGPKLPPT ASPGLELLPP SSIMPHAAGG TARVSAKPSI
PGPQPWGCPS LPRDLDPQLL PGDSISTRTE PVYGSEATGH TELEAELEGI QQQLQHYQTT
KQNLWSCQRQ ANSLRRWLEL SQEEPKSEDL HLEEQVKTEL EEVELQVQQL AKELEAQRQP
VGTCIARVRA LRRALC