F214A_DANRE
ID F214A_DANRE Reviewed; 989 AA.
AC Q1LV22; G1K2M7; Q08CB9;
DT 15-JAN-2008, integrated into UniProtKB/Swiss-Prot.
DT 21-MAR-2012, sequence version 2.
DT 03-AUG-2022, entry version 64.
DE RecName: Full=Protein FAM214A;
GN Name=fam214a; ORFNames=si:dkey-266j9.3;
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tuebingen;
RX PubMed=23594743; DOI=10.1038/nature12111;
RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M.,
RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I.,
RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J.,
RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y.,
RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B.,
RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S.,
RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M.,
RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J.,
RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Threadgold G.,
RA Harden G., Ware D., Begum S., Mortimore B., Kerry G., Heath P.,
RA Phillimore B., Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S.,
RA Pelan S., Griffiths G., Smith M., Glithero R., Howden P., Barker N.,
RA Lloyd C., Stevens C., Harley J., Holt K., Panagiotidis G., Lovell J.,
RA Beasley H., Henderson C., Gordon D., Auger K., Wright D., Collins J.,
RA Raisen C., Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D.,
RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S.,
RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E.,
RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., Babbage A.,
RA Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., Wray P.,
RA Ellington A., Matthews N., Ellwood M., Woodmansey R., Clark G., Cooper J.,
RA Tromans A., Grafham D., Skuce C., Pandian R., Andrews R., Harrison E.,
RA Kimberley A., Garnett J., Fosker N., Hall R., Garner P., Kelly D., Bird C.,
RA Palmer S., Gehring I., Berger A., Dooley C.M., Ersan-Urun Z., Eser C.,
RA Geiger H., Geisler M., Karotki L., Kirn A., Konantz J., Konantz M.,
RA Oberlander M., Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G.,
RA Osoegawa K., Zhu B., Rapp A., Widaa S., Langford C., Yang F.,
RA Schuster S.C., Carter N.P., Harrow J., Ning Z., Herrero J., Searle S.M.,
RA Enright A., Geisler R., Plasterk R.H., Lee C., Westerfield M.,
RA de Jong P.J., Zon L.I., Postlethwait J.H., Nusslein-Volhard C.,
RA Hubbard T.J., Roest Crollius H., Rogers J., Stemple D.L.;
RT "The zebrafish reference genome sequence and its relationship to the human
RT genome.";
RL Nature 496:498-503(2013).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-687.
RC STRAIN=AB; TISSUE=Skin;
RG NIH - Zebrafish Gene Collection (ZGC) project;
RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q1LV22-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q1LV22-2; Sequence=VSP_042436;
CC -!- SIMILARITY: Belongs to the FAM214 family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAI24302.1; Type=Miscellaneous discrepancy; Note=Contaminating sequence. Potential poly-A sequence.; Evidence={ECO:0000305};
CC Sequence=CAK04512.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC Sequence=CAM56662.1; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX908748; CAK04512.1; ALT_INIT; Genomic_DNA.
DR EMBL; BX936456; CAK04512.1; JOINED; Genomic_DNA.
DR EMBL; BX936456; CAM56662.1; ALT_INIT; Genomic_DNA.
DR EMBL; BX908748; CAM56662.1; JOINED; Genomic_DNA.
DR EMBL; BC124301; AAI24302.1; ALT_SEQ; mRNA.
DR AlphaFoldDB; Q1LV22; -.
DR STRING; 7955.ENSDARP00000059207; -.
DR PaxDb; Q1LV22; -.
DR ZFIN; ZDB-GENE-050419-204; fam214a.
DR eggNOG; KOG2306; Eukaryota.
DR InParanoid; Q1LV22; -.
DR PhylomeDB; Q1LV22; -.
DR PRO; PR:Q1LV22; -.
DR Proteomes; UP000000437; Genome assembly.
DR Proteomes; UP000814640; Unplaced.
DR InterPro; IPR025261; DUF4210.
DR InterPro; IPR033473; FAM214/SPAC3H8.04_C.
DR Pfam; PF13889; Chromosome_seg; 1.
DR Pfam; PF13915; DUF4210; 1.
DR SMART; SM01177; DUF4210; 1.
PE 2: Evidence at transcript level;
KW Alternative splicing; Reference proteome.
FT CHAIN 1..989
FT /note="Protein FAM214A"
FT /id="PRO_0000315616"
FT REGION 244..295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 393..477
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 525..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 656..686
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..277
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 278..293
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..455
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 457..474
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 525..560
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 571..595
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 596..639
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 656..676
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 702
FT /note="T -> TQRCSPTPLK (in isoform 2)"
FT /evidence="ECO:0000305"
FT /id="VSP_042436"
FT CONFLICT 57
FT /note="T -> N (in Ref. 2; AAI24302)"
FT /evidence="ECO:0000305"
FT CONFLICT 352
FT /note="S -> R (in Ref. 2; AAI24302)"
FT /evidence="ECO:0000305"
FT CONFLICT 424
FT /note="S -> T (in Ref. 2; AAI24302)"
FT /evidence="ECO:0000305"
FT CONFLICT 427
FT /note="S -> P (in Ref. 2; AAI24302)"
FT /evidence="ECO:0000305"
FT CONFLICT 551
FT /note="N -> D (in Ref. 2; AAI24302)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 989 AA; 109985 MW; 3A10762A022B052A CRC64;
MKPDRDAAEE FFEYDAEEFL VFLTLLITEG RTPECSVKGR TEGVHCPPAQ SAMPVLTKHE
CSDKIPQCRQ ARRTRSEVML LWRNHIPIMI EVMLLPDCCY SDEGPTTDCT DLNDPAIKQD
ALLLERWTLQ PVPRQSGDRF IEEKTLLLAV RSYVFFSQLS AWLSASHGIV PRNILYRISA
ADEELIWNFS QTPSEHAFPV PNVSHSVALK VRVQSLPRQP RYPVLKCSIH SGLAFLGKRA
LEHGEGGNQA GDNRSSLRLP RSPLFSRSLH PSPPSHSPLN TRKCPPRPES PLPPGKAVKW
LYSRLNGGID TPPSEPYSLC TNGAESPKTS RTESPIRGFK SLSITDPLVI PSPSSISGET
NPLIGSLLQE RQEVIARIAQ RLNFCDPTAP HLPDALFTSQ EPPGHKTTWN STQDKECLKK
SKDSLFSVPH PQNHNGNSLE IPERSRSSLF DTPLSPRTRT RLDRVDRESK TSPKPATCRR
LVLSDQSAEG SLIADAVQDI SRLIQERLQH SYSLLNGTYK LKTSQNEQVG SNHGAQTNGF
VSLSSHKKPT NAPNGEESTD PHISHATKCC RSPDSSRRKP DCSPRPLKVA SLKLEDHSVT
KSQPLTASNH QHYVSRESWT SLKNNSSHAS SPQENGLTQI GYHQPFKNRV AISEKEAEKH
VRDGSTCLEK DENQEPHSSL SSTPANLTCN ISSLAPTESN QTSCGNWKKQ TRHSIDGTAT
KAFHPCTGLP LLSSPVPQRK SQTGYFDLDT SLIHCRGVPW AANRRVLKRS QDYDESQHQI
LSASAPPANL SLLGNFEECV LNYRLEPLGT VEGFTAEVGA SGTFCPSHMT LPVDVSFYSV
SDDNAPSPYM GVINLESLGK RGYRVPPSGT IQVTLFNPNK TVVKMFVVMY DLRDMPASHQ
TFLRQRTFSV PVKREFNGQS NKKTSLGQGR TLRYLVHLRF QSSKSGKIYL HRDIRLLFSR
KSMEVDSGAA YELKSFTELP ADPPFSPRC