CAGE1_MACFA
ID CAGE1_MACFA Reviewed; 840 AA.
AC Q95JR0; Q4R6M9; Q95JY2;
DT 20-MAR-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-DEC-2001, sequence version 1.
DT 25-MAY-2022, entry version 45.
DE RecName: Full=Cancer-associated gene 1 protein homolog;
DE Short=CAGE-1;
GN Name=CAGE1; ORFNames=QtsA-12423, QtsA-14351, QtsA-17576;
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
RC TISSUE=Testis;
RX PubMed=12498619; DOI=10.1186/1471-2164-3-36;
RA Osada N., Hida M., Kusuda J., Tanuma R., Hirata M., Suto Y., Hirai M.,
RA Terao K., Sugano S., Hashimoto K.;
RT "Cynomolgus monkey testicular cDNAs for discovery of novel human genes in
RT the human genome sequence.";
RL BMC Genomics 3:36-36(2002).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
RC TISSUE=Testis;
RG International consortium for macaque cDNA sequencing and analysis;
RT "DNA sequences of macaque genes expressed in brain or testis and its
RT evolutionary implications.";
RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 273-840 (ISOFORM 2).
RC TISSUE=Testis;
RA Hashimoto K., Osada N., Hida M., Kusuda J., Tanuma R., Hirai M., Terao K.,
RA Sugano S.;
RT "Isolation of novel full-length cDNA clones from macaque testis cDNA
RT libraries.";
RL Submitted (AUG-2001) to the EMBL/GenBank/DDBJ databases.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=2;
CC Name=1;
CC IsoId=Q95JR0-1; Sequence=Displayed;
CC Name=2;
CC IsoId=Q95JR0-2; Sequence=VSP_023905;
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB62990.1; Type=Erroneous initiation; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB070119; BAB63064.1; -; mRNA.
DR EMBL; AB169153; BAE01246.1; -; mRNA.
DR EMBL; AB070045; BAB62990.1; ALT_INIT; mRNA.
DR AlphaFoldDB; Q95JR0; -.
DR SMR; Q95JR0; -.
DR STRING; 9541.XP_005554154.1; -.
DR eggNOG; KOG3650; Eukaryota.
DR Proteomes; UP000233100; Unplaced.
DR InterPro; IPR029381; CAGE1.
DR Pfam; PF15066; CAGE1; 2.
PE 2: Evidence at transcript level;
KW Alternative splicing; Coiled coil; Reference proteome.
FT CHAIN 1..840
FT /note="Cancer-associated gene 1 protein homolog"
FT /id="PRO_0000280751"
FT REGION 800..840
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 303..559
FT /evidence="ECO:0000255"
FT COMPBIAS 800..819
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 648..669
FT /note="Missing (in isoform 2)"
FT /evidence="ECO:0000303|Ref.2, ECO:0000303|Ref.3"
FT /id="VSP_023905"
FT CONFLICT 3
FT /note="K -> R (in Ref. 2; BAE01246)"
FT /evidence="ECO:0000305"
FT CONFLICT 315
FT /note="T -> I (in Ref. 2; BAE01246)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 840 AA; 97587 MW; A875627BE3D4D716 CRC64;
MNKDYQIFWP SPSDPVRFEV DTSHEKVESI SESDTMNVSN LSQGIMLSDS PICMETTSTT
CDLPQNEIKN FERENEYEST LCEDAYGTLD NLLNDNNIEN YSKNVLTQPV DTISISSLRQ
FDTVCKFHCV EAFDDEMTEK PEFQSQVYNY AKDNNIKQDS FREENPMETS VSASTDQLGN
EYFRQPPPRS PPLIHCSGET LKFPEKSLAK STAKESALNP SQPPSFVCKT AVPSKEIQNY
GEIPEMSVSY AKEVTAEGVE RPEIVSTWSS AGISWRSKAS QENCEMPDME QSAESLQPVQ
EDMALNEILK KLKHTNRKQE ARIQELQCSN LYLEKRVKEL QMKTTKQQVF IDVIDKLKEN
VEELIEEKYK IILEKNDTKK TLQNLQEILA NTQKHLQESR NDKEMLQLQF KKIKANYVRL
QERYMTEMQQ KNKSVSQYLE MDKTLSKKEE EVKRLQQLRK EQEKVTASAL DLLKREKETQ
EQEFLSLQEE FQKRDKANLE ERQKLKSRLE KLLTQVKNLQ FMSENERAKN IKLQQQINEV
KNENKKLKQH VARSEEQNYV PKSETAQLKE QLEEVMKSDI TKDTKMTHSN LLLDCSPCEE
ESLNPADIER SSQLASKMHS LLALMVGLLK CQDITNSDAE HFKESEKVSD IMLQRLKSLH
LKKKNLDKEL LKHKDRITTF RDLIAKEKAF QDHAIKVTDC DSDEAKSIRD VPTFLGAKLD
KYHSLNEELD FLITKLGCLL ESKESHCNRL IEENDKYQRH LGSLIKKVTS YEEIIECADQ
RLAISHSQIA HLEKRNKHLE DLIRKPREKA RKPRSKSLEN HPKSMTMMPA VFKENRNDLD