TOPZ1_MACFA
ID TOPZ1_MACFA Reviewed; 1687 AA.
AC G7NY55; Q95JY4;
DT 21-MAR-2012, integrated into UniProtKB/Swiss-Prot.
DT 25-JAN-2012, sequence version 1.
DT 25-MAY-2022, entry version 25.
DE RecName: Full=Protein TOPAZ1 {ECO:0000250|UniProtKB:E5FYH1};
DE AltName: Full=Testis- and ovary-specific PAZ domain-containing protein 1 {ECO:0000250|UniProtKB:E5FYH1};
GN Name=TOPAZ1; ORFNames=QtsA-12362;
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-696.
RC TISSUE=Testis;
RX PubMed=12498619; DOI=10.1186/1471-2164-3-36;
RA Osada N., Hida M., Kusuda J., Tanuma R., Hirata M., Suto Y., Hirai M.,
RA Terao K., Sugano S., Hashimoto K.;
RT "Cynomolgus monkey testicular cDNAs for discovery of novel human genes in
RT the human genome sequence.";
RL BMC Genomics 3:36-36(2002).
CC -!- FUNCTION: Important for normal spermatogenesis and male fertility.
CC Specifically required for progression to the post-meiotic stages of
CC spermatocyte development. Seems to be necessary for normal expression
CC levels of a number of testis-expressed gene transcripts, although its
CC role in this process is unclear. {ECO:0000250|UniProtKB:E5FYH1}.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000250|UniProtKB:E5FYH1}.
CC -!- SEQUENCE CAUTION:
CC Sequence=BAB62988.1; Type=Frameshift; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001277; EHH51375.1; -; Genomic_DNA.
DR EMBL; AB070043; BAB62988.1; ALT_FRAME; mRNA.
DR AlphaFoldDB; G7NY55; -.
DR STRING; 9541.XP_005546877.1; -.
DR eggNOG; ENOG502QPIV; Eukaryota.
DR Proteomes; UP000009130; Chromosome 2.
DR Proteomes; UP000233100; Unplaced.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007283; P:spermatogenesis; IEA:UniProtKB-KW.
DR InterPro; IPR038952; TOPAZ1.
DR InterPro; IPR029435; TOPAZ1_dom.
DR PANTHER; PTHR35671; PTHR35671; 1.
DR Pfam; PF14669; Asp_Glu_race_2; 1.
PE 2: Evidence at transcript level;
KW Cytoplasm; Differentiation; Reference proteome; Spermatogenesis.
FT CHAIN 1..1687
FT /note="Protein TOPAZ1"
FT /id="PRO_0000416059"
FT REGION 1..131
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..339
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 596..632
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 880..916
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..62
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 78..123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 319..337
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 598..620
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 880..901
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 902..916
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT CONFLICT 105
FT /note="F -> L (in Ref. 2; BAB62988)"
FT /evidence="ECO:0000305"
FT CONFLICT 219
FT /note="F -> S (in Ref. 2; BAB62988)"
FT /evidence="ECO:0000305"
FT CONFLICT 285
FT /note="M -> I (in Ref. 2; BAB62988)"
FT /evidence="ECO:0000305"
FT CONFLICT 694..696
FT /note="RKE -> KKK (in Ref. 2; BAB62988)"
FT /evidence="ECO:0000305"
SQ SEQUENCE 1687 AA; 190437 MW; 8FF3E8673A8E86C8 CRC64;
MRRPPPLGPT TASGPEGNVR NLRKRQAPGP GAAGGCGPEA GGRGENRQKR RMVARATPGR
GEVKSDKSVA ASGAGKAARR RVEGRRGQVS PSDRRGLEAA KEAEFPLQTE RHTKEKRKVT
EASSDDPQPG FDLVRKESLT SSESFQTVEC LRSLGKEGIV EGIKRRIRNK KLKSLENPPL
KITENEATQN IKVEFQDELY KNTLKYSCNI LSPEVENNFV FKLRDCNCFP HSKDCNDENN
LPYEPDGGCM HVAENFSKKE NFRSLAEKSD TNNIPQLLQT EENVMGVNKL LPEESDLYQS
KINGLLPCLQ REKNKYSIEE SSVGRKPRKR MKLSEKADET VTQMNFSNEY NKSELMLQEN
QMIADGKEAE AKSPLNVLRK VSHNTVSLMD HLLSVPEMVE KETSSEHHVN AVFQKTIEPL
LKEETENASE PLGYENMALK EDFKSKSCIG KSPEYHIERR SSREDLRSDS EELKLSCQRT
IPMTGKRTWP YYSCARISAW CWKKASLPES SYFLPGSQKS CKKVDVPKHQ TNKTHLTDSK
LLLQSSLTET NTESSSKEKL DSNLNCLFSV SAVEHTLMVI KEPIIKDDKK IKSEELSRSG
SEVISNTTED TQLTSDTQSL TGNKKRDRGN LTKLNLTAAS KDGQEANNST GKTIHRKACV
AKQTFVVPDL VKILNTGRLT NFKIPLLKNK TKKRKEVNAK SSEREGYSPL ELLDNLSGAD
TRQNRSKENV SMTMLGPQTL SIQNSVTPVQ ASSDSFYNKN SCSISPSFTK HGNSSKPSNH
FSEPGNIVSN KEVASLTVEN NAFSCDPGYV EKSPSFCCNK QETFRPVSSE VRGRKITKNF
SEVGFPDILK AYEDDVLLID VIQDDPDLFG VSNEGELSFT SEVPRISQEP NVPGEHQSTD
SKYVETPVKK EPSDDLRELP VLDCGPIKPD ICASNSAASE IKHDPKDANT SLGEVANETS
ENETLGDFSE QIKGSDLDEK HRFTDKVITK EEKENIYEVR KSKDSRNADI MVGECQFAAP
VPKPLCLLVP PLNLSGHQED TILNTWMNDF RFLGKHSVLK LQNPETCEIF KREKNVGVFQ
KSLGLMIPYK YCKFHFNTLR GCERPLCKFA HVPEQGDEKV CMDVFKKYIN INELCLLQRA
VNVFMEYYRK FPPGIYFDLQ VLNDLLNSLL KHCLLKEVFQ IVNLSIMVKM LPSLKILLNI
FEHVATMKLR NAVPALIDIF CKLVEAGMVL DPEHFNYIVK LLYQVQASKQ EITAVLEMKS
RLQMRQFKKN WKCDLDSALN KLEHCKEKGD WTKLGKLYIN VKMGCEKFAD FQTFCACIAE
TLTKNCEDER PDTPFCEFAE TVSKDPQNSK VDKGVLGRIG ISAMYFYHKL LQWSKGRKVL
DKLYELKIHF ASLKGLIGPE KLASRCQIVN VAAEIFLKSG SLDGALWVMR ESEWIIDTPL
WPCDRLDVLN RHNLLCTIAH ETLAKSLYRQ TFEVLQNLPG FQNSQETVEV SQYSLLFNKL
LGSCIESNSL GMSSSVAEFM ISKSIPIDFS FLRRLITSLG RSRLWLKARA HYKSALSLGC
YPPLEGNLYR KLLLIPSYLS EIEMLLAIEI FMVSNASSIQ SPGTSTQILQ IVLKRCEDNQ
SRSNDDYQAA VERLIMAARI SDPKLFVKHM TVNVNKEQVY SLEHCSALKW LKENMKWAGK
VWLFSNH