ROA1_CAEEL
ID ROA1_CAEEL Reviewed; 346 AA.
AC Q22037; Q336L3; Q336L4; Q95X69;
DT 25-OCT-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1996, sequence version 1.
DT 03-AUG-2022, entry version 146.
DE RecName: Full=Heterogeneous nuclear ribonucleoprotein A1;
DE Short=hnRNP A1;
GN Name=hrp-1 {ECO:0000312|WormBase:F42A6.7a};
GN Synonyms=rbp-1 {ECO:0000312|EMBL:BAA01645.1}; ORFNames=F42A6.7;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1] {ECO:0000305, ECO:0000312|EMBL:BAA01645.1}
RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A).
RC STRAIN=Bristol N2 {ECO:0000312|EMBL:BAA01645.1};
RX PubMed=1354852; DOI=10.1093/nar/20.15.4001;
RA Iwasaki M., Okumura K., Kondo Y., Igarashi H., Tanaka T.;
RT "cDNA cloning of a novel heterogeneous nuclear ribonucleoprotein gene
RT homologue in Caenorhabditis elegans using hamster prion protein cDNA as a
RT hybridisation probe.";
RL Nucleic Acids Res. 20:4001-4007(1992).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [3] {ECO:0000305}
RP PROTEIN SEQUENCE OF 8-61; 65-91; 116-136; 143-149; 156-175; 189-197 AND
RP 217-235 (ISOFORM A), AND IDENTIFICATION BY MASS SPECTROMETRY.
RA Bienvenut W.V.;
RL Submitted (SEP-2005) to UniProtKB.
RN [4]
RP FUNCTION, AND TELOMERE-BINDING.
RX PubMed=15122256; DOI=10.1038/ng1356;
RA Joeng K.S., Song E.J., Lee K.-J., Lee J.;
RT "Long lifespan in worms with long telomeric DNA.";
RL Nat. Genet. 36:607-611(2004).
CC -!- FUNCTION: This protein is a component of ribonucleosomes.
CC Overexpression gradually increases telomere length, leading to increase
CC lifespan. {ECO:0000269|PubMed:15122256}.
CC -!- SUBCELLULAR LOCATION: Nucleus. Chromosome, telomere. Note=Binds to
CC telomeres.
CC -!- ALTERNATIVE PRODUCTS:
CC Event=Alternative splicing; Named isoforms=4;
CC Name=a {ECO:0000269|PubMed:1354852};
CC IsoId=Q22037-1; Sequence=Displayed;
CC Name=b {ECO:0000303|PubMed:9851916};
CC IsoId=Q22037-2; Sequence=VSP_051843, VSP_051844;
CC Name=c;
CC IsoId=Q22037-3; Sequence=VSP_051843;
CC Name=d;
CC IsoId=Q22037-4; Sequence=VSP_051844;
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; D10877; BAA01645.1; -; mRNA.
DR EMBL; FO081370; CCD71134.1; -; Genomic_DNA.
DR EMBL; FO081370; CCD71135.1; -; Genomic_DNA.
DR EMBL; FO081370; CCD71136.1; -; Genomic_DNA.
DR EMBL; FO081370; CCD71137.1; -; Genomic_DNA.
DR PIR; S35500; S35500.
DR RefSeq; NP_001023199.1; NM_001028028.3.
DR RefSeq; NP_001040944.1; NM_001047479.3.
DR RefSeq; NP_001040945.1; NM_001047480.3. [Q22037-4]
DR RefSeq; NP_500326.2; NM_067925.8.
DR AlphaFoldDB; Q22037; -.
DR SMR; Q22037; -.
DR BioGRID; 42241; 2.
DR IntAct; Q22037; 1.
DR STRING; 6239.F42A6.7d.1; -.
DR iPTMnet; Q22037; -.
DR EPD; Q22037; -.
DR PaxDb; Q22037; -.
DR PeptideAtlas; Q22037; -.
DR EnsemblMetazoa; F42A6.7a.1; F42A6.7a.1; WBGene00001999. [Q22037-1]
DR EnsemblMetazoa; F42A6.7a.2; F42A6.7a.2; WBGene00001999. [Q22037-1]
DR EnsemblMetazoa; F42A6.7b.1; F42A6.7b.1; WBGene00001999. [Q22037-2]
DR EnsemblMetazoa; F42A6.7b.2; F42A6.7b.2; WBGene00001999. [Q22037-2]
DR EnsemblMetazoa; F42A6.7b.3; F42A6.7b.3; WBGene00001999. [Q22037-2]
DR EnsemblMetazoa; F42A6.7b.4; F42A6.7b.4; WBGene00001999. [Q22037-2]
DR EnsemblMetazoa; F42A6.7c.1; F42A6.7c.1; WBGene00001999. [Q22037-3]
DR EnsemblMetazoa; F42A6.7c.2; F42A6.7c.2; WBGene00001999. [Q22037-3]
DR EnsemblMetazoa; F42A6.7d.1; F42A6.7d.1; WBGene00001999. [Q22037-4]
DR EnsemblMetazoa; F42A6.7d.2; F42A6.7d.2; WBGene00001999. [Q22037-4]
DR GeneID; 177101; -.
DR UCSC; F42A6.7a.1; c. elegans.
DR CTD; 177101; -.
DR WormBase; F42A6.7a; CE17059; WBGene00001999; hrp-1. [Q22037-1]
DR WormBase; F42A6.7b; CE31510; WBGene00001999; hrp-1. [Q22037-2]
DR WormBase; F42A6.7c; CE29317; WBGene00001999; hrp-1. [Q22037-3]
DR WormBase; F42A6.7d; CE39251; WBGene00001999; hrp-1. [Q22037-4]
DR eggNOG; KOG0118; Eukaryota.
DR GeneTree; ENSGT00940000156757; -.
DR InParanoid; Q22037; -.
DR OMA; WGNNRQN; -.
DR OrthoDB; 1202220at2759; -.
DR PhylomeDB; Q22037; -.
DR PRO; PR:Q22037; -.
DR Proteomes; UP000001940; Chromosome IV.
DR Bgee; WBGene00001999; Expressed in embryo and 4 other tissues.
DR GO; GO:0000781; C:chromosome, telomeric region; IDA:WormBase.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:1990904; C:ribonucleoprotein complex; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; ISS:WormBase.
DR GO; GO:0031581; P:hemidesmosome assembly; IGI:WormBase.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; ISS:WormBase.
DR Gene3D; 3.30.70.330; -; 2.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR Pfam; PF00076; RRM_1; 2.
DR SMART; SM00360; RRM; 2.
DR SUPFAM; SSF54928; SSF54928; 2.
DR PROSITE; PS50102; RRM; 2.
PE 1: Evidence at protein level;
KW Alternative splicing; Chromosome; Direct protein sequencing; Nucleus;
KW Reference proteome; Repeat; Ribonucleoprotein; RNA-binding; Telomere.
FT CHAIN 1..346
FT /note="Heterogeneous nuclear ribonucleoprotein A1"
FT /id="PRO_0000081833"
FT DOMAIN 23..123
FT /note="RRM 1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT DOMAIN 114..191
FT /note="RRM 2"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00176"
FT REGION 92..111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 189..346
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 92..110
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 189..212
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 269..284
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..307
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 323..346
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VAR_SEQ 1..38
FT /note="Missing (in isoform b and isoform c)"
FT /evidence="ECO:0000305"
FT /id="VSP_051843"
FT VAR_SEQ 255
FT /note="D -> GN (in isoform b and isoform d)"
FT /evidence="ECO:0000305"
FT /id="VSP_051844"
SQ SEQUENCE 346 AA; 36343 MW; 48B95818D8BB9A54 CRC64;
MTDVEIKAEN GSGDASLEPE NLRKIFVGGL TSNTTDDLMR EFYSQFGEIT DIIVMRDPTT
KRSRGFGFVT FSGKTEVDAA MKQRPHIIDG KTVDPKRAVP RDDKNRSESN VSTKRLYVSG
VREDHTEDML TEYFTKYGTV TKSEIILDKA TQKPRGFGFV TFDDHDSVDQ CVLQKSHMVN
GHRCDVRKGL SKDEMSKAQM NRDRETRGGR SRDGQRGGYN GGGGGGGGWG GPAQRGGPGA
YGGPGGGGQG GYGGDYGGGW GQQGGGGQGG WGGPQQQQGG GGWGQQGGGG QGGWGGPQQQ
QQGGWGGPQQ GGGGGGWGGQ GQQQGGWGGQ SGAQQWAHAQ GGNRNY