IORF_CVHOC
ID IORF_CVHOC Reviewed; 207 AA.
AC Q4VID0; Q6TNF4;
DT 17-APR-2007, integrated into UniProtKB/Swiss-Prot.
DT 05-JUL-2005, sequence version 1.
DT 03-AUG-2022, entry version 52.
DE RecName: Full=Protein I;
DE AltName: Full=Accessory protein N2;
DE AltName: Full=N internal ORF protein;
DE Short=IORF;
DE AltName: Full=Protein in nucleocapsid ORF;
GN Name=N; Synonyms=I; ORFNames=7b;
OS Human coronavirus OC43 (HCoV-OC43).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Betacoronavirus; Embecovirus.
OX NCBI_TaxID=31631;
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RC STRAIN=Isolate 19572 Belgium 2004;
RX PubMed=15914223; DOI=10.1016/j.virol.2005.04.010;
RA Vijgen L., Keyaerts E., Lemey P., Moes E., Li S., Vandamme A.M.,
RA Van Ranst M.;
RT "Circulation of genetically distinct contemporary human coronavirus OC43
RT strains.";
RL Virology 337:85-92(2005).
RN [2]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RC STRAIN=Isolate ATCC VR-759, and Isolate clinical OC43-Paris;
RX PubMed=15280490; DOI=10.1128/jvi.78.16.8824-8834.2004;
RA St Jean J.R., Jacomy H., Desforges M., Vabret A., Freymuth F., Talbot P.J.;
RT "Human respiratory coronavirus OC43: genetic stability and neuroinvasion.";
RL J. Virol. 78:8824-8834(2004).
RN [3]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RC STRAIN=Isolate ATCC VR-759;
RX PubMed=15650185; DOI=10.1128/jvi.79.3.1595-1604.2005;
RA Vijgen L., Keyaerts E., Moes E., Thoelen I., Wollants E., Lemey P.,
RA Vandamme A.M., Van Ranst M.;
RT "Complete genomic sequence of human coronavirus OC43: molecular clock
RT analysis suggests a relatively recent zoonotic coronavirus transmission
RT event.";
RL J. Virol. 79:1595-1604(2005).
CC -!- FUNCTION: Structural protein that is not essential for the viral
CC replication either in tissue culture or in its natural host.
CC {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000250}.
CC -!- MISCELLANEOUS: The gene encoding this protein is included within the N
CC gene (alternative ORF).
CC -!- SIMILARITY: Belongs to the coronavirus I protein family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY903460; AAX85683.1; -; Genomic_RNA.
DR EMBL; AY585228; -; NOT_ANNOTATED_CDS; Genomic_RNA.
DR EMBL; AY585229; -; NOT_ANNOTATED_CDS; Genomic_RNA.
DR EMBL; AY391777; AAR01020.1; ALT_SEQ; Genomic_RNA.
DR Proteomes; UP000007552; Genome.
DR Proteomes; UP000100580; Genome.
DR Proteomes; UP000159995; Genome.
DR Proteomes; UP000180344; Genome.
DR CDD; cd21662; embe-CoV_Protein-I_like; 1.
DR InterPro; IPR004876; Corona_nucI.
DR InterPro; IPR044311; N2-like_embe-CoV.
DR Pfam; PF03187; Corona_I; 1.
PE 3: Inferred from homology;
KW Virion.
FT CHAIN 1..207
FT /note="Protein I"
FT /id="PRO_0000284099"
FT VARIANT 1..92
FT /note="Missing (in strain: Isolate ATCC VR-759)"
FT VARIANT 30
FT /note="L -> P (in strain: Isolate clinical OC43-Paris)"
FT VARIANT 194
FT /note="A -> V (in strain: Isolate ATCC VR-759 and Isolate
FT clinical OC43-Paris)"
FT VARIANT 204
FT /note="P -> L (in strain: Isolate ATCC VR-759 and Isolate
FT clinical OC43-Paris)"
SQ SEQUENCE 207 AA; 22895 MW; 93F17A9D17364769 CRC64;
MASSSGPISP TSLEMFKPGV EELNPSKLLL LSNHQEGMLY PTILGSLELL SFKRERSLSL
QKDKVCLLHQ ESQLLKLRGT GTDTTDVLLK QPMATSVNCC HDGIFTIWEQ DRMLKTSTAP
ILTESTGSLA TRLMSIPRLT LSIGTQVAMR LFRLGFRLAR YSLRVTILKA QEGLLLIPDL
LRAHPAEPLV QDRAVEPILA IEPPPLV