IORF_CVP67
ID IORF_CVP67 Reviewed; 207 AA.
AC Q8BB22;
DT 17-APR-2007, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-AUG-2022, entry version 46.
DE RecName: Full=Protein I;
DE AltName: Full=Accessory protein N2;
DE AltName: Full=N internal ORF protein;
DE Short=IORF;
DE AltName: Full=Protein in nucleocapsid ORF;
GN Name=N; Synonyms=I;
OS Porcine hemagglutinating encephalomyelitis virus (strain 67N) (HEV-67N).
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Betacoronavirus; Embecovirus.
OX NCBI_TaxID=230237;
OH NCBI_TaxID=9823; Sus scrofa (Pig).
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC RNA].
RX PubMed=12237422; DOI=10.1099/0022-1317-83-10-2411;
RA Sasseville A.M.-J., Boutin M., Gelinas A.-M., Dea S.;
RT "Sequence of the 3'-terminal end (8.1 kb) of the genome of porcine
RT haemagglutinating encephalomyelitis virus: comparison with other
RT haemagglutinating coronaviruses.";
RL J. Gen. Virol. 83:2411-2416(2002).
CC -!- FUNCTION: Structural protein that is not essential for the viral
CC replication either in tissue culture or in its natural host.
CC {ECO:0000250}.
CC -!- SUBCELLULAR LOCATION: Virion {ECO:0000250}.
CC -!- MISCELLANEOUS: The gene encoding this protein is included within the N
CC gene (alternative ORF).
CC -!- SIMILARITY: Belongs to the coronavirus I protein family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AY078417; AAL80037.1; -; Genomic_RNA.
DR Proteomes; UP000007546; Genome.
DR CDD; cd21662; embe-CoV_Protein-I_like; 1.
DR InterPro; IPR004876; Corona_nucI.
DR InterPro; IPR044311; N2-like_embe-CoV.
DR Pfam; PF03187; Corona_I; 1.
PE 3: Inferred from homology;
KW Reference proteome; Virion.
FT CHAIN 1..207
FT /note="Protein I"
FT /id="PRO_0000284100"
SQ SEQUENCE 207 AA; 23051 MW; E00E7F571BA592ED CRC64;
MASLSGPISP TSLEMFKPGV EEFNPSKLLL LSNHQEGLLY PTILGSLELL SFKRERSLNL
QRDKVCLLHQ ESHLLKLRGT GTDTTDVLLK QPTAISVNCC HDGTFTTWEQ DRMPKTSTAP
TLTESSGSLV TRLILIPRLT LSIGIQVAMR LFRLGFRLAR YSLKVTILKA QEGLLLIPDL
LRVHPIEPLV QDRVVEPILA IEPLPLV