Corpus overview


MeSH Disease

HGNC Genes

SARS-CoV-2 proteins

ProteinS (1)

ProteinN (1)


SARS-CoV-2 Proteins
    displaying 1 - 1 records in total 1
    records per page

    The heterogeneous landscape and early evolution of pathogen-associated CpG and UpA HGNC dinucleotides in SARS CoV-2

    Authors: Andrea Di Gioacchino; Petr Sulc; Anastassia V Komarova; Benjamin D Greenbaum; Remi Monasson; Simona Cocco

    doi:10.1101/2020.05.06.074039 Date: 2020-05-07 Source: bioRxiv

    SARS-CoV-2 infection MESHD can lead to acute respiratory syndrome MESHD in patients, which can be due in part to dysregulated immune signalling. We analyze here the occurrences of CpG dinucleotides, which are putative pathogen-associated molecular patterns, along the viral sequence. Carrying out a comparative analysis with other ssRNA viruses and within the Coronaviridae family, we find the CpG content of SARS-CoV-2, while low compared to other betacoronaviruses, widely fluctuates along its primary sequence. While the CpG relative abundance and its associated CpG force parameter [1] are low for the spike protein (S PROTEIN) and comparable to circulating seasonal coronaviruses such as HKU1, they are much greater and comparable to SARS and MERS for the 3-end of the viral genome. In particular, the nucleocapsid protein (N PROTEIN), whose transcripts are relatively abundant in the cytoplasm of infected cells and present in the 3UTRs of all subgenomic RNA, has high CpG content. We speculate this dual nature of CpG content can confer to SARS-CoV-2 high ability to both enter the host and trigger pattern recognition receptors (PRRs) in different contexts. We then investigate the evolution of synonymous mutations since the outbreak of the COVID-19 MESHD COVID-19 MESHD pandemic. Using a new application of selective forces on dinucleotides to estimate context driven mutational processes, we find that synonymous mutations seem driven both by the viral codon bias and by the high value of the CpG force in the N protein PROTEIN, leading to a loss in CpG content. Sequence motifs preceding these CpG-loss-associated loci match recently identified binding patterns of the Zinc Finger anti-viral Protein ( ZAP HGNC) protein.

The ZB MED preprint Viewer preVIEW includes all COVID-19 related preprints from medRxiv and bioRxiv, from ChemRxiv, from ResearchSquare, from arXiv and from and is updated on a daily basis (7am CET/CEST).
The web page can also be accessed via API.



MeSH Disease
HGNC Genes
SARS-CoV-2 Proteins

Export subcorpus as...

This service is developed in the project nfdi4health task force covid-19 which is a part of nfdi4health.

nfdi4health is one of the funded consortia of the National Research Data Infrastructure programme of the DFG.