Middle East Respiratory Syndrome Coronavirus (MERS CoV)is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.Key words codon usage; camel; Coronavirus; helicase; inhibitor Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging virus infection causing lethal pneumonia and high case fatality.1-3) The infection with MERS CoV has now extended to become a global concern.
4-7)Recently, a serious outbreak of infection was recorded in South Korea and China. 8,9) MERS CoV is a single stranded RNA virus with a relatively large genome of ca. 30 kb.10) The virus was firstly isolated from humans. Currently there is still much debate in confirming the potential hosts and mode of transmission of the virus. Camels are incriminated as a potential host of the virus, and a considerable number of genomes of MERS CoV from camels were isolated and sequenced. [11][12][13] Therefore, there are two main sources of MERS CoV genomic data: humans and camels.The genetic code is comprised of 61 different codons corresponding to the standard 20 amino acids. Two of these amino acids are encoded by one codon (tyrosine (Tyr) and methionine (Met), while the other amino acids are encoded by several codons-up to 6 codons for 1 amino acid. Therefore, alternative codons present within 1 amino acid production codes are called synonymous codons.14) Codon bias infers the preference of one synonymous codon over the other, and may be over-represented to indicate overuse of a certain codon or rare codons that are rarely used for a certain gene. There are two main forces affecting the composition of a certain genome as well as codon usage...