DQA) 23 24 ¶ These authors contributed equally to this work. 25 26 Abstract 27 The complete genome of a new rhabdovirus infecting papaya (Carica papaya L.) 28 was sequenced and characterized. The genome consists of 13,469 nucleotides with29 six canonical open reading frames (ORFs) predicted from the antigenomic strand. In 30 addition, two overlapping short ORFs were predicted between ORFs 3 and 4. 31 Phylogenetic analyses using amino acid sequences from the nucleocapsid, 32 glycoprotein and polymerase, grouped the virus with members of the genus 33 Cytorhabdovirus, with rice stripe mosaic virus, yerba mate chlorosis-associated 34 virus and Colocasia bobone disease-associated virus as closest relatives. The 3' 35 leader and 5' trailer sequences were 144 and 167 nt long, respectively. Each end 36 contains complementary sequences prone to form panhandle structures. The motif 37 3'-AUUCUUUUUG-5', conserved across rhabdoviruses, was identified in all but one 38 intergenic regions; whereas the motif 3'-ACAAAAACACA-5' was found in three 39 intergenic junctions. This is the first complete genome of a cytorhabdovirus 40 infecting papaya. The virus was prevalent in commercial plantings of Los Ríos, the 41 most important papaya producing province of Ecuador. During the final stage of this 42 manuscript preparation, the genome of a bean-associated cytorhabdovirus became 43 available. Nucleotide identity (97%) between both genomes indicated that the two 44 viruses are strains of the same species, for which we propose the name papaya 45 cytorhabdovirus E. 46 47 Introduction 48 The Rhabdoviridae, a negative-sense RNA virus family, contains viruses that infect a 49 wide range of hosts including vertebrates, invertebrates and plants [1]. Virions have a 50 helical, bullet-shape morphology, surrounded by a host-derived membrane [2]. 51 Rhabdovirus genomes range from 11 to 16 kilobases (kbp) with only non-segmented 52 ones classically assigned to the taxa. However, virus species with bipartite genomes 53 have recently been included in the family [3,4]. Based on host type, genomic 54 organization and other biological features, rhabdoviruses currently are organized in 13 55 genera: Cytorhabdovirus, Dichorhavirus, Ephemerovirus, Lyssavirus, 56 Novirhabdovirus, Nucleorhabdovirus, Perhabdovirus, Sigmavirus, Sprivivirus, 57 Tibrovirus, Tupoavirus, Varicosavirus and Vesiculovirus 58 (http:ictvonline.org/virusTaxonomy.asp). Next generation sequencing (NGS) 59 techniques have led to the discovery of several novel rhabdovirus species for which 60 new genera have been proposed [5,6,7].61 The genome organization of rhabdoviruses has a canonical arrangement of five genes: 62 3'-N-P-M-G-L-5' that encode the nucleocapsid protein, phosphoprotein, matrix 63 protein, glycoprotein and the large polymerase, respectively. Terminal regions have 64 non-coding regulatory sequences denoted, respectively, as 3'-leader (l) and 5' trailer 65 (t) [8-9]. Additional "accessory" genes have been observed in arrangements that differ 66 among rhabdoviruses [10]. 67 6...