“…Identification of $40,000 small-gene families in phage contigs To predict novel small genes in phages, we first downloaded IMG/ VR (Paez-Espino et al, 2017a;Roux et al, 2021), which contains 2,377,994 viral contigs for a combined total of over 48 billion bases of DNA. This database represents a large collection of viral datasets (Bushman et al, 2019;Espı ´nola et al, 2018;Garcia et al, 2020;Gregory et al, 2019Gregory et al, , 2020Mehrshad et al, 2021;Mobilian et al, 2020;Nayfach et al, 2021a;Paez-Espino et al, 2017b, 2019Roux et al, 2019;Schulz et al, 2020). From these viral contigs, we A B Figure 1.…”