7We applied simulation-based approaches to characterize how microbial community structure 8 influences the amount of sequencing effort to reconstruct metagenomes that are assembled from short 9 read sequences. An initial analysis evaluated the quantity, completion, and contamination of complete-10 metagenome-assembled genome (complete-MAG) equivalents, a bioinformatic-pipeline normalized 11 metric for MAG quantity, as a function of sequencing effort, on four preexisting sequence read datasets 12 taken from a maize soil, an estuarine sediment, the surface ocean, and the human gut. These datasets 13 were subsampled to varying degrees of completeness in order to simulate the effect of sequencing 14 effort on MAG retrieval. Modeling suggested that sequencing efforts beyond what is typical in 15 published experiments (1 to 10 Gbp) would generate diminishing returns in terms of MAG binning. A 16 second analysis explored the theoretical relationship between sequencing effort and the proportion of 17 available metagenomic DNA sequenced during a sequencing experiment as a function of community 18 richness, evenness, and genome size. Simulations from this analysis demonstrated that while 19 community richness and evenness influenced the amount of sequencing required to sequence a 20 community metagenome to exhaustion, the effort necessary to sequence an individual genome to a 21 target fraction of exhaustion was only dependent on the relative abundance of the corresponding 22 organism and its genome size. A software tool, GRASE, was created to assist investigators further 23 explore this relationship. Re-evaluation of the relationship between sequencing effort and binning 24 success in the context of the relative abundance of genomes, as opposed to base pairs, provides a 25 framework to design sequencing experiments based on the relative abundance of microbes in an 26 environment rather than arbitrary levels of sequencing effort. 27 3 Background 28The reconstruction of high-accuracy short read sequences into metagenome-assembled genomes 29 (MAGs) is a recent approach to characterize microbial metabolisms within complex communities [1]. 30The recent creation of ~8,000 MAGs from largely uncultured organisms across the tree of life [2], the 31 spatial characterization of microbial metabolisms and ecology across Earth's oceans [3], and the 32 characterization of the potential impact that fermentation-based microbial metabolisms have on 33 biogeochemical cycling in subsurface sediment environments [4] provide a few examples of how 34MAGs have helped to elucidate the relationships between microbial ecology, microbial metabolisms, 35 and biogeochemistry. 36Sampling environmental microbial DNA involves selecting a target environment, sequencing 37 effort, bioinformatic pipeline software and parameters, metabolism characterization software (i.e., for 38 gene identification and similarity searches) and databases ( Fig. 1). At present, there is little information 39 to guide how much sequencing is appropriate to achieve scientific go...