Abstract-Neuroscientists increasingly use computational tools to build and simulate models of the brain. The amounts of data involved in these simulations are immense and efficiently managing this data is key.One particular problem in analyzing this data is the scalable execution of range queries on spatial models of the brain. Known indexing approaches do not perform well, even on today's small models containing only few million densely packed spatial elements. The problem of current approaches is that with the increasing level of detail in the models, the overlap in the tree structure also increases, ultimately slowing down query execution. The neuroscientists' need to work with bigger and more importantly, with increasingly detailed (denser) models, motivates us to develop a new indexing approach.To this end we developed FLAT, a scalable indexing approach for dense data sets. We based the development of FLAT on the key observation that current approaches suffer from overlap in case of dense data sets. We hence designed FLAT as an approach with two phases, each independent of density.Our experimental results confirm that FLAT achieves independence from data set size as well as density and also outperforms R-Tree variants in terms of I/O overhead from a factor of two up to eight. I. INTRODUCTIONScientists in various disciplines increasingly use computational tools to simulate, process and analyze experimental data. Computational tools make it substantially simpler for them to conduct scientific tasks. At the same time however, scientists are also increasingly buried in the data deluge produced by their tools. Being able to access the relevant parts of their data, i.e., their spatial models, quickly in order to analyze, understand, and prepare new experiments is pivotal for them.In this paper we thus develop a new index that efficiently supports scientists in executing range queries on dense data sets stemming from increasingly detailed spatial models.The work presented in this paper is motivated by our collaboration with the Blue Brain Project (BBP [17]). With data acquired in anatomical research on the cortex of the rat brain the neuroscientists in the BBP build biophysically realistic models, the most detailed computer models of the brain to date, for simulation based research in neuroscience. The project began by focusing on the elementary building block of the neocortex, a neocortical column of about 10,000 neurons. Morphologically speaking, each of these neurons has branches extending into large parts of the tissue in order to receive and send out information to other neurons. Figure 1 (left) shows a cell morphology, with cylinders modeling the branching of the dendrite and axon in three dimensions.
Abstract-Spatial joins are becoming increasingly ubiquitous in many applications, particularly in the scientific domain. While several approaches have been proposed for joining spatial datasets, each of them has a strength for a particular type of density ratio among the joined datasets. More generally, no single proposed method can efficiently join two spatial datasets in a robust manner with respect to their data distributions. Some approaches do well for datasets with contrasting densities while others do better with similar densities. None of them does well when the datasets have locally divergent data distributions.In this paper we develop TRANSFORMERS, an efficient and robust spatial join approach that is indifferent to such variations of distribution among the joined data. TRANSFORM-ERS achieves this feat by departing from the state-of-the-art through adapting the join strategy and data layout to local density variations among the joined data. It employs a join method based on data-oriented partitioning when joining areas of substantially different local densities, whereas it uses big partitions (as in space-oriented partitioning) when the densities are similar, while seamlessly switching among these two strategies at runtime. We experimentally demonstrate that TRANSFORMERS outperforms state-of-the-art approaches by a factor of between 2 and 8.
No abstract
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.