Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a search engine for cell atlases. Using transcriptome data from mouse cell atlases we show how scfind can be used to evaluate marker genes, to perform in silico gating, and to identify both cell-type specific and housekeeping genes. Moreover, we have developed a subquery optimization routine to ensure that long and complex queries return meaningful results. To make scfind more user friendly and accessible, we use indices of PubMed abstracts and techniques from natural language processing to allow for arbitrary queries. Finally, we show how scfind can be used for multi-omics analyses by combining single-cell ATAC-seq data with transcriptome data. easy to interface with other widely used resources, e.g. the genome wide association study (GWAS) catalog (MacArthur et al., 2017) , the gene ontology (GO) knowledgebase (The Gene Ontology Consortium, 2017) , the disease ontology Medical Subject Headings (MeSH) (Sewell, 1964) , and collections of clinically important genetic variants (Cariaso and Lennon, 2012;Landrum et al., 2016) .Many of the challenges in working with single cell data stem from its large size. Typically, a single cell RNA-seq (scRNA-seq) dataset is represented as a gene-by-cell expression matrix with ~20,000 genes and may include millions of cells. For epigenetic data, e.g. scATAC-seq, the situation is often worse as the matrix can have many more rows, each representing a peak. Even though the matrix is sparse, working with such a dataset places high demands on computer hardware. Many operations are not only time consuming, but require advanced bioinformatics skills. To allow for both interactive and high-throughput queries of a large cell atlas, novel algorithms and data structures are required.To ensure that large single cell datasets can be accessed by a wide range of users, the underlying software must (i) allow for complex queries, (ii) take full advantage of the single-cell resolution of the data, (iii) support different types of assays besides RNA-seq, (iv) be fast and easy to use, (v) be possible to run interactively through a web browser to access large repositories of data, (vi) be possible to run locally by a user interested in analyzing their own data. However, none of the computational tools available today for collections of scRNA-seq datasets, e.g. Panglao (Franzén et al., 2019) , the UCSC Cell Browser (Haeussler et al., 2019) , scRNASeqDB (Cao et al., 2017) , SCPortalen (Abugessaisa et al., 2018) , and the EBI Single Cell Expression Atlas (Athar et al., 2019) , provide the required functionality and versatility.Here, we present scfind, a search engine that makes single cell data accessible to a wide range of users by enabling sophisticated queries for large datasets through an interface which is both very fast and familiar to users from any backg...