The assessment of mutagenicity is essential in drug discovery, as it may lead to cancer and germ cells damage. Although in silico methods have been proposed for mutagenicity prediction, their performance is hindered by the scarcity of labeled molecules. However, experimental mutagenicity testing can be time-consuming and costly. One solution to lower the annotation cost is active learning, where the algorithm actively selects the most valuable samples from a vast chemical space and hands them over to the oracle (e.g., human expert) for annotation. In this paper, we propose TOX-AL, a deep active learning framework, which can actively explore the chemical space and identify the most valuable molecules, resulting in competitive performance with a small number of labeled samples. Compared to the random strategy, TOX-AL can reduce about 57% of training molecules. Surprisingly, TOX-AL exhibits outstanding molecular structural discriminability, allowing it to pick molecules with high structural similarity but opposite properties.