Drug metabolite identification is a bottleneck of drug metabolism studies. Ion mobility-mass spectrometry (IM-MS) enables the measurement of collision cross section (CCS), a unique physical property related to an ion's gas-phase size and shape, which can be used to increase the confidence in the identification of unknowns. A current limitation to the application of IM-MS to the identification of drug metabolites is the lack of reference CCS values. In this work, we present the production of a large-scale database of drug and drug metabolite CCS values, assembled using high-throughput in vitro drug metabolite generation and a rapid IM-MS analysis with automated data processing. Subsequently, we used this database to train a machine learning-based CCS prediction model, employing a combination of conventional 2D molecular descriptors and novel 3D descriptors. This novel prediction model enables the prediction of different CCS values for different protomers, conformers, and positional isomers for the first time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.