The central nervous system can generate various behaviours, including motor responses, which we can observe through video recordings. Recent advancements in genetics, automated behavioural acquisition at scale, and machine learning enable us to link behaviours to their underlying neural mechanisms causally. Moreover, in some animals, such as the Drosophila larva, this mapping is possible at unprecedented scales of millions of animals and single neurons, allowing us to identify the neural circuits generating particular behaviours. These high-throughput screening efforts are invaluable, linking the activation or suppression of specific neurons to behavioural patterns in millions of animals. This provides a rich dataset to explore how diverse nervous system responses can be to the same stimuli. However, challenges remain in identifying subtle behaviours from these large datasets, including immediate and delayed responses to neural activation or suppression, and understanding these behaviours on a large scale. We introduce several statistically robust methods for analyzing behavioural data in response to these challenges: 1) A generative physical model that regularizes the inference of larval shapes across the entire dataset. 2) An unsupervised kernel-based method for statistical testing in learned behavioural spaces aimed at detecting subtle deviations in behaviour. 3) A generative model for larval behavioural sequences, providing a benchmark for identifying complex behavioural changes. 4) A comprehensive analysis technique using suffix trees to categorize genetic lines into clusters based on common action sequences. We showcase these methodologies through a behavioural screen focused on responses to an air puff, analyzing data from 280,716 larvae across 568 genetic lines.