Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g., multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learning can uncover hidden structures in sets of images (e.g., resting state functional MRI) or find sub-populations in large cohorts. By considering different functional neuroimaging applications, we illustrate how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps. Scikit-learn contains a very large set of statistical learning algorithms, both supervised and unsupervised, and its application to neuroimaging data provides a versatile tool to study the brain.
Machine learning is a pervasive development at the intersection of statistics and computer science. While it can benefit many data-related applications, the technical nature of the research literature and the corresponding algorithms slows down its adoption. Scikit-learn is an open-source software project that aims at making machine learning accessible to all, whether it be in academia or in industry. It benefits from the general-purpose Python language, which is both broadly adopted in the scientific world, and supported by a thriving ecosystem of contributors. Here we give a quick introduction to scikit-learn as well as to machine-learning basics.
BackgroundIn the context of sensory and cognitive-processing deficits in ADHD patients, there is considerable evidence of altered event related potentials (ERP). Most of the studies, however, were done on ADHD children. Using the independent component analysis (ICA) method, ERPs can be decomposed into functionally different components. Using the classification method of support vector machine, this study investigated whether features of independent ERP components can be used for discrimination of ADHD adults from healthy subjects.MethodsTwo groups of age- and sex-matched adults (74 ADHD, 74 controls) performed a visual two stimulus GO/NOGO task. ERP responses were decomposed into independent components by means of ICA. A feature selection algorithm defined a set of independent component features which was entered into a support vector machine.ResultsThe feature set consisted of five latency measures in specific time windows, which were collected from four different independent components. The independent components involved were a novelty component, a sensory related and two executive function related components. Using a 10-fold cross-validation approach, classification accuracy was 92%.ConclusionsThis study was a first attempt to classify ADHD adults by means of support vector machine which indicates that classification by means of non-linear methods is feasible in the context of clinical groups. Further, independent ERP components have been shown to provide features that can be used for characterizing clinical populations.
Brain oscillations exhibit long-range temporal correlations (LRTCs), which reflect the regularity of their fluctuations: low values representing more random (decorrelated) while high values more persistent (correlated) dynamics. LRTCs constitute supporting evidence that the brain operates near criticality, a state where neuronal activities are balanced between order and randomness. Here, healthy adults used closed-loop brain training (neurofeedback, NFB) to reduce the amplitude of alpha oscillations, producing a significant increase in spontaneous LRTCs post-training. This effect was reproduced in patients with post-traumatic stress disorder, where abnormally random dynamics were reversed by NFB, correlating with significant improvements in hyperarousal. Notably, regions manifesting abnormally low LRTCs (i.e., excessive randomness) normalized toward healthy population levels, consistent with theoretical predictions about selforganized criticality. Hence, when exposed to appropriate training, spontaneous cortical activity reveals a residual capacity for "self-tuning" its own temporal complexity, despite manifesting the abnormal dynamics seen in individuals with psychiatric disorder. Lastly, we observed an inverse-U relationship between strength of LRTC and oscillation amplitude, suggesting a breakdown of long-range dependence at high/low synchronization extremes, in line with recent computational models. Together, our findings offer a broader mechanistic framework for motivating research and clinical applications of NFB, encompassing disorders with perturbed LRTCs.
BackgroundThere are numerous event-related potential (ERP) studies in relation to attention-deficit hyperactivity disorder (ADHD), and a substantial number of ERP correlates of the disorder have been identified. However, most of the studies are limited to group differences in children. Independent component analysis (ICA) separates a set of mixed event-related potentials into a corresponding set of statistically independent source signals, which are likely to represent different functional processes. Using a support vector machine (SVM), a classification method originating from machine learning, this study aimed at investigating the use of such independent ERP components in differentiating adult ADHD patients from non-clinical controls by selecting a most informative feature set. A second aim was to validate the predictive power of the SVM classifier by means of an independent ADHD sample recruited at a different laboratory.MethodsTwo groups of age-matched adults (75 ADHD, 75 controls) performed a visual two stimulus go/no-go task. ERP responses were decomposed into independent components, and a selected set of independent ERP component features was used for SVM classification.ResultsUsing a 10-fold cross-validation approach, classification accuracy was 91%. Predictive power of the SVM classifier was verified on the basis of the independent ADHD sample (17 ADHD patients), resulting in a classification accuracy of 94%. The latency and amplitude measures which in combination differentiated best between ADHD patients and non-clinical subjects primarily originated from independent components associated with inhibitory and other executive operations.ConclusionsThis study shows that ERPs can substantially contribute to the diagnosis of ADHD when combined with up-to-date methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.