Background Spastic cerebral palsy, the most common pediatric-onset disabling condition with an estimated prevalence of 0.2% in children, is a complex condition characterized by stiff movement, muscle contractures, and abnormal gait that can diminish quality of life. Spastic CP accounts for approximately 83% of all CP cases and frequently co-occurs with other complex conditions, like epilepsy. An estimated 42% of spastic CP cases have co-occurring epilepsy. Unfortunately, CP is often difficult to diagnose. Although most children with CP are born with it or acquire it immediately after birth, many are not identified until after 19 months of age with CP diagnosis often not confirmed until 5 years of age. New bioinformatic approaches to identify CP earlier are needed. Recent studies indicate that altered DNA methylation patterns associated with CP may have diagnostic value. The potential confounding effects of co-occurrent epilepsy on these patterns are not known. We evaluated machine learning classification of CP patients with or without co-occurring epilepsy.
Results Whole blood samples were collected from 30 study participants diagnosed with epilepsy (n=4), spastic CP (n=10), both (n=8), or neither (n=8). A novel Support-Vector-Machine learning algorithm was developed to identify methylation loci that have ability to classify CP from controls in the presence or absence of epilepsy. This algorithm was also employed to measure classification ability of identified methylation loci. After preprocessing of data, isolation of important methylation loci was performed in a binary comparison between CP and controls, as well as in a 4-way scheme, encapsulating epilepsy diagnoses. The classification ability was similarly assessed. CP Classification performance wasevaluated with and without inclusion of epilepsy as a feature. Median F1 scoreswere 0.67 in 4-class comparison, and 1.0 in the binary classification, outperforming Linear-Discriminant-Analysis (0.57 and 0.86, respectively).
Conclusion This novel algorithm was able to classify study participants with spastic CPand/or epilepsy from controls with significant performance. The algorithm shows promise for rapid identification in methylation data of diagnostic methylation loci. In this model, Support Vector Machines outperformed Linear Discriminant Analysis in classification. In the evaluation of epigenetics-based diagnostics for CP, epilepsy may not be a significant confounding factor.