Objective
To evaluate the diagnostic performance of artificial intelligence (AI)–based algorithms for identifying the presence of interictal epileptiform discharges (IEDs) in routine (20‐min) electroencephalography (EEG) recordings.
Methods
We evaluated two approaches: a fully automated one and a hybrid approach, where three human raters applied an operational IED definition to assess the automated detections grouped into clusters by the algorithms. We used three previously developed AI algorithms: Encevis, SpikeNet, and Persyst. The diagnostic gold standard (epilepsy or not) was derived from video‐EEG recordings of patients' habitual clinical episodes. We compared the algorithms with the gold standard at the recording level (epileptic or not). The independent validation data set (not used for training) consisted of 20‐min EEG recordings containing sharp transients (epileptiform or not) from 60 patients: 30 with epilepsy (with a total of 340 IEDs) and 30 with nonepileptic paroxysmal events. We compared sensitivity, specificity, overall accuracy, and the review time‐burden of the fully automated and hybrid approaches, with the conventional visual assessment of the whole recordings, based solely on unrestricted expert opinion.
Results
For all three AI algorithms, the specificity of the fully automated approach was too low for clinical implementation (16.67%; 63.33%; 3.33%), despite the high sensitivity (96.67%; 66.67%; 100.00%). Using the hybrid approach significantly increased the specificity (93.33%; 96.67%; 96.67%) with good sensitivity (93.33%; 56.67%; 76.67%). The overall accuracy of the hybrid methods (93.33%; 76.67%; 86.67%) was similar to the conventional visual assessment of the whole recordings (83.33%; 95% confidence interval [CI]: 71.48–91.70%; p > .5), yet the time‐burden of review was significantly lower (p < .001).
Significance
The hybrid approach, where human raters apply the operational IED criteria to automated detections of AI‐based algorithms, has high specificity, good sensitivity, and overall accuracy similar to conventional EEG reading, with a significantly lower time‐burden. The hybrid approach is accurate and suitable for clinical implementation.
The detection of epileptic seizures in long-term electroencephalographic (EEG) recordings is a time-consuming and tedious task requiring specially trained medical experts. The EpiScan seizure detection algorithm developed by the Austrian Institute of Technology (AIT) has proven to achieve high detection performance with a robust false alarm rate in the clinical setting. This paper introduces a novel time domain method for detection of epileptic seizure patterns with focus on irregular and distorted rhythmic activity. The method scans the EEG for sequences of similar epileptiform discharges and uses a combination of duration and similarity measure to decide for a seizure. The resulting method was tested on an EEG database with 275 patients including over 22000h of unselected and uncut EEG recording and 623 seizures. Used in combination with the EpiScan algorithm we increased the overall sensitivity from 70% to 73% while reducing the false alarm rate from 0.33 to 0.30 alarms per hour.
Background: Ongoing or recurrent seizure activity without prominent motor features is a common burden in neurological critical care patients and people with epilepsy during ICU stays. Continuous EEG (CEEG) is the gold standard for detecting ongoing ictal EEG patterns and monitoring functional brain activity. However CEEG review is very demanding and time consuming. The purpose of the present multirater, EEG expert reviewer study, is to test and assess the clinical feasibility of an automatic EEG pattern detection method (Neurotrend).Methods: Four board certified EEG reviewers used Neurotrend to annotate 76 CEEG datasets à 6 h (in total 456 h of EEG) for rhythmic and periodic EEG patterns (RPP), unequivocal ictal EEG patterns and burst suppression. All reviewers had a predefined time limit of 5 min (± 2 min) per CEEG dataset and were compared to a predefined gold standard (conventional EEG review with unlimited time). Subanalysis of specific features of RPP was conducted as well. We used Gwet's AC1 and AC2 coefficients to calculate interrater agreement (IRA) and multirater agreement (MRA). Also, we determined individual performance measures for unequivocal ictal EEG patterns and burst suppression. Bonferroni-Holmes correction for multiple testing was applied to all statistical tests.Results: Mean review time was 3.3 min (± 1.9 min) per CEEG dataset. We found substantial IRA for unequivocal ictal EEG patterns (0.61–0.79; mean sensitivity 86.8%; mean specificity 82.2%, p < 0.001) and burst suppression (0.68–0.71; mean sensitivity 96.7%; mean specificity 76.9% p < 0.001). Two reviewers showed substantial IRA for RPP (0.68–0.72), whereas the other two showed moderate agreement (0.45–0.54), compared to the gold standard (p < 0.001). MRA showed almost perfect agreement for burst suppression (0.86) and moderate agreement for RPP (0.54) and unequivocal ictal EEG patterns (0.57).Conclusions: We demonstrated the clinical feasibility of an automatic critical care EEG pattern detection method on two levels: (1) reasonable high agreement compared to the gold standard, (2) reasonable short review times compared to previously reported EEG review times with conventional EEG analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.