Machine Learning for Seizure Onset Zone Detection from sEEG Data

A Comprehensive Approach to Automated Epileptogenic Zone Identification
Alexiou Anna, Saraf Agam, Tsang Austin, Rodiere Zacharie, Warner Tyler
Machine Learning Class - CS 7641 | Georgia Institute of Technology
October 2025

Literature Review

Epilepsy affects over 50 million people worldwide, and about 30% of patients suffer from drug-resistant epilepsy, for whom surgery is often the only curative option. The success of surgical intervention depends critically on the accurate identification of the epileptogenic zone (EZ), specifically the Seizure Onset Zone (SOZ), the region where seizures originate.

Stereo-electroencephalography (sEEG), which involves intracranial recordings from depth electrodes, has become a gold standard for delineating the SOZ due to its ability to capture high-resolution spatiotemporal activity.

Key Biomarkers for SOZ Detection

Traditional approaches rely heavily on expert visual review, which is time-consuming, subjective, and prone to inter-rater variability. Machine learning methods including CNNs, RNNs, GNNs, and Transformers have shown promise in automating this process.

Dataset Description

We will employ four multi-center datasets to ensure variability in acquisition protocols, electrode configurations, and patient demographics. This heterogeneity will allow us to test the generalizability of our models across centers and clinical settings.

HUP iEEG Dataset

De-identified patient data (n=58) containing electrophysiologic data for interictal and ictal periods from the Hospital of the University of Pennsylvania.

View Dataset
Epilepsy iEEG Interictal Multicenter Dataset

De-identified patient data (n=39) containing iEEG recordings with sleep/wake state annotations from multiple centers including NIH, Johns Hopkins, and University of Miami.

View Dataset
Fragility Multi-Center Retrospective Study

De-identified patient data (n=100) containing iEEG and EEG data from 5 centers, with 4 centers' data publicly available.

View Dataset
CHUL Dataset (Centrale Hospitalier Universitaire de Lyon)

Intracranial EEG recordings from patients undergoing pre-surgical evaluation. This dataset is not publicly available but will be used for training.

Problem Definition and Motivation

For patients with drug-resistant focal epilepsy, surgical resection of the SOZ can be curative. However, the main clinical challenge lies in accurately identifying the SOZ from sEEG recordings. Mislocalization leads to failed surgeries and persistent seizures.

Current manual approaches are inefficient and limited by subjective interpretation. Our motivation is to develop machine learning models that can automatically detect the SOZ with high accuracy, robustness across datasets, and interpretability aligned with known biomarkers.

The project's impact extends beyond accuracy: automating SOZ detection can reduce diagnostic time, improve consistency, and potentially expand surgical candidacy by making evaluation more scalable.

Proposed Solutions

Preprocessing Pipeline

We adopt a multi-pronged preprocessing pipeline combining time-frequency decomposition with clinically validated biomarkers:

Wavelet Packet Transform (WPT)

Five levels of decomposition with Daubechies-4 basis to generate localized time-frequency features for each channel.

High-Frequency Oscillations

Automated HFO detection (80-500 Hz) from WPT representation provides features sensitive to microstructural pathological activity.

Interictal Spikes

Interictal epileptiform discharges are standard clinical markers of epileptogenic zones.

Connectivity Features

Functional connectivity measures including Amplitude Envelope Correlation (AEC) and Phase-Locking Value (PLV).

Machine Learning Models

Transformer Encoder with Spatial Attention

Learns channel-wise temporal representations with spatial attention module to highlight inter-channel relationships. Uses pretraining via spatial contrastive learning.

Transformer Model Architecture
Multi-Feature Deep Learning Model

Reproduction of a recent deep learning approach that utilizes multiple feature types and claims to achieve strong performance on seizure onset zone detection. We aim to reproduce and validate this methodology on our multi-center datasets.

Multi-Feature Model Architecture
Diffusion Convolutional Gated Recurrent Unit (DCGRU) - Self-Supervised Exploration

Self-supervised exploration using DCGRU to model the graph structure of sEEG electrodes. This approach learns signal representations that may enable anomaly detection on learned features if we develop a robust model of normal signal patterns.

DCGRU Model Architecture
Support Vector Machine (SVM)

Classical machine learning baseline using extracted time-frequency features, providing interpretability and comparison to deep models.

SVM Model Architecture

Expected Results and Evaluation

Evaluation Metrics

Since the SOZ is a physical region in the brain but we can only assign binary labels to each electrical contact, evaluation will be performed in terms of channel-wise binary classification:

≥0.80
AUROC
High
Recall
Balanced
F1 Score

Expected Performance

By triangulating results across models and datasets, we aim to validate the robustness of our approach and ensure consistent performance with minimal per-patient variability.

Team Contributions

Anna Alexiou
Literature review, Data preprocessing, Methodology, Results and Discussion
Agam Saraf
Team collaboration and project development
Austin Tsang
Team collaboration and project development
Zacharie Rodiere
Project initiator, proposed methods (transformer & GNNs), datasets, background readings, GitHub/GitLab management, Results & Discussion, Gantt chart
Tyler Warner
Introduction/background, problem definition, methods, results & discussion, Gantt chart

Project Timeline - Gantt Chart

Our project timeline showing the planned phases, milestones, and team responsibilities throughout the semester.

Project Award Eligibility

🏆 Outstanding Project Award Consideration

We would like to opt-in to be considered for the "Outstanding Project" award. Our team is committed to delivering high-quality research that advances the field of automated seizure onset zone detection using machine learning.

We believe our comprehensive approach combining multiple datasets, advanced preprocessing techniques, and diverse machine learning architectures positions us well for this recognition. Our project addresses a critical clinical need and demonstrates both technical innovation and practical impact.

References

[1] M.-C. Picot, M. Baldy-Moulinier, J.-P. Daurès, P. Dujols, and A. Crespel, "The prevalence of epilepsy and pharmacoresistant epilepsy in adults: A population-based study in a Western European country", Epilepsia, vol. 49, no. 7, pp. 1230–1238, 2008, doi: 10.1111/j.1528-1167.2008.01579.x.
[2] J. de Tisi, G. S. Bell, J. L. Peacock, A. W. McEvoy, W. F. J. Harkness, J. W. Sander, and J. S. Duncan, "The long-term outcome of adult epilepsy surgery, patterns of seizure remission, and relapse: a cohort study", The Lancet, vol. 378, no. 9800, pp. 1388–1395, 2011, doi: 10.1016/S0140-6736(11)60890-8.
[3] F. Bartolomei, S. Lagarde, F. Wendling, A. McGonigal, V. Jirsa, M. Guye, and C. Bénar, "Defining epileptogenic networks: Contribution of SEEG and signal analysis", Epilepsia, vol. 58, no. 7, pp. 1131–1147, Jul. 2017, doi: 10.1111/epi.13791.
[4] M. A. G. Matarrese, A. Loppini, L. Fabbri, E. Tamilia, M. S. Perry, J. R. Madsen, J. Bolton, S. S. D. Stone, P. L. Pearl, S. Filippi, and C. Papadelis, "Spike propagation mapping reveals effective connectivity and predicts surgical outcome in epilepsy", Brain, vol. 146, no. 9, pp. 3898–3912, Sep. 2023, doi: 10.1093/brain/awad118.
[5] S. Tang, J. A. Dunnmon, K. Saab, X. Zhang, Q. Huang, F. Dubost, D. L. Rubin, and C. Lee-Messer, "Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis", arXiv preprint arXiv:2104.08336, 2021.
[6] Y. Wang, Y. Yang, S. Li, Z. Su, J. Guo, P. Wei, J. Huang, G. Kang, G. Zhao "Automatic Localization of Seizure Onset Zone Based on Multi-Epileptogenic Biomarkers Analysis of Single-Contact from Interictal SEEG", PMC9774098, Bioengineering, 2022.