ML4SOZ

Literature Review

Epilepsy affects over 50 million people worldwide, and about 30% of patients suffer from drug-resistant epilepsy, for whom surgery is often the only curative option. The success of surgical intervention depends critically on the accurate identification of the epileptogenic zone (EZ), specifically the Seizure Onset Zone (SOZ), the region where seizures originate.

Stereo-electroencephalography (sEEG), which involves intracranial recordings from depth electrodes, has become a gold standard for delineating the SOZ due to its ability to capture high-resolution spatiotemporal activity.

Key Biomarkers for SOZ Detection

High Frequency Oscillations (HFOs): 80-500 Hz signals considered reliable markers
Interictal Spikes: Interictal epileptiform discharges remain standard clinical markers
Phase-Amplitude Coupling (PAC): Captures cross-frequency interactions
Connectivity: Functional connectivity adds valuable information for SOZ localization

Traditional approaches rely heavily on expert visual review, which is time-consuming, subjective, and prone to inter-rater variability. Machine learning methods including CNNs, RNNs, GNNs, and Transformers have shown promise in automating this process.

Dataset Description

We will employ four multi-center datasets to ensure variability in acquisition protocols, electrode configurations, and patient demographics. This heterogeneity will allow us to test the generalizability of our models across centers and clinical settings.

HUP iEEG Dataset

De-identified patient data (n=58) containing electrophysiologic data for interictal and ictal periods from the Hospital of the University of Pennsylvania.

View Dataset

Epilepsy iEEG Interictal Multicenter Dataset

De-identified patient data (n=39) containing iEEG recordings with sleep/wake state annotations from multiple centers including NIH, Johns Hopkins, and University of Miami.

View Dataset

Fragility Multi-Center Retrospective Study

De-identified patient data (n=100) containing iEEG and EEG data from 5 centers, with 4 centers' data publicly available.

View Dataset

CHUL Dataset (Centrale Hospitalier Universitaire de Lyon)

Intracranial EEG recordings from patients undergoing pre-surgical evaluation. This dataset is not publicly available but will be used for training.

Problem Definition and Motivation

For patients with drug-resistant focal epilepsy, surgical resection of the SOZ can be curative. However, the main clinical challenge lies in accurately identifying the SOZ from sEEG recordings. Mislocalization leads to failed surgeries and persistent seizures.

Current manual approaches are inefficient and limited by subjective interpretation. Our motivation is to develop machine learning models that can automatically detect the SOZ with high accuracy, robustness across datasets, and interpretability aligned with known biomarkers.

The project's impact extends beyond accuracy: automating SOZ detection can reduce diagnostic time, improve consistency, and potentially expand surgical candidacy by making evaluation more scalable.

Proposed Solutions

Preprocessing Pipeline

We adopt a multi-pronged preprocessing pipeline combining time-frequency decomposition with clinically validated biomarkers:

Wavelet Packet Transform (WPT)

Five levels of decomposition with Daubechies-4 basis to generate localized time-frequency features for each channel.

High-Frequency Oscillations

Automated HFO detection (80-500 Hz) from WPT representation provides features sensitive to microstructural pathological activity.

Interictal Spikes

Interictal epileptiform discharges are standard clinical markers of epileptogenic zones.

Connectivity Features

Functional connectivity measures including Amplitude Envelope Correlation (AEC) and Phase-Locking Value (PLV).

Machine Learning Models

Transformer Encoder with Spatial Attention

Learns channel-wise temporal representations with spatial attention module to highlight inter-channel relationships. Uses pretraining via spatial contrastive learning.

Multi-Feature Deep Learning Model

Reproduction of a recent deep learning approach that utilizes multiple feature types and claims to achieve strong performance on seizure onset zone detection. We aim to reproduce and validate this methodology on our multi-center datasets.

Diffusion Convolutional Gated Recurrent Unit (DCGRU) - Self-Supervised Exploration

Self-supervised exploration using DCGRU to model the graph structure of sEEG electrodes. This approach learns signal representations that may enable anomaly detection on learned features if we develop a robust model of normal signal patterns.

Support Vector Machine (SVM)

Classical machine learning baseline using extracted time-frequency features, providing interpretability and comparison to deep models.

Expected Results and Evaluation

Evaluation Metrics

Since the SOZ is a physical region in the brain but we can only assign binary labels to each electrical contact, evaluation will be performed in terms of channel-wise binary classification:

≥0.80

AUROC

High

Recall

Balanced

F1 Score

Expected Performance

Transformer model: Highest AUROC (≥0.80)
Multi-Feature Model: Reproduction of published results with validation on our datasets
DCGRU (Self-supervised): Exploratory approach for anomaly detection on learned features
SVM: Baseline performance (AUROC ~0.60-0.65)

By triangulating results across models and datasets, we aim to validate the robustness of our approach and ensure consistent performance with minimal per-patient variability.

Team Contributions

Anna Alexiou

Literature review, Data preprocessing, Methodology, Results and Discussion

Agam Saraf

Team collaboration and project development

Austin Tsang

Team collaboration and project development

Zacharie Rodiere

Project initiator, proposed methods (transformer & GNNs), datasets, background readings, GitHub/GitLab management, Results & Discussion, Gantt chart

Tyler Warner

Introduction/background, problem definition, methods, results & discussion, Gantt chart

Project Timeline - Gantt Chart

Our project timeline showing the planned phases, milestones, and team responsibilities throughout the semester.

Project Award Eligibility

🏆 Outstanding Project Award Consideration

We would like to opt-in to be considered for the "Outstanding Project" award. Our team is committed to delivering high-quality research that advances the field of automated seizure onset zone detection using machine learning.

We believe our comprehensive approach combining multiple datasets, advanced preprocessing techniques, and diverse machine learning architectures positions us well for this recognition. Our project addresses a critical clinical need and demonstrates both technical innovation and practical impact.

References

[1] M.-C. Picot, M. Baldy-Moulinier, J.-P. Daurès, P. Dujols, and A. Crespel, "The prevalence of epilepsy and pharmacoresistant epilepsy in adults: A population-based study in a Western European country", Epilepsia, vol. 49, no. 7, pp. 1230–1238, 2008, doi: 10.1111/j.1528-1167.2008.01579.x.

[2] J. de Tisi, G. S. Bell, J. L. Peacock, A. W. McEvoy, W. F. J. Harkness, J. W. Sander, and J. S. Duncan, "The long-term outcome of adult epilepsy surgery, patterns of seizure remission, and relapse: a cohort study", The Lancet, vol. 378, no. 9800, pp. 1388–1395, 2011, doi: 10.1016/S0140-6736(11)60890-8.

[3] F. Bartolomei, S. Lagarde, F. Wendling, A. McGonigal, V. Jirsa, M. Guye, and C. Bénar, "Defining epileptogenic networks: Contribution of SEEG and signal analysis", Epilepsia, vol. 58, no. 7, pp. 1131–1147, Jul. 2017, doi: 10.1111/epi.13791.

[4] M. A. G. Matarrese, A. Loppini, L. Fabbri, E. Tamilia, M. S. Perry, J. R. Madsen, J. Bolton, S. S. D. Stone, P. L. Pearl, S. Filippi, and C. Papadelis, "Spike propagation mapping reveals effective connectivity and predicts surgical outcome in epilepsy", Brain, vol. 146, no. 9, pp. 3898–3912, Sep. 2023, doi: 10.1093/brain/awad118.

[5] S. Tang, J. A. Dunnmon, K. Saab, X. Zhang, Q. Huang, F. Dubost, D. L. Rubin, and C. Lee-Messer, "Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis", arXiv preprint arXiv:2104.08336, 2021.

[6] Y. Wang, Y. Yang, S. Li, Z. Su, J. Guo, P. Wei, J. Huang, G. Kang, G. Zhao "Automatic Localization of Seizure Onset Zone Based on Multi-Epileptogenic Biomarkers Analysis of Single-Contact from Interictal SEEG", PMC9774098, Bioengineering, 2022.