Honours Thesis: Deep Learning Driven Epilepsy Diagnosis

At a Glance

Novel hybrid bilinear deep neural network achieved state-of-the-art accuracy in epileptic seizure classification using scalp-EEG. Proposed algorithms exhibit generalisation across unseen datasets and high discriminative ability between similar EEG patterns.

Background & Problem Statement

Epilepsy

Fig.1 - Epilepsy manifested in EEG signals.

Critical treatment and prognosis procedures for epilepsy start with and rely on the correct identification of epileptic seizure type. Video-EEG is the most common diagnostic tool that neurologists use to support a diagnosis. However, EEG based epilepsy diagnosis is often confounded by several factors:
  • Overlapping clinical and EEG features across different epilepsy types makes identifying key bio-markers difficult
  • Inherent subjectivity in visual examination (of EEG) contributes to variability in clinical interpretation based on the EEG reader’s level of expertise
  • Visual examination of video-EEG results in a tedious and time-consuming process particularly where hour’s or day’s worth of EEG needs to be reviewed visually
The uncertainty inherent in contemporary seizure diagnostic tools motivated this investigation into developing novel deep learning algorithms that can classify epileptic seizures at high accuracy.

Algorithm

Hybrid Bilinear Model

Fig.2 - Hybrid Bilinear Model. The feature extractors in the model can be replaced with any ML extractor. Here, I have chosen to use a CNN and ConvLSTM.

CNN Extractor ConvLSTM Extractor

Fig.3a) - CNN Feature Extractor; Fig.3b) - ConvLSTM Feature Extractor

This study proposes the use of bilinear models for the purpose of seizure-type classification. Bilinear models have been shown to be exceptionally effective at fine-grained recognition and differentiating between similar patterns (e.g. classification of different dog breeds). There is considerable overlap in EEG patterns between certain epilepsy types, which motivated the use of the more discriminative bilinear architecture.
Fig 2. illustrates the architecture of the hybrid bilinear network - the outputs of the 'base' feature extractors are multiplied through matrix outer product to derive the bilinear vector. The hybrid-bilinear model investigated in this work employs both CNN and RNN as feature extractors, where one network (ConvLSTM) extracts temporal features and the other (CNN) models spatial features, which are combined into second-order statistics through bilinear pooling. Fig 3. describes the two 'base' feature extractors used in this study. Note that the feature extractors can be replaced with any ML extractor. The training process for the bilinear algorithm is detailed in Fig 4.

Training Algorithm

Fig.4 - Bilinear Model Training Steps.

Results

The algorithm was trained/evaluated on two datasets - the Temple University Hospital Seizure Corpus (TUH) and the EPILEPSIAE dataset. On the TUH dataset, the hybrid model achieved F1-score of 97.40%, exceeding the previously established benchmark.
The same algorithm, without any adjustments to the architecture, was trained and evaluated on the EPILEPSIAE dataset, achieving a comparable performance of 97.00%. The comparable results achieved on EPILEPSIAE demonstrates the generalisation ability of the algorithm. Since EPILEPSIAE is a significantly smaller dataset, the results also demonstrate the ability for the algorithm to learn from smaller amounts of data.
The bilinear models work well as the bilinear vector obtained by multiplying the output of feature extractors can effectively model interactions between pairs of local features. Thus, they can discriminate well when the input data is similar and can thus differentiate similar EEG artifacts in different seizure classes.

My Contributions

This project was independently proposed and completed by myself for my undergraduate Honour's Thesis. To learn more, please refer to this published paper.