Machine learning using multimodal clinical, electroencephalographic, and magnetic resonance imaging data can predict incident depression in adults with epilepsy: A pilot study

Abstract

Objective

This study was undertaken to develop a multimodal machine learning (ML) approach for predicting incident depression in adults with epilepsy.

Methods

We randomly selected 200 patients from the Calgary Comprehensive Epilepsy Program registry and linked their registry-based clinical data to their first-available clinical electroencephalogram (EEG) and magnetic resonance imaging (MRI) study. We excluded patients with a clinical or Neurological Disorders Depression Inventory for Epilepsy (NDDI-E)-based diagnosis of major depression at baseline. The NDDI-E was used to detect incident depression over a median of 2.4 years of follow-up (interquartile range [IQR] = 1.5–3.3 years). A ReliefF algorithm was applied to clinical as well as quantitative EEG and MRI parameters for feature selection. Six ML algorithms were trained and tested using stratified threefold cross-validation. Multiple metrics were used to assess model performances.

Results

Of 200 patients, 150 had EEG and MRI data of sufficient quality for ML, of whom 59 were excluded due to prevalent depression. Therefore, 91 patients (41 women) were included, with a median age of 29 (IQR = 22–44) years. A total of 42 features were selected by ReliefF, none of which was a quantitative MRI or EEG variable. All models had a sensitivity > 80%, and five of six had an F1 score ≥ .72. A multilayer perceptron model had the highest F1 score (median = .74, IQR = .71–.78) and sensitivity (84.3%). Median area under the receiver operating characteristic curve and normalized Matthews correlation coefficient were .70 (IQR = .64–.78) and .57 (IQR = .50–.65), respectively.

Significance

Multimodal ML using baseline features can predict incident depression in this population. Our pilot models demonstrated high accuracy for depression prediction. However, overall performance and calibration can be improved. This model has promise for identifying those at risk for incident depression during follow-up, although efforts to refine it in larger populations along with external validation are required.

0