A review of usable EEG-based solutions for epileptic seizure prediction Abir Affes1,* , Afef Mdhaffar1,2 , Mohamed Jmaiel1,2 and Bernd Freisleben3 1 ReDCAD Laboratory, ENIS, University of Sfax, Tunisia 2 Digital Research Center of Sfax, 3021 Sfax, Tunisia 3 Department of Mathematics and Computer Science, University of Marburg, D-35043 Marburg, Germany. Abstract An epileptic seizure is a sudden event caused by an abnormal activity of the brain leading to severe consequences for affected patients, such as serious injuries and even sudden death. Therefore, it is crucial to predict epileptic seizures using wearable sensors. Electroencephalography (EEG) signals are the most adequate sensor signals leading to an accurate prediction. However, EEG sensors contain several electrodes, making them not usable on a daily basis. To solve this issue, researchers opted to reduce the number of electrodes by selecting the most pertinent EEG channels. In this paper, we present a literature review of epileptic seizure prediction approaches. We consider the different steps along the pathway, from data preprocessing to classification and performance evaluation. In addition, with the aim of a final product that can be worn daily for continuous monitoring, we study existing works that are related to EEG channel selection. We also discuss possible future studies to design a usable and efficient seizure prediction system. Keywords Epileptic seizure prediction, EEG channel selection, Deep learning, Electroencephalogram 1. Introduction Epilepsy is a neurological disease that causes epileptic seizures, accompanied by a loss of consciousness, leading to serious injuries [1]. Therefore, it is necessary to propose a usable solution that can be worn daily by affected patients for continuous monitoring, which predicts seizures before they happen. This is a challenging problem attracting researchers from several disciplines. In the last years, epilepsy-related research has matured thanks to the availability of data and the shown transition between the seizure phases. Furthermore, advanced analytics has attracted the attention of researchers to use predictive modeling, machine learning algorithms, and deep learning methods in detecting and predicting epileptic seizures. This paper presents a literature review of existing approaches to predict epileptic seizures. Our study starts by discussing data preprocessing that is essential for algorithms to operate efficiently TACC 2023: Tunisian-Algerian Joint Conference on Applied Computing, November 6 - 8, Sousse, Tunisia * Corresponding author. $ abir.affes@stud.enis.tn (A. Affes); afef.mdhaffar@enis.tn (A. Mdhaffar); mohamed.jmaiel@redcad.org (M. Jmaiel); freisleben@uni-marburg.de (B. Freisleben)  0000-0001-9333-6811 (A. Affes) © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings and fulfil the intended purpose. We also compare the performance of prediction systems based on standard popular algorithms or novel methods. We describe the used data, the proposed methods dealing with the problem and their related results. Existing studies demonstrate that the records of the electrical activity of the brain (EEG) are the most informative data items that can be used to predict epilepsy seizures. However, EEG sensors cannot not be worn on a daily basis due to high number of electrodes, which is related to the number of channels (i.e., one channel corresponds to two electrodes). Thus, researchers proposed novel approaches that deal with EEG channel selection. These approaches allow us to reduce the number of electrodes composing an EEG sensor to make it suitable for daily use. This paper discusses these approaches and presents future research leading to the development of usable EEG-based systems for epileptic seizure prediction. The remainder of this paper is organized as follows. Section 2 describes and discusses some related approaches that were developed in the context of seizure prediction. Section 3 presents and discusses relevant studies that deal with EEG channel selection. Section 4 discusses the objectives and challenges related to the development of a usable EEG-based system for epileptic seizures prediction. Section 5 summarizes the paper. 2. Epileptic seizure prediction Uncontrolled seizures are a major handicap for people with epilepsy in their daily life. The difficulties encountered by these people can be mitigated by developing algorithms capable of anticipating epileptic episodes. Aimed at enhancing the patients’s life quality, several algorithms were proposed for the detection and prediction of seizures. Recent research has principally focused on advancing the prediction of an episode, aiming to allow enough time to take necessary precautions before seizures start. In the following, we discuss several published approaches of seizure prediction. 2.1. Approaches to epileptic seizure prediction For epileptic seizure prediction, Truong et al. [2] used three databases: (1) the Freiburg Hospital intra-cranial EEG (iEEG) data set, (2) the CHB-MIT data set, and (3) the American Epilepsy Society Seizure Prediction Challenge data set. They selected 13 out of 24 patients from the CHB-MIT database and 13 out of 21 patients from the Freiburg iEEG database. The continuous data was segmented into 30-seconds windows. To deal with imbalance between classes, Truong et al. [2] generated more preictal segments. The authors first applied the Short-Time-Fourier- Trasnform (STFT) to transform raw data into spectrograms. Then, they proposed a CNN that automatically extracts features and classifies the data in a patient-oriented manner. As a post- processing configuration, the authors set an alarm only if 240 seconds per 300 seconds positive predictions were generated. The proposed model achieved a sensitivity of 81.4%, 81.2%, and 75%, and a FPR of 0.06/h, 0.16/h, and 0.21/h, with the 3 databases, respectively. The seizures were predicted at most 5 minutes before the onset. Fujiwara et al. [3] aimed to predict epileptic seizures using heart rate variability (HRV) analysis. First, HRV features were extracted from the data of epileptic patients. Then, an anomaly monitoring technique called multivariate statistical process control (MSPC) was implemented to predict the seizures. Fujiwara et al. [3] used ECG data collected from 11 patients. They defined the preictal episode to be 15 min before and 5 min after the seizure’s onset. The interictal data was recorded at least 50 min before or after the seizure’s onset. The advantage of using MSPC was the fact that, for modeling, it only needs the interictal data considered as the normal data. The proposed approach was not efficient with the seizures that happened during sleep. Considering the 8 patients with wakening preictal episodes, it achieved an average sensitivity of 91% with a FPR of 0.7/h. Fujiwara et al. [3] showed that for seizure prediction, HRV individuality must be considered among the patients. Khan et al. [4] used a CNN to predict focal epileptic seizures. Their model was evaluated using data from 28 patients from the Mount Sinai Hospital and data from the CHB-MIT database. Only the records including one single seizure were considered. The data was first transformed using continuous wavelet transform applied to each channel. This step permits to exploit both time and frequency information. The proposed model is a CNN with six convolutional layers, two dense layers and a softmax layer that outputs the probability distribution of the input data. The seizure prediction problem is defined as a multi-class classification (i.e., the target output is either interictal, preictal, or ictal labels). To deal with the imbalanced data sets, Khan et al. [4] selected random examples from the interictal data set, so the classes were balanced. One challenge presented in this study is to estimate the optimal length of preictal period also considered as the prediction horizon (SPH) in a patient-independent way. During the learning process, a grid search on many possible values and cross-validation were used to select the appropriate preictal period. The final length was set to be 10 min before the seizure onset. The proposed model achieved a sensitivity of 87.8% and a FPR of 0.142/h during the test phase. The correctly predicted seizures were reported 4 to 10 min before the seizure onset. Shahbazi and Aghajan [5] developed a CNN-LSTM network to predict seizures in a patient- specific manner. To evaluate the model, the authors considered 14 out of 24 cases from the CHB-MIT database. The records were segmented into windows with 10-seconds length. Preictal data was extracted from the 30 min that precede the seizure’s onset. To deal with imbalanced sets, only a number of the interictal segments equal to the preictal segments was considered. Shahbazi and Aghajan [5] initially applied STFT to transform the raw data segments into multichannel images. The features from spectral, spatial, and temporal domains were then learned by the CNN-LSTM model to classify the segments into preictal and interictal data. The CNN was initially pre-trained to classify the data. Then, the whole CNN-LSTM was trained using the initialized CNN weights. To obtain an effective prediction, the event was reported only if 8 of 10 successive samples are classified as preictal segments. The evaluation results were, on average, a sensitivity of 98.21% and a FPR of 0.13/h. The mean prediction time was 44.74 minutes. Shasha et al. [6] proposed a lightweight solution for the epileptic seizure prediction problem, which can efficiently operate on edge computing platforms. They considered 19 patients from the CHB-MIT database to validate their approach. Shasha et al. [6] chose the preictal data to be the 15 min preceding the ictal phase. First, the EEG recordings were transmitted to the edge computing platform. After segmentation, the correlation matrices were extracted using Pearson correlation coefficient (PCC). These features represent the synchronization strength between all pairs of EEG channels, during the different brain states. Then, the matrices were fed into a simple CNN to classify the preictal and interictal signals. The prediction results were 92.91% of sensitivity and 89.98% of accuracy. Zhang et al. [7] presented an approach based on a bidirectional long short-time memory (Bi- LSTM) for the prediction of the epileptic seizure. They selected 13 patients from the CHB-MIT database to evaluate their method. First, they segmented the signals into 5 seconds windows. Then, for the feature extraction, Zhang et al. [7] calculated an optimized multidimensional sample entropy that considers all the 23 EEG channels. For the classification step, the Bi-LSTM network was trained using data from the 13 patients and then, tested on each patient individually. The proposed method achieved, on average, 80% of accuracy, 86.67% of sensitivity and a FPR of 0.26/h. The authors also compared 3 intervention times equal to 2, 5 and 10 min. Based on the model performance, they finally chose the prediction time to be 5 minutes. 2.2. Discussion The previously discussed approaches that deal with the prediction of epileptic seizures are shown in Table 1. In addition to the details about the considered data and the analysis process, we also specify if the proposed solution can be a real life practical product, i.e., the proposed approach involving the data type can be convenient to use in real life. Furthermore, if many algorithms were tested, only the results corresponding to the best performance are indicated in the tables. Existing studies have demonstrated the feasibility of epileptic seizure monitoring. Prediction systems aim to improve the life quality of patients with refractory epilepsy (i.e., drug-resistant). Although the precital period is not visually recognizable when analyzing the related data, state-of-the-art research studies have shown the transition between the interictal and the ictal state and have shown the possibility of predicting epileptic seizures. A perfect prediction is not reached yet, but with the currently achieved performance, patients can still be warned for the majority of their seizures. Regarding the used parameters, Fujiwara et al. [3] used ECG to monitor the epileptic seizures. The advantage of using non-EEG solutions is that they are more promising with regard to their practical use in daily life. Using the HR variable, Fujiwara et al. [3] obtained a good performance. However, they defined the preictal period to last from the 15 min before the seizure until 5 min after the actual onset. This way, predicting a seizure at a time close or matching the onset can not be counted as a successful prediction. On the other hand, unlike non-EEG solutions that are usually associated to specific types of seizures, EEG solutions were tested with a variety of seizures that encompasses focal and generalized seizures, with or without motor symptoms. Therefore, EEG is proven to be the most valuable source for epilepsy monitoring. Making the EEG a daily life tool that will be worn continuously by the patients is not realistic. Consequently, it is important to transform the conventional EEG-based solutions into a minimalistic and portable product. Indeed, the main objective is to reduce the number of EEG channels to the minimum and keep the same performance. We need to further investigate this issue. In almost all the studied approaches that use EEG data [2, 8, 9, 4, 5], the authors used techniques such as STFT, DFT, and wavelet transforms to convert the raw data into a form that reveals the frequency aspect in addition to the temporal aspect. They have shown the efficiency of using time-frequency features in comparison with using the raw data. Some of the studies relied on handcrafted feature extraction, while others used intelligent Table 1 Related work on epileptic seizure prediction Real life Patient- Features Related prac- Seizure’s specific Data engi- Classifier Performance* work tical type vs cross- neering product patient MSPC ECG Focal patient- handcrafted [3] yes anomaly sen = 91%, / HR seizures specific extraction detection FPR = 0.7/h cross- [4] EEG no variety* CNN CNN sen = 87.8%, patient FPR = 0.142/h patient- [5] EEG no variety CNN CNN+LSTM sen = 98.21%, specific FPR = 0.13/h EEG + patient- [2] no variety CNN CNN 3 databases: iEEG specific sen = 81.4% & 81.2% & 75%, FPR = 0.06/h & 0.16/h & 0.21/h cross- [6] EEG no variety PCC CNN Ac = 89.98%, patient sen = 92.91% cross- entropy [7] EEG no variety Bi-LSTM sen = 86.67%, patient method FPR = 0.26/h variety : focal, lateral, and generalized seizure onsets. Performance: sen = sensitivity/recall; FPR = False Positive Rate; prec= precision; spec = specificity; Ac = Accuracy. techniques for automatic feature extraction. They commonly used CNNs [8, 4, 5, 2] to learn and extract the most relevant information. For data classification, the majority of the approaches used ML algorithms that have shown optimistic results. In the related work on seizure prediction, the majority of the approaches are essentially based on the use of a single type of DL network, such as a CNN or an RNN variant (e.g., RNN vanilla, LSTM, GRU). Epileptic seizures are temporal intervals or events characterized by their sequential phases. Hence, during a seizure, the signal segments are correlated. Consequently, previous studies demonstrated the effectiveness of the recurrent aspect in DL models used to analyze the EEG data. One approach [5] is based on a combination of a CNN and an LSTM. It achieved promising results. In this work, preictal data was choosen to be at most 30 minutes before the seizure onset. However, for 12 among 14 patients, their model predicted the seizure before an horizon going from 32 to 103 minutes. Thus, the reported predictions are likely not correct classifications. Based on the different RNNs proposed in [10, 11, 12, 13, 9], the GRU network is shown to be adequate to analyze the EEG data in the context of seizure prediction. It has shown good results in temporal sequence modeling and especially in the detection of different seizure phases [11, 9, 14]. 3. EEG channel selection Contrary to the other eventual measurements, EEG can effectively describe all the seizure types, either with hidden symptoms or with apparent altered behavior. It is up to now the most adequate tool to use for seizure prediction task. In the literature, many researchers addressed the problem of EEG channel selection with regard to epilepsy disorder. In general, their aim is to reach the same or better classification results while using a reduced number of EEG channels compared to using all the initial channels. 3.1. Approaches to EEG channel selection for seizure monitoring In the context of epileptic seizure prediction, Jana and Mukherjee [15] presented a patient- specific solution to select the best EEG channels. The proposed algorithm was based on the use of a CNN for automatic feature extraction and EEG data classification. The CHB-MIT database was used to validate the approach. The proposed channel reduction algorithm proceeded by elimination. First, the authors calculated the average classification accuracy using all channels. Then, they reduced each channel temporally and checked the average classification accuracy using the remaining channels. The channel was permanently removed if it did not affect the accuracy result. The first iteration reduced the number of channels to 15. Using the same technique, two other iterations reduced the total number to 6 most important channels. The selection was stopped when removing any of the remaining channels provided less accuracy. The proposed method achieved 99.47%, 97.83% and 92.36% in terms of average classification accuracy, sensitivity and specificity, respectively. The procedure reduced the EEG channels by 72.73%. The proposed method was designed to reduce the channels while keeping the same accuracy as using all the channels. Shoka et al. [16] aimed to detect the epileptic seizure. They presented a five-step algorithm to analyze EEG data. The first step was dimensionality reduction. It consisted of selecting the most relevant EEG channels. The next steps were feature extraction from the selected channels, feature averaging, classification using seven different classifiers, and finally testing using a cross-validation method. The proposed method was evaluated using the CHB-MIT data set. To select the most important channels that reveal the seizure, the variance of the signal amplitude was calculated from the several channels. Then, only the data from the three channels with the maximum variance was analyzed and considered for seizure detection. The authors set a fixed number of channels to select for all the patients. In the context of epileptic seizure detection, Karimi and Kassiri [17] proposed a patient- specific approach that includes channel selection. The solution consisted of three main steps: (1) EEG channel reduction toward dimensionality reduction, (2) feature extraction, and (3) data classification using a non-linear SVM. The authors selected 5 (out of 23) channels for every patient that can determine the seizure state from the non-seizure state. In fact, the selection was based on the ratio between the average energy of the wideband ictal signal and interictal signal. The proposed algorithm was validated on the CHB-MIT data set. The model achieved a detection sensitivity of 96.87% and a specificity of 99.95%. With only 5 selected channels, the authors reduced the processing power/time by a factor of 4.6 for univariate feature extraction and 25.3 for bivariate features. The results showed that the proposed automatic patient-specific channel selection method improved the seizure detection performance. However, the authors statically set the number of channels to select. To improve the seizure detection performance, Jana et al. [18] proposed an EEG channel selection algorithm. It also allowed to decrease the computational cost. The approach was based on binary particle swarm optimization (BPSO) integrated with the extreme learning machine (ELM) classifier. To determine the best size of channels set and identify the best combination for seizure detection, the authors measured a value called fitness function of 50 random combinations of channels for 100 iterations. The fitness function assumed a balanced relation between the classification accuracy and the number of channels. Jana et al. [18] defined their fitness function with weights equal to 0.7 and 0.3 for the accuracy and the channels reduction, respectively. To evaluate the proposed approach, the data from two patients from the CHB-MIT data set was processed. After channel optimization, the ELM classifier tested the final set of channels to estimate the seizure detection performance. For the first subject, the proposed solution improved the accuracy by approximately 3% after reducing the number of channels to 6. For the second subject, the model achieved an accuracy of 75.47% after the channel selection compared to an accuracy of 73.69% with all the channels. In this approach, the authors needed to choose the weights of the fitness function. Furthermore, testing many combinations can be time consuming for real applications. Towards an efficient seizure prediction, Daoud and Bayoumi [19] included in their study an algorithm that selects the most important EEG channels. In fact, they proposed several DL models to classify preictal and interictal signals and then compared the results before and after reducing the channels. They chose the preictal signal to be one hour before the seizure onset and the interictal phase to be at least four hours away from the seizures. They considered raw EEG data as their input without preprocessing. The four proposed models are (1) a simple MLP, (2) a CNN to extract spatial features from the electrodes’ locations followed by an MLP for classification, (3) an ensemble of a CNN and a Bi-LSTM and (4) a deep convolutional autoencoder (DCAE). Daoud and Bayoumi [19] have shown results indicating enhancements after using the CNN and the Bi-LSTM instead of MLP. To select the channels, they calculated the product of variance and entropy of the preictal signals for each channel. Indeed, the authors selected the 8 channels with the highest values and test the selection. The prediction results were compared to the ones using all the channels. Iteratively, in case of a decreased performance, the next high-rated channel was added to the selection. Daoud and Bayoumi [19] considered 8 patients to evaluate their approach. Using only 10 channels on average among the considered subjects, they achieved the same performance as all the channels. Indeed, the three presented models (1) CNN-BiLSTM, (2) DCAE-BiLSTM and (3) DCAE-BiLSTM with channel selection achieved the same prediction performance in terms of accuracy, precision, sensitivity, and FPR. The benefits of channel selection were a reduced training time and using less parameters. On the other hand, the authors had to fix the number of channels to test at first. Chakrabarti et al. [20] tested different configurations of artificial neural networks (ANN) to detect epileptic seizures. For feature extraction and channel reduction, they applied PCA. It was used to reduce high-dimensional data sets while keeping the most relevant information. The proposed approach was evaluated using 10 patients’ records from the CHB-MIT data set. Based on PCA and the channels’ contributions, several combinations were tested. The number of selected channels ranges from 3 to 19. The 3-layer ANN achieved the best performance with 85.6% accuracy when using all initial channels. After applying the channel selection algorithm, the 4-layer ANN achieved 86.7% accuracy using only 18 among the 23 channels. Yuan et al. [21] considered an SSDA-based approach to select the EEG channels by looking for correlation between them. They ranked the channels based on a zero-stimulus method. Half of the number of initial channels that were high-rated was selected and considered in seizure detection. The presented results showed an improvement of the accuracy by 2% after the channel reduction. In the next research, Yuan et al. [22] proposed an approach that considers the contribution of each channel when classifying the data. They introduced two attention-based models. The first one considered local-based attention that measures the energy independently for each channel. The second model, named global-based attention, was an MLP model that consider the relation between all channels. Next, Yuan et al. [23] introduced a novel attention- based model called MuVAN. Yuan et al. [23] used three data sets with a similar data structure for three different applications: (i) epileptic seizure detection, (ii) sleep stage classification, and (iii) physical activity recognition. Knowing that intended patterns are related to an interval rather than to a point, the authors divided data into many multivariate temporal records that are fed into a 2-layer bidirectional gated recurrent unit (Bi-GRU). Two attention mechanisms were proposed: (i) a location-based and (ii) a context-based mechanism. The first mechanism does not capture any existing correlation between the views, while the second algorithm captures the hidden connections. With a conventional softmax operation, the elements were rated, so that important ones are top-ranked and irrelevant elements have close-to-zero scores. However, instead of ignoring the impertinent elements, Yuan et al. [23] took into account the contribution of some irrelevant views in the decision making. For all considered data sets, the MuVAN model outperformed the baseline approaches. The methods proposed in Yuan et al. [22, 23] consider each of the available channels when taking the final decision (i.e., without reducing the number of channels). Shah et al. [24] aimed to reduce spatial information (i.e., reducing the number of EEG channels) to minimize the EEG signal artifacts. The proposed channel selection was based on testing many configurations and evaluate their performance to detect the epileptic seizure. The authors used epilepsy domain knowledge to designate the subsets of channels whose sizes range between a minimum of 8 channels and a maximum of 20. To compare the performance of the different configurations, the authors used a DL algorithm that consists of a CNN and a BiLSTM. The 8-channel configuration presented a tolerable reduction in performance compared with a all channels configurations, with 33.44% of sensitivity, 85.51% of specificity, and 38.19 FA per day. In [25], an EEG channel selection method was proposed not only in the context of seizure detection, but also for other applications that use EEG records. The authors applied the Gumbel- softmax trick to develop their concrete selector layer. The layer was composed of K neurons. Based on the Gumbel-softmax measures, each neuron selected one single channel with the highest probability from all the input channels. To address the problem of duplicate channel selection, Strypsteen and Bertrand [25] created a dependency between layer’s neurons by employing a regularization function. Consequently, the selected channels were distinct and their number corresponded to the neurons number which was set by the researchers. To validate the channel selection layer, two EEG-related data sets were tested. The first one was associated to a classification task for motor execution. The authors tested different number of neurons (i.e., number of channels to be selected). Selecting 50 out of 128 channels achieved the best performance. Moreover, with only 8 channels, the model’s performance decreased by only 5% compared to the best performance. The second data set was related to a regression problem for auditory attention decoding. With only 10 per 64 channels, the model achieved an accuracy close to that with the all-channel baseline. The proposed selection method achieved the same or better performance as the baseline approaches. It also requires less computations and hence, less processing time. However, the number of the selected channels depends on the researchers’ decision (the size of the channels subset is equal to the number of neurons in the Gumbel-softmax layer). 3.2. Discussion The previously discussed EEG channel selection approaches related to epilepsy disorder are summarized in Table 2. Table 2 Related work on channel selection Patient- Prior pa- Detection Patient- specific Number Channel Related rameter (D) or specific number of Results selection work specifi- Prediction selection of channels method cation (P) channels Testing combina- [15] no no no 6/23 P same tions and elimination Ranking and [16] yes no no 3/23 D - selection Ranking and [17] yes yes no 5/23 D improved selection average Testing com- [18] yes yes yes D improved 5.5/23 binations average Ranking and [19] yes yes yes P same 10/23 selection Testing com- [20] yes no no 18/23 D improved binations Ranking and [21] yes no no 11/23 D improved selection Testing com- [24] no no no 8/22 D decreased binations Prior parameter specification: selection related parameters are given to the algorithm. Research line: D for epileptic seizure detection and P for prediction. Model performance improvement: the performance after channel selection compared to the performance using all channels. Using different methodologies, some researchers obtained the same results before and after channel reduction [15, 19, 26]. Others managed to improve their model performance after channel selection [17, 18, 20]. This indicates that some channels can be contaminated by noise or do not yield relevant information, which can significantly affect the performance of the model in detecting or predicting the seizures. Shah et al. [24] also obtained poorer values after reducing the number of EEG channels. This might be related to the elimination of pertinent channels. The majority of existing approaches use channel selection as part of seizure analysis or seizure detection processes [16, 17, 18, 20, 21, 22, 24, 26]. They help medical staff in analyzing EEG data after the medical consultation. In particular, these approaches allow us to better localize past seizures that have already happened in short periods of time. Another main goal presented in related work is to reduce power consumption of seizure detection systems [18, 26]. In almost all existing approaches, the authors selected a specific number of channels based on personal choice, on a doctor’s opinion, or based on previous research. They generally used different ways to rank the channels or test different combinations. This leads to select the same number of channels for all patients. However, epilepsy is a very distinctive condition that manifests itself differently and uniquely for each person. It can start from several locations and affect brain areas with different sizes. This makes the one-size-fits-all approaches inappropriate for epilepsy management. Additionally, a patient with epileptic disorder frequently undergoes the same type of seizures. Therefore, a channel selection method should be a patient-specific solution to get optimal parameters and best evaluations. On the other hand, testing multiple candidate channel combinations and all possible sizes of subsets with a DL approach for each patient is computationally very demanding and time-consuming for practical use. 4. Future prospects Based on findings discussed above, some considerations that could improve the outcome of seizure prediction systems should be explored. In fact, to develop an optimal solution that accurately predicts all types of epileptic seizure with a minimum of discomfort to the patient, we present some research lines. In regard with the data, the first essential step is to select many databases and carry out data preprocessing to ensure a high-quality data. Also, it is important to determine the optimal feature extraction method. To get an accurate classification of the data, we must choose and design the right ML algorithm for the intended task and find the optimal hyperparameters that results in the best performance of the algorithm. In addition, to obtain a product that can be easily used by the patients for long periods of time, while achieving a high prediction performance, we must enhance the solution by reducing the used EEG channels, thereby reducing the number of electrodes. We confirm that testing different channel combinations is time-consuming. Consequently, we must consider an automated approach that minimizes the amount of time and effort and results in few selected channels. The exclusion of noisy and irrelevant channels must be studied in terms of the prediction performance as we need to target better prediction results. Also, the number of selected channels must be set automatically and distinctively for each patient based on his or her physiological signals. 5. Conclusion In this paper, we first analyzed and discussed the most pertinent research works related to the prediction of epileptic seizures. Then, we presented the state of the art concerned with a specific type of data frequently used in the context of epilepsy, i.e., EEG. This study involved research work that deals with the selection of EEG channels. Based on our literature review, we defined several avenues to improve the outcome of the epileptic seizure predictors. In the future, we plan to implement a usable EEG-based system for epilepsy seizure prediction. The target system should solve the challenges identified in this work. Acknowledgments This work is supported by the German Academic Exchange Service (DAAD) in the Ta’ziz Science Cooperations Program (AirFit Project; 57682841). References [1] WHO, Epilepsy: A Public Health Imperative - Summary, Technical Report WHO/MS- D/MER/19.2, World Health Organization, Geneva, 2019. [2] N. D. Truong, A. D. Nguyen, L. Kuhlmann, M. r. Bonyadi, J. Yang, S. Ippolito, O. Kavehei, Convolutional Neural Networks for Seizure Prediction Using Intracranial and Scalp Elec- troencephalogram, Neural Networks 105 (2018) 104–111. doi:10.1016/j.neunet.2018. 04.018. [3] K. Fujiwara, M. Miyajima, T. Yamakawa, E. Abe, Y. Suzuki, Y. Sawada, M. Kano, T. Maehara, K. Ohta, T. Sasai-Sakuma, T. Sasano, M. Matsuura, E. Matsushima, Epileptic Seizure Prediction Based on Multivariate Statistical Process Control of Heart Rate Variability Features, IEEE Transactions on Biomedical Engineering 63 (2016) 1321–1332. doi:10. 1109/TBME.2015.2512276. [4] H. Khan, L. Marcuse, M. Fields, K. Swann, B. Yener, Focal Onset Seizure Prediction Using Convolutional Networks, IEEE Transactions on Biomedical Engineering 65 (2018) 2109–2118. doi:10.1109/TBME.2017.2785401. [5] M. Shahbazi, H. Aghajan, A Generalizable Model for Seizure Prediction based on Deep Learning Using CNN-LSTM Architecture, in: Proceedings of the IEEE Global Confer- ence on Signal and Information Processing (GlobalSIP), 2018, pp. 469–473. doi:10.1109/ GlobalSIP.2018.8646505. [6] Z. Shasha, D. Chen, R. Ranjan, k. Hengjin, Y. Tang, A. Zomaya, A lightweight solution to epileptic seizure prediction based on EEG synchronization measurement, The Journal of Supercomputing 77 (2021) 1–19. doi:10.1007/s11227-020-03426-4. [7] Q. Zhang, J. Ding, W. Kong, Y. Liu, Q. Wang, T. Jiang, Epilepsy prediction through optimized multidimensional sample entropy and Bi-LSTM, Biomedical Signal Processing and Control 64 (2021) 102293. doi:https://doi.org/10.1016/j.bspc.2020.102293. [8] I. Korshunova, Epileptic Seizure Prediction using Deep Learning, Master’s thesis, Univer- siteit Gent, Belgique, 2014-2015. [9] M. Larmuseau, Epileptic Seizure Prediction using Deep Learning, Master’s thesis, Univer- siteit Gent, Belgique, 2015-2016. [10] P. Thodoroff, J. Pineau, A. Lim, Learning Robust Features using Deep Learning for Automatic Seizure Detection, in: Proceedings of the 1st Machine Learning in Health Care (MLHC) 2016, 2016, pp. 178–190. [11] S. S. Talathi, Deep Recurrent Neural Networks for seizure detection and early seizure detection systems, arXiv e-prints (2017) arXiv:1706.03283. [12] M. Golmohammadi, S. Ziyabari, V. Shah, E. von Weltin, C. Campbell, I. Obeid, J. Picone, Gated recurrent networks for seizure detection, in: 2017 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), 2017, pp. 1–5. doi:10.1109/SPMB.2017.8257020. [13] G. Choi, C. Park, J. Kim, K. Cho, T.-J. Kim, H. Bae, K. Min, K.-Y. Jung, J. Chong, A Novel Multi-scale 3D CNN with Deep Neural Network for Epileptic Seizure Detection, in: 2019 IEEE International Conference on Consumer Electronics (ICCE), 2019, pp. 1–2. doi:10.1109/ICCE.2019.8661969. [14] Y. Xu, Q. Kong, Q. Huang, W. Wang, M. Plumbley, Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging, in: 2017 International Joint Conference on Neural Networks (IJCNN 2017), 2017, pp. 3461–3466. doi:10.1109/IJCNN. 2017.7966291. [15] R. Jana, I. Mukherjee, Deep Learning based Efficient Epileptic Seizure Prediction with EEG Channel Optimization, Biomedical Signal Processing and Control 68 (2021) 102767. doi:https://doi.org/10.1016/j.bspc.2021.102767. [16] A. Shoka, H. Alkinani, A. El-Sherbeny, A. El-Sayed, M. Dessouky, Automated Seizure Diagnosis System Based on Feature Extraction and Channel Selection Using EEG Signals, Brain Informatics 8 (2021). doi:10.1186/s40708-021-00123-7. [17] M. R. Karimi, H. Kassiri, A Multi-Feature Nonlinear-SVM Seizure Detection Algorithm with Patient-Specific Channel Selection and Feature Customization, in: 2020 IEEE International Symposium on Circuits and Systems (ISCAS), 2020, pp. 1–5. doi:10.1109/ISCAS45731. 2020.9180729. [18] G. C. Jana, A. Tripathi, A. Agrawal, EEG Channel Selection Approach for Seizure Detection Based on Integrated BPSO and ELM, in: 2020 7th International Conference on Signal Processing and Integrated Networks (SPIN), 2020, pp. 856–861. doi:10.1109/SPIN48934. 2020.9071106. [19] H. Daoud, M. A. Bayoumi, Efficient Epileptic Seizure Prediction Based on Deep Learning, IEEE Transactions on Biomedical Circuits and Systems 13 (2019) 804–813. doi:10.1109/ TBCAS.2019.2929053. [20] S. Chakrabarti, A. Swetapadma, P. K. Pattnaik, A Channel Selection Method for Epileptic EEG Signals, in: Proceedings of the Emerging Technologies in Data Mining and Information Security (IEMIS), 2018, pp. 565–573. doi:10.1007/978-981-13-1951-8_51. [21] Y. Yuan, G. Xun, K. Jia, A. Zhang, A Multi-view Deep Learning Method for Epileptic Seizure Detection using Short-time Fourier Transform, in: Proceedings of the 8th ACM In- ternational Conference on Bioinformatics, Computational Biology and Health Informatics (ACM BCB), 2017, pp. 213–222. doi:10.1145/3107411.3107419. [22] Y. Yuan, G. Xun, F. Ma, Q. Suo, H. Xue, K. Jia, A. Zhang, A Novel Channel-aware Attention Framework for Multi-channel EEG Seizure detection via Multi-view Deep Learning, in: Proceedings of the IEEE EMBS International Conference on Biomedical Health Informatics (BHI), 2018, pp. 206–209. doi:10.1109/BHI.2018.8333405. [23] Y. Yuan, G. Xun, F. Ma, Y. Wang, N. Du, K. Jia, L. Su, A. Zhang, MuVAN: A Multi- view Attention Network for Multivariate Temporal Data, in: Proceedings of the IEEE International Conference on Data Mining (ICDM), 2018, pp. 717–726. doi:10.1109/ICDM. 2018.00087. [24] V. Shah, M. Golmohammadi, S. Ziyabari, E. Von Weltin, I. Obeid, J. Picone, Optimizing Channel Selection for Seizure Detection, in: Proceedings of the IEEE Signal Processing in Medicine and Biology Symposium (SPMB), 2017, pp. 1–5. doi:10.1109/SPMB.2017. 8257019. [25] T. Strypsteen, A. Bertrand, End-to-end Learnable EEG Channel Selection for Deep Neural Networks with Gumbel-Softmax, Journal of Neural Engineering 18 (2021) 0460a9. doi:10. 1088/1741-2552/ac115d. [26] S. Faul, W. Marnane, Dynamic, Location-based Channel Selection for Power Consumption Reduction in EEG Analysis, Computer Methods and Programs in Biomedicine 108 (2012) 1206–1215. doi:10.1016/j.cmpb.2012.06.005.