Overlapping filter-bank convolutional neural network for multisubject multicategory motor imagery BCI

doi:10.21203/rs.3.rs-2137240/v2

Download PDF

Method Article

Overlapping filter-bank convolutional neural network for multisubject multicategory motor imagery BCI

https://doi.org/10.21203/rs.3.rs-2137240/v2

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Jul, 2023

Read the published version in BioData Mining →

You are reading this latest preprint version

Background

Recently, CNN-based models have been widely used in motor imagery brain-computer interfaces (BCIs) due to their powerful feature representation ability. However, in multisubject motor imagery BCI, the discriminative frequency bands vary from subject to subject. Thus, using CNNs to extract discriminative features from EEG signals of different frequency components is a promising method in multisubject EEG recognition.

Methods

This paper presents a novel overlapping filter-bank CNN to incorporate discriminative information from multiple frequency components in multisubject motor imagery recognition. Specifically, two overlapping filter banks with fixed low-cut frequency or sliding low-cut frequency are employed to obtain multiple frequency component representations of EEG signals. Then, multiple CNN models are trained separately. Finally, the output probabilities of multiple CNN models are integrated to determine the predicted EEG label.

Results

Experiments were conducted based on three popular CNN backbone models and two public datasets. We compared the performance of overlapping filter-bank CNN with the state-of-the-art methods and traditional nonoverlapping-based CNN, and the results showed that the overlapping filter-bank CNN was efficient and universal in improving multisubject motor imagery BCI performance.

Conclusion

The proposed overlapping filter bank CNN framework with fixed low-cut frequency is an efficient and universal method to improve the performance of multisubject motor imagery BCI.

brain-computer interface (BCI)

multisubject BCI

motor imagery (MI)

convolutional neural network (CNN)

overlapping filter bank

Brain computer interface (BCI) is a communication interface directly established between the brain and the computer, which can realize the interconnection between brain and external objects, and achieve the interactive integration of biological intelligence and machine intelligence. It has important applications in the fields of medicine, neurobiology and psychology [1]. Many mature paradigms have emerged in the BCI field, including motor imagery BCI, steady-state visual evoked potential (SSVEP) BCI, P300 visual-evoked potentials BCI and emotional BCI [1; 2].

Motor imagery electroencephalography (MI-EEG) is a kind of endogenous spontaneous EEG with low environmental requirements. Thus, MI-EEG is widely used in BCI. The MI-BCI system collects EEG signals when the subject performs specific motor imagery, then recognizes the MI content according to the EEG signals, and converts the recognition results into control commands of the peripheral devices[3]. EEG signals have the characteristics of a low signal-to-noise ratio and low spatial resolution, so effective features and classifiers are the keys to the success of the MI recognition system, and BCI researchers have increasingly proposed many algorithms for MI classification.

1.1 Common spatial pattern-related method

The common spatial pattern (CSP) algorithm and its variants have been widely applied to construct spatial filters and extract highly discriminative features in EEG-based MI classification by maximizing the variance difference between two classes of EEG signals [4]. The outstanding performance of the filter-bank common spatial pattern (FBCSP) won the BCI Competition IV in the 2a dataset and 2b dataset [5]. The FBCSP used a filter bank consisting of 9 nonoverlapping subband bandpass filters covering the frequency range of 4 to 40 Hz to preprocess the signal. Then, the CSP features were extracted and selected for specific subjects by the mutual information-based rough set reduction algorithm and fed to the naïve Bayesian Parzen window classifier. The filter-bank regularized common spatial pattern was proposed to simultaneously solve the dependency problems on frequency bands and sample-based covariance estimation, and the proposed method improved the mean classification accuracy compared with other CSP-based methods [6]. Zhang proposed a hybrid network consisting of a CNN and a long-term short-term memory network for extracting temporal and spatial features from CSP features [7].

1.2 Deep learning-based method

Recently, deep learning algorithms have developed quickly, and related algorithms have been proposed for EEG-based BCI. In particular, CNNs have been widely used in EEG-based MI classification due to their ability to effectively extract temporal and spatial features from EEG signals. Schirrmeister proposed a Shallow ConvNet and a Deep ConvNet for end-to-end MI-based EEG recognition and showed better performance compared with the FBCSP algorithm. In addition, the CNN visualization results showed that the proposed model learned to use spectral power characteristics from different frequency bands [8]. EEGNet was proposed by Lawhern to suggest that a compact CNN can be applied and provide robust performance across many BCI paradigms, such as P300 event-related potential, feedback error-related negativity, movement-related cortical potential and sensory motor rhythm (MI recognition) [9]. Chen designed a deep learning approach termed filter-bank spatial filtering and temporal-spatial CNN for MI decoding. Filter-bank spatial filtering extracts the feature presentation of raw EEG signals, and the temporal-spatial CNN implements a decoding procedure. A stagewise training strategy including optimizing the triplet loss and cross-entropy loss was proposed to mitigate the optimization difficulty [10]. Li proposed an end-to-end EEG decoding framework that regards the original multichannel EEG as the input and improves the classification accuracy through a channel projection mixed-scale convolutional neural network (CP-MixedNet) and amplitude perturbation data augmentation [11]. Sakhavi proposed a novel filter-bank convolutional network (FBCNet) for MI classification, which extracted a multiview data representation through a filter bank, extracted spatial features through depthwise convolutional layers, and effectively aggregated temporal information by the proposed variance layer [12]. Zhao built a multibranch 3D convolutional neural network, where the 3D representation was generated by transforming EEG signals into a series of 2D arrays focusing on the spatial distribution of channels [13]. Li proposed a novel temporal-spectral-based squeeze-and-excitation feature fusion network, which extracted high-dimensional temporal features and discriminative spectral representations from raw EEG signals via deep-temporal convolution block and multilevel wavelet convolutions, respectively. Channelwise discriminative responses were highlighted by constructing interdependencies among different domain features [14]. Five adaptive schemes of the EEG-BCI system based on CNN were proposed for decoding MI-EEG. The adaptive transfer learning method fine-tuned an extensively trained, pretrained model and adjusted it to adapt the target subject [15].

1.3 Multisubject calibration-free MI-BCI

At present, research on MI-BCIs mainly focuses on subject-dependent systems, in which a model is built for a single target subject and has achieved satisfactory results [6; 8]. However, a subject-dependent system needs to collect data for calibrating the target subject, which is time-consuming and only applicable to the target subject. Therefore, research on multisubject calibration-free BCI systems has appeared. Kwon constructed a large MI-based EEG database consisting of 54 subjects performing left- and right-hand MI and proposed a subject-independent multibranch CNN framework for subject-independent BCI. The spectral–spatial inputs are individually trained through the CNN and then combined by a concatenation fusion technique to make predictions [16]. Zhang proposed a convolutional recurrent attention model for subject-independent EEG signal analysis. Specifically, they split an EEG trial into multiple temporal slices, utilized the spatial-temporal block to extract the spatial-temporal information of every temporal slice, and finally, leveraged a recurrent attention mechanism to explore the temporal dynamics among different temporal slices. The improved performance in the experiment indicated that the proposed convolutional recurrent attention model can utilize potential invariant EEG patterns among different subjects [17]. To improve the classification accuracy of multisubject motor imagery, Autthasan designed a novel end-to-end multitask learning architecture called MIN2Net, which applied a deep metric learning method in a multitask autoencoder model to learn discriminative potential representations and make predictions simultaneously [18]. Luo proposed a twin cascaded softmax convolutional neural network (TCSCNN) for multisubject MI-BCIs. The cascaded softmax structure was applied to achieve subject recognition and MI recognition simultaneously, and the twin EEG and twin structure were employed to further improve the performance [19].

In multisubject MI-BCI systems, the individual differences in EEG signals cause great difficulties for research [20]. In particular, the effective frequency band related to event-related desynchronization (ERD) and event-related synchronization (ERS) varies from subject to subject. Utilizing the discriminative information of different frequency bands is key to improving the capability of multisubject MI-BCI, but existing CNN models perform poorly in this aspect. This paper presents a novel overlapping filter-bank CNN framework for multisubject motor imagery EEG recognition. Through the proposed overlapping filter bank, the filtered EEG forces the CNN to learn from different frequency bands. To combine the discriminative ability from different frequency bands, ensemble probability from multiple CNNs is employed to make predictions.

The main contributions of this paper are as follows.

1) A filter-bank CNN framework is proposed to enable different CNN models to learn discriminative information from multiple EEG frequency bands for multisubject MI-BCI.

2) A novel overlapping filter bank with a fixed low-cut frequency outperforms other filter banks in the multisubject MI-BCI experiments.

3) Comprehensive experimental evaluations of three popular CNN backbone models using two benchmark datasets for the overlapping filter-bank CNN framework demonstrate the effectiveness and universality of the proposed framework.

The rest of this paper is organized as follows. Section 2 presents the proposed overlapping filter-bank CNN framework. Section 3 introduces the dataset and experimental settings in detail. Then, the experimental results and discussion are presented in Section 4. Section 5 gives the conclusions.

2.1 Analysis of FBCSP

The FBCSP algorithm is a successful example of using discriminative information from multiple frequency bands and won the champion of BCI Competition IV dataset 2a and dataset 2b[5]. To date, the FBCSP has been applied and modified by numerous researchers[4; 6; 21].

In the FBCSP algorithm, a nonoverlapping bandpass filter bank is employed first to decompose the EEG into multiple frequency bands. Thus, the discriminative information of different frequency bands can be collected in the following feature extraction. Second, the CSP algorithm is applied to design a spatial filter, and the log variance feature is extracted from the EEG signals filtered by the spatial filter.

$${v}_{p}=log\frac{var\left({W}_{p}E\right)}{{\sum }_{i=1}^{I}var\left({W}_{i}E\right)}$$

where E is the raw EEG signals and W_i is the i th spatial filter [22]. As CSP is a supervised feature learning method using a spatial filter to maximize the variance differences between the two classes of EEG signals, it is more like a classification method rather than a feature extraction method. The CSP features can directly be used in classification by a threshold. However, the discriminative ability of a single classifier built on a specific EEG frequency band is usually weak in classification. Thus, the following feature selection and classifier training procedures are applied.

Analysis of the FBCSP method shows that it is indeed an ensemble learning method. Multiple weak classifiers (CSP features extracted in different frequency bands) are combined (training of classifier) to build a strong classifier. This is a valuable experience that should be used in CNN-based MI recognition methods.

However, this framework of training and combining weak classifiers from different frequency bands is difficult to transfer directly. Compared with the simple linear spatial filter provided by the CSP, the complexity of the CNN model is much higher. If the input bandpass filtered EEG does not have enough discriminative information, the CNN begins to search for less discriminative information and even indiscriminative information, which finally results in overfitting. In conclusion, the CNN model performs better based on wide frequency band EEG, which contains more discriminative information. The discriminative information in the EEG signals filtered by a narrow bandpass filter (4 Hz in FBCSP) is insufficient, so a narrow bandpass filter easily causes overfitting in the CNN-based method.

2.2 Overlapping filter bank

To supply enough discriminative information for the CNN model, the filter bank should have a wider passband. Compared with the nonoverlapping filter bank applied in FBCSP, filter banks with overlapping frequency bands are applied before the CNN model in this paper. Two kinds of overlapping filter banks are proposed. The first type is a fixed-low-cut-frequency filter bank, the second type is a sliding-low-cut-frequency filter bank, and the specific setting is described in Table 1.

Table 1
The bandpass filters included different overlapping filter banks.
Filter bank	First filter	Second filter	Third filter		Last filter
Fixed-0 Hz	0–8 Hz	0–12 Hz	0–16 Hz	…	0–36 Hz
Fixed-4 Hz	4–12 Hz	4–16 Hz	4–20 Hz	…	4–36 Hz
Sliding-0 Hz	0–12 Hz	4–16 Hz	8–20 Hz	…	24–36 Hz
Sliding-4 Hz	4–16 Hz	8–20 Hz	12–24 Hz	…	24–36 Hz

2.3 Overlapping filter-bank CNN

Due to the powerful feature learning capabilities of the CNN model, it is applied to extract discriminative information from multiple EEG frequency bands. The flowchart of the proposed overlapping filter-bank CNN (OFBCNN) is shown in Fig. 1. Multiple CNN models are trained by the EEG signal filtered by the overlapping filter bank, and the output probabilities are combined to make the final prediction. First, the raw EEG is filtered by a bandpass filter:

$\widehat{{E}_{i}}={F}_{i}\left(E\right)$

(2)

where E represents the raw EEG, F_i represents the i th bandpass filter in the filter bank and $\widehat{{E}_{i}}$ represents the filtered EEG. Second, the filtered EEG signals are input into each CNN model to complete the model training process separately. The prediction probabilities are the output of the CNN model:

${O}_{i}={M}_{i}\left(\widehat{{E}_{i}}\right)$

(3)

where M_i is the CNN model for $\widehat{{E}_{i}}$ and O_i is the output prediction probability of each CNN. Finally, the output probabilities of all CNN models are summed to determine the final prediction:

$L=\text{a}\text{r}\text{g}\text{m}\text{a}\text{x}\left(\sum _{i=1}^{N}{O}_{i}\right)$

(4)

where N is the total number of bandpass filters in the overlapping filter bank and L is the predicted label.

2.4 CNN backbone

Three popular CNN backbone models, including shallow ConvNet, deep ConvNet[8] and EEGNet[9], are applied in the proposed overlapping filter-bank CNN framework (OFBCNN) in this paper.

Deep ConvNet is a deep CNN proposed by Schirrmeister that includes four “Conv Pool Blocks” and a classification layer. A standard “Conv Pool Block” includes a convolutional layer and a max pooling layer. However, the first “Conv Pool Block” includes a temporal convolution to extract low-level temporal feature representations, a spatial filter (convolution along all EEG channels) and a max pooling layer. For specific network details, see[8].

Shallow ConvNet is a shallower network with only one “Conv Pool Block”, including a temporal convolution, spatial filtering layer and an average pooling layer. This architecture is inspired by the FBCSP and uses the first two convolutional layers to replace the bandpass and CSP spatial filters in the FBCSP. For specific network details, see [8].

EEGNet is a robust end-to-end network showing outstanding performance in multiple BCI paradigms, i.e., P300 visual evoked potential, error-related negative response (ERN), motor-related cortical potential (MRCP) and sensorimotor rhythm (SMR). EEGNet consists of three convolutional pooling blocks and a softmax layer. Each convolutional layer is followed by the batch norm, ELU activation function, max pooling and dropout. For specific details of the network, see [9].

3.1 BCI Competition IV dataset 2a

The BCI Competition IV dataset 2a includes a four-category MI-based EEG signal recognition task. This dataset includes MI-based EEG signals from 9 subjects. For each subject, there were 72 trials in each class in the training set and test set, recorded at different times. According to four types of prompts, including MI of the left hand, right hand, tongue, and foot, the subjects performed MI, and EEG signals were monitored from 22 channels. The sampling rate of the EEG signals was 250 Hz, and the resolution of the amplifier was 100 mV. In each trial, the prompt appeared on the screen in seconds, and the execution time of the MI task was between the third second and sixth second. A 0.5–100 Hz bandpass filter and a 50 Hz notch filter were used to preprocess the collected EEG signals. After recording, an expert marked the artefact trial. For more details, see[23]. The experiments in this paper used all the training data of 9 subjects as the training set and all the test data of 9 subjects as the test set to construct a multisubject MI recognition task.

3.2 High-Gamma dataset

The High-Gamma dataset is a large-scale EEG dataset composed of four types of EEG signals from 14 subjects performing motor execution tasks (left hand, right hand, feet, and rest) [8]. For each subject, there were 13 runs and approximately 1,000 four-second trials. The first 11 runs included approximately 880 trials belonging to the training set, and the last 2 runs consisted of approximately 160 trials belonging to the test set. EEG signals were collected from 128 channels (44 channels were used in the following experiments according to Braindecode [8]) and resampled at 250 Hz. More details can be found in [8]. All training EEG signals from 14 subjects were used as the training set, and all the testing EEG signals from 14 subjects were used as the test set for multisubject EEG recognition.

3.3 Experimental setup

The EEG signals were only minimally preprocessed to encourage the CNN model to learn features by itself. The 4.5 second EEG segment was used, from 0.5 seconds before the MI prompt appeared to 4 seconds after it appeared. The EEG signals were filtered by the overlapping filter bank, and then the filtered EEG signals were normalized using channel exponential running standardization, as configured in [8]. After that, the EEG signals were input into each CNN model. The Adam optimization method was used in the training phase [24]. The experiment in this article was implemented in Python using the PyTorch[25] and Braindecode[8] frameworks.

To reduce the influence of randomness and epoch number, the average and maximum test accuracy were used to evaluate the accuracy of the model. In addition, the variance in the test accuracy was used to assess the convergence of the model.

To verify the effectiveness, universality and feasibility of the proposed overlapping filter-bank CNN algorithm, a series of experiments were conducted. First, the OFBCNN was compared with the original CNN and the CNN with the nonoverlapping filter bank applied in the FBCSP. Second, the performance of OFBCNN was compared with the state-of-the-art algorithm. Finally, a parameter sensitivity test was performed.

4.1 Performance comparison of the proposed OFBCNN with the original CNN and the nonoverlapping filter-bank CNN in multisubject MI recognition

4.1.1 Classification accuracy comparison

The accuracy of OFBCNN was compared with that of the original CNN model and the nonoverlapping filter-bank CNN in the multisubject MI recognition task. Since that training usually converges within 400 epochs, the maximum number of epochs for training was set to 500 to ensure the convergence of the model. The accuracy of the test in a particular epoch was greatly affected by randomness. To evaluate the overall performance of the model, the measurements applied in the experimental evaluation included (1) the maximum test accuracy in 500 epochs and (2) the average test accuracy between 401 and 500 epochs.

Table 2

Performance comparison of the proposed OFBCNN with the nonoverlapping filter-bank CNN and original CNN in multisubject MI recognition. /%
CNN-Dataset-Low-Cut Frequency	Max Accuracy				Average Accuracy
CNN-Dataset-Low-Cut Frequency	Original	NOFBCNN	Sliding OFBCNN	Fixed OFBCNN	Original	NOFBCNN	Sliding OFBCNN	Fixed OFBCNN
EEGNet-2a-0	70.83	65.46	68.06	72.85	68.57	60.76	65.15	71.76
Shallow-2a-0	75.05	70.63	73.68	77.85	72.80	68.31	71.51	76.57
Deep-2a-0	73.95	67.82	70.70	76.68	71.42	65.14	68.27	75.53
EEGNet-2a-4	63.40	58.18	61.96	66.19	61.68	54.49	59.23	63.40
Shallow-2a-4	68.50	64.55	69.56	71.98	65.44	62.58	67.65	69.97
Deep-2a-4	59.24	59.76	64.01	63.08	56.76	58.26	61.89	61.38
EEGNet-HG-0	85.46	74.00	80.45	86.99	83.16	70.08	76.20	85.60
Shallow-HG-0	89.96	89.29	90.54	90.92	88.69	87.50	89.19	89.91
Deep-HG-0	92.14	87.06	88.48	93.10	89.68	80.88	83.94	91.90
EEGNet-HG-4	81.49	68.35	76.16	83.12	79.11	64.85	71.29	80.18
Shallow-HG-4	88.81	86.34	88.30	88.95	87.15	84.38	86.76	87.51
Deep-HG-4	84.99	83.26	85.54	85.65	80.37	75.85	80.11	82.16
Average	77.82	72.89	76.45	79.78	75.40	69.42	73.43	77.99

The results are given in Table 2. The first column represents the original CNN model name, dataset, and low-cut frequency of the filter bank encoded in the form of a "model-dataset-filter". For example, "EEGNet-2a-0" indicates that OFBCNN was built based on the EEGNet model, evaluated on the BCI Competition IV dataset 2a, and the low-cut frequency of the filter bank is 0 Hz. NOFBCNN denotes the nonoverlapping filter-bank CNN with bandpass filters including 0–4 Hz (if the low-cut frequency is 0), 4–8 Hz, 8–12 Hz, 12–16 Hz … 32–36 Hz (as the configuration in FBCSP). The highest accuracy of each experimental setting is shown in bold.

Some conclusions can be drawn from the above experimental results. (1) The fixed OFBCNN achieved the highest average accuracies of 79.78 (maximum accuracy) and 77.99 (average test accuracy between 401 and 500 epochs). (2) The performance improvements provided by the fixed OFBCNN are universal for the CNN backbone, EEG data and low-cut frequency. (3) Shallow ConvNet with fixed OFBCNN achieved the highest accuracy in multisubject MI recognition tasks. (4) The overall performance of the nonoverlapping filter-bank CNN was the worst because of the narrow bandpass filters applied before CNN.

Moreover, the accuracy boxplots of the fixed OFBCNN, sliding OFBCNN, NOFBCNN and original CNN for BCI Competition IV dataset 2a are compared in Fig. 2. The test accuracy of 400–500 epochs was used to draw the boxplot. The upper and lower edges of the box represent the 25th and 75th percentiles of the accuracy, and the horizontal lines in the box represent the median of the data. The lines running above and below the box represent the maximum and minimum values. Outliers are marked with dots. As seen in the boxplot, the fixed OFBCNN achieved the highest or close to the highest accuracy in all configurations. In addition, the box body of the fixed OFBCNN was short, which indicates small fluctuations in accuracy. NOFBCNN had the lowest accuracy and the largest accuracy fluctuation due to insufficient discriminative information in narrow-band EEG signals.

To confirm the significance of the accuracy improvement, a paired-sample one-sided Student’s t test was conducted (the null hypothesis was that the accuracy of the OFBCNN model was equal to the accuracy of the original/NOFBCNN model, against the alternative that the accuracy of the OFBCNN model was greater than the accuracy of the original/NOFBCNN model). The p values are shown in Fig. 3. These results showed that the accuracy improvement of the fixed OFBCNN compared with the original model and NOFBCNN was significant.

4.1.2 Convergence comparison

To verify the convergence improvement provided by the OFBCNN, the variance in test accuracy in epochs 400–500 was compared with that of the original CNN and NOFBCNN by the variance in the test accuracy. The results are shown in Table 3. A smaller variance in the test accuracy means better convergence and stability.

We can draw conclusions from Table 3 that the fixed OFBCNN was the most stable model, and the average improvement factor (the original CNN accuracy variance divided by the proposed fixed OFBCNN accuracy variance) was 2.06.

Table 3

The convergence comparison of the proposed OFBCNN with the nonoverlapping filter-bank CNN and original CNN in multisubject MI recognition.
CNN-Dataset-Low-Cut Frequency	Original	NOFBCNN	Sliding OFBCNN	Fixed OFBCNN
EEGNet-2a-0 Hz	9.24	36.67	13.33	3.45
Shallow-2a-0 Hz	15.32	10.00	8.77	3.42
Deep-2a-0 Hz	12.11	16.67	10.00	3.53
EEGNet-2a-4 Hz	23.55	23.33	16.67	13.33
Shallow-2a-4 Hz	14.82	9.33	7.74	5.81
Deep-2a-4 Hz	10.85	13.15	8.67	5.00
EEGNet-HG-0 Hz	7.94	40.00	25.00	4.28
Shallow-HG-0 Hz	3.90	7.71	4.35	2.25
Deep-HG-0 Hz	13.33	85.00	35.00	3.88
EEGNet-HG-4 Hz	8.30	25.00	25.00	20.00
Shallow-HG-4 Hz	7.05	9.13	5.79	4.98
Deep-HG-4 Hz	60.00	105.00	45.00	20.00
Average	15.53	31.75	17.11	7.49

4.2 Compared with the state-of-the-art algorithm

To fully show the superiority of the proposed method, the fixed OFBCNN was compared with the state-of-the-art algorithm on the BCI Competition IV dataset 2a. In addition to the Shallow/Deep ConvNet from BrainDecode and EEGNet, the following models were included in comparison:

CNN++ [26]: CNN + + is an improved version of the CNN model consisting of 5 CNN and max pooling layers with an input fully connected (FC) layer. Inspired by CSP, a channel projection by a fully connected layer is conducted before the convolution layer.
PSTNet [27]: PSTNet is a CNN architecture based on a self-attention mechanism that extracts distinguishable spatial-temporal features through the attention mechanism in the time domain and space domain.
TS_SEFFNet [14]: TS_SEFFNet designed the DT-Conv block and MS-Conv block for feature extraction and finally used the SE-Feature-Fusion block for feature fusion.

The maximum number of epochs for training was also set to 500 for performance comparison, and other configurations are the same as those in the original literature. The average test accuracy between 401 and 500 epochs was used for comparison, and the results are shown in Fig. 4. The proposed method had a higher accuracy than the other state-of-the-art algorithms in the comparison.

4.3 Parameter selection

The key parameter in the proposed OFBCNN framework is the frequency step of the filter bank. In previous results, the frequency step of bandpass filters was set as 4 Hz. The influence of different frequency steps was tested in this section. As the excellent capability of the fixed OFBCNN based on Shallow ConvNet was shown previously, it was employed in the test on the BCI Competition IV dataset 2a, and the results are shown in Table 4. The first bandpass filter was set as shown in Table 1, and the following bandpass filters were set according to the frequency step. For example, if the low-cut frequency was 0 Hz and the frequency step was 1 Hz, the bandpass filter frequency of the overlapping filter bank was 0–8 Hz, 0–9 Hz, 0–10 Hz, …, 0–36 Hz, and thus, this overlapping filter bank had 29 bandpass filters. To visually show the influence of the frequency step on the recognition accuracy, the results in Table 4 are shown in Fig. 5.

If the frequency step is smaller, more bandpass filters will exist in the OFBCNN, and more CNN models need to be trained. In contrast, if the frequency step is larger, fewer bandpass filters will exist. However, the OFBCNN cannot take advantage of the discriminative information from all frequency bands, and the probability ensemble of a few models will diminish the discriminatory ability. Therefore, the selection of the frequency step in the filter bank was a compromise between performance and computational complexity. Based on the results in Table 4, 4 Hz was selected as an empirical value.

Table 4

The average testing accuracy comparison of different frequency steps. /%
Step	1 Hz	2 Hz	3 Hz	4 Hz	5 Hz	6 Hz	7 Hz	8 Hz	9 Hz	10 Hz	11 Hz	12 Hz	13 Hz	14 Hz
0 Hz	76.65	76.62	76.52	76.57	76.36	76.48	76.26	76.49	76.28	76.09	76.07	76.23	76.23	76.05
4 Hz	70.29	70.22	70.14	69.97	69.79	69.97	69.63	69.61	69.55	69.28	69.68	69.51	68.47	68.47

In this paper, an overlapping filter-bank CNN framework was proposed to enable the CNN model to learn discriminative information from multiple frequency bands of EEG for multisubject MI-BCI. Specifically, the novel overlapping filter bank with a fixed low cut frequency and overlapping filter bank with a sliding low cut frequency were applied for preprocessing EEG. Comprehensive experimental evaluations using two benchmark datasets were conducted to test the effectiveness and universality of the proposed overlapping filter-bank CNN framework. The experimental results showed that the fixed OFBCNN achieved the highest classification accuracy.

Acknowledgements

Not applicable.

Authors’ contributions

Method design and manuscript preparation (Jing Luo), algorithm implementation and data analysis (Qi Mao, Zhenghao Shi), critical review and revision of the manuscript (Xiaoyong Ren, Xinhong Hei). All authors read and approved the final manuscript.

Funding

This work is jointly supported by the National Natural Science Foundation of China (Grant Nos.61906152 and 62076198), Key Research and Development Program of Shaanxi (Program Nos. 2021GY-080 and 2020GXLH-Y005), and the Beilin District Science and Technology Plan Project (Grant No. GX2245).

Availability of data and materials

The datasets used and/or analysed during the current study are from public datasets.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

M.M. Shanechi, Brain–machine interfaces from motor to mood. Nature Neuroscience 22 (2019) 1554–1564.
C. Liu, J. Jin, I. Daly, H. Sun, Y. Huang, X. Wang, and A. Cichocki, Bispectrum-based hybrid neural network for motor imagery classification. Journal of Neuroscience Methods 375 (2022) 109593.
W.C. Jin, S. Huh, and S. Jo, Improving performance in motor imagery BCI-based control applications via virtually embodied feedback. Computers in Biology and Medicine 127 (2020).
F. Lotte, L. Bougrain, A. Cichocki, M. Clerc, M. Congedo, A. Rakotomamonjy, and F. Yger, A review of classification algorithms for EEG-based brain–computer interfaces: a 10 year update. Journal of Neural Engineering 15 (2018) 031005.
K.K. Ang, Z.Y. Chin, C. Wang, C. Guan, and H. Zhang, Filter bank common spatial pattern algorithm on BCI competition IV datasets 2a and 2b. Frontiers in Neuroscience 6 (2012) 39.
S.-H. Park, D. Lee, and S.-G. Lee, Filter bank regularized common spatial pattern ensemble for small sample motor imagery classification. IEEE Transactions on Neural Systems and Rehabilitation Engineering 26 (2018) 498–505.
R. Zhang, Q. Zong, L. Dou, and X. Zhao, A novel hybrid deep learning scheme for four-class motor imagery classification. Journal of Neural Engineering 16 (2019) 066004.
R.T. Schirrmeister, J.T. Springenberg, L.D.J. Fiederer, M. Glasstetter, K. Eggensperger, M. Tangermann, F. Hutter, W. Burgard, and T. Ball, Deep learning with convolutional neural networks for EEG decoding and visualization. Human Brain Mapping 38 (2017) 5391–5420.
V.J. Lawhern, A.J. Solon, N.R. Waytowich, S.M. Gordon, C.P. Hung, and B.J. Lance, EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces. Journal of Neural Engineering 15 (2018) 056013.
J. Chen, Z. Yu, Z. Gu, and Y. Li, Deep temporal-spatial feature learning for motor imagery-based brain–computer interfaces. IEEE Transactions on Neural Systems and Rehabilitation Engineering 28 (2020) 2356–2366.
Y. Li, X.-R. Zhang, B. Zhang, M.-Y. Lei, W.-G. Cui, and Y.-Z. Guo, A channel-projection mixed-scale convolutional neural network for motor imagery EEG decoding. IEEE Transactions on Neural Systems and Rehabilitation Engineering 27 (2019) 1170–1180.
R. Mane, E. Chew, K. Chua, K.K. Ang, N. Robinson, A.P. Vinod, S.-W. Lee, and C. Guan, FBCNet: A Multi-view Convolutional Neural Network for Brain-Computer Interface. arXiv preprint arXiv:2104.01233 (2021).
X. Zhao, H. Zhang, G. Zhu, F. You, S. Kuang, and L. Sun, A multi-branch 3D convolutional neural network for EEG-based motor imagery classification. IEEE Transactions on Neural Systems and Rehabilitation Engineering 27 (2019) 2164–2177.
Y. Li, L. Guo, Y. Liu, J. Liu, and F. Meng, A Temporal-Spectral-Based Squeeze-and- Excitation Feature Fusion Network for Motor Imagery EEG Decoding. IEEE Transactions on Neural Systems and Rehabilitation Engineering 29 (2021) 1534–1545.
K. Zhang, N. Robinson, S.-W. Lee, and C. Guan, Adaptive transfer learning for EEG motor imagery classification with deep Convolutional Neural Network. Neural Networks 136 (2021) 1–10.
O.Y. Kwon, M.H. Lee, C. Guan, and S.W. Lee, Subject-Independent Brain–Computer Interfaces Based on Deep Convolutional Neural Networks. IEEE Transactions on Neural Networks and Learning Systems 31 (2020) 3839–3852.
D. Zhang, L. Yao, K. Chen, and J. Monaghan, A convolutional recurrent attention model for subject-independent EEG signal analysis. IEEE Signal Processing Letters 26 (2019) 715–719.
P. Autthasan, R. Chaisaen, T. Sudhawiyangkul, S. Kiatthaveephong, P. Rangpong, N. Dilokthanakul, G. Bhakdisongkhram, H. Phan, C. Guan, and T. Wilaiprasitporn, Min2net: End-to-end multi-task learning for subject-independent motor imagery eeg classification. IEEE Transactions on Biomedical Engineering (2021) 1–1.
J. Luo, W. Shi, N. Lu, J. Wang, H. Chen, Y. Wang, X. Lu, X. Wang, and X. Hei, Improving the performance of multisubject motor imagery-based BCIs using twin cascaded softmax CNNs. Journal of Neural Engineering 18 (2021) 036024.
V. Jayaram, M. Alamgir, Y. Altun, and B. Scholkopf, Transfer Learning in Brain-Computer Interfaces. IEEE Computational Intelligence Magazine 11 (2015) 20–31.
J. Luo, J. Wang, R. Xu, and K. Xu, Class discrepancy-guided sub-band filter-based common spatial pattern for motor imagery classification. Journal of Neuroscience Methods 323 (2019) 98–107.
I. Xygonakis, A. Athanasiou, N. Pandria, D. Kugiumtzis, and P.D. Bamidis, Decoding motor imagery through common spatial pattern filters at the EEG source space. Computational Intelligence and Neuroscience 2018 (2018).
M. Tangermann, K.-R. Müller, A. Aertsen, N. Birbaumer, C. Braun, C. Brunner, R. Leeb, C. Mehring, K.J. Miller, and G.R. Müller-Putz, Review of the BCI competition IV. Frontiers in Neuroscience 6 (2012).
D.P. Kingma, and J. Ba, Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, and L. Antiga, Pytorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems 32 (2019) 8026–8037.
Y. Zhao, S. Yao, S. Hu, S. Chang, R. Ganti, M. Srivatsa, S. Li, and T. Abdelzaher, On the improvement of classifying EEG recordings using neural networks, 2017 IEEE International Conference on Big Data (Big Data), 2017, pp. 1709–1711.
L. Xiuling, Y. Shen, J. Liu, J. Yang, P. Xiong, and F. Lin, Parallel Spatial–Temporal Self-Attention CNN-Based Motor Imagery Classification for BCI. Frontiers in Neuroscience 14 (2020).

No competing interests reported.

Download PDF

Journal Publication

published 10 Jul, 2023

Read the published version in BioData Mining →

Editorial decision: Major revision
20 Feb, 2023
Reviews received at journal
03 Feb, 2023
Reviewers agreed at journal
14 Jan, 2023
Reviews received at journal
23 Dec, 2022
Reviewers agreed at journal
19 Dec, 2022
Reviewers invited by journal
18 Dec, 2022
Editor assigned by journal
08 Dec, 2022
Submission checks completed at journal
08 Dec, 2022
First submitted to journal
29 Nov, 2022

You are reading this latest preprint version

Overlapping filter-bank convolutional neural network for multisubject multicategory motor imagery BCI

Status:

Journal Publication

Version 2

Abstract

Background

Methods

Results

Conclusion

Figures

1 Background

1.1 Common spatial pattern-related method

1.2 Deep learning-based method

1.3 Multisubject calibration-free MI-BCI

2 Methods

2.1 Analysis of FBCSP

2.2 Overlapping filter bank

2.3 Overlapping filter-bank CNN

2.4 CNN backbone

3 Experimental Data

3.1 BCI Competition IV dataset 2a

3.2 High-Gamma dataset

3.3 Experimental setup

4 Results And Discussion

4.1 Performance comparison of the proposed OFBCNN with the original CNN and the nonoverlapping filter-bank CNN in multisubject MI recognition

4.1.1 Classification accuracy comparison

4.1.2 Convergence comparison

4.2 Compared with the state-of-the-art algorithm

4.3 Parameter selection

5 Conclusion

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 2