This study protocol was approved by the Institutional Ethics Committee of the Second Affiliated Hospital of the Army Medical University on March 25, 2020 (Chongqing, China, approval number: 2020-078-01). Patients were recruited from March 27, 2020 to April 29, 2020. Written informed consent was obtained from each patient. Twenty-three American Society of Anaesthesiology (ASA) physical status I or II adult patients, aged from 20 to 70 years old, scheduled to undergo elective laparoscopic abdominal surgery were recruited. Exclusion criteria included patients with neurological and cardiovascular diseases or a known allergy history of anaesthetics.
All patients underwent preoperative fasting for at least 8 h. The placement of the chest electrodes was the same for all participants. The five-leads were located at five different positions on the chest. The upper left position was at the junction of the midclavicular line on the left edge of the sternum and the first intercostal space. The lower left position was at the junction of the left midline of the clavicle and the level of the xiphoid process. The upper right position was at the junction between the midclavicular line on the right edge of the sternum and the first intercostal space. The lower right position was at the horizontal junction of the right clavicle midline and the xiphoid process, and the middle position was at the fourth intercostal space on the left edge of the sternum. After the electrodes were placed on the patient chest wall, anaesthesia was usually induced with intravenous midazolam, propofol, sufentanil, and cisatracurium. Sevoflurane together with propofol and remifentanil were used to maintain anaesthesia. Table 1 summarises this information in detail. Physiological signals (such as ECG, BP, HR, and SpO2) were measured to guarantee the safety of the patients under different anaesthesia states. The attending anaesthetist adjusted the DoA accordingly, using these observed signals and their own experience. From the various monitoring feedback information observed, attending anaesthetists need to analyze, synthesize, and judge the vital function indicators of patients according to their own experience and to make timely adjustments and interventions as needed to keep the vital signs as normal or close to the normal physiological state as possible, to adjust the DoA and maintain it at an appropriate level.
In this study, ECG signals were recorded from twenty-three adult patients under general anaesthesia. The signals were recorded using a Philips MP60 monitor (Intellivue; Philips, Foster City, CA, USA). The operation time was 1—3 h. Raw ECG data were sampled at a 500-Hz sampling frequency.
Expert assessment of consciousness level
The expert assessment of consciousness level (EACL) is the average value of the DoA assessment score determined by five experienced anaesthesiologists (i.e., attending physicians) based on clinical recordings and their own experience [27]. An experienced anaesthesiologist trained for many years with rich clinical experience can be familiar with health risks evaluation and accurately assess the DoA through clinical signs, surgical stimulations, the dose of the anaesthetic agent, etc. combined with his or her own clinical experience. Thus, such an expert can perform anaesthesia-related operations proficiently and correctly handle various problems in anaesthesia even if he or she is not in the operating room during surgery. The states of general anaesthesia are classified as: anaesthesia induction, anaesthesia maintenance, and anaesthesia recovery. Anaesthesia induction means that the anaesthesia depth gradually increases, anaesthesia maintenance means that the anaesthesia depth is relatively stable, and anaesthesia recovery means that the anaesthesia depth gradually decreases. These three states represent the different states of anaesthesia depth. The obtained EACL value is a single number from 0–100, similar to the BIS (with 100 denoting ‘fully awake’ and 0 denoting ‘isoelectricity’). During surgery, the clinical information recorded included: (1) vital signs (e.g., HR, BP, SpO2), (2) anaesthetic events, including induction, intubation and extubation of anaesthesia, addition of muscle relaxant drugs, and airway management, (3) surgical events, including the start and end of the surgical procedure and the occurrence of noxious stimulus, (4) other clinical signs, including unusual responses, movement, and arousability under induction and recovery, and (5) any other related events, such as lacrimation, sweating, and patient demography.
ECG preprocessing
Body movements and medical device frequency noise are the main artifacts in ECG recordings. These artifacts seriously affect the analysis results of the ECG signals. Therefore, data preprocessing is essential for distinguishing different anaesthesia states, and can normalize and facilitate subsequent analysis. The specific process is detailed in additional file 1(1).
Frequency domain algorithm
Wavelet transform is a typical nonlinear analysis technique and one of the most useful methods for biological signal analysis, especially in cases of continuous signals with various frequency features [35, 36]. Therefore, in this study, discrete wavelet transform was used for the frequency domain analysis of the HRV power. The calculation formula for the HRV power is detailed in additional file 1(2). Entropy, as a nonlinear dynamic parameter that can measure the incidence of new information in a time series, can be described as a regularity or degree of randomness indicator. SampEn is an improved algorithm from entropy. The calculation formula for the SampEn is detailed in additional file 1(3).
Machinelearning algorithms
Logistic regression is a classification algorithm used to predict the probability of classifying dependent variables. A support vector machine is a supervised learning algorithm that can be applied to classification problems. The calculation formula for the support vector machine approach is detailed in additional file 1(4). A decision tree is a multi-classification supervised learning algorithm. The calculation formula for the decision tree method is detailed in additional file 1(5). An artificial neural network is a nonparametric parallel computing model, which is similar to the neural structure of the human brain [37]. It usually consists of an input layer, a hidden layer, an output layer, and numerous interconnected nodes in multiple layers. The deep neural network developed from the artificial neural network was used in this study. The flowchart of the deep neural network construction is shown in Fig. 1. The deep neural network is detailed in additional file 1(6).
Performance analysis
The performance of four models was quantified based on the results of cross-validation using the precision, recall, and classification accuracy. Precision is defined as the ratio of the number of correct classifications of an anaesthesia state to the total number of classifications of the same type of anaesthesia state. Recall is defined as the ratio of the number of correct classifications of an anaesthesia state to the number of actual occurrences of this anaesthesia state. Classification accuracy is defined as the ratio of the total number of correctly identified anaesthesia states to the sum of all anaesthesia states. The calculation formulas for the precision, recall, and classification accuracy are detailed in additional file 1(7).
Statistical analysis
There are no standardized methods for sample size calculation based on machine learning algorithms. Thus, the sample size calculations in this pilot feasibility study were based on previous reports [32, 34]. The sample size is 23 cases in this study, including 46000 datasets. The average datasets per patient is 2000 datasets. Statistical analyses were performed using SPSS 22.0 (SPSS Inc., Chicago, IL) and Python (version 3.6.5) software. Data were expressed as mean (SD) or percentage, where appropriate. Ternary classification outcome parameters were expressed as events (percentages). Data are presented as tables, box-and-whisker diagrams, and correlation graphs. In addition, we calculated the distribution of the four features in the three anaesthesia states. The Pearson’s correlation coefficient between the EACL and the four features of the deep neural network model was also calculated to estimate the efficacy of the proposed method. The performances of four classification methods were compared: the logistic regression, support vector machine, decision tree, and proposed deep neural network methods. Owing to the small sample size in this study, the sample does not satisfy a normal distribution. Therefore, the four classification methods were compared using the Chi-square test. p < 0.05 was considered statistically significant.