Diagnosis of Schizophrenia with Functional Connectome Data: A Graph-Based Convolutional Neural Network Approach

doi:10.21203/rs.3.rs-786309/v1

Download PDF

Research Article

Diagnosis of Schizophrenia with Functional Connectome Data: A Graph-Based Convolutional Neural Network Approach

https://doi.org/10.21203/rs.3.rs-786309/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 17 Jan, 2022

Read the published version in BMC Neuroscience →

You are reading this latest preprint version

Previous deep learning methods have not captured graph or network representations of brain structural or functional connectome data. To address this, we developed the Brain Graph Covariance Pooling Network (BrainGCPNet) by incorporating global covariance pooling and BrainNetCNN into the self-attention mechanism. Resting-state functional magnetic resonance imaging data were obtained from 171 patients with schizophrenia spectrum disorders (SSDs) and 161 healthy controls (HCs). We conducted an ablation analysis of the proposed BrainGCPNet and quantitative performance comparisons with competing methods using the nested tenfold cross validation strategy. The performance of our model was compared with competing methods. Discriminative connections were visualized using the gradient-based explanation method and compared with the results obtained using functional connectivity analysis. The BrainGCPNet showed an accuracy of 83·13%, outperforming other competing methods. Among the top 10 discriminative connections, some were associated with the default mode network and auditory network. Interestingly, these regions were also significant in the functional connectivity analysis. Our findings suggest that the proposed BrainGCPNet can classify patients with SSDs and HCs with higher accuracy than other models. Visualization of salient regions provides important clinical information. These results highlight the potential use of the BrainGCPNet in the diagnosis of schizophrenia.

Cellular & Molecular Neuroscience

Brain network

Convolutional neural network

Functional connectome

Schizophrenia

Self-attention mechanism

Convolutional neural networks (CNNs) are extremely efficient architectures in image and audio recognition tasks. CNNs performed better than other DNNs in the classification of Alzheimer’s disease versus mild cognitive impairment or normal controls [1, 2]. We also previously reported 84·15–84·43% classification accuracies for schizophrenia (SZ) using a 3D CNN model, outperforming support vector machine (SVM) and other 3D CNN models [3]. However, a critical limitation of CNNs is that receptive fields of their filters for feature extraction do not exactly capture graph or network representations of structural or functional connectome data of the brain. Recent research has shown that the representations produced by CNNs can be strengthened by integrating learning mechanisms into the network that help capture graph or network representations between features; one of these models is the BrainNetCNN [4].

The BrainNetCNN, a type of CNN, is composed of novel edge-to-edge (E2E), edge-to-node (E2N), and node-to-graph (N2G) convolutional filters that leverage the topological locality of brain network data. With structural connectome data, the BrainNetCNN framework outperformed a variety of other methods in predicting neurodevelopment [5]. Another model is a global covariance pooling (second-order pooling). Compared with global average pooling (first-order pooling) in existing deep CNNs, global covariance pooling produces covariance matrices deciphering higher order representations with the potential to enhance the nonlinear modeling capacity of deep CNNs [6]. However, a drawback of global covariance pooling is that the second-order pooling block is only applicable at the end of the network. To overcome this, Gao and colleagues proposed a novel network model introducing global second-order pooling across lower to higher layers to exploit holistic image information throughout a network [7]. With the self-attention strategy [8], high-order statistical representations can be trained at every layer, outperforming other methods [6]. Based on current trends, we hypothesized that the BrainNetCNN framework combined with global covariance pooling and self-attention mechanisms would achieve a higher performance with functional connectome data. We named this new model the Brain Graph Covariance Pooling Network (BrainGCPNet). The aims of the present study were to perform an ablation study using the BrainGCPNet to analyze resting-state functional magnetic resonance imaging (rsfMRI) data and compare its accuracy in classifying SZ versus healthy controls (HCs) with those of other networks. In addition, we sought to develop an explainable saliency map showing significant connections discriminating between SZ and HCs. These connections were compared to the results of functional connectivity (FC) obtained with univariate analysis.

Demographic and clinical characteristics

The diagnoses of patients were SZ (n = 128), schizophreniform disorder (n = 40), and schizoaffective disorder (n = 3). There were no significant differences in age and sex between the SSD and HC groups. However, education was higher in the SSD group compared to the HC group (Table 1).

Ablation study on the BrainGCPNet

When two convolutional layers with the GCP block were used, the highest accuracy obtained was 83·13%. Regardless of the number of layers, accuracy was consistently better (6–7%) in the network with the GCP block compared to the network without the GCP block (Table 2). Unexpectedly, performance was the best with one E2E layer (Table 3). As for the hidden units of N2G in the GCP block, performance was slightly better with 10 units (Table 4). When the number of output dimensionality of the typical convolutional layer was 64, the best performance was observed. Minimal variations of performance were observed with different numbers of output dimensionality of the E2E layer of the GCP block (Table 5)

Performance comparison with other competing methods

The proposed BrainGCPNet showed the best accuracy (83·13%) and area under the curve (89·42%). Its permutation test (10 000 times) was significant (p < 0·001). The next best model was SENet (Table 6 and Figure 2).

Discriminative connections

Regarding the connectivity strength between nodes, the top 10 discriminative connections were between the brain regions of the left posterior cingulate gyrus and right posterior cingulate gyrus; left thalamus and right thalamus; left calcarine sulcus and right cuneus; and left Heschl’s gyrus and right Heschl’s gyrus. The brain regions with highest nodal strength were the left calcarine sulcus, right amygdala, left putamen, right thalamus, and right supramarginal gyrus (Table 7 and Figure 3).

Functional connectivity analysis

Compared to the HC group, the SSD group exhibited significantly increased FC between the brain regions of the cingulate gyrus and inferior frontal gyrus; left superior frontal gyrus and right inferior frontal gyrus; left angular gyrus and left inferior frontal gyrus; and right cuneus and left calcarine sulcus. Additionally, the SSD group showed decreased FC between the brain regions of the putamen and insular cortex, left Heschl’s gyrus and right Heschl’s gyrus and left superior temporal gyrus and, right superior temporal gyrus compared to the HC group (Table 8). Partial correlation analysis revealed significant positive relationships between the connectivity of the left anterior cingulate gyrus and left triangularis inferior frontal gyrus and positive symptom subscale, connectivity of the right anterior cingulate gyrus and left orbital inferior frontal gyrus and negative symptom subscale and connectivity of the right cuneus and left calcarine and positive symptom subscale, general psychopathology subscale and, total score of the PANSS (Supplement Table 1).

To overcome the limitation of previous deep learning (DL) methods not capturing graph or network representations of connectome data, we developed the BrainGCPNet by incorporating global covariance pooling and BrainNetCNN into the self-attention mechanism. Advancement of scientific knowledge is described below in terms of methodological and clinical aspects.

Methodological aspects

In the ablation study, favorable performance was reported in the network architecture composed of only two convolutional layers with GCP blocks. Relevant studies [8,9] have shown that deep convolutional networks with covariance pooling outperformed other competing methods in a large-scale visual recognition task. Unlike the results of the BrainNetCNN study [5], there was no benefit when stacking multiple E2E layers for our classification task. It seems that the features transformed by multiple E2E layers have a negative effect on extracting higher order features of the next E2N layer, which was not used in their original study [5]. Although ten units performed slightly better, the overall difference was small.

The best accuracy was obtained with the BrainGCPNet, compared with other competing methods. Two characteristics of our model may have contributed to this superiority. First, because second-order pooling in the E2N layer captures higher order representations, this may lead to more discriminative features. The covariance matrix produced by second-order pooling is known to improve representation power by quadratic modeling. The i-th row can be interpreted to indicate statistical dependency between the i-th brain region and all other brain regions. We believe that this technique is a proper approach for neuroimaging data in which correlations or network features between brain regions are crucial. This may be partially supported by the finding that performance dropped a little when second-order pooling was not used in our model. Second, adopting the self-attention mechanism may enhance classification performance by effectively learning graph-wise high-order representations at every convolutional layer and recalibrating filter responses. Because the self-attention mechanism allows covariance pooling to be conveniently plugged into any location of the convolutional layers, it helps build deep CNNs. Unlike the BrainNetCNN, our model is flexible and has a deep structure. This may have contributed to the capture of richer statistics of deep features and improvement of the representation and generalization abilities of deep CNNs. Although implementation of a typical convolutional layer before the GCP block was necessary to apply the self-attention mechanism, it may be criticized that features extracted from this convolutional layer may not contain graph or network representations of our connectome data. However, we regarded this convolutional layer as a local sparse feature extractor since a typical convolution operation with small kernels does not significantly distort the topological characteristic of the correlation matrix.

Clinical implications

Using multivariate DL techniques, neuroimaging-based single-subject prediction of psychiatric disorders has gained increasing attention in recent years. Several studies have employed DL methods to classify SZ and HCs. Using sMRI data, two studies applied a DBN to the original pre-processed images and obtained accuracy rates of 91% and 73·6%, respectively [10,11]. Applying SAE with weight sparsity control to rsfMRI data, classification of SZ vs. and controls with an accuracy of 85·5% was reported [12]. Other studies with rsfMRI data reported accuracy rates of 79–92% in SZ using autoencoder-based two- or three-stage architecture [13-15]. FC analysis with rsfMRI data produces a correlation matrix representing inter-correlation between voxels or regions. However, previous DL methods used in the classification of SZ and HCs did not capture inter-correlation features of the brain network. For example, DNN with weight sparsity control requires a 1D input feature vector, thereby losing spatial information between voxels or regions. Although the CNN used in our previous work [3] can preserve spatial locality with the use of 3D data, this model did not capture topological locality of the brain network. The accuracy of our model, BrainGCPNet, was the best in the classification of SSDs and HCs, outperforming BrainNetCNN by 6·09%. This suggests our proposed model is an optimally designed approach to capture inter-correlation features of functional connectome data.

Using the gradient-based explanation method, we identified discriminative functional connections between the brain regions contributing significantly to the recognition of SSDs. Among the top 10 discriminative connections, some regions (posterior cingulate gyrus and angular gyrus) were related to the DMN and others to the auditory network (superior temporal gyrus and Heschl’s gyrus) and thalamus network. The DMN is involved in complex self-referential stimuli, such as mental time travel, perspective taking, and theory of mind [16]. Accumulating evidence suggests that the DMN is abnormal in SZ, although the results have been mixed [8]. It is of interest that the best discriminative connectivity was an interhemispheric connection in the posterior cingulate gyrus. The posterior cingulate cortex (PCC), a key node in the DMN, has a central role in supporting internally-directed cognition showing increased activity when individuals retrieve autobiographical memories or plan for the future [17]. Increases and decreases of resting FC around the PCC have been reported in both patients and their first-degree relatives [18, 19]. In SZ, the auditory cortex is closely associated with auditory verbal hallucinations, which have been proposed to be a result of abnormally elevated resting-state activity in the auditory cortex or from the DMN [20]. The second most discriminative connection was an interhemispheric connection in the thalamus. The thalamus represents an essential hub for cognitive processes and an interface between sensory and motor systems. Brain-wide analysis of FC in SZ revealed that thalamic-related aberrant connectivities were prominent at the chronic stage of SZ [21]. In the task-based fMRI data, the most-significant, stable and discriminative FC changes involved increased correlations between thalamus and other cortical regions [22]. Interestingly, some of these regions (cingulate gyrus, superior temporal gyrus, Heschl’s gyrus and cuneus) were also found to be significant in the FC analysis.

Overall pattern of the FC analysis was a more widespread occurrence of increased connectivities in patients with SZ compared with HCs. This seems in contrast with the results of previous studies that global/average connectivity strength was significantly reduced in SZ compared to controls [23, 24]. However, it should be noted that there are many other studies reporting increased connectivities in the resting-state DMN [25], thalamo-sensorimotor link [26] and computational modeling [27]. The most prominent aberrant connections in SZ were between the cingulate and inferior frontal gyri. Our findings are partially supported by the results of Li et al study [21]. that in patients with first episode SZ, 90% of the FC changes involved the frontal lobes, mostly the inferior frontal gyrus whereas the PCC was one of the areas showing the most prominent changes in chronic SZ. Interestingly, we found positive relationship between the connectivity strength of anterior cingulate gyrus with inferior frontal gyrus and positive or negative symptoms. The anterior cingulate cortex (ACC) is known to be involved in the affect regulation, conflict monitoring and executive control of cognition [28]. The inferior frontal gyrus is involved in attention control and response inhibition [29]. Therefore, it may be speculated that aberrant connectivity in the ACC and inferior frontal gyrus affects their functioning which may in turn lead to development of positive or negative symptoms in SZ. We observed decreased connectivities in the Heschl’s gyrus and superior temporal gyrus. Similarly, Venkataraman and colleagues reported decreased connectivity between the temporal cortices bilaterally in SZ [30]. However, no significant correlation was found between these hypoconnectivity and psychopathology. Lastly, increased connectivity between the cuneus and calcarine sulcus was shown in individuals with SZ compared to HCs. The cuneus (Brodmann area 17) receives visual information from the same-sided superior quadrantic retina and is primarily involved in basic visual processing. The calcarine fissure is a deep sulcus located on the medial surface of the occipital lobe. Multiple lines of evidence indicate that there are reduced intrinsic visual cortical connectivity [31] and decreased connection in high-visual network which was found to be correlated with the severity of positive symptoms in SZ [32]. Thus, our findings on the connectivity plus its correlation with psychopathology suggest that impaired visual networks may also contribute to the development of psychopathology in SZ. On the other hand, while medial, superior, and inferior frontal gyri were found to be significant in the FC analysis, these regions were not identified as such in the gradient-based explanation method. In addition, bilateral connections between the same regions were highly prominent in the gradient-based explanation method, whereas unilateral or bilateral connections between different regions were more common in the FC analysis. These discrepancies may be due to the different methodologies used in the two analyses.

This study has several limitations. First, because the number of subjects used for the training and test phases was small, it is unclear how well these findings will generalize to different samples. Validation experiments will also be necessary if the transfer classification model is applied to a clinical population at a new imaging site. Second, although the proportion of patients with an antipsychotic naïve or free state was approximately 33%, most of the patients were on medication at the time of scanning. Antipsychotics are known to affect FC [33], this factor should be controlled if possible. Despite these caveats, this is the first study to apply a graphical approach based on the CNN to functional connectome data in SSDs. Overall, the BrainGCPNet showed high accuracy in the classification of SSDs and HCs, outperforming other competing methods. Some of the discriminative connections were associated with DMN and auditory network brain regions. Furthermore, some of the discriminative connections identified by DL and conventional univariate methods were similar. These results highlight a potential use of the BrainGCPNet in the diagnosis of SZ.

Participants

All participating patients (n = 171) met DSM-IV-TR criteria for schizophrenia spectrum disorders (SSDs) according to the Structured Clinical Interview for DSM-IV (SCID) [34, 35]. Individuals with alcohol- or drug-use disorders within the past 6 months, intellectual disability (IQ ≤ 70), current or historical neurological disorders, pregnancy, and claustrophobia were excluded from the study. HCs were required to have no previous or current psychiatric disorders, neurological disorders, or significant medical conditions. All participants were presented with a detailed description of the study design to ensure that they fully understood the procedures and gave written informed consent. The study was approved by the Ethics Committee of Jeonbuk National University. All procedures were performed in accordance with relevant guidelines.

Clinical assessment

The severity of symptoms was evaluated within a week of fMRI scanning using the positive and negative syndrome scale [36] and, the Calgary Depression Scale for Schizophrenia⁷. The PANSS and CDSS were administered by trained psychiatrists.

Image Acquisition And Preprocessing

Resting-state functional and structural MRI (rsfMRI and sMRI) data were obtained at the Jeonbuk National University Hospital on a 3T Verio scanner (Siemens Magnetom Verio, Erlangen, Germany) using a 12-channel standard quadrature head coil. We collected a 5-min resting-state scan consisting of 150 contiguous echo-planar imaging functional images (TR: 2000 ms; TE: 30 ms; flip angle: 90°; FOV: 240 mm; image matrix: 64 × 64 mm; voxel size = 1·0 × 1·0 × 1·0 mm [3]; 176 slices). MRI data preprocessing was conducted in a standard way using the Statistical Parametric Mapping software package, ver·12. The criteria for excessive head motion were translation > 2 mm or rotation > 2° in any direction. Participants for whom more than 10% of volumes showed excessive head motion were excluded from the analysis. The linear trend was removed through the time course, and the band-pass filter (0·008 < f < 0·09 Hz) was applied.

Functional Connectivity Analysis

Time series of the voxels within the ROI were averaged to generate the regional time series for the automated anatomical labeling (AAL) atlas. The FC matrix was computed by correlating time series data between every pair in the AAL atlas using the CONN toolbox. Group comparison was performed using ANCOVA with education as covariate. For the contrast map, we applied the cluster-level extent threshold of p < 0·01, which was corrected for multiple comparisons using the false discovery rate. Partial correlations were carried out controlling for age, sex, education, duration of illness, chlorpromazine equivalent doses and head motion (framewise displacement) on the relationship between the rsFC z-values of brain regions showing significant between- group differences and PANSS. The significance level was set at a cluster-level of p < 0·05, and data were not corrected for multiple comparison because of the exploratory nature of the evaluation.

Graph Covariance Pooling Block

The brain FC can be expressed as the complete graph $G=\left(E, B\right),$ where $B$ is a set of nodes reflecting 116 brain regions and $E$ is a weighted adjacency matrix of edges. To capture graph representations of a functional connectome, we adopted graph-wise convolutional filters in the BrainNetCNN [5], which were composed of E2E, E2N, and N2G. However, the block was modified by applying two more methods, i.e., second-order pooling and the self-attention mechanism, and was named the graph covariance pooling (GCP) block (Fig. 1).

Unlike the BrainNetCNN [5] second-order pooling was inserted before the row convolutional filter in the E2N layer. To this end, the 3D output tensor ${\mathbf{x}}_{\text{e}\text{e}}{\in \mathbb{R}}^{h\times w\times {c}^{{\prime }}}$ of the E2E layer was reshaped to the two-dimensional matrix ${\mathbf{F}}_{\text{r}\text{e}}:{\mathbf{x}}_{\text{e}\text{e}}\to {\mathbf{x}}_{\mathbf{e}{\mathbf{e}}^{{\prime }}}{\in \mathbb{R}}^{h\times M}$ where the i-th row indicates the representations of i-th brain regions. Given the matrix ${\mathbf{x}}_{\text{e}{\text{e}}^{{\prime }}}$ consisting of M-samples and h-dimensional features, the sample covariance matrix of ${\mathbf{x}}_{\text{e}{\text{e}}^{{\prime }}}$ can be written as follows:

$${\mathbf{F}}_{\text{c}\text{o}\text{v}}: {\mathbf{x}}_{\text{e}{\text{e}}^{{\prime }}}\to {\mathbf{x}}_{\text{c}\text{o}\text{v}}={\mathbf{x}}_{\text{e}{\text{e}}^{{\prime }}}\mathbf{A}{\mathbf{x}}_{\text{e}{\text{e}}^{{\prime }}}^{\text{T}} , \mathbf{A}=\frac{1}{M}\left(\mathbf{I}-\frac{1}{M}{\mathbf{J}\mathbf{J}}^{\text{T}}\right) \left(5\right)$$

where $\mathbf{I}$ is the$M\times M$ identity matrix, $\mathbf{J}$ represents the $M$-dimensional vector, which is composed of one, and $\text{T}$ denotes the matrix transpose. We performed a row-wise group convolutional filter by considering each row as a group, ${\mathbf{F}}_{\text{r}\text{c}\text{o}\text{n}\text{v}}:{{\mathbf{x}}_{\text{c}\text{o}\text{v}}\to \mathbf{x}}_{\text{e}\text{n}}{\in \mathbb{R}}^{h\times 1}$, to maintain characteristics of the functional connectome data. Through the proposed E2N layer, the input tensor from the E2E layer was transformed into region-wise sparse representations corresponding to the number of brain regions, which can be defined as follows:

$${\mathbf{F}}_{\text{E}2\text{N}}={{\mathbf{F}}_{\text{r}\text{c}\text{o}\text{n}\text{v}}\circ \mathbf{F}}_{\text{c}\text{o}\text{v}}\circ {\mathbf{F}}_{\text{r}\text{e}} :{\mathbf{x}}_{\text{e}\text{e}}\to {\mathbf{x}}_{\text{e}\text{n}}{\in \mathbb{R}}^{h\times 1} \left(6\right)$$

The GCP block is a computational module that can build the enhanced tensor ${\tilde{\mathbf{X}}\in \mathbb{R}}^{h\times w\times c}$ from its original tensor ${\mathbf{X}\in \mathbb{R}}^{h\times w\times c}$, which can be defined as follows:

$${\mathbf{F}}_{\text{G}\text{B}\text{C}\text{P}}={\mathbf{F}}_{\text{E}\text{X}}\circ {\mathbf{F}}_{\text{S}\text{E}} :\mathbf{X} \to \tilde{\mathbf{X}} \left(3\right)$$

where${ \mathbf{F}}_{\text{S}\text{E}}={\mathbf{F}}_{\text{E}2\text{N}}{\circ \mathbf{F}}_{\text{E}2\text{E}}$ is the squeeze function and ${ \mathbf{F}}_{\text{E}\text{X}}= {\mathbf{F}}_{\text{N}2\text{G}}$ denotes the excitation function. For the squeeze operation, an input tensor was fed to the E2E layer to encode the edge strengths over a pair of brain regions. To decrease the computational cost of second-order pooling at the following layer, we also reduced the number of channels from $c$ to ${c}^{{\prime }}$, and the E2E layer of the proposed squeeze operation is defined as follows:

$${\mathbf{F}}_{\text{E}2\text{E}}:\mathbf{X}\to {\mathbf{x}}_{\text{e}\text{e}} {\in \mathbb{R}}^{h\times w\times {c}^{\text{'}}} \left(4\right)$$

For the excitation operation, we employed the N2G layer. This aims to summarize the responses of all brain regions into a single response. In the N2G layer, the dimensionality of input vector ${\mathbf{x}}_{\text{e}\text{n}}$ was decreased by passing the bottleneck layer with a reduced ratio. We then increased the vector from the reduced size to its original size, and activated the output vector using the sigmoid function, ${\mathbf{F}}_{\text{f}\text{c}}: {\mathbf{x}}_{\text{e}\text{n}}\to {\mathbf{x}}_{\text{n}\text{g}}{\in \mathbb{R}}^{h\times 1}$. The final enhanced tensor $\tilde{\mathbf{X}}$ computed by the proposed excitation operation can be obtained by

${\mathbf{F}}_{\text{N}2\text{G}}={\mathbf{F}}_{\text{r}\text{m}\text{u}\text{l}} :\mathbf{X}, { \mathbf{x}}_{\text{n}\text{g}}\to$ $\tilde{\mathbf{X}}{\in \mathbb{R}}^{h\times w\times c} \left(7\right)$

where${ \mathbf{F}}_{\text{r}\text{m}\text{u}\text{l}}$ denotes a row-wise multiplication between the input tensor $\mathbf{X}=[{\mathbf{x}}_{1},{\mathbf{x}}_{2},\dots ,{\mathbf{x}}_{h}{\in \mathbb{R}}^{w\times c}]$ and the weight vector${ \mathbf{x}}_{\text{n}\text{g}}=[{a}_{1},{a}_{2},\dots ,{a}_{h}]$. The output tensor $\tilde{\mathbf{X}}$ was highlighted, helping to boost representation discriminability.

Braingcpnet Architecture

The BrainGCPNet consists of a typical convolutional layer, GCP block, and fully connected classification layer (Fig. 1). For a detailed description, see the Supplementary Material.

Experiments

We conducted an ablation analysis on the proposed BrainGCPNet and quantitative performance comparisons with competing methods using the nested 10-fold cross validation strategy. To avoid possible bias caused by the random dataset partitioning, the cross-validation was repeated 10 times independently, and the average score was reported. Hyperparameters such as varying regularization factors, weight decay, and network architecture, were empirically tuned and optimized. We optimized two important hyperparameters, an initial learning rate and a weighting factor of L1 regularization, using Bayesian optimization. The performance was evaluated using accuracy, sensitivity, and specificity. Also, we plotted the receiver operating characteristic curve of BrainGCPNet and other competing methods, including SVM, fully connected neural network, CNN, squeeze and excitation network (SENet) [8], and BrainNetCNN [5]. For a detailed description, see the Supplementary Material.

Discriminative connections

To discover discriminative functional connections between the brain regions that make significant contributions to the recognition of SDDs, we used the gradient-based explanation method [37]. To obtain an explainable saliency map, after choosing a target class (SSDs or HCs), we fed validation data to the explanation method, and entire saliency maps were linearly integrated and normalized. Connectivity strength between the nodes and nodal strength (sum of edge weights attached to a node within a network) were estimated.

Availability of data and materials

All the data presented and analysed in this study are fully available from the authors upon request.

ACKNOWLEDGEMENTS

The corresponding author would like to thank all participants in the study and father for guidance and support. This study was supported by a grant of the Korean Mental Health Technology R&D Project, Ministry of Health & Welfare, Republic of Korea (HL19C0015) and a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI18C2383).

Authors’ contributions

Y-CC and I-SO conceptualized the study. UT, JS, W-SK, CL, and Y-CC performed the study and acquired data. K-HO and W-SK conducted experiment and statistical analysis. K-HO drafted the manuscript. N-IK, K-HL, JS, I-SO and S-WK critically reviewed the manuscript and Y-CC finalized it. All authors approved the final manuscript.

Funding

This study was supported by a grant of the Korean Mental Health Technology R&D Project, Ministry of Health & Welfare, Republic of Korea (HL19C0015) and a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI18C2383).

Ethics approval and consent for publication

All participating patients (n = 171) met DSM-IV-TR criteria for schizophrenia spectrum disorders (SSDs) according to the Structured Clinical Interview for DSM-IV (SCID). Individuals with alcohol- or drug-use disorders within the past 6 months, intellectual disability (IQ ≤ 70), current or historical neurological disorders, pregnancy, and claustrophobia were excluded from the study. Healthy controls were required to have no previous or current psychiatric disorders, neurological disorders, or significant medical conditions. The study was approved by the Ethics Committee of Jeonbuk National University.

Competing interests

The authors report no biomedical financial interests or potential conflicts of interest.

Author details

¹Department of Computer Science and Engineering, Jeonbuk National University, Jeonju, Korea. ²Department of Psychiatry, Jeonbuk National University, Medical School, Jeonju, Korea. ³ Research Institute of Clinical Medicine of Jeonbuk National University-Biomedical Research Institute of Jeonbuk National University Hos pital, Jeonju, Korea. ⁴Department of Psychiatry, Maeumsarang Hospital, Wanju, Jeollabuk-do, Korea. ⁵Brainnetome Center and National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China. ⁶ University of Chinese Academy of Sciences; CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Beijing 100049, China. ⁷Department of Psychiatry, Chonnam National University Medical School, Gwangju, Republic of Korea.

Hosseini-Asl, E., Gimel'farb, G., El-Baz, A., 2016. Alzheimer's disease diagnostics by a deeply supervised adaptable 3D convolutional network. arXiv preprint arXiv:1607.00556. https://arxiv.org/abs/1607.00556.
Oh, K., Chung, Y.-C., Kim, K.W., Kim, W.-S., Oh, I.-S., 2019a. Classification and visualization of Alzheimer’s disease using volumetric convolutional neural network and transfer learning. Scientific Reports 9(1), 1-16. https://pubmed.ncbi.nlm.nih.gov/31796817/.
Oh, K., Kim, W., Shen, G., Piao, Y., Kang, N.-I., Oh, I.-S., Chung, Y.C., 2019b. Classification of schizophrenia and normal controls using 3D convolutional neural network and outcome visualization. Schizophrenia research 212, 186-195. https://pubmed.ncbi.nlm.nih.gov/31395487/.
Bruna, J., Zaremba, W., Szlam, A., LeCun, Y., 2013. Spectral networks and locally connected networks on graphs. arXiv preprint arXiv:1312.6203. https://arxiv.org/abs/1312.6203.
Kawahara, J., Brown, C.J., Miller, S.P., Booth, B.G., Chau, V., Grunau, R.E., Zwicker, J.G., Hamarneh, G., 2017. BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment. NeuroImage 146, 1038-1049. https://www.sciencedirect.com/science/article/abs/pii/ S1053811916305237.
Wang, Q., Xie, J., Zuo, W., Zhang, L., Li, P., 2019. Deep cnns meet global covariance pooling: Better representation and generalization. arXiv preprint arXiv:1904.06836. https://ieeexplore.ieee.org/abstract/document/9001240.
Kim Y-K, Won S-D, Lee K-M, et al. A study on the reliability and validity of the Korean version of the Calgary Depression Scale for Schizophrenia (K-CDSS). Journal of Korean Neuropsychiatric Association 2005; 44(4): 446-55. https://www.koreamed.org/SearchBasic.php?RID=2341113.
Hu, M.-L., Zong, X.-F., Mann, J.J., Zheng, J.-J., Liao, Y.-H., Li, Z.-C., He, Y., Chen, X.-G., Tang, J.-S., 2017. A review of the functional and anatomical default mode network in schizophrenia. Neuroscience bulletin 33(1), 73-84. https://link.springer.com/article/10.1007/s12264-016-0090-1.
Gao, Z., Xie, J., Wang, Q., Li, P., 2019. Global second-order pooling convolutional networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3024-3033. https://openaccess.thecvf.com/content_CVPR_2019/html/Gao_Global_Second- Order_Pooling_Convolutional_Networks_CVPR _2019_paper.html.
Pinaya, W.H., Gadelha, A., Doyle, O.M., Noto, C., Zugman, A., Cordeiro, Q., Jackowski, A.P., Bressan, R.A., Sato, J.R., 2016. Using deep belief network modelling to characterize differences in brain morphometry in schizophrenia. Scientific reports 6, 38897. https://www.nature.com/articles/srep38897.
Plis, S.M., Hjelm, D.R., Salakhutdinov, R., Allen, E.A., Bockholt, H.J., Long, J.D., Johnson, H.J., Paulsen, J.S., Turner, J.A., Calhoun, V.D., 2014. Deep learning for neuroimaging: a validation study. Frontiers in neuroscience 8, 229. https://www.frontiersin.org/articles/10.3389/fnins.2014.00229/full.
Kim, J., Calhoun, V.D., Shim, E., Lee, J.-H., 2016. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage 124, 127-146. https://www.sciencedirect.com/science/article/abs/pii/S1053811915003985.
Han, S., Huang, W., Zhang, Y., Zhao, J., Chen, H., 2017. Recognition of early-onset schizophrenia using deep-learning method, Applied Informatics. SpringerOpen, pp. 1-6. https://applied-informatics-j.springerope n.com/articles/10.1186/s40535-017-0044-3.
Patel, P., Aggarwal, P., Gupta, A., 2016. Classification of schizophrenia versus normal subjects using deep learning, Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing, pp. 1-6. https://dl.acm.org/doi/abs/10.1145/3009977.3010050.
Zeng, L.-L., Wang, H., Hu, P., Yang, B., Pu, W., Shen, H., Chen, X., Liu, Z., Yin, H., Tan, Q., 2018. Multi-site diagnostic classification of schizophrenia using discriminant deep learning with functional connectivity MRI. EBioMedicine 30, 74-85. https://www.sciencedirect.com/science/article/pii/S2352396418301014.
Molnar-Szakacs, I., Arzy, S., 2009. Searching for an integrated self-representation. Communicative & Integrative Biology 2(4), 365-367. https://www.tandfonline.com/doi/full/10.4161/cib.2.4.8290.
Leech, R., Sharp, D.J., 2014. The role of the posterior cingulate cortex in cognition and disease. Brain 137(1), 12-32. https://academic.oup.com/brain/article/137/1/12/358120?login=true.
Whitfield-Gabrieli, S., Thermenos, H.W., Milanovic, S., Tsuang, M.T., Faraone, S.V., McCarley, R.W., Shenton, M.E., Green, A.I., Nieto-Castanon, A., LaViolette, P., 2009. Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia. Proceedings of the National Academy of Sciences 106(4), 1279-1284. https://www.pnas.org/content/106/4/1279.short.
Zhou, Y., Liang, M., Tian, L., Wang, K., Hao, Y., Liu, H., Liu, Z., Jiang, T., 2007. Functional disintegration in paranoid schizophrenia using resting-state fMRI. Schizophrenia research 97(1-3), 194-205. https://www.sciencedirect.com/science/article/abs/pii/S0920996407002393.
Northoff, G., 2014. Are auditory hallucinations related to the brain's resting state activity? A'neurophenomenal resting state hypothesis'. Clinical Psychopharmacology and Neuroscience 12(3), 189. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4293163/.
Li, T., Wang, Q., Zhang, J., Rolls, E.T., Yang, W., Palaniyappan, L., Zhang, L., Cheng, W., Yao, Y., Liu, Z., 2017b. Brain-wide analysis of functional connectivity in first-episode and chronic stages of schizophrenia. Schizophrenia bulletin 43(2), 436-448. https://academic.oup.com/schizophreniabulletin/article/43/2/436/2503518?login=true.
Gheiratmand, M., Rish, I., Cecchi, G.A., Brown, M.R., Greiner, R., Polosecki, P.I., Bashivan, P., Greenshaw, A.J., Ramasubbu, R., Dursun, S.M., 2017. Learning stable and predictive network-based patterns of schizophrenia and its clinical symptoms. NPJ schizophrenia 3(1), 1-12. https://www.nature.com/articles/articles/s41537-017-0022-8.
Argyelan, M., Gallego, J.A., Robinson, D.G., Ikuta, T., Sarpal, D., John, M., Kingsley, P.B., Kane, J., Malhotra, A.K., Szeszko, P.R., 2015. Abnormal resting state FMRI activity predicts processing speed deficits in first-episode psychosis. Neuropsychopharmacology 40(7), 1631-1639. https://www.nature.com/articles/npp20157.
Lynall, M.-E., Bassett, D.S., Kerwin, R., McKenna, P.J., Kitzbichler, M., Muller, U., Bullmore, E., 2010. Functional connectivity and brain networks in schizophrenia. Journal of Neuroscience 30(28), 9477-9487. https://www.jneurosci.org/content/30/28/9477.short#page.
Libby, L.A., Ragland, J.D., 2011. FMRI as a measure of cognition related brain circuitry in schizophrenia, Brain Imaging in Behavioral Neuroscience. Springer, pp. 253-267. https://rd.springer.com/chapter/10.1007/7854_2011_173.
Cheng, W., Palaniyappan, L., Li, M., Kendrick, K.M., Zhang, J., Luo, Q., Liu, Z., Yu, R., Deng, W., Wang, Q., 2015. Voxel-based, brain-wide association study of aberrant functional connectivity in schizophrenia implicates thalamocortical circuitry. npj Schizophrenia 1, 15016. https://www.nature.com/articles/npjschz201516/fig_tab.
Yang, G.J., Murray, J.D., Wang, X.-J., Glahn, D.C., Pearlson, G.D., Repovs, G., Krystal, J.H., Anticevic, A., 2016. Functional hierarchy underlies preferential connectivity disturbances in schizophrenia. Proceedings of the National Academy of Sciences 113(2), E219-E228. https://www.pnas.org/content/113/2/E219.short.
Carter, C.S., Botvinick, M.M., Cohen, J.D., 1999. The contribution of the anterior cingulate cortex to executive processes in cognition. Reviews in the Neurosciences 10(1), 49-58. https://www.degruyter.com/document/doi/10.1515/REVNEURO.1999.10.1.49/html.
Swick, D., Ashley, V., Turken, U., 2008. Left inferior frontal gyrus is critical for response inhibition. BMC neuroscience 9(1), 1-11. https://bmcneurosci.biomedcentral.com/articles/10.1186/1471-2202-9-102.
Venkataraman, A., Whitford, T.J., Westin, C.-F., Golland, P., Kubicki, M., 2012. Whole brain resting state functional connectivity abnormalities in schizophrenia. Schizophrenia research 139(1-3), 7-12. https://www.sciencedirect.com/science/article/abs/pii/S0920996412002538.
van de Ven, V., Jagiela, A.R., Oertel-Knöchel, V., Linden, D.E., 2017. Reduced intrinsic visual cortical connectivity is associated with impaired perceptual closure in schizophrenia. NeuroImage: Clinical 15, 45-52. https://www.sciencedirect.com/science/article/pii/S2213158217300918.
Li, P., Fan, T.-T., Zhao, R.-J., Han, Y., Shi, L., Sun, H.-Q., Chen, S.-J., Shi, J., Lin, X., Lu, L., 2017a. Altered brain network connectivity as a potential endophenotype of schizophrenia. Scientific reports 7(1), 1-9. https://www.nature.com/articles/s41598-017-05774-3.
Lui, S., Li, T., Deng, W., Jiang, L., Wu, Q., Tang, H., Yue, Q., Huang, X., Chan, R.C., Collier, D.A., 2010. Short-term effects of antipsychotic treatment on cerebral function in drug-naive first-episode schizophrenia revealed by “resting state” functional magnetic resonance imaging. Archives of general psychiatry 67(8), 783-792. https://jamanetwork.com/journals/jamapsychiatry/article-abstract/210850.
First MB. The structured clinical interview for DSM-IV axis I disorders. Biometrics Research Department 1997. https://psycnet.apa.org/record/2004-12821-011.
Han O, Hong J. Structured clinical interview for DSM-IV axis I disorder-Korean version. Seoul: Hana Medical Publishing 2000. https://www.koreascience.or.kr/article/JAKO201032059186148.page.
Yi, J.S., Ahn, Y.M., Shin, H.K., An, S.K., Joo, Y.H., Kim, S.H., Yoon, D.J., Jho, K.H., Koo, Y.J., Lee, J.Y., 2001. Reliability and Validity of the Korean Version of the Positive and Negative Syndrome Scale. Journal of Korean Neuropsychiatric Association 40(6), 1090-1105. https://www.koreamed.org/SearchBasic. php?RID=2340616.
Behzadi, Y., Restom, K., Liau, J., Liu, T.T., 2007. A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. Neuroimage 37(1), 90-101. https://www.sciencedirect.com/science/article/abs/pii/S1053811907003837.

Table 1. Demographic and clinical characteristics of patients with SSDs and HCs

Characteristics	SSDs (n = 171)	HCs (n = 161)	p-value (2 Tailed)
Age (years)	34·38 (10·61)	33·73 (10·96)	0·597^b
Sex
Male (%)	89 (52·05)	74 (45·96)	0·259^a
Female (%)	82 (47·95)	87 (54·04)	0·259^a
Education (years)	13·90 (2·44)	15·26 (2·07)	<0·001^b
Duration of illness (months)	77·70 (96·50)	-	-
CDSS Total	5·88 (5·83)	-	-
PANSS
Positive symptoms	13·69 (8·00)	-	-
Negative symptoms	11·57 (6·55)	-	-
General psychopathology	24·80 (11·35)	-	-
Total score	50·05 (23·26)	-	-
Medication
Naive/Free (%)	28 (16·37)/29 (16·96)	-	-
Chlorpromazine equivalent (mg/day)	449·33(351·495) (n=114)	-	-

Data given as mean (SD). ^aSignificant T statistic for the Chi-square test; ^bSignificant T statistic for the independent two sample t-test.

CDSS, Calgary Depression Scale for Schizophrenia; HCs, Healthy Controls; PANSS, Positive and Negative Syndrome Scale; SSDs, Schizophrenia Spectrum Disorders

Table 2. Performance comparison by the number of convolutional layers with or without GCP block

Layers	Accuracy	Sensitivity	Specificity
1	81·93 / 75·30	84·21 / 76·61	79·50 / 73·91
2	83·13 / 76·79	85·96 / 79·65	80·12 / 73·68
3	83·02 / 76·81	85·27 / 79·53	80·88 / 73·91
4	82·23 / 75·70	85·38 / 76·88	78·88 / 74·53

Data given as with GCP / without GCP (%).

Table 3. Performance comparison by the number of E2E layers in GCP block

The Number of E2E layers	Accuracy	Sensitivity	Specificity
1	83·13	85·96	80·12
2	82·23	85·38	78·88
3	80·72	83·04	78·26

Data given as %.

Table 4. Performance comparison by the number of the hidden units of N2G in GBCP block

Number of hidden units	Accuracy	Sensitivity	Specificity
5	82·83	87·13	78·26
10	83·13	85·96	80·12
15	82·53	87·72	77·02
20	82·83	85·96	79·50

Data given as %.

Table 5. Performance comparison by the number of the output channels

E2E layer	Convolutional layer
E2E layer	8	12	16	20
16	80·42	79·95	80·12	80·72
32	81·33	81·93	81·33	82·50
64	82·23	82·83	83·13	83·02
96	81·63	82·23	83·02	82·83
128	81·93	81·33	82·23	82·50

Data given as %.

Table 6. Performance comparison of the BrainGCPNet with competing methods

	Accuracy	Sensitivity	Specificity	AUC
SVM-PCA	74·90	77·96	71·55	78·85
SVM	72·34	76·91	67·40	76·25
FNNs	74·59	77·72	71·25	78·82
CNNs	76·79	79·65	73·68	80·69
BrainNetCNNs	77·04	78·98	75·00	81·74
SENet	81·21	83·38	79·10	86·85
BrainGPNet	82·04	84·47	79·63	88·41
BrainGCPNet	83·13	85·96	80·12	89·42

Data given as %, AUC; Area under the curve; CNNs; Convolutional Neural Networks, FNNs; Fully Connected Neural Networks, GCPN; Graph Covariance Pooling Network, PCA; Principal Component Analysis, SENet; Squeeze and Excitation Network, SVM; Support Vector Machine.

Table 7. Top 10 discriminative connections

	Connectivity strength	Nodal strength
1	Left posterior cingulate gyrus – Right posterior cingulate gyrus (1)	Left calcarine sulcus
2	Right thalamus – Left thalamus (1)	Right amygdala
3	Right cuneus – Left calcarine sulcus (0·71)	Left putamen
4	Right superior temporal gyrus – Left superior temporal gyrus (0·69)	Right thalamus
5	Right Heschl’s gyrus – Left Heschl’s gyrus (0·69)	Right supramarginal gyrus
6	Left lingual gyrus – Left calcarine sulcus (0·67)	Right putamen
7	Right cuneus – Right calcarine sulcus (0·59)	Right caudate nucleus
8	Right caudate nucleus – Left caudate nucleus (0·57)	Right calcarine sulcus
9	Left lingual gyrus – Right lingual gyrus (0·56)	Left posterior cingulate gyrus
10	Right supramarginal gyrus – Left angular gyrus (0·55)	Left angular gyrus

Table 8. Aberrant functional connections in patients with schizophrenia spectrum disorders

Brain region	t value	Effect size	p-unc	p- FDR	Brain region
SSDs > HCs
Left posterior cingulate gyrus	6·38	0·150	<0·001	<0·001	Left orbital inferior frontal gyrus
	5·17	0·130	<0·001	<0·001	Right orbital inferior frontal gyrus
	4·35	0·110	<0·001	0·005	Left triangularis inferior frontal gyrus
Right posterior cingulate gyrus	6·33	0·140	<0·001	<0·001	Left orbital inferior frontal gyrus
	5·20	0·130	<0·001	<0·001	Right orbital inferior frontal gyrus
	4·24	0·099	<0·001	0·007	Left triangularis inferior frontal gyrus
Left orbito medial frontal gyrus	4·72	0·130	<0·001	0·002	Right orbital inferior frontal gyrus
	4·70	0·110	<0·001	0·002	Right operculum inferior frontal gyrus
	4·24	0·120	<0·001	0·007	Left orbital inferior frontal gyrus
Right orbito medial frontal gyrus	4·04	0·097	<0·001	0·001	Left operculum inferior frontal gyrus
	3·82	0·090	<0·001	0·001	Left triangularis inferior frontal gyrus
	3·40	0·081	<0·001	0·001	Right triangularis inferior frontal gyrus
Left anterior cingulate gyrus	4·21	0·110	<0·001	0·001	Left triangularis inferior frontal gyrus
	3·83	0·100	<0·001	0·001	Right triangularis inferior frontal gyrus
	3·82	0·093	<0·001	0·001	Left operculum inferior frontal gyrus
Right anterior cingulate gyrus	4·42	0·110	<0·001	0·004	Left orbital inferior frontal gyrus
	3·78	0·094	<0·001	0·001	Left triangularis inferior frontal gyrus
	3·48	0·091	<0·001	0·001	Right triangularis inferior frontal gyrus
Left superior frontal gyrus	4·54	0·110	<0·001	0·003	Right operculum inferior frontal gyrus
	4·37	0·110	<0·001	0·005	Right triangularis inferior frontal gyrus
Left precuneus	4·19	0·100	<0·001	0·008	Left orbital inferior frontal gyrus
Left angular gyrus	4·24	0·120	<0·001	0·007	Left triangularis inferior frontal gyrus
Right cuneus	4·17	0·130	<0·001	0·008	Left calcarine sulcus
Left calcarine sulcus	5·40	0·140	<0·001	<0·001	Left cerebellum 6
Left middle cingulate gyrus	4·73	0·110	<0·001	0·002	Left triangularis inferior frontal gyrus
SSDs < HCs
Left putamen	-5·20	-0·120	<0·001	<0·001	Right insular cortex
Right putamen	-5·94	-0·140	<0·001	<0·001	Right insular cortex
	-5·20	-0·110	<0·001	<0·001	Left insular cortex
Left Heschl’s gyrus	-6·38	-0·160	<0·001	<0·001	Right Heschl’s gyrus
	-4·45	-0·110	<0·001	0·004	Right superior temporal gyrus
Left superior temporal gyrus	-4·80	-0·140	<0·001	0·001	Right superior temporal gyrus
Whole-brain thresholded at FDR corrected p < 0.01, FDR, False Discovery Rate; HCs, Healthy Controls; SSDs, Schizophrenia spectrum disorders.

No competing interests reported.

SupplementalMaterial.docx

Download PDF

Journal Publication

published 17 Jan, 2022

Read the published version in BMC Neuroscience →

Editorial decision: Major revision
01 Oct, 2021
Reviews received at journal
21 Sep, 2021
Reviewers agreed at journal
09 Sep, 2021
Reviews received at journal
03 Sep, 2021
Reviewers agreed at journal
17 Aug, 2021
Reviewers invited by journal
17 Aug, 2021
Editor assigned by journal
17 Aug, 2021
Editor invited by journal
17 Aug, 2021
Submission checks completed at journal
17 Aug, 2021
First submitted to journal
06 Aug, 2021

You are reading this latest preprint version

Diagnosis of Schizophrenia with Functional Connectome Data: A Graph-Based Convolutional Neural Network Approach

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Participants

Clinical assessment

Image Acquisition And Preprocessing

Functional Connectivity Analysis

Graph Covariance Pooling Block

Braingcpnet Architecture

Experiments

Discriminative connections

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1