Nanobiomechanical data classified by Deep learning based on convolutional neural networks

doi:10.21203/rs.3.rs-3235928/v1

Download PDF

Article

Nanobiomechanical data classified by Deep learning based on convolutional neural networks

https://doi.org/10.21203/rs.3.rs-3235928/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Nanobiomechanical data have an interest in biomedical research, but the capability of deep learning (DL) based on convolutional neural networks (CNN) has not been explored to classify such data. We propose to use these strategies to treat nanobiomechanical data acquired by atomic force microscopy (AFM) on Candida albicans living cells, an opportunistic pathogenic micro-organism of medical interest. Data, acquired by force spectroscopy, allowed us to generate force vs. distance curves (FD curves) which its profile is linked to nanobiomechanical properties of C. albicans. DL was applied to classify FD curves, considered as images, into 3 groups: adhesive nanodomains, non-adhesive domains or in between domains. We achieved a real multiclass classification with a validation accuracy, macro-average of F1, and the weighted average of 92%, without the need to perform the usual dropout or weight regularisation methods. Transfer learning with a pre-trained (PT) VGG16 architecture with and without fine tuning (FT) permitted us to verify that our model is less computationally complex and better fitted. The generalisation was done by classifying on other C. albicans cells with more that 99% of confidence, to finally predict 16,384 FD curves in less than 90 seconds. This model could be employed by a non-machine learning specialist as the trained model can be downloaded to predict the adhesiveness, within seconds, on C. albicans cells characterized by AFM.

Physical sciences/Nanoscience and technology/Other nanotechnology/Computational nanotechnology

Biological sciences/Cell biology/Cell adhesion/Mechanotransduction

Physical sciences/Mathematics and computing/Computer science

Health sciences/Diseases/Infectious diseases/Fungal infection

Artificial intelligence (AI) is a burgeoning field that is impacting virtually every aspect of human life and of our societies, mainly because of advances in AI algorithms, big data, cloud computing¹ and the advent of quantum machine learning (QML). Machine learning (ML) is an adaptive subfield solution of AI, which aims to extract patterns from real-world raw data to acquire knowledge and make decisions². Deep Learning (DL) is a subfield of ML and represents an impactful approach with trainable parameters and significantly better performance than traditional ML algorithms, considering the size of the dataset³. DL can be supervised, semi-supervised, or unsupervised learning and is based on artificial Neural Networks (NNs). Today, DL is used to analyse big data collected in biomedical procedures, with the aim of helping clinicians to make decisions⁴. For instance, it has been showed that convolutional neural networks (CNN or ConvNet) are well adapted to medical image recognition, allowing one to diagnose Alzheimer's disease from magnetic resonance images⁵, or for cancer diagnosis using for example microscopic images of blood cancer cells⁶ or thermal images from breast thermography databases⁷. But as these examples rely on qualitative recognition procedures based on the extraction of textural or morphological features, such as shape or size, only a few works demonstrate the medical utility of CNNs in the real world⁸. On the other hand, ML and mainly CNN-based DL in the field of cell nanobiomechanic remain largely unexplored⁹. Nanobiomechanic refers to the measurement of forces in cells at the piconewton level. Because these forces are involved in cell regulation and physiology, measuring them can provide for example information about the difference between a diseased and a healthy cell¹⁰. In this regard, the nanobiomechanical properties of cells (stiffness, elasticity, viscoelasticity, adhesion) could be used as label-free biomarkers, because they could represent a fingerprint of many pathological situations and could correlate with numerous diseases¹¹. Atomic force microscopy is a tool of choice for measuring the nanobiomechanical properties of living cells. Indeed, AFM, in force spectroscopy mode, can record force-distance curves (FD curves), where the force experienced by the probe is plotted as a function of the probe-sample separation distance. These curves can then be interpreted through different physical models, thereby giving access to the nanomechanical and nano-adhesive properties of cell surfaces¹². Among the few articles on the subject, Minelli et al.¹³ demonstrated the possibility to automatically classify surgically removed brain tissues as normal or cancerous on the basis of a feed-forward neural-network. Another work was reported by Paul Müller et al.¹⁴, where a Python package named nanite¹⁵ was used to automate basic AFM FD curves analysis operations (base line correction, contact point retrieval or model fitting) with data from different AFM manufacturers¹⁶. The authors reported the analysis of 200 FD curves by traditional ML algorithms such as Random Forest or AdaBoost, finding the lowest mean squared error (MSE) for the Extra Trees regressor.

In contrast, in this work, we apply CNN to recognise and classify FD curves as images, acquired on Candida albicans cells. C. albicans is an opportunistic pathogenic micro-organism of medical interest as it is responsible for skin, nail, oral cavity, and vulvovaginal infections (candidaemia)¹⁷. The mechanism used by C. albicans to infect cells and tissues consists first in adhering to a host cell via surface proteins, making it therefore crucial to understand the organisation and adhesive properties of these proteins. Such proteins are called adhesins, such as the agglutinin-like sequences (ALS) proteins, which contain amyloid-like sequences and can assemble as nanodomains. For instance these proteins have been mapped at the surface of C. albicans using AFM and published by some of the present authors¹⁸. In that previous work, different adhesive nanodomains resulting from ALS proteins aggregation were elucidated by means of AFM based high-resolution force mapping, generating images containing 16,384 FD curves. The aggregation degree of the ALS proteins was correlated to their stiffness and adhesiveness, representing a nanobiomechanical signature. Here, we propose to use CNN based supervised DL to classify the FD curves acquired in that study. Hence, we used 2400 FD curves for training and 600 FD curves for validation. We further demonstrated the performance of our algorithm on 900 humanly classified FD curves coming from another cell and finally we applied the algorithm without supervision on an entire matrix of 16,384 FD curves. These latter were classified in around 90 seconds. We have specifically used CNN to analyse and extract significant features of the FD curves to deduce the adhesiveness degree of C. albicans, showing that ConvNet are well suited to classify nanomechanical data as images, collected from living cells.

The adjustment and performance of the CNN model could be visualised by graphs of training and validation losses. Figure 1.a depicts these metrics, both the validation and the training losses decrease and stabilise, reaching its minimum after 80 until 100 epochs (complete passes of the training data set through the algorithm). Our model shows an optimal generalisation to validation data (optimal fit) and then is not under- or over-fitted because the curves of the training and validation losses overlap. The accuracy of the model is depicted in the Fig. 1.b. The plot illustrates that both the training and the validation accuracies are linearly increasing until they reach 92%. This behaviour is attained in the model without dropout or weight regularisations.

Our mode had a hold-out of 80 − 20 (80% of data for training and 20% of the data for validating). The performance of the training model was tested on the validation dataset, which consisted of 200 images of FD curves of each class. Hence the final model used 2400 images of FD curves (800 images for each of the 3 classes) for training and 600 images for validating the performance of the model (200 images per class). The classification capability of the model is then appreciated in a confusion matrix (Fig. 1.c).

The Fig. 1.c illustrates that the true positives (TP) are high for the three predicted classes on the validation dataset (it can be seen on the diagonal going from up-left corner to the bottom-right corner). On 200 FD curves belonging to the domains denoted as Inter, the algorithm correctly classifies 174 curves as Inter and misclassify 4 curves as Nano and 22 as Out. On 200 FD curves belonging to the domains denoted as Nano, the algorithm correctly classifies 199 curves as Nano and misclassify 1 curve as Inter and 0 as Out. On 200 FD curves belonging to the domains denoted as Out, the algorithm correctly classifies 182 curves as Out and misclassify 18 curves as Inter and 0 as Nano. From the confusion matrix, we can make a classification report that includes the commonly useful metric parameters as follows. The precision for Inter, Nano and Out classes are 0.89, 0.99 and 0.88, respectively (Fig. 1.d). The recall (sensitivity or true positive rate) for Inter, Nano and Out classes are 0.86, 0.97 and 0.92 respectively. The F1-score, which combines the precision and recall scores, for Inter, Nano and Out classes are 0.88, 0.98 and 0.89, respectively. This last parameter is used to obtain 0.92 of macro-averaged F1 score and 0.92 for the weighted average, finally the general accuracy was of 92%.

The trained model predicts each FD curve in a few milliseconds giving probabilities displayed following the order in which the model classify the FD curves (see Fig. 3b), i.e. the first number corresponds to the Inter class of the FD curve, the second to the Nano class and the third to the Out class. Then by feeding a random FD curve image, the algorithm recognises that it belongs to an intermediate part of the cell (Fig. 2.a) with a probability of 99.99% (displayed as .9999 considering 1 as 100% of probability), it also gives a probability of 5.61 x 10^− 5 that it belongs to a nanodomain and a probability of 3.96 x 10^− 8 that it belongs to an external part of the membrane cell, interpreting these as 0% probability, for the latter 2. Figure 2.b also shows the recognition of an FD curve corresponding to part of a nanodomain, in this case only with the retract curve, with 100% confidence. It also includes the probability value corresponding to an FD curve of an Inter class as small as 6.5 x 10^− 25 and 5.3 x 10^− 27 belonging to the Out class, which can be considered as a 0% probability. The model can display a list of all predicted probabilities.

The objective of our classification algorithm is to generalise on unseen data coming from other C. albicans cells. To validate this objective, we worked on a different JPK quantitative imaging data with also 128 X 128 nanoindentations (16,384 FD curves) corresponding to another C. albicans cell (Fig. 3.a). We then processed in 2 steps: first, 300 FD curves from each class (Inter, Nano and Out) were selected and exported by using the free ROI. Figure 3.b depicts the prediction result on the testing dataset, it gives the classification of the 3 classes in just 5 seconds using GPU once the trained mode was loaded. The prediction was 298, 304 and 298 for Inter, Nano and Out, respectively, which represents more than 99% confidence. In a second step, the algorithm was fed with the full matrix made of 16,384 FD curves. In less than 90 seconds, it provides a classification that is presented in Fig. 3.c. Further tests were performed to corroborate the performance of our model. A test dataset was prepared as the previous one, but in this case, each image of the FD curves included only the retract curve, allowing us to corroborate that the main feature the CNN is taking into consideration to learn and predict is the adhesion force of the curve.

The above tests have allowed us to be fully confident that our model of extension .h5 (available together with the code in GitHub) is powerful and can predict on any dataset or any group of FD curves exported as images without needing to be labelled. Simply exporting this file from an AFM adhesion map of a C. albicans cell makes the prediction over 99% reliable, giving the amount of nanodomain FD curves correlated to a degree of adhesion.

Even without a large training dataset, dropout and weight regularisation methods that are usually required in DL, applied on image classification, to avoid overfitting our DL model fitted very well, obtaining high accuracy and good metric parameters such as accuracy. It seems that when using images of curves as input to the CNN, there is no need to use regularisation methods. The high confidence in the prediction of the real multiclass classification is attributed to the fact that the data preparation was very clean using the powerful atomicJ software, but also that the images are not complex as it is usually used with CNN. The use of FD curves as images to feed the CNN is another way of using the CNN to achieve high accuracy and specially to predict with high confidence. We found even better performance when a binary classification was performed. We built a similar model that had just 800 images of FD curves of two classes for the training, to classify between nanodomain or to non-adhesive nanodomain. In this case, we surprisingly achieved, in less than 20 epochs, 100% accuracy, a perfect confusion matrix and 100% confidence in prediction (not shown in this article, found in the code link), which is so difficult to achieve with more complex images such as cells, animals or other inanimate objects. In conventional analysis of AFM FD curves, an AFM specialist must pre-process these curves by correcting the baseline or slope shift. We observed that due to the high capacity of the CNN, pre-processing of FD curves is no longer necessary due to the following two important properties of CNN¹⁹: First, the patterns that a CNN learns are translation invariant ²⁰, that is when a CNN learns a certain pattern for example in lower-left corner of the image (in this case, a part of the FD curve) the CNN can recognise it anywhere in another image, similar to the functioning of visual organ in humans. Secondly, CNN can learn spatial hierarchies of patterns,²¹ which means that a first convolutional layer would learn a small local pattern such as the CP in the FD curve and a second convolutional layer would learn a bigger pattern related to the previous feature layer, and so on. These behaviours, validated by the images obtained of the different CNN layers, are working very well when the model has images of curves. Finally, our model had 14,152,259 trainable parameters compared to the 27,626,307 trainable parameters of the pre-trained VGG16 architecture²² commonly used when there is not a large dataset to train (1.4 million labelled images and 1,000 different classes, available in Keras). The bootstrap extractor technique was used, i.e. a fully connected layer, similar to ours, was integrated into the pre-trained layers. This technique achieved 95% accuracy, but the complexity of learning the model is higher, as it has more convolutional layers and therefore a more complex model architecture that requires more time to learn. In terms of losses and accuracy for this pre-trained VGG16, the validation curve shows a noisy behaviour, indicating that the model is not very well fitted compared to our better fitted model.

Cell immobilization and quantitative adhesion images

An AFM, sometimes called bioAFM, can perform nanoindentations measurements on living cells. Nanoindentation refers to the act of pressing a nanometric cantilever tip against a cell surface to deduce the nanobiomechanics of the cell surface. For that, force-distance curves are recorded, from which parameters such as elastic modulus, adhesion, and stiffness can be extracted. A typical force-distance (FD) curve is illustrated in Fig. 4.a. It is composed of two parts; one is the approach or extend, where the contact point (CP) is visible. The CP indicates the position at which the tip touches the cell surface; it is mainly used to calculate the Young’s modulus. The other part of the force curve is the retract part or withdraw, used to obtain the maximum adhesion force (MA_F) and the stiffness deduced from a linear fit on the slope of the curve (LF).

Before conducting AFM nanoindentations, C. albicans (ABC Plataform Bug Bank, Nancy, France) were immobilised into microwells made of microstructured polydimethylsiloxane (PDMS) stamps following a convective method reported in another publication^23,24 (see Fig. 4.b). This immobilisation procedure permitted us to both obtain images and quantify adhesive properties of single C. albicans cells. The big data related to the adhesiveness degree of the C. albicans cell surface were obtained using an AFM Nanowizard III (Bruker, Billerica, USA), as reported elsewhere²⁵. Quantitative Imaging mode was conducted, using MLCT AUWH (Bruker) cantilevers (nominal spring constant of 0.01 N/m), to obtain high resolution AFM images of 128 X 128 pixels which represent 128 X 128 nanoindentations giving 16,384 FD curves such as of Fig. 4.a¹⁸. In these AFM images some adhesive nanodomains were observed and highlighted by coloured circle-like shapes that measure around 170 nm in diameter (close-up in the Fig. 4.c).

Nanobiomechanical data processing

The pre-processing of the big data used for the training and validation was done by using an open-source java-based software, named AtomicJ²⁶. It is a graphical user-friendly interface (GUI) application for the analysis of AFM recordings on biological samples. The software permits to directly open force maps of several extensions. We opened one file having JPK quantitative imaging data (.jpk-qi-data) corresponding to one C. albicans cell, illustrated in Fig. 4.c. Free-hand region of interest (ROI) command was used to select three different parts of the cell wall corresponding to nanodomain, intermediate and outside nanodomain. We decided to create these 3 classes because 3 different adhesion magnitudes were visually identified, corresponding to different maximum adhesion forces (MA_F), in Fig. 4.d, 4.e, 4.f. These different plots (cured via AtomicJ software of their axes) were labelled as class Nanodomain, Intermediate or Outside and used for image recognitions in our CNN architecture; actually, our model takes these AFM curves as input images. For the training part of the DL, 1000 images coming from different nanodomain parts such as of Fig. 4.d were generated, 1000 images from the intermediate between a nanodomain and an outside part such as of Fig. 4.e, were generated. Finally, 1000 images coming from different parts of the outside nanodomain such as of Fig. 4.f were stored. The dataset containing all JPG images were stored in a Zip file, of 138 Mb in size (the type and resolution of the different images were chosen with the AtomicJ software).

Convolution Neural Network (CNN) based deep learning architecture

CNN are conceived specially to work with dimensional(D) spaces²⁷ which are multidimensional array data known as tensors. The learning in the neural networks could be formulated as an optimisation problem, denoted as follows²⁸.

is a non-linear activation function. This is a non-convex optimisation problem that could be experimentally ameliorated by randomly initialising the network weights and choosing ReLU as non-linear activation function, as it is done in this work. To conduct the DL algorithm, the open-source Google platform TensorFlow²⁹ was employed and Keras as Python based open-source API¹⁹, built under TensorFlow environment. Google Collaboratory was used as coding environment, under graphical processing unit (GPU) as hardware accelerator to train and validate at 92 % of accuracy.

We implemented a simple hierarchical CNN architecture similar to the LeNET 5²⁷, but with 7 layers, which is well adapted for image recognition. After conducting several training tests, the best retained CNN architecture is illustrated in Fig. 5. It is composed of four convolutional layers of neurons; in the first one it is included the input shape of our images indicated with 3 depth dimensions (red, green, blue or RGB) with a size of 224 X 224. To define the feature map, we used a first convolutional layer with 32 filters, 3 kernel size, the second layer with 64 filters, the third layer with 128 filters and the last with 256 filters. A MaxPooling was then used to reduce the down sample feature maps, after each convolutional layer. Finally, the output was fed to a flatten function to work with the same dimensionality of the 2 dense layers. As the goal was to obtain a 3-image classification, the last layer had 3 outputs and a softmax activation.

Data and code availability

Training, validation and test real-world datasets are available at https://github.com/users/AdrianMartinezRivas. The Python code to reproduce results are also there.

Acknowledgements

AMR discloses support for publication of this work to Instituto Politécnico Nacional (IPN) through its sabbatical year and COTEBAL programs and to have had the permission to stay in France for a period. We disclose support for the research of this work from the Agence Nationale de la Recherche (ANR) in France, grant number ANR-20-CE42-0017. E. D is researcher at Centre National de la Recherche Scientifique(CNRS).

Author contributions

Conceptualisation: A.M.R., E.D. The manuscript was written by A.M.R in discussion and correction with all other authors. All authors have reviewed and approved the final manuscript.

Competing interests

The authors declare no competing interests.

Fosso Wamba, S., Bawack, R. E., Guthrie, C., Queiroz, M. M. & Carillo, K. D. A. Are we preparing for a good AI society? A bibliometric review and research agenda. Technol. Forecast. Soc. Change 164, 120482 (2021).
Ian Goodfellow, Yoshua Bengio & Aaron Courville. Deep Learning. (MIT Press, 2016).
Azuri, I., Rosenhek-Goldian, I., Regev-Rudzki, N., Fantner, G. & Cohen, S. R. The role of convolutional neural networks in scanning probe microscopy: a review. Beilstein J. Nanotechnol. 12, 878–901 (2021).
Arun Bhavsar, K. et al. A Comprehensive Review on Medical Diagnosis Using Machine Learning. Comput. Mater. Contin. 67, 1997–2014 (2021).
Islam, J. & Zhang, Y. Brain MRI analysis for Alzheimer’s disease diagnosis using an ensemble system of deep convolutional neural networks. Brain Inform. 5, 2 (2018).
Kumar, D. et al. Automatic Detection of White Blood Cancer From Bone Marrow Microscopic Images Using Convolutional Neural Networks. IEEE Access 8, 142521–142531 (2020).
Zuluaga-Gomez, J., Al Masry, Z., Benaggoune, K., Meraghni, S. & Zerhouni, N. A CNN-based methodology for breast cancer diagnosis using thermal images. Comput. Methods Biomech. Biomed. Eng. Imaging Vis. 9, 131–145 (2021).
Kleppe, A. et al. Designing deep learning studies in cancer diagnostics. Nat. Rev. Cancer 21, 199–211 (2021).
Zhou, G., Zhang, B., Tang, G., Yu, X.-F. & Galluzzi, M. Cells nanomechanics by atomic force microscopy: focus on interactions at nanoscale. Adv. Phys. X 6, 1866668 (2021).
Chen, J. Nanobiomechanics of living cells: a review. Interface Focus 4, 20130055 (2014).
Stylianou, A., Lekka, M. & Stylianopoulos, T. AFM assessing of nanomechanical fingerprints for cancer early diagnosis and classification: from single cell to tissue level. Nanoscale 10, 20930–20945 (2018).
Formosa-Dague, C., Duval, R. E. & Dague, E. Cell biology of microbes and pharmacology of antimicrobial drugs explored by Atomic Force Microscopy. Semin. Cell Dev. Biol. 73, 165–176 (2018).
Minelli, E. et al. A fully-automated neural network analysis of AFM force-distance curves for cancer tissue diagnosis. Appl. Phys. Lett. 111, 143701 (2017).
Müller, P. et al. nanite: using machine learning to assess the quality of atomic force microscopy-enabled nano-indentation data. BMC Bioinformatics 20, 465 (2019).
Nanite documentation. Nanite 3.1.2 documentation. https://nanite.readthedocs.io/en/stable/ (2021).
afmformats documentation. Documentation — afmformats 0.16.4 documentation. https://afmformats.readthedocs.io/en/stable/index.html#index (2021).
Martin, H., Kavanagh, K. & Velasco-Torrijos, T. Targeting adhesion in fungal pathogen Candida albicans. Future Med. Chem. 13, 313–334 (2021).
Formosa, C. et al. Multiparametric imaging of adhesive nanodomains at the surface of Candida albicans by atomic force microscopy. Nanomedicine Nanotechnol. Biol. Med. 11, 57–65 (2015).
Chollet, F. Deep Learning with Python, Second Edition. Manning Publications https://www.manning.com/books/deep-learning-with-python-second-edition (2021).
Zhang, H. & Arodz, T. Learning Invariance in Deep Neural Networks. in Computational Science – ICCS 2021 (eds. Paszynski, M., Kranzlmüller, D., Krzhizhanovskaya, V. V., Dongarra, J. J. & Sloot, P. M. A.) 64–74 (Springer International Publishing, 2021). doi:10.1007/978-3-030-77967-2_6.
Razzaghi, P., Abbasi, K. & Bayat, P. Learning spatial hierarchies of high-level features in deep neural network. J. Vis. Commun. Image Represent. 70, 102817 (2020).
Simonyan, K. & Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. Preprint at http://arxiv.org/abs/1409.1556 (2015).
Dague, E. et al. Assembly of live micro-organisms on microstructured PDMS stamps by convective/capillary deposition for AFM bio-experiments. Nanotechnology 22, 395102 (2011).
Formosa, C. et al. Generation of living cell arrays for atomic force microscopy studies. Nat. Protoc. 10, 199–204 (2015).
L. Chopinet; C. Formosa ; M.P. Rols ; R.E. Duval ; E. Dague. Imaging living cells surface and quantifying its properties at high resolution using AFM in QI^™ mode. Micron (2013).
Hermanowicz, P., Sarna, M., Burda, K. & Gabryś, H. AtomicJ: An open source software for analysis of force curves. Rev. Sci. Instrum. 85, 063703 (2014).
Lecun, Y., Bottou, L., Bengio, Y. & Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Rene Vidal and Joan Bruna. Mathematics of Deep Learning. (2017).
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. (2016).

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Nanobiomechanical data classified by Deep learning based on convolutional neural networks

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Material and Methods

Cell immobilization and quantitative adhesion images

Nanobiomechanical data processing

Convolution Neural Network (CNN) based deep learning architecture

Declarations

References

Additional Declarations

Status:

Version 1