According to the World Health Organization, lung cancer is the leading cause of cancer-related deaths worldwide, accounting for the highest mortality rates among both men and women. Lung cancer is a significant public health concern; however, this cancer is often diagnosed at advanced stages when treatment options are limited (WHO 2023). The use of immunohistochemistry has improved diagnostic accuracy in the classification of lung carcinoma, but the interpretation of the results remains challenging in some instances and the pathologists must be aware of many interpretation pitfalls (Yatabe et al. 2019). Often, molecular methods are difficult to apply to diagnosis due to the high costs and the long time required to complete the analysis (Mincione et al. 2015). In the same way, various equipment can have many disadvantages due to high cost, bulky and expert dependent subjective evaluations (Pala et al. 2022). Therefore, new technologies are emerging to try to improve the lung cancer detection at low cost, such as computational methods and AI applied to microscopic images of cancer.
Human eyes are very efficient at detecting patterns in images. However, the eye has limitations in its ability to analyze certain patterns of shapes, colors and sizes. In these cases, computerized image analysis can quantify changes in the structure that are imperceptible to human eyes. Furthermore, computer image analysis can reduce the time and cost of detections, with practical potential for use and distribution in healthcare systems (Melo et al. 2022; Tenorio et al. 2017).
According to da Silva et al. (2021), the identification of parameters that could successfully differentiate cancer images and aid in the diagnosis is of great interest. The present study aimed to find a different path in the automated analysis of microscopic images of human lung cancer by suggesting mathematical algorithms to quantitatively differentiate the spatial distribution patterns of the cell nuclei in SqCC and benign lung tissue. It is worth mentioning that the shape patterns shown by the spatial distribution of lung cells vary greatly and are difficult to differentiate quantitatively by human eye.
Overall, automated cancer detection involves, at least: 1) image preprocessing; 2) extracting identifiable features; and 3) correlating the features to the diagnosis (da Silva et al. 2021). A major difficulty in automating image analysis is segmentation, which is the separation of the analyzed structure and the image background, commonly resulting in black and white images (binarization). The present study used the spatial distribution of cell nuclei. The nuclei (region of interest) in routine staining present good contrast from the rest of the image (background), as they appear with color numbering in pixels clearly distinguishable from other color numbers in the background by the computer. Thus, in computer vision, the nucleus represents a good structure for image segmentation and binarization, and consequently, the results were satisfactory in the segmentation for automated nucleus analysis described in the methodology. Many previous studies used manual image processing to obtain results in microscopic images of cancer; therefore, these manual analyses are dependent on a human operator (subjective), reducing their automation potential. On the other hand, a highlight of the present study is the automation in the processing of lung microscopic images, which allows practical use in automatic software and AI.
Diagnosing cancer in small nodules and small biopsy samples is still a challenge; therefore, methods for analyzing small areas are of particular importance. In this sense, another highlight in our study is analyzing cell nucleus at high magnification in an area of a few micrometers, which allows the use of small lung samples with less invasive techniques and analysis of very small nodules.
The nucleus of lung cells varies greatly and form irregular/complex images even in benign tissue. However, the distribution and shape of nuclei in cancer can result in an even more complex image. Therefore, the present study used this increase in the complexity of the cell nucleus as shape marker to characterize lung carcinoma. Fractal dimension, lacunarity and multifractality are algorithms used to analyze the complexity/irregularity of biological images (Casey et al. 2024; Lee et al. 2014; Costa and Nogueira 2015; Sánchez-Chávez et al. 2024). According to da Silva et al. (2021), the formation of a tumor generally progresses with distortion of the tissue architecture, altering its complexity compared to the healthy counterpart.
Many structures in the human body, such as the distribution of cells in the lung, cannot be well described using classical Euclidean geometry, which considers lines, circles, squares, triangles and three-dimensional spaces, such as spheres, cubes and cylinders. The fractal dimension can measure how full the image space is occupied. As the fractal dimension has fractional values, fractal methods can better describe the degree of complexity and irregularity in images. Briefly applied to digital images, the fractal dimension can quantify how much of the image the structures occupy between dimensions; for example, between dimension 2 of a square (height and width) and dimension 3 of a cube (height, width and depth), and thus quantify fractional values to better describe complex biological structures. According to Pala et al. (2022), the larger the value of the fractal dimension, the more developed the properties of the structure at different scales. Therefore, the fractal dimension provides a relationship between fractal continuity and information richness (depth and detail), so if the detail and richness of the image are high, the fractal dimension value is also high. Many anatomical structures have been shown to demonstrate self-similarity properties, such as the lung with its bronchioalveolar ramifications. In this sense, fractal morphometry assessing architectural distortion and correlating with a specific pattern can enhance the recognition of histological structures (Mancini et al. 2023).
The present study demonstrated the ability of fractal dimension to detect significant differences between benign lung tissue and SqCC; showing an increase in the complexity/irregularity of cell nuclei in this cancer. The fractal curve also describes the way in which the length measurement between two points increases as the scale decreases (Costa et al. 2013; Hernández and Ventura 2022). Several studies have shown the use of fractal methods to describe complex pathological structures including colon, prostate and breast cancer for diagnosis, staging and prognosis (Lee et al. 2014; da Silva et al. 2021). The data showed consistent correlations, providing strong evidence that macroscopic fractal dimension and lacunarity measurements reflect differences in the tumor cells that give rise to them (Mambetsariev et al. 2019). Fractal geometry could give insights into tumor morphology and could become a useful tool for analyzing irregular tumor growth patterns (Mincione et al. 2015). According to Namazi and Kiminezhadmalaie (2015), the damaged DNA exhibits higher degree of fractality compared to normal DNA in lung cancer.
Some studies have shown the ability of the fractal dimension to detect changes in parameters of certain types of lung cancer. According to Lee et al. (2014), the fractal dimension of carcinoma epithelial architecture stained with pancytokeratin can differentiate adenocarcinoma from squamous cell carcinoma of the lung. Mambetsariev et al. (2019) showed an increased fractal dimension in microscopic images and immunohistochemistry (C-MET, P-C-MET, FAK, P-FAK) for small cell lung cancer. However, we did not find previous studies showing the efficiency of fractal dimension in detecting changes in the spatial distribution of cell nuclei in lung SqCC, as shown in the present study.
According to Lee et al. (2014), the use of fractal dimension to differentiate lung cancer subtypes is a nascent technology, but studies with a large number of cases are still needed. In this way, the present study can contribute to consolidating the use of the fractal dimension as a large number of images (400) from a database were used in our methodology. It is worth mentioning that the fractal dimension has also been used recently to detect alterations in other non-cancer lung pathologies. Mancini et al. (2023) provided evidence to support the use of fractal morphometry as a tool for quantifying and determining lung tissue remodeling in idiopathic pulmonary fibrosis. It is also worth highlighting the increase in the fractal dimension of cell nuclei in carcinoma of the oral mucosa (Mincione et al. 2015).
The theoretical characterization of our images as a fractal structure may represent a limitation; but this possible limitation often occurs in the mathematical use of the method in practical situations applied to pathology. According to Lee et al. (2014), it should be noted that actual fractals do not exist in nature because there is a fundamental limitation to the scaling behavior of natural objects. Even real image renderings of mathematical fractals cannot be truly fractal because of the finite resolution of the rendering. Hence, as in previous studies, the concept of fractal dimension refers to the complexity of an object over a very finite range of scales, and thus represents a pragmatic tool to extract clinically meaningful information.
The lacunarity method measures the spatial distribution of gaps in the structure, quantifying the homogeneity of an image or part of it, in order to make it comparable with other images and quantifying the space filled in this structure. Lacunarity describes the heterogeneity of the structure, the higher the value of the lacunarity the greater the heterogeneity (Melo et al. 2022). According to Mambetsariev et al. (2019), lacunarity also analyzes the texture within the image beyond what can be observed with standard measurements, making it ideal to quantify the complexity of stained tissue.
Lacunarity analysis in the present study showed a statistical reduction (41%) in the SqCC; interestingly, even though lacunarity is less studied in lung cancer compared to fractal dimension, it showed a greater change than the fractal dimension (28%) in the SqCC images. The use of lacunarity in lung cancer microscopy is still scarce; however, the present study showed lacunarity as a useful tool applied to lung pathology. Mambetsariev et al. (2019) described lacunarity as a method capable of differentiating benign from malignant tissue and also providing additional information on the biology of the tumor. These authors showed reduced lacunarity in microscopic images and immunohistochemistry (C-MET, P-C-MET, FAK, P-FAK) for small cell lung cancer. Studies analyzing the lacunarity of cell nuclei in microscopic images of lung cancer are scarce, and we found no previous reports using lacunarity to analyze the distribution of cell nuclei in human lung SqCC.
Very few studies have used multifractal to evaluate lung cancer. The principle behind multifractal analysis is that diferent regions of a process may have different fractal properties, providing information on a broad range of heterogenous phenomena (Barbora et al. 2023). These authors used multifractal analysis on acquired FTIR spectra to demonstrate the cancer metastatic potential in B16-f01, B16-f10, SW-480 and SW-620 cancer cell lines. The multifractal describes the structure analyzed at different scales; this method may be more efficient to evaluate biological structures that have different characteristics at different scales and dimensions (Melo et al. 2022). The present study showed a statistical increase of the parameters Dq, α and f(α) for all values of q (-10 to + 10), with a greater increase for more positive q values. Therefore, multifractality was able to detect changes in the spatial distribution of cell nuclei in lung SqCC. It is worth mentioning that the complex shape marker with the greatest change observed in this study was the f(α) for q + 10, which increased by 53% in lung SqCC images. Surprisingly, studies using multifractal methods are even rarer in health sciences, so we found no previous reports using multifractality to analyze the distribution of cell nuclei in human lung SqCC.
Carcinogenesis involves, among other characteristics, impaired cellular proliferation control and the capacity to invade tissue (da Silva et al. 2021). In this sense, the results observed in the present study for traditional measures (number of cells, area and perimeter) corroborate the literature. The changes observed in our results for simpler shape markers: circularity, aspect ratio, roundness and solidity, corroborate the data observed by Wang et al. (2014). Even so, it is worth highlighting that descriptions using these parameters in SqCC images are still few.
Traditional shape markers in microscopy may have limitations depending on the characteristics of the structure. For example, the area is one of the most analyzed parameters in images, however, the area only provides information on the amount of space the structure occupies, but does not inform how this structure occupies the image. In a simple example, the cell can show a circle shape with area 100 in benign tissue and the same area 100 with a shape similar to a star in a pathology. Furthermore, the area of the nucleus (2 dimensions) can vary greatly depending on the shape and the angle of the cross-section because the nucleus in vivo has 3 dimensions. Even so, optical microscopy is still one of the most used methods to evaluate cancer, despite its limitation when analyzing 2-dimensional shape of 3-dimensional structures, resulting in great variability. In this sense, proposing complex markers capable of generating new information about how cell nuclei occupy the image (not just the amount) can bring new possibilities in detecting and understanding the development of cancer.
Given the rising availability and lower pricing of digital cameras and slide scanners, the use of fractal methods could be easily added in the diagnostic work-up of both surgical and small biopsy samples. Moreover, it could be implemented in artificial intelligence-guided pattern recognition, helping to provide a more comprehensive approach for difficult cases (Mancini et al. 2023). The present study analyzed a human database with a large number of images and diverse characteristics. Thus, the complex shape markers applied in the present study showed good robustness and efficiency independent of adverse factors such as sample collection, processing, coloring, intensity, lighting, among other variations present in images from human databases (outside controlled conditions of experiments in laboratory). Studies have been analyzing the LC25000 dataset to automate cancer detection (Al-Jabbar et al. 2023; Dabass et al. 2023; Gabralla et al. 2023; Halder and Dey 2023). However, most use methods involving machine learning, neural networks, deep learning and other AI-related protocols. On the other hand, the present study followed a different path, aiming to use the LC25000 dataset in search of complex shape markers to better describe the spatial distribution of cells in lung cancer. We did not find previous studies in this approach evaluating the database using fractal dimension, lacunarity and multifractality. It is worth mentioning that complex shape markers can be easily integrated into AI protocols.
Methods analyzing complex shape markers for automatic detection of lung SqCC are still rare, and analyzing the fractal dimension, lacunarity and especially multifractality in the spatial distribution of cell nuclei are practically absent. Therefore, the results in the present study may indicate new tools applicable to the development of computational methods and artificial intelligence for SqCC detection. In association with a pathologist to supervise the diagnosis, automated methods using complex shape markers could be useful new tools in the near future to increase the efficiency and speed of analysis, reducing costs by serving as a first or second analyzer in cancer detection. However, further studies are still needed to determine the applicability of these methods in practical routine evaluating different classifications of SqCC and different types of lung cancer.