Measurement of Mouse Head and Neck Tumors by Automated Analysis of CBCT Images

doi:10.21203/rs.3.rs-2871247/v1

Download PDF

Article

Measurement of Mouse Head and Neck Tumors by Automated Analysis of CBCT Images

https://doi.org/10.21203/rs.3.rs-2871247/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 25 Jul, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

Animal experiments are often used to determine effects of drugs and other biological conditions on cancer progression, but poor accuracy and reproducibility of established tumor measurement methods make results unreliable. In orthotopic mouse models of head and neck cancer, tumor volumes approximated from caliper measurements are conventionally used to compare groups, but geometrical challenges make the procedure imprecise. To address this, we developed software to better measure these tumors by automated analysis of cone-beam computed tomography (CBCT) scans. This allows for analyses of tumor shape and growth dynamics that would otherwise be too inaccurate to provide biological insight. Monitoring tumor growth by calipers and imaging in parallel, we find that caliper measurements of small tumors are weakly correlated with actual tumor volume and highly susceptible to experimenter bias. The method presented provides a unique window to sources of error in a foundational aspect of preclinical head and neck cancer research and a valuable tool to mitigate them.

Biological sciences/Biological techniques/Imaging/X ray tomography

Physical sciences/Engineering/Biomedical engineering

Physical sciences/Mathematics and computing/Software

Biological sciences/Cancer

Biological sciences/Cancer/Cancer imaging

Biological sciences/Cancer/Cancer models

Biological sciences/Cancer/Head and neck cancer

Biological sciences/Cancer/Oral cancer

Despite advances in the use of organoids ¹ and organ-on-chip technologies ², in vitro systems fall short in reproducing crucial aspects of real tumors, which contain a diverse array of stromal cells, interact with a systemic immune system, and can metastasize to distant organs. Mouse models of cancer overcome these limitations and allow controlled experiments comparing groups of nearly identical animals with pharmacological and genetic manipulations to test precise biological hypotheses. They are therefore of immense value in showing that something affects tumor growth—or determining why in a context where the effect can be directly verified.

Unfortunately, the methods used to measure tumors in mice are often inaccurate, time-consuming, and susceptible to various forms of bias. The volume of an externally palpable tumor is normally approximated using the following formula (or similar) from distances measured with calipers:

where d_long and d_short are the longer and shorter of two roughly orthogonal measurements respectively. Details of the measurement procedure, including how tightly to squeeze the calipers, likely contribute to random errors and inconsistencies between measurers. The extent to which Eq. 1 over- or under-estimates volume is, of course, also dependent on tumor shape. This is particularly pertinent in head and neck cancer (HNC), where tumors can grow exophytically or endophytically through foramens and invade adjacent lymph nodes. Growth of tumors that are not externally palpable, such as lung tumors ³, or brain tumors ⁴, can instead be tracked using medical imaging modalities, including computed tomography (CT) ⁵, magnetic resonance imaging (MRI) ⁶, and ultrasound. Although this can provide useful information, the applicability of image-based tumor measurement to serial monitoring of multiple animals is limited by lengthy image acquisition and analysis times. It has been previously reported that caliper measurement is less accurate than image-based measurement of the same tumors ^7,8. Bioluminescence imaging (BLI) is also notable in this context as a method to compare growth of appropriately labeled tumors, although it suffers from a number of technical issues and is not typically used to estimate tumor volume ⁹.

Measurements of tumor volume by both calipers and manual image analysis, merely through the direct involvement of humans, also allow for conscious or subconscious injection of bias into the results. Despite evidence for such experimenter effects and the importance of blind data recording ^10,11, it is not a common practice in this context, and represents an important advantage of automating tumor volume quantification. We accordingly sought to develop an accurate, cost-effective method of tumor measurement through automated image analysis.

We therefore sought to develop a method of tumor measurement that overcomes these sources of error, while also being sufficiently cost-effective to allow for continual monitoring of tumor growth in many animals over the course of an experiment as a direct substitute for caliper measurement. To this end, we developed software that processes low-dose cone beam computed tomography (CBCT) images of up to five mice with buccal tumors, isolates the individual mice, and segments the images to calculate volumes and additional information about each tumor, without requiring any user input. We present preliminary analysis of data generated by this software and caliper measurements for comparison, demonstrating substantial utility of this approach in pre-clinical head and neck cancer research.

Automatic buccal tumor segmentation

Cone-beam computed tomography (CBCT), a form of CT imaging commonly used in dentistry and radiation therapy, is capable of quickly acquiring 3D images with high resolution in all three dimensions, which is desirable for accurate volume measurement. As a proof of concept for the use of our CBCT images for volume measurement, we scanned a collection of small aliquot tubes containing known volumes of water and used a python script to segment the image (Supplementary Figure 1A, B). Volumes calculated from the number of voxels in each region identified as water were in very good agreement with volumes expected based on weighing the water when is was pipetted into the tubes (Supplementary Figure 1C). In this case, image segmentation was achieved by thresholding (voxel values in water are higher than air or plastic), some processing of the binary image to correct consequences of noise and burring, and separation of the water density voxels into connected components. While similar methods are also applicable to mice, tumors are not readily distinguishable from other soft tissue in CT images, so a somewhat complicated approach was required to segment them (Figure 1A).

Using modified stages and anesthesia setups (Figure 1B, Supplementary Figure 1D), we were able to acquire CBCT scans of multiple mice at a rate comparable to caliper measurement (Supplementary Figure 1E), making this a viable alternative for tumor monitoring at scale. Pre-processing of the resulting scans was then required to separate individual mice (Figure 1C) and remove non-mouse objects (Figure 1D). The method we developed to segment each mouse exploits the symmetry of the head to identify tissue on one side that does not correspond to tissue on the other. This left-right mapping is achieved by fitting a curvilinear coordinate system that bends with the neck and jaw. In contrast to the natural coordinate system of voxel indices (Supplementary Figure 2A), where each voxel represents the same volume of space, the curved coordinates define a grid where the volume of real space represented by each point varies. To account for this, volumes in the curved space were calculated by adding up volumes of 24 tetrahedra per voxel, arranged to count all real space exactly once (Supplementary Figure 2B).

We used various image processing techniques, including thresholding, region growing, registration of small blocks of voxels, and filters to detect specific features, to first segment teeth (Supplementary Figure 2C), then bone (Figure 1E), and label various consistently identifiable points.

One measurer tended to record lower volumes than the others, achieving good agreement with volumes from CBCT segmentation (Figure 2B) and collected all other caliper measurements reported in this paper. Importantly, this results from caliper measurements were less similar to automatic CBCT analysis in other experiments (Supplementary Figure 3A). Differences between measurers is a significant problem with the caliper method, resulting in obvious inconsistencies if one person does not collect all caliper measurements for a given experiment. Variability between caliper measurements of a given tumor tended to be far greater than variability between CBCT segmentation volumes from different scans, especially for small tumors (Figure 2C). It appears that random errors in volume measurement from CT are roughly proportional to volume, whereas a significant component of the variability between caliper measurements is independent of volume (Figure 2D).

We similarly compared results from automatic segmentation to manual segmentation. For this experiment, we acquired contrast-enhanced CBCT scans of 15 large tumors. Three different styles of manual contouring (Figure 2E, F) were performed by two researchers, using ITK-Snap software, and measured distances approximating caliper measurement were included for additional comparisons (Supplementary Figure 3B). We observed poor agreement between these methods (Figure 2E). Despite visible differences between the contouring styles, it is worth noting that all of the segmentations look similar and reasonably accurate when overlayed on the scans. This suggests that for manual (or semi-automatic) image segmentation to give reproducible results would require very rigidly defined procedures. This approach is also too time-consuming to present a viable alternative to automatic segmentation for continual monitoring of many tumors.

Comparison of tumor volumes from image segmentation to caliper measurements

While a linear relationship between tumor volumes from automatic CBCT segmentation and caliper measurements by the primary tumor measurer was observed when tumors were measured at a single timepoint (Figure 2B, “Measurer One”), this was not observed when caliper measurements and CBCT scans were collected in parallel over the course of real experiments (Supplementary Figure 3A). Two such experiments were conducted, and all matched volume pairs (CT and caliper measurements of the same tumor on the same day) are shown in Figure 3A with linear and quadratic fits. It appears that caliper volumes less than 200 mm³ were nearly always overestimates and caliper volumes greater than 200 mm³ were nearly always underestimates. A coefficient of only .57 for the first order term of the linear fit is particularly concerning, as it suggests that caliper measurement substantially underestimates changes in tumor volume.

The same data are shown in Figure 3B, but with linearly interpolated connecting lines between points for each mouse. These trajectories are not entirely random; a tumor measured once to have a low caliper volume relative to CT volume tends to have a low caliper volume at the next measurement as well. Another important observation about these results is the large number of caliper measurements near 100 mm³ that correspond to CT measurements close to zero. From overlayed histograms of these volumes (Figure 3C), we see that the dataset is primarily comprised of such measurements. A Bland-Altman plot reveals strong size-dependence of disagreement between the two methods (Figure 3D). Both methods occasionally reported tumor volumes of exactly zero, although usually not for the same tumors (Figure 3E).

Effects of experimenter expectations on caliper data

One notable feature of the caliper growth curves is that several abruptly drop from about 100 mm³ to zero at day 60, when it was assumed that the surviving mice had been cured. A mouse with a tumor that would be measured as having a volume of 100 mm³ is, in fact, not necessarily distinguishable from a mouse with no tumor at all. At small volumes, the tumor is not superficial but somewhere in the cheek muscle, making caliper measurement unreliable. Furthermore, caliper measurements of small tumors tend to fluctuate together over the course of the experiment, suggesting other contextual human influence on the recorded measurements (Figure 3F, expanded in Supplementary Figure 3C). To investigate this, we fitted a function to predict caliper volume from the CT volume for each mouse (Figure 4A) and used these fits to regress out the effect of the actual tumor volume on the caliper measurement. The resulting residual measured volumes (Figure 4B) tended to be higher on some days than others, and, in fact, correlated strongly with the average total tumor volume (Figure 4C), suggesting a tendency to inflate reported volumes of small tumors when large tumors are present.

It is, in some sense, rational to inform the measurements with outside information, as the presence of large tumors may more accurately predict when the small tumors will start to regrow than observations of the small tumors themselves. However, it is obviously problematic from a data analysis perspective. The range of caliper volumes reported for tumors whose actual volumes were likely near zero (Figure 4A) introduces an opportunity for the measurer to substantially bias the average volume reported for a group of mice, which might falsely confirm an incorrect hypothesis.

Notably, the measurer had direct access to measurements from previous days when recording each new set, as is common practice. This reduces the probability of reporting clearly erroneous measurements but means that the data are not truly independent between timepoints. Tumor growth curves (Figure 3F) are likely smoothed to appear more biologically plausible through the influence of previous measurements. One undesired consequence of this can be seen in the tumor growth rates calculated by comparing consecutive volume measurements from each method (Figure 4D). In this experiment, several tumors spontaneously and unexpectedly shrank. The caliper measurer noticed this but was vocally skeptical of the negative growth rates, remeasured, and ultimately recorded fewer and less dramatic reductions in tumor volume than the automatic segmentation results eventually confirmed.

Effects of tumor shape on caliper data

We considered the tendency to record similar consecutive measurements of the same tumor as a possible contributor to the observed discrepancy between Figure 2C and Figure 3A (plotted together in Supplementary Figure 3A) but chose to investigate an alternative explanation based on differences in tumor shape. The tumors measured for Figure 2 were derived from the MOC2 cell line and mostly untreated, whereas the tumors measured for Figure 3 were derived from the P029 cell line and treated with radiotherapy. It was therefore plausible that caliper measurements tended to overestimate MOC2 tumor volumes and underestimate P029 tumor volumes because of a difference in shape. To investigate this, we computed the eigendecomposition of the covariance matrix of the coordinates of each voxel of tumor relative to its centroid (essentially principal component analysis). A convenient geometric interpretation of this is that, for a perfectly ellipsoidal tumor, the volume would be proportional to the product of the square roots of the eigenvalues. For these tumors, the two largest eigenvalues would roughly correspond to the distances measured by calipers (Figure 4E, Supplementary Figure 3B). Caliper volumes were not more strongly correlated with volumes from the ellipsoid approximation than with the volume from voxel counting overall (Supplementary Figure 4A), but the correlation was somewhat stronger for small tumors (Supplementary Figure 4B), suggesting that reported caliper measurements were not related to the size of the tumor in the dimensions measured, but missing information about the thickness of the tumor makes Eqn. 1 a poor approximation to the volume.

To compare tumor shapes between the two experiments, we “normalized” the eigenvalues by taking the square root, then dividing by the sum of the square roots (Figure 4F, G). MOC2 tumors tended to be more elongated than P029 tumors of similar volumes, having relatively smaller largest eigenvalues and larger middle eigenvalues. This is consistent with the higher caliper volumes reported in Figure 2B, because the middle eigenvalue approximately corresponds to the shorter caliper distance (top panel of Supplementary Figure 3B), which is squared in Eqn. 1. While this application of tumor shape analysis is somewhat mundane, it demonstrates a critical strength of tumor measurement by image segmentation: in an experiment where some tumors were treated or genetically modified in a way that affects their shape, caliper measurements might detect this as a difference in tumor volume, or simply miss the important effect. With segmented scans of each mouse, we can test new hypotheses about tumor shape and location post-hoc. For example, tumor location varies within experiments (Supplementary Figure 4C), as does the fraction of a tumor that could fit within a sphere of the same volume (Supplementary Figure 4D), but the MOC2 and P029 tumors show similar distributions in these metrics (Supplementary Figure 4E, F), suggesting that the elongation of P029 tumors may not be due to increased invasion into the neck.

Automatic image segmentation improves tumor growth-curve analysis

We hypothesized that a primary benefit of more accurate tumor volume data would be more fruitful analysis of tumor growth dynamics. Growth curves for three mice are shown with in Figure 5A with 3D renders of the corresponding tumors, showing deleterious eeffects of tumor shape on caliper measurement. In the first example, the tumor was very flat, causing the caliper measurer to miss it entirely (Figure 5A, left). The next tumor is of typical shape and exemplifies the seemingly random fluctuations in caliper volume at early timepoints, followed by significantly underestimated growth rate at larger at late timepoints, observed for most mice (Figure 5A, center). Finally, the third tumor did not respond as completely as most to radiotherapy, which is apparent from high CT volume as early as day 18, by not from caliper data until day 38 (Figure 5A, right).

To demonstrate the utility of more accurate tumor growth curves, we developed a simple model for the growth dynamics of tumors in these experiments, where all tumors were treated with x-ray radiotherapy (XRT):

This includes a gaussian to model the initial peak around the time of XRT, a logistic curve to model the regrowth phase, and a constant term to improve fitting to the caliper dataset (Figure 5B). This appears to capture the most important characteristics of each curve in a small number of easily interpretable parameters (Figure 5C). We hypothesized that tumor volumes during and shortly after the time of XRT might be predictive of the parameters f (the fractional rate of regrowth) and g (the time of regrowth) characterizing the eventual regrowth phase. In CT-measured curves, we found that the average volume 8-20 days after tumor implantation negatively correlated with f and g (Figure 5D, E). Similar trends may be present in the caliper data as well, but less clearly. Correlation statistics are given in Table 1. Interestingly, there is also a negative correlation between f and both the tumor’s sphericalness (Supplementary Figure 4G) and a (tumor volume at the time of XRT) (Supplementary Figure 5A), but a positive correlation between c (time for the tumor to shrink after XRT) and g (Supplementary Figure 5B). Both a and c contribute positively to the averaged tumor volumes reported (Figure 5D, E), yet the correlation between c and g is in the opposite direction, suggesting that the gaussian fit parameters may tease apart two separate effects: tumors that shrink slowly after XRT also take longer to enter the regrowth phase, and tumors that grow faster before XRT regrow more slowly. Similar correlations are also observed with the tumor shape metrics at early timepoints (Supplementary Figure 4G, Supplementary Figure 5C, D). These findings require validation and explanations beyond the scope of this paper, but demonstrate how more accurate tumor measurement can reveal biologically interesting effects that would otherwise go unnoticed.

Table 1. Statistics for correlations involving growth curve fitting parameters

In the clinic, patients’ response to therapy is often defined radiographically based on fractional changes in tumor volume. Automatic image segmentation therefore provides more comparable assessment of response to therapy in mice. Segmented CT scans are also used extensively in medical physics for Monte-Carlo simulations to determine the spatial distribution of radiation dose. Despite access to the equipment and software to perform these simulations for mice, we typically do not, because treatment planning time limits the number of mice that can be treated. Automating the segmentation could significantly improve this situation, allowing for more accurate radiation delivery tailored to the specific geometry of each tumor.

As most clinical trials fail to impact patient outcomes ¹², the translational value of preclinical models for human disease is increasingly questioned. Particularly in head and neck carcer, where immunotherapy and targeted therapy trials have uniformly failed ¹³, there is a need for preclinical research to better predict the effects of drugs in human patients, which could partially be addressed by increasing the quality and quantity of data obtained from each mouse. The method of buccal tumor measurement described here contributes to this, as does detection of lung metastases, which the same scans were also used for. Software for automatic segmentation of lung tumors in CT images has been described many times in the literature ^3,14−17, suggesting a promising future direction.

Automating tumor measurement not only improves the accuracy and reliability, but reproducibility of results. Attempts to replicate preclinical cancer biology experiments are often undermined by inadequate description of methods, and find smaller effect sizes than were originally published in the vast majority of cases ¹⁸, pointing to widespread problems in the field. Small animal CBCT imaging is available at many universities, either for image-guided radiotherapy or in stand-alone micro-CT machines. Automating the analysis of CBCT images could not only encourage more extensive use of this technology, but standardize quantification for absolute comparisons of tumor growth between experiments conducted at different institutions. This has the potential to reduce the cost in animal lives of preclinical oncology experiments, while improving their translational value to medicine.

While the segmentation method described here can only be used to measure tumors that grow on one side of the head or neck, it is notable that the scanning method and much of the code would also be applicable to measurement of lung and flank tumors and perhaps other locations where tumors can be readily identified in CT images. A simpler extension of the existing code might be to essentially register the image of each mouse with a different scan rather than it’s mirror image to eliminate the need for tumors to be confined to one side of the head.

One limitation of the current manuscript is the small number of tumor models examined. Different implantation sites, cell lines, and treatments may give somewhat different results. Our findings would be applicable to a broader range or preclinical oncology experiments if we had developed an automated method to measure flank tumors, used to study many types of cancer. It is likely that the caliper method is more accurate for measuring tumors in the flank than the buccal, because it is less likely that the tumor will be obscured by normal tissue. Our software could also likely be improved to give more accurate results with further development of various modules. For example, we did not incorporate any information about soft tissue anatomy, which might be used to make better estimates of tumor depth or more accurately locate the midline when the neck is bent. The accuracy of our tumor segmentations is also unavoidably limited by the imaging modality. Fairly low-quality CBCT scans were used to facilitate quickly imaging large numbers of mice, but high-resolution MRI or contrast agents it could, in principle, be used to generate more accurate segmentations by incorporating information from soft tissue contrast.

Despite these technical challenges, our software is already able to dramatically outperform the caliper method, particularly in the measurement of small tumors, and this has several clear practical applications for preclinical HNC experiments. One is to randomize groups after implantation but prior to treatment to ensure initially similar tumor volume distributions. Another is to compute tumor volume fold-changes; it could be useful to compare relative changes in tumor volume from onset of treatment, but this clearly would not be appropriate for the data shown in Fig. 3F, because the volumes from caliper measurement at the time of XRT were unrealistically high and appear to be very weakly correlated with the actual volumes of the tumors. Tumor monitoring by automated image segmentation opens the door to more sophisticate analysis of tumor growth, such as shape comparisons and fitting mathematical models, and has great potential to positively impact how preclinical HNC experiments are conducted.

Animal Models

All methods were performed in accordance with the relevant guidelines and regulations. Animal procedures were conducted in accordance with protocols approved by the University of Colorado, Anschutz Medical Campus institutional animal care and use committee (IACUC). Some mice were euthanized due to tumor volumes exceeding 1000 mm³ or weight loss exceeding 5% of body weight per day over 2–3 days, or abnormal breathing due to lung metastases. Others were euthanized without reaching these endpoints because of tumor ulceration. The euthanasia method employed was carbon dioxide inhalation followed by cervical dislocation, consistent with the American Veterinary Medical Association’s guidelines for the euthanasia of animals. All experiments are reported in accordance with ARRIVE guidelines¹⁹.

The images analyzed for this manuscript were collected for experiments designed to explore roles of ephrinB2/EphB4 signaling in head and neck cancer. The effects of these proteins on tumor growth are, however, beyond the scope of this manuscript, so pooled data from multiple conditions are presented without identification. All analyses include all mice from the experiment or experiments presented. C57BL/6 wild-type or EfnB2^fl/flTie2-Cre-ERT (genetically modified C57BL/6) mice implanted with either 100,000 MOC2 ²⁰ or 50,000 P029 ²¹ cells in the right buccal. All mice were approximately 3 months old at the time of tumor implantation and female except for data from 25 males included in Fig. 3A-E and Fig. 5D, E. The MOC2 cell line was obtained from Dr. Ravindra Uppaluri (Dana-Farber Cancer Institute, Boston, MA) ²², the P029 cell line from the Xiao-Jing Wang lab (University of Colorado, Anschutz Medical Campus). Genetic manipulations (to knock down/out EphB4) of the MOC2 and P029 cell lines were performed by the University of Colorado Cancer Center Functional Genomics Facility. MOC2 cells were transfected with PX458 control plasmid or PX458 containing gRNA targeting EPHB4 and CRISPR knockout (The same cells were used in Bhatia, et al., 2022) ²². P029 cells were transduced with shRNA targeting murine EphB4 or non-specific shRNA. Some mice were also treated with TNYL-RAW-Fc (EphB4 inhibitor) or PCDNA3 (control) plasmids, as in Bhatia et al., 2019 ²³, except that two doses of plasmid were administered prior to tumor implantation. The procedure for implantation of buccal tumors was previously described in Oweida, et al. 2019 ²⁴. All tumors were treated with 8Gy × 3 XRT at 8-, 11-, and 14-days post implantation, except where otherwise noted.

Automatic buccal tumor segmentation

The program begins by locating the teeth, which were easily identified because of their high density, and defining labels and points for front, bottom, and back teeth, which can be reliably identified in nearly all scans (Supplementary Fig. 2C). Using this as a starting point, bone was then similarly segmented to identify structures such as the jaw, cranium, and shoulders (Fig. 1E). A set of line segments connecting points at the midline and corresponding points on the left and right sides of the skeleton in the original scan were then mapped to line segments of a structure that can bend at intersections to fit the position of the mouse while preserving the lengths of the line segments (Supplementary Fig. 2D). A local resampling grid to rotate and translate part of the source scan into the orientation shown in Fig. 1F is calculated for each line segment, then these are stitched together to form a single curvilinear resampling grid. Supplementary Fig. 3E shows center slices of a 3-dimensional checkerboard pattern resampled in the same way to demonstrate how the original image was warped.

Because the voxels of the resampled scan may represent slightly unequal volumes of real space, it was necessary to calculate these volumes. Supplementary Fig. 2F shows map of maximum volume represented by a voxel along three directions through the initial resampling grid. Red areas of these images show that the shoulders were compressed to span a standardized distance. The sampling grid is then adjusted by gradient descent, registering the head with its own mirror image to sub-pixel accuracy using a loss function that also penalizes curvature, but slightly expands or compresses some voxels (Supplementary Fig. 2G). To ensure that all real space was counted exactly once, the volume represented by each voxel of the resampled image was calculated from volumes of a flexible, space-filling arrangement of 24 tetrahedra per voxel (Supplementary Fig. 2B).

Scan Pre-Processing

Once scans were obtained, they were processed to ensure that the measurement of the head was unabetted by other objects in the scan. For both three- and five-mouse scans, a coordinate system was established for consistent analysis (Supplementary Fig. 2A). Processing of the exported scans generated a series of single-mouse scans.

For source scans contain up to three mice in a single row in the XY plane, scan splitting begins by finding the acrylic platform by first taking the means of 100 evenly spaced columns of voxels to determine where mice were not resting on the bed. Along these columns, the “bed point” is identified as the z-value where the largest difference in voxel intensity is observed and annotated in combination with the associated x and y values. From these points, a plane was calculated and the voxels in the area below the plane were assigned the same intensity value as air, effectively removing it from the scan. Next, two measurements were used to separate the mice in the scan. The first was the mean voxel intensity across columns of voxels aligned with the z-axis to model where mice rest along the x-axis. The second found the teeth at the maximum voxel intensity along the same columns, as the teeth are generally the highest density points in the scan. Each of these methods were tested against the number of mice expected to be present in the image to ensure splitting in an appropriate number of locations. If one measurement was unreliable, the scans were split at the minimum values of the mean intensity curve or on either side of the teeth. Otherwise, a combination of these measurements was used to ensure that no excess voxels were included in the final one-mouse scans.

For five-mouse scans, split points were located along the z-axis as well as the x-axis. Bed heights were estimated using the same method as the three mouse scans. However, three different height ranges were found for each of the three platforms, allowing them to be sorted into bottom, middle, and top platform points. These were used to determine each platform location.

The tooth-finding for five-mouse scans also differed from that of three-mouse scans. The maximum voxel intensity of XY slices was obtained along the z-axis, finding the heights of each row of mice to split the scan in the z-axis. Along the z-axis, mice were split either just below the maximum tooth point in each row of mice, or just above the platform supporting the row of mice. Along the x-axis, the lower two rows were split in the middle of the scan, due to the consistency of volume scanning and the ample distance between the two mice in these rows (Fig. 1C).

Once the scans were split, they were processed further to eliminate unwanted objects in each scan (Fig. 1D). The sum of tissue-density voxels in each scan was then summed to determine whether a mouse was present in the split scan. If the scan was empty, it was not processed any further. The Python package Connected Components 3D was used to segment individual mice and remove the areas of mouse that were not connected to the largest mouse in the scan. This method was also used to remove the anesthesia nosecone. One-mouse scans originating from five-mouse scans required additional processing to remove the sides of the mouse-holders. This was done by fitting a plane to points found on the plastic sides above the mouse in each scan, in a process similar to the bed-removal. The isolated scan of each mouse was then automatically adjusted based on the histogram of voxel values to ensure numerically similar values correspond to air and soft tissue before further analysis.

Code Architecture

All code was written in Python, using primarily the standard scientific computing packages NumPy, SciPy, Pandas, and Matplotlib, as well as NiBabel and Pydicom for reading and writing image files, and Connected Components 3D to analyze connected components of binary images. The overall architecture of the program is described below and in Supplementary Fig. 6.

CBCT scans were processed using a series of modules written in Python, which integrate a Microsoft Excel spreadsheet containing scan locations and information as a user interface. These modules can optionally return individual one-mouse scan, aligned scan of just the head, segmentation of structures including tumor, log of any processing errors or warnings, mouse ID numbers, tumor volumes, dates, locations of source and generated files, and debugging information.

The code was separated into 7 modules. Two provided the necessary foundation for working with CT scans. The first of these, “voxelhelp,” contained several functions designed to help with general 3D image processing and visualization. The next, “xradct” defined classes specific to one-, three-, and five-mouse scans obtained using the XRAD SmART and annotation and segmentation objects to create further ease-of-use for these types of scans.

The next two modules performed the bulk of processing for each scan to separate each mouse in the scan and calculate tumor volume. The module “scan_processing” split raw three- and five-mouse scans to produce cleaned-up one-mouse images with components of the mouse holder and anesthesia setup removed, which were used by the “head_segmentation” module to locate anatomical features and segment the image.

The final three modules were for organization and containment of the previous four. The “task_controller” module used the scan information from the input spreadsheet to split each scan and segment the new split scans. The “run” module provided a cleaner interface to run the code from. Finally, and additional module called “process_pool_controller” can act as a wrapper for the “task_controller,” allowing multithreaded execution so that many scans can be analyzed simultaneously, or provide an alternative interface to the other modules.

CBCT Imaging

The XRAD-SmART irradiator is equipped to scan a cylindrical volume up to approximately 10cm x 10cm. We used these capabilities to perform longitudinal monitoring of tumor growth and to obtain qualitative data on tumor size. Mice were anesthetized using 1–2% isoflurane concentration and placed in the XRAD-SmART in the prone position with a small in-machine nosecone to remain under anesthesia. For imaging buccal tumors, the front limbs were normally swept back behind the shoulders, to avoid possible issues with the image segmentation (Fig. 1B, Supplementary Fig. 1D). A brief fluoroscopy was performed in some cases to ensure correct positioning of all mice in the scannable volume, followed by a preliminary, low-resolution “scout” CBCT scan to select the volume to be reconstructed for subsequent images. CBCT scans were acquired using either a high-dose (.1 mm voxels) or low-dose (.2 mm voxels) scanning presets with 80 kVp or 60 kVp x-rays filtered through 0.8mm of Beryllium and 2 mm of aluminum. Each mouse was then removed from the machine and returned to its cage to recover while the computer finished reconstructing the image. Scans were exported from the Pilot XRAD 1.18.3 software as DICOM files for analysis. For contrast-enhanced imaging (Fig. 2F), 200 µL of iohexol were injected by tail vein 15 minutes prior to imaging.

To allow for simultaneous imaging of three mice at a time, we modified the in-irradiator anesthesia system by splitting the isoflurane tube into several smaller ones and attached them to a wide acrylic platform (Supplementary Fig. 1D). However, given that each cage in our vivarium can host up to five mice, we reasoned that scanning five mice at once would significantly increase efficiency for larger studies, as this allows for each cage to be scanned individually. To procure these scans, we designed a device to hold and anesthetize five mice. A 3D-printed frame holds small pieces of plastic in the shape of three half-pentagon slots for three mice. This assembly is then suspended above two more mice below additional 3D-printed objects glued to the acrylic platform. The five-mouse holder was designed to fit within the 10cm diameter of the cylindrical scanning volume and hold all five mice at approximately the same distance from the gantry’s axis of rotation, which is important because image quality varies with this distance. Mice were first anesthetized in the induction chamber, then the lower two were positioned prone with their noses to the anesthesia tubes. The top level of the device is placed over the lower two mice, and the remaining mice are positioned in the three upper holders in the same position as the lower two. Mice were scanned using the lower resolution preset to decrease scan processing time.

Timing of CBCT scans and caliper measurements

CBCT scans were timed using a combination of stopwatch data and timestamps associated with the beginning of each scan. Caliper measurements were timed using a stopwatch. The times associated with the setup of either measurement method were found to be similar, but varied enough that they were omitted from the total time data.

Curve Fitting

Growth curves shown in Fig. 5A were generated by fitting second order splines using scipy.interpolate.UnivariateSpline. To mitigate potential effects of caliper and CT data having been collected on different days, we also used these splines to resample the growth curves at 10 points 8–20 days post implantation, and averaged these values to obtain the tumor volumes reported in Fig. 5D. Fitting of Eq. 2 to volume data was performed using scipy.linalg.lstsq().

Statistical Analysis

All correlation coefficients were calculated using the SciPy functions scipy.stats.pearsonr() and scipy.stats.spearmanr(). One way ANOVA tests comparing tumor volumes were performed using GraphPad Prism (results not shown) for the experiments described in Figs. 3–5, revealing no statistically significant differences between groups.

Acknowledgements

Figure 1B and Supplementary Figure 1D were drawn by BN for this manuscript.

Author contributions:

Conceptualization: BVC, BN, SK

Methodology: BVC, BN

Software: BVC, BN

Investigation: DN, BN, BVC, MK

Visualization: BVC, BN

Supervision: SK

Writing—original draft: BVC, BN

Writing—review & editing: BVC, SK, RR

Data and materials availability:

Image analysis code will be available on GitHub. Tumor volume data are included in supplementary Excel files. CBCT scans are available from the corresponding author upon request.

Competing Interests Statement

Dr. Karam receives clinical funding from Genentech, Ionis, and AstraZeneca and preclinical funding from Roche for work unrelated to this manuscript. All other authors declare they have no competing interests.

Jubelin, C. et al. Three-dimensional in vitro culture models in oncology research. Cell & Bioscience 12, 155 (2022).
Liu, X. et al. Tumor-on-a-chip: from bioinspired design to biomedical application. Microsystems & Nanoengineering 7, 50 (2021).
Holbrook, M. D. et al. Detection of lung nodules in micro-CT imaging using deep learning. Tomography 7, 358-372 (2021).
Schmidt, K. F. et al. Volume reconstruction techniques improve the correlation between histological and in vivo tumor volume measurements in mouse models of human gliomas. Journal of neuro-oncology 68, 207-215 (2004).
Kirsch, D. G. et al. Imaging primary lung cancers in mice to study radiation biology. International Journal of Radiation Oncology* Biology* Physics 76, 973-977 (2010).
Montelius, M., Ljungberg, M., Horn, M. & Forssell-Aronsson, E. Tumour size measurement in a mouse model using high resolution MRI. BMC medical imaging 12, 1-7 (2012).
Brodin, N. P. et al. Semi-automatic cone beam CT segmentation of in vivo pre-clinical subcutaneous tumours provides an efficient non-invasive alternative for tumour volume measurements. The British Journal of Radiology 88, 20140776 (2015).
Jensen, M. M., Jørgensen, J. T., Binderup, T. & Kjær, A. Tumor volume in subcutaneous mouse xenografts measured by microCT is more accurate and reproducible than determined by 18F-FDG-microPET or external caliper. BMC medical imaging 8, 1-9 (2008).
O'Neill, K., Lyons, S. K., Gallagher, W. M., Curran, K. M. & Byrne, A. T. Bioluminescent imaging: a critical tool in pre‐clinical oncology research. The Journal of Pathology: A Journal of the Pathological Society of Great Britain and Ireland 220, 317-327 (2010).
Holman, L., Head, M. L., Lanfear, R. & Jennions, M. D. Evidence of experimental bias in the life sciences: why we need blind data recording. PLoS biology 13, e1002190 (2015).
Macleod, M. R. et al. Evidence for the efficacy of NXY-059 in experimental focal cerebral ischaemia is confounded by study quality. Stroke 39, 2824-2829 (2008).
Seyhan, A. A. Lost in translation: the valley of death across preclinical and clinical divide–identification of problems and overcoming obstacles. Translational Medicine Communications 4, 1-19 (2019).
Hayes, D. N., Gleysteen, J. P. & Schwartz, D. L. Vol. 40 1967-1970 (Wolters Kluwer Health, 2022).
Montgomery, M. K. et al. Mouse lung automated segmentation tool for quantifying lung tumors after micro-computed tomography. PLoS One 16, e0252950 (2021).
van de Worp, W. R. et al. Deep learning based automated orthotopic lung tumor segmentation in whole-body mouse CT-scans. Cancers 13, 4585 (2021).
Namati, E. et al. Longitudinal assessment of lung cancer progression in the mouse using in vivo micro‐CT imaging. Medical physics 37, 4793-4805 (2010).
Barck, K. H. et al. Quantification of tumor burden in a genetically engineered mouse model of lung cancer by micro-CT and automated analysis. Translational oncology 8, 126-135 (2015).
Errington, T. M. et al. Investigating the replicability of preclinical cancer biology. Elife 10, e71601 (2021).
Percie du Sert, N. et al. The ARRIVE guidelines 2.0: Updated guidelines for reporting animal research. Journal of Cerebral Blood Flow & Metabolism 40, 1769-1777 (2020).
Judd, N. P. et al. ERK1/2 regulation of CD44 modulates oral cancer aggressiveness. Cancer Res 72, 365-374, doi:10.1158/0008-5472.CAN-11-1831 (2012).
Aleman, J., Nguyen, K. A., Ke, Y., Young, C. D. & Wang, X. J. in Proceedings: AACR Annual Meeting 2022. (American Association for Cancer Research).
Bhatia, S. et al. EphB4 and ephrinB2 act in opposition in the head and neck tumor microenvironment. Nat Commun 13, 3535, doi:10.1038/s41467-022-31124-7 (2022).
Bhatia, S. et al. Inhibition of EphB4-Ephrin-B2 Signaling Reprograms the Tumor Immune Microenvironment in Head and Neck Cancers. Cancer Res 79, 2722-2735, doi:10.1158/0008-5472.CAN-18-3257 (2019).
Oweida, A. J., Bhatia, S., Darragh, L., Serkova, N. & Karam, S. D. Intramucosal inoculation of squamous cell carcinoma cells in mice for tumor immune profiling and treatment response assessment. JoVE (Journal of Visualized Experiments), e59195 (2019).

Competing interest reported. Dr. Karam receives clinical funding from Genentech, Ionis, and AstraZeneca and preclinical funding from Roche for work unrelated to this manuscript. All other authors declare they have no competing interests.

Download PDF

Journal Publication

published 25 Jul, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
19 Jun, 2023
Reviews received at journal
14 Jun, 2023
Reviews received at journal
14 May, 2023
Reviewers agreed at journal
13 May, 2023
Reviewers agreed at journal
10 May, 2023
Reviewers invited by journal
10 May, 2023
Editor assigned by journal
10 May, 2023
Editor invited by journal
10 May, 2023
Submission checks completed at journal
10 May, 2023
First submitted to journal
28 Apr, 2023

You are reading this latest preprint version

Measurement of Mouse Head and Neck Tumors by Automated Analysis of CBCT Images

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Animal Models

Automatic buccal tumor segmentation

Scan Pre-Processing

Code Architecture

CBCT Imaging

Timing of CBCT scans and caliper measurements

Curve Fitting

Statistical Analysis

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1