The impact of deep learning reconstruction in low dose computed tomography on the evaluation of interstitial lung disease

doi:10.21203/rs.3.rs-1951749/v1

To evaluate the effect of the deep learning model reconstruction (DLM) method in terms of image quality and diagnostic efficacy of low-dose computed tomography (LDCT) for interstitial lung disease (ILD), 193 patients who underwent LDCT for suspected ILD were retrospectively reviewed. Datasets were reconstructed using filtered back projection (FBP), adaptive statistical iterative reconstruction Veo (ASiR-V), and DLM. For image quality analysis, the signal, noise, signal-to-noise ratio (SNR), blind/referenceless image spatial quality evaluator (BRISQUE), and visual scoring were evaluated. Also, CT patterns of usual interstitial pneumonia (UIP) were classified according to the 2022 idiopathic pulmonary fibrosis (IPF) diagnostic criteria. The differences between CT images subjected to FBP, ASiR-V 30%, and DLM were evaluated. The image noise and BRISQUE scores of DLM images was lower and SNR was higer than that of the ASiR-V and FBP images (ASiR-V vs. DLM, p < 0.001 and FBP vs. DLR-M, p < 0.001, respectively). The agreement of the diagnostic categorization of IPF between the three reconstruction methods was almost perfect (κ = 0.992, CI 0.990–0.994). Image quality was improved with DLM compared to ASiR-V and FBP. The diagnostic efficacy was maintained in the CT pattern diagnosis of UIP in DLM.

image reconstruction

deep learning model reconstruction

low dose computed tomography

interstitial lung disease

idiopathic pulmonary fibrosis

A precise diagnosis of interstitial lung disease (ILD) is crucial for selecting appropriate treatment candidates^1,2. High spatial resolution computed tomography (CT) with thin sections is ideal for evaluating ILD^3,4; however, due to concerns about the malignancy risk associated with cumulative exposure to radiation, low-dose chest CT (LDCT) is the preferred modality for conducting follow-up examinations and determining disease progression. However, reducing the radiation dose compromises image quality, which may affect the detection of subtle parenchymal changes^5,6 and thus, further improvements are needed.

To overcome the shortcomings of LDCT, image reconstruction methods after image acquisition can be used to reduce noise, thereby improving image quality and potentially enhancing the diagnostic value⁷. Iterative reconstruction (IR) algorithms are the most widely used CT noise-reduction methods as an alternative to the conventional reconstruction mode based on filtered back projection (FBP)^8–10. However, IR algorithms are typically nonlinear and task specific, which may modify the spatial resolution and image noise texture¹¹.

Deep learning technology has been reported to exhibit excellent performance in various fields of medical imaging^10,12−15. Deep learning model reconstruction (DLM) has strength in that it does not require simplification of parameters and can handle millions of parameters¹⁶. A few studies have evaluated image quality and noise in DLM^{10, 13–15,17}; however, to the best of our knowledge, none has specifically investigated its diagnostic impact on ILD.

This study evaluated the effect of the DLM method in terms of quantitative and qualitative image quality and diagnostic impact of DLM in terms of classifying CT pattern in ILD.

Basic characteristics of the participants and radiation dose. Of the 193 patients included in the current study, 141 were men and 52 were women, with a mean age of 68.95 ± 9.39 years (range 36–88 years). A total of 93 patients (48.2%) were diagnosed with IPF based on the diagnostic criteria of the American Thoracic Society and European Respiratory Society¹⁸, 55 patients (28.5%) were diagnosed with connective tissue disease related ILD, and 19 patients (9.8%) were diagnosed with interstitial lung abnormality. Organizing pneumonia was diagnosed in six patients (3.1%), followed by smoking related ILD in six (3.1%), nonspecific interstitial pneumonia in four (2.1%), and pleuroparenchymal fibroelastosis in four patients (2.1%). Remaining six patients (3.1%) had other diagnoses (for example, chronic hypersensitivity pneumonitis, post inflammatory fibrosis, sarcoidosis, hemosiderosis). Lung biopsy was performed on 38 patients, either wedge resection (34 patients) or transbronchial lung biopsy (4 patients). Of these, 17 were pathologically confirmed to have UIP on surgical lung biopsy.

As for the radiation dosage in LDCT, the mean CTDI_vol, DLP, and effective dose were 1.96 ± 0.03 mGy, 70.32 ± 5.82 mGy*cm, and 0.98 ± 0.08 mSv, respectively.

Comparison of imaging reconstruction methods. Signal, noise, and SNR results. A comparison of the image signal, noise, and SNR measurements is shown in Fig. 1A-F. The image signal of the lung parenchyma did not significantly differ across FBP, ASiR-V, and DLM; however, the mean background air signal of ASiR-V images was lower than that of FBP and DLM images (FBP vs. ASiR-V, ASiR-V vs. DLM, all p < 0.001). The other parameters, including noise and SNR, differed significantly (Fig. 2). Image noise was significantly higher in FBP and significantly lower in DLM, both in the lung parenchyma and background air (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). The SNR in the lung parenchyma and background air significantly differed across the three different reconstructions, and it was higher in the DLM and lower in the FBP (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001).

BRISQUE score results. The BRISQUE scores of the three reconstructed images are depicted in Fig. 1G. The BRISQUE scores of both ASiR-V and DLM were lower than those of the FBP images (all p < 0.001). The difference between ASiR-V and DLM was statistically significant (p < 0.001), and DLM showed a lower BRISQUE score. The difference between the ASiR-V and FBP images was approximately 1 point, whereas the difference between the DLM and FBP images was approximately 10 points.

Visual scoring by thoracic radiologist. The results of the qualitative image analysis are presented in Table 1. In the evaluation of the image contrast, DLM had highest mean points (4.49 ± 0.55), showing above average to excellent contrast, followed by ASiR-V (3.93 ± 0.51) and FBP (3.71 ± 0.59). The scores significantly differed among the three methods (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). In terms of image noise, DLM had highest mean points (3.73 ± 0.57), showing average to below average noise, followed by ASiR-V (3.20 ± 0.80), FBP (2.52 ± 0.69), and scores significantly differed between three methods (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). For overall image quality, the DLM images yielded the highest score, which was slightly inferior to the best image quality. The scores between ASiR-V and DLM, ASiR-V and FBP, and DLM and FBP significantly differed (FBP vs. ASiR-V, ASiR-V vs. DLM, and FBP vs. DLM; all p < 0.001).

Table 1

Comparison of image quality by visual scoring
Variables	FBP	ASiR-V	DLM	P FBP vs. ASiR-V	P FBP vs. DLM	P ASiR-V vs. DLM
Image contrast	3.71 ± 0.59	3.93 ± 0.51	4.49 ± 0.55	< 0.01	< 0.01	< 0.01
Image noise	2.52 ± 0.69	3.20 ± 0.80	3.73 ± 0.57	< 0.01	< 0.01	< 0.01
Overall image quality	3.98 ± 0.36	4.26 ± 0.49	4.64 ± 0.48	< 0.01	< 0.01	< 0.01
FBP, Filtered back projection; ASiR-V, adaptive statistical iterative reconstruction-Veo at a level of 30%; DLM, deep learning image reconstruction.

Diagnostic categorization of UIP based on CT patterns. On the CT pattern diagnosis of UIP, there was substantial agreement between the two readers (κ = 0.617). According to the reference standard that consisted of a consensus panel of two radiologists, the UIP was the most common diagnosis (56.5% in FBP [109 of 193], 56.0% in ASiR-V [108 of 193], and 56.0% in DLM [108 of 193]), followed by probable UIP (19.2% in FBP [37 of 193], 17.6% in ASiR-V [34 of 193], and 18.1% in DLM [35 of 193]), alternative diagnosis (16.6% in all FBP, ASiR-V, and DLM [32 of 193]), and indeterminate for UIP pattern (7.8% in FBP [15 of 193], 9.8% in ASiR-V [34 of 193], and 9.3% in DLM [18 of 193]) (Table 2).

Table 2

Diagnostic categorization of usual interstitial pneumonia based on computed tomography patterns
Category	FBP (%)	ASiR-V (%)	DLM (%)
UIP	109 (56.5)	108 (56.0)	108 (56.0)
Probable UIP	37 (19.2)	34 (17.6)	34 (17.6)
Indeterminate for UIP	15 (7.8)	19 (9.8)	19 (9.8)
Alternative diagnosis	32 (16.6)	32 (16.6)	32 (16.6)
FBP, filtered back projection; ASiR-V, adaptive statistical iterative reconstruction-Veo at a level of 30%; DLM, deep learning image reconstruction; UIP, usual interstitial pneumonia.

The agreement in diagnostic categorization between the three reconstruction methods was almost perfect (κ = 0.992, CI 0.990–0.994). In the ASiR-V and DLM images, probable UIP was less diagnosed compared to that of FBP (19.2% in FBP [37 of 193], 17.6% in AsiR-V [34 of 193], and 17.6% in DLM [34 of 193]), and indeterminate for UIP was frequently diagnosed (7.8% in FBP [15 of 194], 9.8% in AsiR-V [19 of 193], and 9.8% in DLM [19 of 193]) compared to that of the FBP images (Figs. 3 and 4). One case was diagnosed as UIP in the FBP image but categorized as probable UIP in both the ASiR-V and DLM images (Fig. 5). There were no discrepant cases in the alternative diagnoses between the three reconstruction methods. Cases with discrepant diagnoses among the three reconstruction methods are presented in Table 3.

Table 3

Cases with discrepant diagnosis between three reconstruction methods
Patient Number	FBP	IR	DLM	HRCT diagnosis	Pathology	Final diagnosis
1	UIP	Probable UIP	Probable UIP	Probable UIP	None	IPF
2	Probable UIP	Probable UIP	Indeterminate for UIP	Probable UIP	UIP	IPF
3	Probable UIP	Indeterminate for UIP	Indeterminate for UIP	Probable UIP	None	Interstitial lung abnormality
4	Probable UIP	Indeterminate for UIP	Probable UIP	Probable UIP	None	Connective tissue disease related ILD
5	Probable UIP	Indeterminate for UIP	Indeterminate for UIP	Probable UIP	None	IPF
6	Probable UIP	Indeterminate for UIP	Indeterminate for UIP	Probable UIP	None	IPF
FBP, filtered back projection; ASiR-V, adaptive statistical iterative reconstruction-Veo at a level of 30%; DLM, deep learning image reconstruction; UIP, usual interstitial pneumonia; HRCT, high resolution computed tomography; IPF, idiopathic pulmonary fibrosis.

This study demonstrated that DLM showed favorable results in terms of image noise, SNR, BRISQUE scores, and visual scoring compared to that of the ASiR-V and FBP images. Furthermore, DLM maintained diagnostic efficacy in CT pattern diagnosis of UIP.

The FBP and IR algorithms are widely used in CT image reconstruction. However, both methods have several drawbacks. FBP can severely degrade image quality by increasing image noise, while IR can modify the spatial resolution and noise texture in different regions of CT images^11,19. IR often requires long reconstruction times^11,20. With recent advances in artificial intelligence technology, DLM has been introduced to overcome the limitations of FBP and IR approaches. The DLM incorporates convolutional neural networks into the image reconstruction process, which can be used to generate high-quality images from low-dose projection data in a short reconstruction time in a clinical environment¹².

Recently, several clinical studies have reported that DLM yields a favorable noise texture with superior image quality on low-dose chest CT^15,17,21,22, which is in accordance with our study results. In this study, we applied the BRISQUE score to assess the image quality of the DLM images with LDCT. Unlike SNR, BRISQUE is a no-reference image quality assessment model based on the principle that natural images possess certain regular statistical properties that are measurably modified by distortions²³. In our study, DLM images yielded significantly better image quality in the BRISQUE score in concordance with other image parameters, including noise, SNR, and visual scoring.

ILD is a rare condition characterized by extensive inflammation and fibrosis mainly involving the lungs and IPF, the most common type of ILD, is associated with poor prognosis^2,24. LDCT is widely used to monitor disease progression^5,6. In our study, DLM in LDCT maintained diagnostic efficacy in terms of CT pattern diagnosis of UIP compared to that of IR and FBP. To the best of our knowledge, this study specifically investigated the diagnostic impact of DLM on ILD.

Although diagnostic agreement between three reconstruction methods was almost perfect, there were six cases (3.1%, 6 of 193) of discrepant diagnoses between the three reconstruction methods (Table 3). Four patients were diagnosed with indeterminate UIP in DLM, while probable UIP was diagnosed on HRCT. Of these, three patients were finally diagnosed with IPF based on multidisciplinary diagnosis. While DLM showed better image quality than that of other reconstruction images, the denoising process may have produced some degradation in the small peripheral airway dilatations or subtle reticulation, which may have caused underestimation of lung fibrosis. Similar results were also observed in IR, with four cases being diagnosed as indeterminate for UIP but probable for UIP on HRCT. Although IR has shown favorable results in the detection and assessment of ILD^8,9 it is known to have limitations in that the noise texture often differs from that of traditional FBP images and can negatively affect subjective acceptance and diagnostic confidence, altering noise texture^20,25. In our study, there were cases in which both IR and DLM were suggestive of such results (Figs. 2 and 3). Another possible reason for the discrepant diagnoses is that the reported interobserver agreement in diagnosing UIP pattern is not high ranging 0.40-0.69^3,6,26. The diagnostic agreement between the two radiologists in this study was substantial; however, variability may have influenced the diagnosis.

It is noteworthy that the diagnostic agreement of UIP between the three different reconstructions was almost perfect, except in one case (Fig. 4). Diagnosing the UIP pattern by CT is important, since the radiologic diagnosis is sufficient to secure a diagnosis without lung biopsy in an appropriate clinical contest^27,28. In addition, there were no discrepant cases of alternative diagnoses that may require further diagnostic interventions.

The current study has some limitations. Our investigation was retrospective. Moreover, the study was conducted at a single institution, which may have caused selection bias. Second, complete blinding of the image reconstruction method was not possible because of the unique visual appearance of the FBP, ASiR-V, and DLM images despite the radiologists being blinded to the reconstruction methods and may have caused bias.

In conclusion, the image noise, SNR, BRISQUE, and visual scoring of chest LDCT scan images improved with DLM compared to that with ASiR-V and FBP. DLM may be feasible in clinical practice for evaluating ILD, since diagnostic efficacy is maintained in CT pattern diagnosis of UIP compared to that of ASiR-V and FBP.

Patients. The institutional review board (IRB) of Samsung Medical Center approved this retrospective study (IRB, file number 2021-06-092), and the requirement of informed consent was waived for the use of patient medical data. All methods were performed in accordance with the Declaration of Helsinki. In total, 369 consecutive patients were included between August 2021 and September 2021. A CT scan was performed because of clinical or radiological suspicion of ILD on chest radiography. All patients underwent routine CT evaluation for ILD, which included helical LDCT and standard-dose non-helical high-resolution CT (HRCT). A total of 170 patients without imaging findings suggestive of ILD (for example, non-dependent ground-glass opacity or reticular abnormalities, non-emphysematous cysts, honeycombing, and traction bronchiectasis)²⁹ were excluded. Six patients were excluded due to suboptimal image quality. Finally, 193 patients were included in this study.

CT acquisition and image reconstruction. All CT images were obtained using a multidetector CT scanner (Revolution Frontier, GE Healthcare) under the LDCT protocol without the use of contrast. The protocols consisted of a fixed tube current of 20 mAs per slice (40 mA with a half-second rotation and 0.984 pitch). Slice thickness of 1.25 mm and high-spatial-frequency algorithm were applied. The chest CT protocol used the helical mode with the following parameter: 1.25 mm × 64 detector configuration. The other parameters were as follows: peak tube voltage of 120 kVp, 40-mm table feed per gantry rotation, pitch of 0.984:1, and z-axis tube current modulation. Supine inspiratory HRCT scans of all patients were obtained without intravenous contrast using the same CT scanner. The protocol consisted of sections reconstructed with a high-spatial-frequency algorithm at 1- or 2-cm intervals under automatic exposure control (142–275 mA with dose modulation) with a slice thickness of 1.25 mm, from apex to base.

Three different reconstructions of the LDCT images were obtained: conventional FBP, adaptive statistical iterative reconstruction-Veo at a level of 30% (ASiR-V), and DLM. All scan data were directly displayed on the picture archiving and communication systems (PACS) (Centricity 3.0, GE Healthcare) workstation monitors. Images were viewed on monitors in lung (width, 1500 HU; level, -700 HU) window settings. To assess radiation exposure in LDCT, we reviewed the CT dose index (CTDIvol) and dose-length product (DLP) recorded as digital imaging and communications in medicine data. The estimated effective dose was calculated as the DLP multiplied by a k-factor of 0.014 mSv·mGy^− 1·cm^− 130.

Deep learning reconstruction model. The deep learning model reconstruction (DLM; ClariCT.AI, ClariPi)¹³ was developed as a denoising solution using a U-Net-based convolutional neural network. Details are summarized in Supplementary material (Supplementary Fig. S1 and S2).

Image quality analysis. The performance of the image reconstruction methods was evaluated both quantitatively and qualitatively for each case, reconstructed using three different methods (FBP, ASiR-V, and DLM). Quantitative analysis was performed using signal, noise, signal-to-noise ratio (SNR), and blind/referenceless image spatial quality evaluator (BRISQUE) score. For qualitative analysis, a thoracic radiologist visually scored the images.

Signal, noise, and signal-to noise ratio. Standardized 20-mm-diameter circular regions of interest were used to record signal and noise, which represented the mean pixel intensity value and standard deviation of pixels for the lung parenchyma, and background air for LDCT scans in FBP, ASiR-V, and DLM image sets²⁵. Lung measurements were obtained from the lower lobes towards the periphery to avoid parenchymal lesions. Background air was obtained from the air external and anterior to the patient at the sternomanubrial junction³¹. The signal-to-noise ratio (SNR) was calculated for all three image sets. SNR was calculated as follows:

𝑆𝑁𝑅_{𝑏𝑎𝑐𝑘𝑔𝑟𝑜𝑢𝑛𝑑 𝑎𝑖𝑟}= |(𝑠𝑖𝑔𝑛𝑎𝑙_{𝑏𝑎𝑐𝑘𝑔𝑟𝑜𝑢𝑛𝑑 𝑎𝑖𝑟})/(𝑛𝑜𝑖𝑠𝑒_{𝑏𝑎𝑐𝑘𝑔𝑟𝑜𝑢𝑛𝑑 𝑎𝑖𝑟})|, 𝑆𝑁𝑅_{lung p𝑎𝑟𝑒𝑛𝑐ℎ𝑦𝑚𝑎}= |(𝑠𝑖𝑔𝑛𝑎𝑙_{lung p𝑎𝑟𝑒𝑛𝑐ℎ𝑦𝑚𝑎)}/(𝑛𝑜𝑖𝑠𝑒_{lung p𝑎𝑟𝑒𝑛𝑐ℎ𝑦𝑚𝑎)}|³².

BRISQUE. BRISQUE is a no-reference image quality assessment model that uses natural scene statistics in the spatial domain²³. This model is composed of three steps: 1) extraction of natural scene statistics, 2) calculation of feature vectors, and 3) prediction of the image quality score. We utilized a pre-trained prediction model provided by Mittal et al.²³ to predict the image quality score. The minimum and maximum image scores are 0 and 100, respectively, with a lower image score indicating a better image quality. The potential of BRISQUE as an indicator of medical image quality has been reported previously^23,33,34.

Visual scoring. A radiologist with six years of experience in thoracic imaging performed a qualitative image analysis on a chest CT scan. The radiologist was blinded to the patients’ data and the image reconstruction techniques and examined the images in a random order using PACS. CT scans were graded on axial images with datasets displayed on standard windows, and windowing was allowed, as in routine reporting conditions. The reader randomly assessed the subjective image contrast, noise, and image quality using a five-point visual scoring system²⁰ (Table 4).

Table 4

Visual scoring system used to evaluate image quality
Scale	Contrast	Noise	Overall image quality
5	Excellent	Minimal	Best
4	Above average	Below average	Slight inferior (no influence on diagnosis)
3	Acceptable	Average	Mildly inferior (possible influence on diagnosis)
2	Suboptimal	Above average	Moderately inferior (probable influence on diagnosis)
1	Poor	Unacceptable	Markedly inferior (impairing diagnosis)

Evaluating diagnostic efficacy of ILD. Two thoracic radiologists (reader 1 had 6 years of experience; reader 2 had 15 years of experience) who were blinded to clinical data and image reconstruction techniques independently assessed CT images and determined the radiologic features of usual interstitial pneumonia (UIP), which is a hallmark of idiopathic pulmonary fibrosis (IPF). A classification of ‘UIP’, ‘Probable UIP’, ‘Indeterminate for UIP’, and ‘Alternative diagnosis’ was assigned for each case using the 2022 American thoracic society and Fleischner Society guidelines^4,18,24. The UIP pattern was defined as subpleural, basal predominance of reticular abnormalities, honeycombing with, or without traction bronchiectasis; the absence of findings was suggestive of alternative diagnosis, including extensive ground-glass opacity, micronodules, discrete cysts, mosaic attenuation, or segmental/lobar consolidation²⁴. The two readers formed a consensus diagnosis for each case reconstructed using three different methods (FBP, ASiR-V, and DLM) after independent assessment. After the final diagnosis was made using the three reconstruction methods, cases showing discrepant diagnoses were selected and compared with HRCT findings of the patient, which was considered a reference standard.

Statistical analysis. Image quality of the three reconstruction methods (FBP, ASiR-V, and DLM) were compared using one-way analysis of variance, and post-hoc pairwise comparisons were adjusted for multiple comparisons using the Bonferroni correction. Cohen’s kappa statistics were used to evaluate the agreement between the two readers and the diagnostic agreement on the three reconstruction methods. A kappa statistic of 0.81–1.00 indicates an excellent agreement; 0.61–0.80, substantial agreement; 0.41–0.60, moderate agreement; 0.21–0.40, fair agreement; and 0.00–0.20, poor agreement³⁵. Statistical significance was set at p < 0.05. All statistical calculations were performed using SAS (version 9.4; SAS Institute, Cary, NC, USA) and R (version 3.3.1; Vienna, Austria; http://www.R-project.org/³⁶) software.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Acknowledgement

None

Author contributions

Conceptualization: Myung Jin Chung, Chu hyun Kim
Data curation: Chu hyun Kim, Seok Oh
Formal analysis: Chu hyun Kim, Yoon Ki Cha, Seok Oh
Investigation: Chu hyun Kim, Kwang gi Kim
Methodology: Kwang gi Kim, Myung Jin Chung
Project administration: Myung Jin Chung
Resources: Hongseok Yoo, Myung Jin Chung, Kwang gi Kim
Software: Kwang gi Kim
Supervision: Hongseok Yoo, Kwang gi Kim
Validation: Myung Jin Chung
Visualization: Chu hyun Kim, Seok Oh
Writing-original draft: Chu hyun Kim
Writing-review & editing: Chu hyun Kim, Myung Jin Chung

Competing interests

The authors declare no competing interests.

Funding

none

Petnak, T., Lertjitbanjong, P., Thongprayoon, C. & Moua, T. Impact of Antifibrotic Therapy on Mortality and Acute Exacerbation in Idiopathic Pulmonary Fibrosis: A Systematic Review and Meta-Analysis. Chest 160, 1751–1763. http://doi.org/10.1016/j.chest.2021.06.049 (2021).
Canestaro, W. J., Forrester, S. H., Raghu, G., Ho, L. & Devine, B. E. Drug Treatment of Idiopathic Pulmonary Fibrosis: Systematic Review and Network Meta-Analysis. Chest 149, 756–766. http://doi.org/10.1016/j.chest.2015.11.013 (2016).
Mayo, J. R. CT evaluation of diffuse infiltrative lung disease: dose considerations and optimal technique. J. Thorac. Imaging 24, 252–259. http://doi.org/10.1097/RTI.0b013e3181c227b2 (2009).
Lynch, D. A. et al. Diagnostic criteria for idiopathic pulmonary fibrosis: a Fleischner Society White Paper. Lancet Respir. Med. 6, 138–153. http://doi.org/10.1016/S2213-2600(17)30433-2 (2018).
Zwirewich, C. V., Mayo, J. R. & Muller, N. L. Low-dose high-resolution CT of lung parenchyma. Radiology 180, 413–417. http://doi.org/10.1148/radiology.180.2.2068303 (1991).
Christe, A., Charimo-Torrente, J., Roychoudhury, K., Vock, P. & Roos, J. E. Accuracy of low-dose computed tomography (CT) for detecting and characterizing the most common CT-patterns of pulmonary disease. Eur. J. Radiol. 82, e142-150. http://doi.org/10.1016/j.ejrad.2012.09.025 (2013).
Padole, A., Ali Khawaja, R. D., Kalra, M. K. & Singh, S. CT radiation dose and iterative reconstruction techniques. AJR Am. J. Roentgenol. 204, W384-392. http://doi.org/10.2214/AJR.14.13241 (2015).
Pontana, F. et al. Effect of Iterative Reconstruction on the Detection of Systemic Sclerosis-related Interstitial Lung Disease: Clinical Experience in 55 Patients. Radiology 279, 297–305. http://doi.org/10.1148/radiol.2015150849 (2016).
Lim, H. J., Chung, M. J., Shin, K. E., Hwang, H. S. & Lee, K. S. The Impact of Iterative Reconstruction in Low-Dose Computed Tomography on the Evaluation of Diffuse Interstitial Lung Disease. Korean J. Radiol. 17, 950–960. http://doi.org/10.3348/kjr.2016.17.6.950 (2016).
Park, S. et al. Image quality in liver CT: low-dose deep learning vs standard-dose model-based iterative reconstructions. Eur. Radiol. 32, 2865–2874. http://doi.org/10.1007/s00330-021-08380-0 (2022).
Mohammadinejad, P. et al. CT Noise-Reduction Methods for Lower-Dose Scanning: Strengths and Weaknesses of Iterative Reconstruction Algorithms and New Techniques. Radiographics 41, 1493–1508. http://doi.org/10.1148/rg.2021200196 (2021).
Hosny, A., Parmar, C., Quackenbush, J., Schwartz, L. H. & Aerts, H. Artificial intelligence in radiology. Nat. Rev. Cancer 18, 500–510. http://doi.org/10.1038/s41568-018-0016-5 (2018).
Choi, H. et al. Dose reduction potential of vendor-agnostic deep learning model in comparison with deep learning-based image reconstruction algorithm on CT: a phantom study. Eur. Radiol. 32, 1247–1255. http://doi.org/10.1007/s00330-021-08199-9 (2022).
Kolb, M. et al. Effect of a novel denoising technique on image quality and diagnostic accuracy in low-dose CT in patients with suspected appendicitis. Eur. J. Radiol. 116, 198–204. http://doi.org/10.1016/j.ejrad.2019.04.026 (2019).
Nam, J. G. et al. Image quality of ultralow-dose chest CT using deep learning techniques: potential superiority of vendor-agnostic post-processing over vendor-specific techniques. Eur. Radiol. 31, 5139–5147. http://doi.org/10.1007/s00330-020-07537-7 (2021).
Kambadakone, A. Artificial Intelligence and CT Image Reconstruction: Potential of a New Era in Radiation Dose Reduction. J. Am. Coll. Radiol. 17, 649–651. http://doi.org/10.1016/j.jacr.2019.12.025 (2020).
Kim, J. H. et al. Validation of Deep-Learning Image Reconstruction for Low-Dose Chest Computed Tomography Scan: Emphasis on Image Quality and Noise. Korean J. Radiol. 22, 131–138. http://doi.org/10.3348/kjr.2020.0116 (2021).
Raghu, G. et al. Idiopathic Pulmonary Fibrosis (an Update) and Progressive Pulmonary Fibrosis in Adults: An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am. J. Respir. Crit. Care Med. 205, e18-e47. http://doi.org/10.1164/rccm.202202-0399ST (2022).
Nagayama, Y. et al. Deep Learning-based Reconstruction for Lower-Dose Pediatric CT: Technical Principles, Image Characteristics, and Clinical Implementations. Radiographics 41, 1936–1953. http://doi.org/10.1148/rg.2021210105 (2021).
Goodenberger, M. H. et al. Computed Tomography Image Quality Evaluation of a New Iterative Reconstruction Algorithm in the Abdomen (Adaptive Statistical Iterative Reconstruction-V) a Comparison With Model-Based Iterative Reconstruction, Adaptive Statistical Iterative Reconstruction, and Filtered Back Projection Reconstructions. J. Comput. Assist. Tomogr. 42, 184–190. http://doi.org/10.1097/RCT.0000000000000666 (2018).
Lee, S. et al. Noise reduction approach in pediatric abdominal CT combining deep learning and dual-energy technique. Eur. Radiol. 31, 2218–2226. http://doi.org/10.1007/s00330-020-07349-9 (2021).
Yeoh, H. et al. Deep Learning Algorithm for Simultaneous Noise Reduction and Edge Sharpening in Low-Dose CT Images A Pilot Study Using Lumbar Spine CT. Korean J. Radiol. 22, 1850–1857. http://doi.org/10.3348/kjr.2021.0140 (2021).
Mittal, A., Moorthy, A. K. & Bovik, A. C. No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process 21, 4695–4708. http://doi.org/10.1109/TIP.2012.2214050 (2012).
Raghu, G. et al. Diagnosis of Idiopathic Pulmonary Fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am. J. Respir. Crit. Care Med. 198, e44-e68. http://doi.org/10.1164/rccm.201807-1255ST (2018).
Lin, S., Lin, M. & Lau, K. K. Image quality comparison between model-based iterative reconstruction and adaptive statistical iterative reconstruction chest computed tomography in cystic fibrosis patients. J. Med. Imaging Radiat. Oncol. 63, 602–609. http://doi.org/10.1111/1754-9485.12895 (2019).
Choe, J. et al. Diagnostic and prognostic implications of 2018 guideline for the diagnosis of idiopathic pulmonary fibrosis in clinical practice. Sci. Rep. 11, 16481. http://doi.org/10.1038/s41598-021-95728-7 (2021).
Hunninghake, G. W. et al. Radiologic findings are strongly associated with a pathologic diagnosis of usual interstitial pneumonia. Chest 124, 1215–1223. http://doi.org/10.1378/chest.124.4.1215 (2003).
Hunninghake, G. W. et al. Utility of a lung biopsy for the diagnosis of idiopathic pulmonary fibrosis. Am. J. Respir. Crit. Care Med. 164, 193–196. http://doi.org/10.1164/ajrccm.164.2.2101090 (2001).
Putman, R. K. et al. Imaging Patterns Are Associated with Interstitial Lung Abnormality Progression and Mortality. Am. J. Respir. Crit. Care Med. 200, 175–183. http://doi.org/10.1164/rccm.201809-1652OC (2019).
Trattner, S. et al. Cardiac-Specific Conversion Factors to Estimate Radiation Effective Dose From Dose-Length Product in Computed Tomography. JACC Cardiovasc. Imaging 11, 64–74. http://doi.org/10.1016/j.jcmg.2017.06.006 (2018).
Winklehner, A. et al. Raw data-based iterative reconstruction in body CTA: evaluation of radiation dose saving potential. Eur. Radiol. 21, 2521–2526. http://doi.org/10.1007/s00330-011-2227-y (2011).
Kuo, Y. et al. Comparison of image quality from filtered back projection, statistical iterative reconstruction, and model-based iterative reconstruction algorithms in abdominal computed tomography. Medicine (Baltimore) 95, e4456. http://doi.org/10.1097/MD.0000000000004456 (2016).
Zhang, Z. et al. Can signal-to-noise ratio perform as a baseline indicator for medical image quality assessment. IEEE Access 6, 11534–11543. http://doi.org/10.1109/ACCESS.2018.2796632 (2018).
Chow, L. S. & Rajagopal, H. Modified-BRISQUE as no reference image quality assessment for structural MR images. Magn. Reson. Imaging 43, 74–87. http://doi.org/10.1016/j.mri.2017.07.016 (2017).
Svanholm, H. et al. Reproducibility of histomorphologic diagnoses with special reference to the kappa statistic. APMIS 97, 689–698. http://doi.org/10.1111/j.1699-0463.1989.tb00464.x (1989).
Ripley, B. D. The R project in statistical computing. MSOR Connections 1, 23–25 (2001).

No competing interests reported.

supplementarymaterial.docx

The impact of deep learning reconstruction in low dose computed tomography on the evaluation of interstitial lung disease

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1