Phantom data
A NEMA IEC body phantom was examined using a GE Discovery MI PET scanner (GE Healthcare, General Electric, Boston, MA, USA) with a 3-ring detector with silicon photomultipliers (SiPM) and a reported sensitivity of 7.3 cps/kBq (27). Total activity in field of view was approximately 35 MBq. The absolute activities were measured in a certified dose calibrator (ISOMED 2010, MED Dresden GmbH, Germany), which was also used for regular cross calibration of the PET scanner (every 6 months). Sphere inserts (inner diameter, 10, 13, 17, 22, 28, and 37 mm) were filled with 24.4 kBq/ml F18-fluoride while the background was filled with 3.1 kBq/ml (sphere-to-background ratio, approx. 8:1). Acquisition time was 3 min per bed position (transaxial field of view, 70 cm; matrix size, 256 × 256; voxel size, 2.73 × 2.73 × 2.78 mm3). CT data of the phantom were used for attenuation correction. Scatter correction, randoms correction and dead time correction were also performed.
PET raw data were reconstructed using OSEM with time of flight (TOF; GE “VUE Point FX”) with 4 iterations and 16 subsets (i.e., TOF4/16). This reconstruction was defined as the reference algorithm for subsequent analyses and used either a 2.0 mm, 6.4 mm or 9.5 mm in-plane Gaussian filter (i.e., TOF4/16/2, TOF4/16/6.4 or TOF4/16/9.5). Further reconstruction was performed with OSEM and TOF with 4 iterations, 8 subsets and either 2.0 mm, 6.0 mm or 9.5 mm in-plane filter (TOF4/8/2, TOF4/8/6 or TOF4/8/9.5).
Additionally, data were reconstructed using OSEM with TOF and point spread function (OSEM+PSF+TOF, hereafter referred to as PSF+TOF; GE “VUE Point FX” with “SharpIR”) with 2 iterations and 17 subsets and either 2.0 mm, 7.0 mm or 10.0 mm in-plane filter (PSF+TOF2/17/2, PSF+TOF2/17/7 or PSF+TOF2/17/10), respectively. TOF and PSF+TOF reconstructions always included a “standard” z-axis filter.
All data were also reconstructed using Bayesian penalized likelihood reconstruction (GE “Q.Clear”) with a penalization factor β of 600, 1750 or 4000 (Q.Clear600, Q.Clear1750 or Q.Clear4000), respectively.
Reconstructed spatial resolution was assessed as the full width at half maximum (FWHM) of the PSF in the reconstructed phantom images. PSF was modeled by a 3D Gaussian, and FWHM was determined by applying the method described in detail by Hofheinz et al. (28). This method is based on fitting the analytic solution for the radial activity profile of a homogeneous sphere convolved with a 3D Gaussian to the reconstructed data. In this process, the full 3D vicinity of each sphere is evaluated by transforming the data to spherical coordinates relative to the respective sphere's center. A summary of the used reconstructions, resulting spatial resolution and image noise (patient data) is given in Table 1. Representative radial profiles are shown in Figure 1.
To study effects of different acquisition time per bed position, PET list mode data were retrospectively rebinned to reconstruct further datasets representing an acquisition time of 120s, 90s or 60s, respectively. Reconstruction was then performed with the algorithms that resulted in a reconstructed spatial resolution of 7 mm (i.e., TOF4/8/6, TOF4/16/6.4, PSF+TOF2/17/7 and Q.Clear1750).
Patients and scans
Fifty patients (female, 20; median age, 69 years; range, 46 to 83 years) with histologically proven NSCLC underwent pretherapeutic FDG-PET/CT between 07/2018 and 02/2019 using the same scanner. Patients were required to fast for at least 6 hours prior to tracer administration, and a blood glucose level of ≤150 mg/dl was ensured. A median activity of 249 MBq (interquartile range [IQR], 238 to 257 MBq; range, 209 to 274 MBq) or 3.7 MBq/kg (IQR, 3.1 to 4.2 MBq/kg; range, 2.0 to 5.7 MBq/kg) was administered intravenously. Static PET data were acquired after a median uptake time of 65 minutes (IQR, 61 to 70 min; range, 55 to 96 min) from base of skull to the proximal femora in 3D acquisition mode (acquisition time, 180s per bed position; bed overlap, approx. 25%). Attenuation correction was based on a non-enhanced low-dose CT (automated tube current modulation “Smart mA”; maximum tube current-time product, 100 mAs; tube voltage, 120 kV; gantry rotation time, 0.5 s) or non-enhanced diagnostic CT (maximum tube current-time product, 200 mAs).
PET raw data were reconstructed as described above (patient example in Figure 2). Furthermore, data with 5 mm FWHM resolution were smoothed with a Gaussian filter (5 mm FWHM). According to

this results in a target spatial resolution of approximately 7 mm. Altogether, 25 image data per patient with different spatial resolution and noise (i.e., acquisition time) were generated.
Data evaluation
Evaluation of the data was performed with a dedicated software (ROVER, version 3.0.34, ABX advanced biochemical compounds GmbH, Radeberg, Germany) by an experienced physician in nuclear medicine. MTV of the primary tumor was delineated in each dataset using the same threshold-based, background-adapted algorithm (30). Delineation was visually inspected and manually corrected if deemed necessary. Tumoral FDG-avid tissue not related to the primary tumor and delineable from the latter (lymph nodes, metastases) was excluded. If the primary tumor was determined to be multifocal (i.e. separate ipsilateral tumor nodules) or the presence of lymphangitic carcinomatosis was diagnosed by interdisciplinary consensus, all tumor nodules and FDG-avid lymphangitic tissue were included in the MTV (see also (15)). SUVmax and ASP (31) of the MTV were derived. SUV were normalized using the body weight in kg.
ASP was calculated identical to its initial definition by the authors (31), which was unaltered in subsequent publications (13-15, 32-38):

S and V are the surface area and the volume of the MTV, respectively. S was computed as the sum of all voxel surfaces that form the outer and inner surface of the MTV multiplied by the factor 2/3. Note that this corresponds to the approximation of the surface area of discrete 3D objects using 6 voxel classes as described by (39).
Please note that this definition of the MTV surface area is distinctly different from the definition by the Image Biomarker Standardization Initiative (IBSI), and compliance of both definitions cannot be assumed. The IBSI estimates the MTV surface area using a mesh-based representation after triangulation of the MTV’s outer surface (26). Supplementary material #1 provides the IBSI checklist for an overview of all methodological aspects of image generation and image processing in the present analysis. Distribution of ASP values in all current 50 patients is illustrated in figure 3.
In each dataset, a spherical volume of interest (VOI) of approx. 19 ml was placed in the unaffected right liver lobe to derive its SUVmean and SUV standard deviation and calculate image noise (SUV standard deviation / SUVmean).
Statistical analysis
Statistical analysis was performed using SPSS 22 (IBM Corporation, Armonk, NY, USA). Descriptive parameters were expressed as median and IQR. Relative differences between any dataset a and the reference dataset b were calculated as follows:

The significance of these differences was assessed with Wilcoxon signed-rank test for paired data. Proportions (%) of discordantly classified cases (high vs. low ASP/SUVmax/MTV) between algorithms were given with their 95% binomial proportion confidence intervals (95%-CI) which included the continuity correction of ±0.5/n (= ±0.5/50 = ±1%). Classification with ASP (>19.5%) was based on a previously identified cut-off in NSCLC patients (15) while cut-offs for SUVmax (>10.5) and MTV (>9.5 ml) were the respective median among the current 50 patients. Proportions between different pairs of algorithms were compared with two-sided McNemar’s test. Correlation between ASP and MTV was examined using the Pearson correlation coefficient r and interpretation criteria based on (40). Statistical significance was generally assumed at p<0.05.