Multi-institutional evaluation of guidance from International Ki67 Working Group vs National Health Commission of China on Immunohistochemistry-based Ki67 assessment

doi:10.21203/rs.3.rs-4064759/v1

Download PDF

Research Article

Multi-institutional evaluation of guidance from International Ki67 Working Group vs National Health Commission of China on Immunohistochemistry-based Ki67 assessment

https://doi.org/10.21203/rs.3.rs-4064759/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Purpose: Recommendations from National Health Commission of China (NHCC) and International Ki67 Working Group (IKWG) were issued respectively to guide immunohistochemistry (IHC)-based Ki67 scoring for breast cancer patients in daily clinical practice. They were evaluated in this multi-institutional study alongside with results from Quantitative Dot Blot (QDB) method.

Method: Three sections each from 40 primary ER+ breast cancer resection blocks were randomly assigned a number from 1 to 120 for Ki67 staining and reviewed by 21 pathologists while the other three alternative sections were sent for QDB analysis of Ki67 protein levels. Ki67 scores were grouped by 5/30% (IKWG), 10/30% (NHCC) and 20/30% (NHCC appendix 9, NHCCa9) respectively while QDB results were grouped by C₅-C₉₅ of 2.31 nmole/g defined in previous study as low, intermediate and high risk groups.

Results: The overall Intraclass Correlation Coefficient (ICC) was 0.785 for IHC evaluations from 21 pathologists, with the Fleiss Kappa at 0.555, 0.628 and 0.480 when Ki67 scores were grouped by the guidance from IKWG, NHCC and NHCCa9 respectively. In comparison, the ICC and Fleiss kappa for QDB analysis were at 0.939 and 0.831. When IHC and QDB results were cross-referenced, more specimens were grouped as high risk by QDB than IHC, and NHCCa9 led to highest percentage of disagreement between two methods.

Conclusion: The IKWG recommendation was harder to achieve categorized agreement among pathologists than that of NHCC, yet it led to best agreement with QDB to define low-risk group. QDB method offered significantly improved consistency over current IHC-based Ki67 assessment.

The nuclear proliferation biomarker Ki67 maybe one of the most used protein diagnostic biomarkers across all types of cancers (1). Its expression level is widely regarded as the reflection of aggressiveness of the tumor. For breast cancer patients, it is also a critical factor to consider for the benefits of chemotherapy in daily clinical practice worldwide. Thus, it is required for every new breast cancer patient by National Health Committee of China (NHCC) by its latest guidance (2, 3).

Current method of Ki67 assessment relies on immunohistochemistry (IHC) in daily clinical practice. In this process, the percentage of positive stained nuclei, or Ki67 score, is evaluated in tumor tissue to reflect the aggressiveness of the tumor. However, this method is clearly far from satisfactory in real-world practice, as intensive efforts have been launched aiming to amend issues associated with this method over the years (1, 2, 4).

The international Ki67 working group, or IKWG, suggested that “In this T1-2, N0-1 patient group, the IKWG consensus is that Ki67 5% or less, or 30% or more, can be used to estimate prognosis” for identifying estrogen receptor (ER) positive and Her2 negative breast cancer patients who may not need adjuvant chemotherapy (2). The NHCC guidance issued in 2022, on the other hand, considered Ki67 scored between 10% and 30% as borderline samples, requiring evaluation of more than 500 invasive breast cancer cells to improve the consistency of the results (3). Yet in the same guidance, appendix 9 (NHCCa9), the guidance suggested that between pathological laboratories, the scores between 20% ~ 30% are used as cutoffs to determine the necessity of adjuvant chemotherapy for breast cancer patients.

We interpreted that in all these guidance, the Ki67 scores are categorized as low, intermediate (or equivocal) and high risk groups, with the intermediate risk group varying between 5 ~ 30% (IKWG), 10 ~ 30% (NHCC), and 20 ~ 30% (NHCCa9). In this study, the three recommendations were compared by inviting 21 pathologists from 18 hospitals across China to evaluate same set of ER-positive breast cancer resections.

Meanwhile, we have demonstrated that absolute quantitation of Ki67 from Formalin Fixed Paraffin Embedded (FFPE) specimens using Quantitative Dot Blot (QDB) method may be used to identify ER-positive patients for adjuvant chemotherapy in a series of studies (5, 6). A putative cutoff of 2.31 nmol/g was developed based on overall survival (OS) analysis, and was validated using another cohort of breast cancer specimens independently.

Unlike IHC, QDB method is an objective and quantitative assay. Accordingly, we categorized the QDB results into low, intermediate and high risk groups based on the C₅ and C₉₅ values of the defined 2.31 nmole/g cutoff as a reflection of the reliability of the assay. In other word, the low risk was defined as ≤ C₅, the intermediate risk as between C₅ and C_95, and the high risk as ≥ C₉₅ of 2.31 nmole/g (the process of identification of C₅ and C₉₅ of the cutoff was included in the Materials and Methods section). Clearly, unlike in the cases of IKWG, NHCC and NHCCa9, this categorization is based on statistically analysis to minimize the influence of random error.

In this study, the organizer (Dr. Hao) chose 40 ER + breast cancer tissue surgical resection blocks for IHC and QDB analysis side by side. Three slices from each block were used for IHC analysis and scanned to be accessible online. The organizer assigned a number from 1 to 120 randomly to each of these 120 IHC slides, without revealing to invited pathologists that there were triplicate for each block until after the finish of the evaluation. The Ki67 scores were categorized as low, intermediate and high risk group by following IKWG, NHCC and NHCCa9 guidance respectively to evaluate the practicability of these three guidance, as well as those of QDB method when categorized using C₅ and C₉₅ of the defined 2.31 nmole/g in previous studies.

The pathological characteristics of all 40 breast cancer surgical resection blocks were listed in Table 1. These patients were all ER positive, with majority of them also PR positive (37 vs 3). More patients were over 50 (25 vs 15). All lymph node statuses were included as well as all pathological tumor sizes and histological grades. However, more patients were at pT1 and pT2 for tumor size, and pN0 and pN1 for lymph node statuses. Majority of them were at histological grade II (29 out of 40). As expected, majority of the patients were Her2- (37 vs 3).

The Ki67 scores for all 40 specimens from all 21 pathologists were shown in boxplot in Fig. 1. Cutoffs used in three guidance were indicated by lines of different color, with green to indicate the 5% cutoff from IKWG, black for the 10% cutoff from NHCC, blue for the 20% cutoff from NHCCa9, and red for the 30% cutoff shared by all three guidance to identify specimens in high risk group. The detailed scores were also reported in Table 2, with the triplicated Ki67 scores for each block listed within the same cell. As shown in Fig. 1, only 1 out of 40 specimens achieved 100% agreement as at low risk when IKWG guidance was followed. This number reached 7 with NHCC guidance and 13 with NHCCa9 guidance respectively. However, none of the specimens achieved 100% agreement as at high risk in this study.

In Table 2, a heat map was generated based on IKWG guidance, with ≤ 5% as green for low risk group, between 5% and 30% in yellow for intermediate risk group, and those ≥ 30% in red for high risk group. In supplemental table 1a and 1b, the same set of colors were applied to indicate low, intermediate and high risk groups based on NHCC and NHCCa9 guidance.

The IHC results were analyzed with Fleiss Kappa analyses. We found that NHCC had the highest overall Kappa at 0.628 (95%CI: 0.628 ~ 0.629). The IKWG guidance led to overall Kappa at 0.555 (95%CI: 0.554 ~ 0.555), and the NHCCa9 guidance led to lowest Kappa at 0.480 (95%CI:0.479 ~ 0.481).

The inter-rater intraclass correlation coefficient (ICC) was calculated at 0.785 (95%CI: 0.708 ~ 0.858). The intra-rater ICC was also investigated among invited pathologists. It should be emphasized that the reading of the triplicate section from each sample were blind to all but the organizer of the study. We found the single measurement of ICC was ranging from 0.639 to 0.982, with 25% percentile at 0.76, median at 0.848 and 75% percentile at 0.9225 respectively (Fig. 2).

The same slices were also used for QDB analysis, as shown in Table 3. The lysates were analyzed by three technicians, each measured in triplicate for three times. The overall CV of all the specimens were at 15.86%. When plotted against mean of Ki67 scores from 21 pathologists, the QDB results were highly correlated with those of IHC assessment, with r = 0.78, p < 0.0001 using Pearson’ correlation analysis (Fig. 3).

We also performed C₅₀ studies to determine the C₅ and C₉₅ of 2.31 nmole/g at 1.793 nmole/g and 2.727 nmole/g respectively (supplemental Fig. 1). The Ki67 levels from QDB analysis were thus categorized as ≤ C₅ as low risk group in green, between C₅ and C₉₅ as intermediate group in yellow, and ≥ C₉₅ as high risk group in red (Table 3).

The overall ICC of QDB was calculated at 0.939 (95%CI: 0.908 ~ 0.963), significantly higher than that of IHC. The categorized consistency of QDB method was also significantly higher than IHC-based method, with the overall Fleiss Kappa at 0.831 (95%CI: 0.827 ~ 0.836). The intra-rater ICC for three technicians were calculated at 0.924, 0.933 and 0.963 respectively. As shown in Fig. 2, all intra-rater ICCs of QDB analysis were above the 75% percentile of that of IHC analysis.

When we compared categorized QDB results with those of IHC, we observed that there were more specimens categorized as high risk by QDB method than IHC method (14/40 vs 6/40) (Table 4). The discordant specimens categorized as high risk group by QDB method was more likely categorized as intermediate risk group by IKWG and NHCC guidance, but as low risk group by NHCCa9 guidance. Unexpectedly, we identified two specimens (#23 and #29) grouped as high risk by QDB method, but as low risk group by any of three IHC guidance. Overall, we found Ki67 scores tend to be conservative when evaluating by IHC method than those by QDB method.

One of the major goal of Ki67 assessment is to identify patients who may be spared of chemotherapy, i.e, those of low risk group. There were 19 specimens in low risk group by IKWG, 23 by NHCC, and 31 by NHCCa9. When QDB results were used as reference, we found 84.21% (16/19), 73.91% (17/23), and 58.06% (18/31) agreement with IKWG, NHCC and NHCCa9 respectively (Table 4). In other word, QDB results were in best agreement with IKWG, and worst with NHCCa9.

In this study, by inviting 21 pathologists to assess Ki67 scores of same set of Luminal-like breast cancer specimens, we were able to evaluate the practicability of three guidance (IKWG, NHCC and NHCCa9), as well as that of Quantitative Dot Blot (QDB)-based Ki67, in daily clinical practice.

Our results demonstrated that the consistency is easier to achieve among pathologists by following NHCC guidance while harder to achieve by NHCCa9 guidance. However, if QDB results may be used as reference, the IKWG guidance offered best guidance to identify patients of low risk group, i.e., those spared of chemotherapy, while NHCCa9 offered significantly more false negative results.

We were also able to investigate the intra-rater ICC in this study by assigning random number from 1 to 120 to the triplicate section of the 40 samples. We believe this design would best reveal the potential subjectivity of IHC analysis in real world practice. Admittedly, a few of the pathologists may be alarmed with the repeated images during the evaluation. However, from the results we got, even if this situation did exist, it made minimum impact on the overall assessment.

The intra-rater ICC was calculated at as low as 0.639, with 25% percentile at 0.76, median at 0.848 and 75% percentile at 0.9225. We interpreted that even for an experienced pathologist in China, there was only 85% chance on average for him/her to score the same IHC slide consistently.

The current study is limited to the evaluation of a set of pre-stained slides. Thus, it was unable to evaluate the potential variations associated with pre-analytical factors in individual institutions. All the invited pathologists were also not through extensive training besides a broad instruction. Thus, we considered this study should reflect faithfully the real-world practice for all these invited pathologists.

We were surprised to find that there were 20% (8/40) specimens categorized as intermediate risk group based on the C₅-C₉₅ of the purposed 2.31 nmole/g identified in the previous study using QDB method. We interpreted that the precision of QDB method remains to be improved, as its improvement should narrow the window of intermediate risk group more in the future. It should also be cautioned that the proposed 2.31 nmole/g remained to be validated in the future with much larger scale of study. However, we expect the possible adjustment of this cutoff should have minimum impact on overall conclusion.

One unexpected observation is that while overall agreement between QDB and IHC method was satisfactory (r = 0.78 by Pearson), there were clear difference with two specimens, #23 and #29. They were grouped as high risk group by QDB method, but as low risk group by IHC method by all three guidance. One putative explanation maybe due to the negative influence of heavy staining on the nuclear antigen, as suggested by Rudbeck (7). The other possibility may be the incorrect staining due to poor pre-staining treatment. However, this point was debatable even among the invited pathologists. The IHC images of these two specimens were provided in supplemental data (Supplemental Fig. 2), warranting further discussion of this clear discrepancy of the two methods.

It also should be pointed out there existed difference in nature of the results from QDB analysis and IHC analysis. In QDB, the total protein lysates were extracted from FFPE slices through disruption of the tissue structure. Thus, QDB measures averaged protein content to minimize the heterogeneity of the tissue slice. In contrast, Ki67 scores reflect the localized Ki67 protein level with the fully preserved tissue structure, thus reflect better the heterogeneity of the tissue slice. The results from these two methods should be highly correlated, as demonstrated by current study, but not identical by any chance.

It is not unclear which method would provide more relevant result for the prognosis and prediction of patients. While some argue that tissue heterogeneity might be better reflected through IHC analysis, it is equally arguable that QDB method might maximally minimize the negative influence of tissue heterogeneity in the prognosis and prediction of the patients. Clearly, the final judgement may be only achieved through properly controlled prospective clinical trials in the future.

Another limitation with current study is that we invited 21 pathologists for IHC analysis, yet only three technicians were requested for QDB analysis. The limited number of technicians for QDB analysis may underestimate the variations among technicians when interpreting the QDB results. On the other hand, QDB analysis is an objective biochemical assay tightly controlled internally. The C₅/C₉₅ analysis also takes full consideration of the variations among technicians at large scale. Thus, we interpreted that potential impact of including more technicians for QDB analysis should not fundamentally change the overall conclusion of the current study.

In conclusion, by inviting 21 experienced pathologists to score the Ki67 levels of the same set of IHC slides from 40 ER + breast cancer specimens, we were able to compare the practicability of three clinical guidance (IKWG, NHCC and NHCCa9) in daily clinical practice. We were also able to compare the Ki67 scores with results from QDB measurements to suggest that QDB may improve the consistency of Ki67 assessment significantly in daily clinical practice. Our results also showed that if QDB results may be used as reference, the IKWG guide was hard to achieve agreement among pathologists, yet give the most trustworthy guide for chemotherapy for Luminal-like patients.

Human subjects

The inclusion criteria were patients diagnosed with invasive breast cancer with FFPE resection specimens available at Yantai Affiliated Hospital of Binzhou Medical University, Yantai, P. R. China 2015 to 2017. The specimens must be ER + based on IHC analysis, and have more than 50% tumor tissues based on H & E staining. All the studies were performed in accordance with the Declaration of Helsinki, and were approved by the Medical Ethics Committee of Yantai Affiliated Hospital of Binzhou Medical University (Approval # 20191127001 to Dr. Hao) with informed consent forms waived for archived specimens.

Sample preparation and distribution

For each of 40 resection blocks, seven adjacent sections were prepared, with the 1st stained with H & E, the 2nd, 4th, and 6th stained with IHC method using MIB1 antibody against Ki67. The 3rd, 5th and 7th sections were used to extract total tissue lysates for QDB measurement using the same MIB1 antibody against Ki67.

The 120 IHC-stained slides were randomly assigned number 1 to 120, and sent out for scoring using the NHCC guidance stated as the following “Our recommendation is that the whole slice be evaluated under a low-power field to determine whether the positive cells are uniformly distributed. If the positive cells are uniformly distributed, three or more high-power fields are to be randomly counted, and an average Ki-67 index is obtained. If the positive cells are unevenly distributed, a prominent “hot spot” of Ki-67 index may exist. (1) If the hot spot appears at the junction of tumors and normal tissues, and the Ki-67 index was relatively low within the tumor, it is recommended that the Ki-67 index in 3 or more highpower fields should be counted in the tumor margin area; (2) If the hot spot appears within the tumor, the Ki-67 index of the whole slice can be evaluated on average 3 or more high-power fields including the hot spot area should be selected. When the Ki-67 index is within the critical range of 10%−30%, it is recommended that more than 500 invasive carcinoma cells should be evaluated as much as possible to improve the accuracy” (3). There were no attempt initiated to standardize the scoring method other than the guidance per se. According, variations in scoring among different participants were expected.

All the invited pathologists are certified pathologist with minimum 10 years of experience in the hospital. The scoring process was blind to all participants except the study coordinator (Dr. Hao). All invited participants considered the 120 IHC slides as independent sections, and has no prior knowledge of QDB results until after submitting the Ki67 scores.

All the 120 IHC slides were also digitally scanned. The IHC slides are available for evaluation during and after the publication of the manuscript upon written request to Dr. Junmei Hao.

General Reagents

Mouse anti-Ki67antibody (clone MIB1) was purchased from ZSGB-BIO (Beijing,China). HRP-labeled Donkey Anti-Mouse IgG secondary antibody was purchased from Jackson Immunoresearch lab (Pike West Grove, PA, USA). All other chemicals were purchased from Sinopharm Chemicals (Beijing, P. R. China). KI67 recombinant protein was prepared by Quanticision Diagnostics, Inc., and the preparation method had been published(5).

QDB analysis

All the QDB analysis was performed by Quanticision Diagnostics, Inc. The detailed method was described elsewhere (5, 6). In brief, sections of all the 40 breast cancer specimens were used to extract total protein lysates. Total of 0.5 µg was loaded on QDB plate together with serially diluted recombinant KI67 purified protein in triplicate. The loaded QDB plate was dried for 4 hours at RT and then blocked in 4% non-fat milk for 1 hour. Anti-Ki67 antibody (MIB1) was diluted at 1:1,000 in blocking buffer, and incubated with QDB plate at 100 µl/well overnight at 4°C, and incubated next with a donkey anti-mouse secondary antibody (diluted at 1:2500 in blocking buffer) on a shaker at 100rpm for 4 hour at RT. At the last wash, invert the QDB plate for 1min, extract TBST waste liquid with the filter pump. The QDB plate was inserted into a white 96-well plate pre-filled with 100 µl/well ECL working solution for 3 mins for quantification with Tecan Infiniti 200pro Microplate reader with the option “plate with cover”.

The consistency of the experiments was ensured by including 293T cell lysate with known Ki67 levels in all the experiments. The result was considered valid when the calculated Ki67 level of 293T was within 20% of known Ki67 level at 12.5 (10.0–15.1) nmole/g, respectively. The absolute Ki67 level was determined based on the dose curve of protein standard. Ki67 level less than 25 pg (about 1.4 nmole/g) was defined as Limit of Quantitation and noted undetectable level (UD).

QDB analysis was performed by three technicians using the same set of total protein lysates in triplicate, and the experiments were repeated three times for the total of 9 independent measurements of Ki67 levels in the 40 breast cancer specimens.

Defining C₅ and C₉₅ of Ki67 cutoff

Multiple breast cancer specimens were screened to identify those with Ki67 levels at or above 5 nmole/g. Lysates from three specimens were mixed and serially diluted at 1:1.3 until at 0.81 nmole/g, supplemented with 0.25 µg/µl IgG free BSA. The prepared lysates were loaded to QDB plate as 56-plicates at each dose, and the Ki67 levels were measured through QDB analysis. The number of samples at each dose with their Ki67 values above 2.31 nmole/g were used to calculate C₅, C₅₀ and C₉₅ using the probit model of SPSS software at 1.793 nmole/g, 2.26 nmole/g and 2.727 nmole/g respectively. This experiment was performed twice with highly consistent results.

Statistical analysis

All the data were analyzed with SPSS 26.0. The overall agreement of the Ki67 scores from 21 pathologists with three independent evaluations of each specimens, as well as that of QDB results from 3 technicians with three independent measurements of each specimen, were assessed using Intraclass Correlation Coefficient (ICC) method. The inter-personal and intra-personal agreement of the Ki67 scores from three independent evaluations of each of the 40 specimens by 21 pathologists, as well as those of the Ki67 levels from three independent measurements of all three technicians, were also analyzed with ICC.

The Ki67 scores from 21 pathologists were also categorized as low, intermediate and high risk groups based on the guidance of International Ki67 working group (IKWG), National Health Committee of People’s Republic of China (NHCC), and National Health Committee of People’s Republic of China, appendix 9 (NHCCa9) respectively, and the overall performance of each guidance was assessed using Fleiss Kappa test. The Ki67 levels from QDB measurements were also categorized as low, intermediate and high risk groups based on the C₅ and C₉₅ of the 2.31 nmole/g cutoff defined in previous studies, and the consistency of QDB measurements from three technicians were also assessed using Fleiss Kappa test.

Data statement：

Ethics declaration:All the studies were conducted in accordance with the Declaration of Helsinki, and was approved by the Medical Ethics Committee of Yantai Affiliated Hospital of Binzhou Medical University (Approval # 20191127001 to Dr. Hao) with informed consent forms waived for archived specimens.

Data availability statement:Data are available from the correspondent author upon reasonable written request.

Conflict of interests: No conflict of financial and non-financial interests.

Authors' contributions:JMH provided clinical samples. CHL and BY performed IHC staining. QHC, GHD, PHF, XG, JMH, JYJ, YQK, CL, CJL, MRL, ZQL, YP, HYS, YHW, BCX, GHY, CPZ, HMZ, JRZ, LTZ and ZLZ performed IHC analyses. JMH, YW and JRZ performed data analysis. YL, JW and ZZ performed all the statistical analysis. JMH designed and supervised the overall study and drafted the manuscript. JMH, YW and JRZ contributed to data interpretation and edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding：This research is funded by Yantai Affiliated Hospital of Binzhou Medical University.

Acknowledgement:The authors wish to thank Ms. Yunyun zhang, Jiahong Lyu, and Fangrong Tang for their expertise in QDB analysis.

Dowsett M, Nielsen TO, A’Hern R, et al. Assessment of Ki67 in Breast Cancer: Recommendations from the International Ki67 in Breast Cancer Working Group. JNCI J Natl Cancer Inst. 2011;103(22):1656–64. 10.1093/jnci/djr393.
Nielsen TO, Leung SCY, Rimm DL, et al. Assessment of Ki67 in Breast Cancer: Updated Recommendations From the International Ki67 in Breast Cancer Working Group. J Natl Cancer Inst. 2021;113(7):808–19. 10.1093/jnci/djaa201.
Health Commission of the People’s Republic of China N. National guidelines for diagnosis and treatment of breast cancer 2022 in China (English version). Chin J Cancer Res. 2022;34(3):151–75. 10.21147/j.issn.1000-9604.2022.03.02.
Leung SCY, Nielsen TO, Zabaglo L, et al. Analytical validation of a standardized scoring protocol for Ki67: phase 3 of an international multicenter collaboration. NPJ Breast Cancer. 2016;2:16014. 10.1038/npjbcancer.2016.14.
Hao J, Lyu Y, Zou J, et al. Improving Prognosis of Surrogate Assay for Breast Cancer Patients by Absolute Quantitation of Ki67 Protein Levels Using Quantitative Dot Blot (QDB) Method. Front Oncol. 2021;11:3673. 10.3389/fonc.2021.737781.
Yu G, Lv J, Zhang Y et al. validation of the roles of Ki67 and cyclin D1 for subtyping of Luminal-like breast cancer patients.
Rudbeck L. Adding quality to your qualitative IHC. MLO Med Lab Obs. 2015;47(12):18–9.

Tables 1 to 4 are available in the Supplementary Files section.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Multi-institutional evaluation of guidance from International Ki67 Working Group vs National Health Commission of China on Immunohistochemistry-based Ki67 assessment

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Materials and Method

Human subjects

Sample preparation and distribution

General Reagents

QDB analysis

Defining C₅ and C₉₅ of Ki67 cutoff

Statistical analysis

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1

Multi-institutional evaluation of guidance from International Ki67 Working Group vs National Health Commission of China on Immunohistochemistry-based Ki67 assessment

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Materials and Method

Human subjects

Sample preparation and distribution

General Reagents

QDB analysis

Defining C5 and C95 of Ki67 cutoff

Statistical analysis

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1

Defining C₅ and C₉₅ of Ki67 cutoff