Background As opioid prescriptions have risen, there has also been a rise in opioid overdose deaths and substance use disorders. Public health systems have tried to improve their ability to detect and intervene in opioid use disorders to prevent death due to overdose. The objective of this study is to compare two approaches to identify opioid use problems (OUP) using electronic health record data- text mining versus diagnostic codes.
Methods Our sample consisted of adults on long-term opioid therapy (LTOT), defined as at least ≥ 70 days of supply within 90 days, and who visited a large multi-hospital network within a two-year period, between 1 January 2013 and 31 December 2014. We excluded patients with active cancer or schizophrenia. Text mining results were validated by a semi-assisted human review process and positive predictive value and level of agreement was reported. Each algorithm sought to identify patients who visited a health care facility due to an opioid poisoning event, opioid abuse, or opioid dependence. Population characteristics for positive OUP identified by text mining and ICD cohorts were compared. Chi-square and Fishers exact test were used for categorical data analysis and independent t-test was used to compare means for continuous variables. We further compared the demographics of the cohorts identified by the two methods.
Results We identified 14,298 eligible LTOT patients. Text mining of relevant electronic clinical notes yielded 127 positive OUP cases compared to 45 cases using International Classification of Disease (ICD)-9 codes for the same population. Just eight OUP patients were identified using both methods. The two cohorts differed significantly with respect to age, gender, and other characteristics.
Conclusions Compared to diagnostic codes, text mining identified more OUP cases with distinct characteristics. Incorporating text-mining techniques into OUP surveillance methods may support better detection of OUP and more accurate estimates of prevalence.

Figure 1

Figure 2
This is a list of supplementary files associated with this preprint. Click to download.
Loading...
Posted 05 Mar, 2020
Posted 05 Mar, 2020
Background As opioid prescriptions have risen, there has also been a rise in opioid overdose deaths and substance use disorders. Public health systems have tried to improve their ability to detect and intervene in opioid use disorders to prevent death due to overdose. The objective of this study is to compare two approaches to identify opioid use problems (OUP) using electronic health record data- text mining versus diagnostic codes.
Methods Our sample consisted of adults on long-term opioid therapy (LTOT), defined as at least ≥ 70 days of supply within 90 days, and who visited a large multi-hospital network within a two-year period, between 1 January 2013 and 31 December 2014. We excluded patients with active cancer or schizophrenia. Text mining results were validated by a semi-assisted human review process and positive predictive value and level of agreement was reported. Each algorithm sought to identify patients who visited a health care facility due to an opioid poisoning event, opioid abuse, or opioid dependence. Population characteristics for positive OUP identified by text mining and ICD cohorts were compared. Chi-square and Fishers exact test were used for categorical data analysis and independent t-test was used to compare means for continuous variables. We further compared the demographics of the cohorts identified by the two methods.
Results We identified 14,298 eligible LTOT patients. Text mining of relevant electronic clinical notes yielded 127 positive OUP cases compared to 45 cases using International Classification of Disease (ICD)-9 codes for the same population. Just eight OUP patients were identified using both methods. The two cohorts differed significantly with respect to age, gender, and other characteristics.
Conclusions Compared to diagnostic codes, text mining identified more OUP cases with distinct characteristics. Incorporating text-mining techniques into OUP surveillance methods may support better detection of OUP and more accurate estimates of prevalence.

Figure 1

Figure 2
This is a list of supplementary files associated with this preprint. Click to download.
Loading...