Application of ML methods in identifying patients with asthma in primary care

doi:10.21203/rs.3.rs-1946315/v1

Background:

Asthma is one of the most prevalent diseases, with approximately 5.4 million patients on prescribed medication in the UK. Poor asthma management is responsible for many preventable deaths in the UK, making the mortality rate the highest in Europe. Identifying asthma patients is time-consuming and requires detailed reviews of individual patients by GPs. In a previous study (awaiting publication), bespoke designed algorithms (Smart-Searches™) were used to identify patients who were not on the Quality Outcome Framework (QOF) asthma register but were likely to have asthma. GPs further reviewed these patients found by the searches to confirm their condition. This study aims to apply machine learning methods to real-world primary care electronic health records (EHRs) and compare their performance in identifying asthma patients with the previously used Smart-Searches™.

Methods:

This is a binary classification problem where patients are identified as asthmatic or non-asthmatic. Data from two practices used in this study comprised around 9000 patients, of whom around 600 were on the asthma register. A set of 40–45 features were extracted from the health records as inputs to the models. The models were trained and tested on datasets in several experiments. Both linear models such as Logistic Regression, Random Forest, Support Vector Model, Naïve Bayes, and deep learning models such as MLP and CNN were evaluated, and compared with the existing traditional methods.

Results:

ML models, on average, got a higher accuracy of about 70% compared to traditional methods at 54%. The Ensemble model obtained the highest accuracy at 77%, followed by MLP at 75%. In addition, the average positive predictive value for the ML methods was 82% compared to the search-based system at 54%. Finally, the Naïve Bayes model obtained the best positive predictive value at 100%.

Conclusions:

ML methods obtained high accuracy and positive predictive values, showing that the ML models could make better asthma identification predictions than the existing system. This also shows that the machine learning models could help clinicians identify more asthma patients in significantly less time while requiring less clinician input than the existing best methods leading to improved efficiency and better patient care.

Asthma diagnosis

ML for Asthma

Asthma primary care

There have been several methods proposed for asthma diagnosis in the past. Machine Learning (ML) methods have been beneficial in applications such as cancer diagnosis (1), detection of Alzheimer's (2), (3), (4), (5).

(6) has looked at artificial intelligence (AI) in asthma diagnosis and prognosis. He has tried several models incorporating patient-filled, doctor-filled questionnaires and doctor observations and obtained a high accuracy of over 90%. The paper uses symptom-based questionnaires, chest sound analysis, and medical reports from lung function tests and CT scans to input models. The answers to the questionnaires and weighting factors make up matrices that are passed into the ML models as features. The author found that the model with the questionnaires, the scoring system, symptoms, and other patient data gave better results than just one kind of input. Alan Kaplan et al. (7) have provided an overview of AI's use in medicine, particularly in respiratory medicine. The authors evaluated lung cancer images, diagnosed fibrotic lung disease, and, more recently, is being developed to aid the interpretation of pulmonary function tests and the diagnosis of a range of obstructive and restrictive lung diseases, specifically asthma and COPD. They suggest that AI has a clear role in providing support to clinicians. They state that asthma and COPD diagnosis depends on the patient's history, physical examinations, and pulmonary function tests. They claim that their AI models could identify different pulmonary conditions with a sensitivity of up to 100% and showed a success rate of 82% in assigning a correct diagnosis, performing better than pulmonologists who made correct diagnoses in only 44.6% +/- 8.7% of the cases.

In another study (8), the authors define asthma as "a combination of variable respiratory symptoms and large changes in lung functions". As they have said, many of the asthma symptoms are not specific to the disease, which increases the complexity involved in the diagnosis. As a result, patients could be offered inappropriate treatments with potentially harmful physical and financial implications. The authors have used deep neural networks and linear models to diagnose asthma with an accuracy of over 98%, and a group of doctors confirmed the outcome. They conclude that using AI techniques can help save time in asthma diagnosis.

A study on the early detection of asthma was carried out by (9). In this paper, the authors have explored the use of ML in predicting asthma persistence in children aged five and below. They claim that the models can predict if the children will have persistent or transient asthma at age 10. They included children diagnosed as asthmatic and had hospital visits, A&E visits or ambulatory visits between the ages of 2 and 5 years in the dataset.

Li S et al (10), studied a cohort of children for asthma symptoms and used various sets of filtered features along with linear models to identify asthma. They show that machine learning algorithms can help in identifying asthma patients.

Most of the previous studies discussed above have used questionnaires and other forms of data collection for their study or have worked on children. Our study resembles (4) in terms of the approach. However, it differs somewhat because we exclusively use pre-existing primary care data as input to our models rather than collect data exclusively for the research by running lung function tests. They also do not use prescription history to help with the study. While (9) have studied persistent asthma in children, they do not study the onset of asthma but use data from a cohort of children already diagnosed with asthma to predict the presence of persistent asthma at a later age. Although (9) use features such as symptoms, blood test results, and lung sounds, they do not use medication history and work with a younger cohort in a hospital setting.

In our study, the input consists of data found within each patient's electronic health record (EHR), which has been collated during their specific medical care history in primary care. No other tests or data collection through questionnaires were conducted.

The number of patients who are identified as asthmatic in our data set is about 6% (11), and are tracked using the Quality Outcome Framework (QOF) asthma register (12). The asthma register contains a list of patients taking asthma medication in the last 12 months and removed if not. The register used as a truth set is a list of patients currently taking asthma medications rather than a list of patients diagnosed with asthma. Although this list helps keep track of patients currently on medications, it does not help identify asthma patients. Furthermore, the primary care dataset is not complete for all patients. In other words, patients have some or other features but not all, leading to a sparse dataset. This emphasizes that if a patient does not have a reading or symptom(s), it could be because they have not been tested for this rather than the value not being present for the patient. There is no data regarding FeNO and spirometry tests, which in (4) is shown significantly to increase the asthma diagnosis accuracy of ML methods. Our study uses the prescription data in the patients' health records. The medication data is available accurately from 2016 onwards, which means that patients who were on the asthma register before this date cannot be identified and used for the analysis. Although Alan et al. (6) claim to have successfully identified cases much better than pulmonologists, they focus on a broader set of respiratory disorders. They use not only patients' physiological history but also images and pulmonary function test results for their study.

A system using SQL searches with specially identified criteria has been in place at the practices. The searches used a combination of several conventional rule-based logic systems to identify suspected asthma patients who were missing a qualifying diagnostic code but receiving inhalers for some reason. These patients were part of the suspected asthma group and meticulously reviewed by the usual GP. This conventional approach was employed in 2017 and improved the prevalence rate from 4.7% to 6.9% over five years (England's average of 6.5% in March 2020), as shown in Figure 1. Patient records that contain symptoms, triggers, diagnosis, findings, reviews of events and treatments or prescriptions, along with free text medical notes written by GPs, attachments, and results from any further test conducted on the patients, were used to study and identify these patients. We aim to use ML methods to analyse existing primary care data to identify patients with asthma more accurately and help improve the efficiency of the asthma diagnosis process.

We used data from two primary care centres with a list size of 6000 and 4000 patients in London to test and train the models. The raw data of coded histories were extracted from the clinical IT system supplier (EMIS) and imported into a secure local SQL database. Patient data was anonymised to ensure further data privacy to comply with GDPR. All identifiers were randomised, and only the GP practice with access to patient identifiable information for reviewing patient records can access data in the usual way.

Initial data exploration and interpretation conducted helped identify data quality issues. For example, each coded entry had two dates; the event date related to the date of the occurrence of the event and the audit date related to the date the code was added. These are often the same but different when retrospective entries are made, indicating that medical codes could have been altered at different time points.

Feature Set:

Asthma is a chronic respiratory disease related to inflammation of the airways. Common asthma symptoms include chest tightness, wheeze, cough and breathlessness. When these symptoms significantly compromise the airway, it can constitute the individual having an asthma exacerbation.

The input data contains registered patients' demographics and events, including diagnosis, findings, observations, and communication information. Prescriptions contain data about medications prescribed to the patients. Prescription data goes back five years, and events data goes back more than 15 years before the study. The following features were extracted from the patients' electronic health records, as shown in S1.

Some common asthma symptoms are cough, wheezing, sleep disorder, shortness of breath and chest tightness (13). These symptoms may occur in isolation or as a combination (8). Cough/ wheezing symptoms are considered bouts of a single episode if they occur within eight weeks. So, the first time a patient visits the surgery for cough or wheeze, it is considered an event, and subsequent visits are put together as a single episode if they fall within an 8-week window. Patients are more likely to have asthma when wheezing and atopic conditions occur. Atopic eczema is a condition that makes the skin red and itchy (14). According to Ellina Gillani et al, atopic dermatitis is considered an entry point for respiratory conditions such as asthma. They suggest that careful management of atopic dermatitis is essential to prevent the development of respiratory allergy or to reduce the severity of asthma and allergic rhinitis, another factor closely associated with asthma. According to Bergeron and Hamid (15), who conducted a study on asthma patients and patients with rhinitis, 40% of patients with allergic rhinitis have asthma, and 94% of patients with allergic asthma have rhinitis. They also state that the incidence of asthma and rhinitis increases with age and have found that keeping a check on rhinitis helps manage asthma better.

Allergens that can cause inflammation in the body can be identified using the immunoglobulins E values. A value of more than 0.35kU/L is considered elevated. Bronchodilator reversibility is a lung function test performed using a spirometer and a bronchodilator. A positive result is considered a strong indication of asthma. Spirometry is conducted to test the restrictiveness or obstructiveness of the lungs. Obstructive lung function can be found in asthma patients who have difficulty exhaling air. A positive result is when the ratio FeV1/FVC is less than 70%. Another feature considered is the frequency of asthma medications administered in a year to patients in the past; in this study, a period of 5 years is considered.

Evaluation:

In this study, several deep learning models – Multi-perceptron Layer (MLP), Convolutional neural networks (CNN), as well as linear models -- Logistic regression (LR), Random Forest (RF), Naïve Bayes (NB), Support vector machines (SVM), Linear ensemble model (ensemble) - were explored to analyse patient data. Both linear and deep models are effective in different ML applications, as seen in (16–22). The problem was a binary classification where patients were required to be classified as asthmatic or non-asthmatic. All models were provided with the same input features so that the outputs could be compared on an equal footing. The metrics used to evaluate the models are accuracy, specificity, sensitivity, AUC-ROC score, and positive predictive value (PPV). Accuracy is defined as the number of correct predictions for all predictions. Specificity measures how well the model identifies asthma patients, and sensitivity measures how well the model identifies negative samples or non-asthma patients. AUC score is used as the metric that indicates how well the model can distinguish between the classes. The higher the AUC score, the better the model's performance distinguishing between the positive and negative classes. As the models are being compared against an existing search-based system that identifies potential asthmatic patients, PPV was used as the appropriate metric, which helps indicate how many patients were correctly predicted as asthmatic. The output of the searches was a set of suspected asthma patients, who GPs reviewed before confirming their asthma status. The number of positive cases obtained from the reviews was compared against the predictions of the models.

The input features were extracted using several SQL queries with appropriate cut-off dates and other constraints and merged in the python environment. The 'cut-off' dates ensure that the model only sees the data before diagnosis and is not biased by subsequent codes assigned after diagnosis. In a previous review conducted between 2017 and 2019, 171 patients who were identified as suspected asthma patients were found using Smart Searches^TM. These searches are a set of intelligent queries built to filter out asthma suspected patients. The queries use asthma-related diagnoses and prescription codes to filter out patients. Of approximately 10,000 patients from 2 practices, 171 were suspected of having asthma. After the review of the case notes by GPs, 104 were identified as asthmatic. From this group of 171, only 114 patients EHR were currently available for the study, of which 66 were confirmed asthmatic and 48 non-asthmatic.

All diagnosis was confirmed after a rigorous review by the GP and the following spirometry where required. The review process occurred between July 2017 and May 2019. Audited patients obtained by the search-based system and reviewed by GPs make up the audited dataset, and non-audited patients were the rest of the patients from the practice. This audited dataset was used as the validation hold-out set for evaluating the models in the current study. Initially, data from one practice was used to train the models. For the non-audited patients' dataset, only about 6% (~410) of the patients on the asthma register are positive samples, and the rest, 94% (~5800) are non-asthmatic patients or negative samples. The reviewed audit dataset of 91 patients from the same practice comprises a reliable training set. Each suspected asthma patient found by the originally structured algorithm searches (Smart Searches^TM) has verified their asthma status by a GP. Unfortunately, as shown by experiments 1, 2 and 3 in Table 3, this reliable dataset was too small to be used as an effective training set. Therefore, the complete set of patients at the GP surgery (excluding a small test set) was used as a training set, with each patient's asthma status recorded in the GP surgeries asthma register used to label the samples.

The asthma register is an administrative ledger that follows the QOF guidelines and contains a list of patients diagnosed with asthma and who have had asthma medications prescribed to them in the last 12 months . Any patient diagnosed with asthma and receives a prescription remains on the asthma register, whereas patients without a prescription in the last 12 months are removed from the register. The asthma diagnosis and medication codes underlying the asthma register are the QOF cluster codes. These codes are assigned to patients to indicate a diagnosis, finding, treatment or procedure and is summarised in the Table 2 below. The asthma codes were provided to the model as required for the analysis. AST_COD cluster indicates asthma diagnosis and related codes, and ASTTRT_COD are codes assigned to the asthma medication/ treatment the patients receive. For all the experiments, ASTSPIR_COD codes were used to indicate if the spirometry test was conducted and the test results recorded.

All the experiments were run with the validation split varied between 20% and 70% to ensure maximum exposure of the training data to the model. The models were first trained with the existing training set before applying the Synthetic Minority Oversampling Technique (SMOTE) to the train set to reduce the class imbalance. The models achieved some accuracy with the original dataset but learned better with the balanced training data. Class weights also helped with improving the predictions. To understand the predictability of the models, explainability techniques were used. Shapely additive explanations (SHAP) are useful to demystify a model, as shown in (23). In addition, they help to understand the 'why' of a decision made by the machine learning models. The SHAP values calculated from models help understand the impact of a feature on the model's decision on a patient. For example, with SHAP values, we could find that one of the features is valid for a few patients; otherwise, 0 was given a high priority by the models. After converting this feature to a categorical variable and retraining, the models showed a slight improvement in their accuracy. The summary plot of SHAP shows the importance of features on average, as shown in Figure 4.

Data from another GP surgery helped to increase the patient list size to 9000 and the number of patients on the asthma register to around 600. The models were tested and retrained with the new data.

The results from all the experiments are summarised in the table 3 below. The first two experiments were conducted with the audit data as the train set and the non-audit as the test set. The models achieved an AUC score of 85% for the MLP model, which was the best model for the first experiment. The test set achieves an AUC of 44%. For experiment 2, the train set achieves an AUC of 65% for the MLP model, but only 37% for the non-audited test set. For experiments 3-5, the non-audited patients' dataset was used as the train set and the audited patients' dataset as the test set. For the 3^rd run, the CNN model achieved the highest AUC score at 66% for the train set and 50% for the test set. The MLP models achieve a high AUC score of 61% for the test set for the 4^th experiment. For the 5^th run, the MLP models achieve the best results at 77% AUC score for the test set.

The models from experiment 7 achieved the best performance out of all of the experiments. The Table 4 shows the results from this experiment in detail and the results for the existing search-based system. The metrics used for this analysis slightly vary by comparing the true and false predictions with the findings from the searches. The results show that the Smart Searches^TM extracted 53 asthma patients from a total of 99, which gives an accuracy of 54% and hence a PPV of 54% against 93% obtained by the Ensemble model. The MLP model obtained a PPV of 88%, whereas the Naïve Bayes obtained 100% PPV.

In the first 2 experiments, the models have been trained on the audit dataset and tested on the rest of the patients. The accuracy of the models is for the test set is about 47% as shown in Table 3. The AUC score is also low for the test set at 44%. The dataset is cut-off on the estimated review date. So, only data before this date is available for the models to use for the training. But, for the test set, the data was cut-off at the review date and asthma codes such as review codes, diagnosis, resolved, and review invite codes were removed from the dataset. The train set did not have a cut-off date for the second experiment, but asthma codes were removed along with medication. The test set also followed the same criteria. The models gave poor results with the test set, with an AUC score of 37%, much lower than the first experiment. For both runs, the model failed to generalize, which could be because the asthma codes were removed, which might indicate a loss of important information. Another reason could be due to the small train set (audited dataset).

The rest of the experiments use the non-audited dataset as the train set and the audit dataset as the test set. For the third, the train set was not cut-off on any particular date, but asthma codes were removed. In other words, the entire dataset was used but with the asthma codes removed. The test accuracy was about 49% and AUC score was 50%. This is much higher than the previous experiment but still very low. The models when trained on a much bigger dataset, performed much better and the more extensive train set might have helped the models train better. A higher accuracy could also be from the fact that medications were added back into the dataset, indicating the significance of having medication information for identifying asthma in patients. For the fourth experiment, a cut-off date was introduced again, but the asthma codes were removed. The cut-off date was the first asthma code entry date, which indicates when the patients might have been first diagnosed with asthma. The models’ test accuracy was about 53% and AUC score for the test set was about 61%. The test performance was slightly better than the previous experiments. The cut-off date used here seemed logical because the model now had information regarding events that occurred before the onset of asthma or in other words, before they were assigned the asthma diagnosis code for the first time. For the fifth experiment, the cut-off date was slightly changed to the first date a patient got onto the asthma register instead of the first time they were assigned an asthma code. The models test accuracy rose to about 72% and the AUC score for the asthma class was about 75% for the MLP models. The models obtained a much better prediction score than the previous experiments on the test set.

The improvement could be because of the more logical cut-off date, where models have access to data until the first asthma register entry date. This could also be because only asthma diagnosis codes were removed for this run instead of all the asthma-related codes. In addition, the models have been provided with information indicating any asthma reviews conducted or invites for reviews, etc. which could be subtle clues to the patients being asthmatic. With the addition of new data, the models, which were trained with the old data, now tested with the new, did not perform well as shown by experiment 6 results. They gave poor results with 49% AUC_ROC score. Hence, the models were retrained with new data and the performance of the models was much higher than the previous experiments at 77% AUC_ROC score. The best model in terms of PPV for experiment 7 is the Naïve Bayes model, which has a PPV of 100%.

The PPV values are calculated using values from the confusion matrices obtained from the test results, as shown in Table 4. The CNN model obtained an accuracy of 72% and a PPV value of 69%, which indicates that the model could predict 69% of asthma patients correctly. The random forest and the logistic regression obtain a PPV value of about 72% and 79%, whereas the ensemble model achieves 93%. The MLP models achieve an accuracy of about 79% and a PPV value of over 88%. From the results above, the MLP model is considered the best, with 43 true positives and an accuracy of 71%. The test dataset is a similar one used in the searches; hence, the results can be compared as shown in Table 4.

The training data used for the searches and the GP reviews combined slightly vary from those provided to the models. Data such as results from further tests or any specific notes obtained from other practitioners during the review process may not be available for the models to use. The data used for the machine learning analysis is as-is from the practice, and no other data has been collected for the analysis. This also includes free text narratives, other typed documents, and scanned attachments contained in the medical notes recorded by the doctors. The features used as inputs to the models have been extracted from the practice data as closely as possible, as shown in Table 1. Although data has been recorded from more than 20 years ago, it is very sparse up until a few years back. The same can be mentioned regarding the medication data. They are only available from 2016 at the moment. This means that the QOF asthma register can be built from 2016 onwards using the medication data and events data. Therefore, the cut-off date used for experiment 5 would be accurately available from 2016 onwards for patients not on the register already, indicating that the amount of data for these patients might not be as complete as provided for the Smart Searches^TM. But despite these data issues, the models can identify asthma patients with an average accuracy of over 70%. The ML models can help identify asthma patients and correctly predict non-asthma patients. This helps with saving time for the GPs reviewing a lesser number of patients. While the searches extract a certain number of patients, there is no indication of the number of patients who could be asthmatic and non-asthmatic. In contrast, the ML models can accurately indicate who could be asthmatic and non-asthmatic. For example, the ensemble model, from the Table 4 above can correctly identify 43 patients as asthmatic and indicate 3 (False Positives) as suspected asthmatics out of 114 patients. So, the GPs now would require to conduct reviews on the 46 patients instead of the entire set of 114 patients. Similarly, the MLP model can identify 43 patients as asthmatic and six as suspected asthmatic patients. The GPs had taken, on average, five minutes to review the case notes from the results generated using Smart Searches^TM. Therefore, the amount of time spent on the reviews with the searches output would be about twice the amount of time compared to when ML methods are used, where fewer reviews are required. For example, 53 out of 99 patients were identified as asthmatics by the searches. In that case, this confirmation was only arrived after conducting 99 reviews. With the ML methods, a smaller but more precise number of patients are extracted, and the GPs can confirm these patients in fewer reviews and hence in less time. This can be seen in the below chart in Figure 2. The chart shows the number of GP review hours required if searches and ML methods are used. The Smart Searches^TM method would require about ~15 hours of reviews, whereas the ML methods would require about 10 hours of review time on average. Figure 3 shows the number of patients identified as asthmatic in one GP day of 7.5 hours using different methods. The Smart Searches^TM would help identify 48 patients in 7.5 hrs of GP time. In contrast, the ML models would, on average, help identify 74 (66-82 CI 95%) patients, that is around 26 (18-34 CI 95%) patients more than the Smart Searches^TM method, considering the amount of time taken to review patients obtained from different methods. This may lead to missing fewer patients and hence help provide better patient care.

This study aimed to understand the benefits of the application of machine learning in identifying asthma patients in primary care. The goal was to find better ways of improving the quality of asthma patient care in primary care settings through reduced GP time in identifying asthma patients. Some patients could be asthmatic but not yet identified, or those that have been identified and not tracked and hence missing out on quality care. Real unmodified data from one practice was used in the study to evaluate the benefits of using ML methods to help identify asthma patients. Previously, Smart Searches™ was used to extract suspected asthma patients whose asthma status was confirmed using GP reviews. With ML methods, asthma patients are being identified, along with non-asthma patients, which eliminates the need to review all extracted patients. The ML methods are found to reduce GP time in identifying 100 asthma patients to approximately 8.3 GP review hours versus the 15.6 hours from Smart Searches™.

Consequently, the results indicate that more patients could be identified in one GP working day when ML methods are used compared to the existing system. The review process could potentially improve the model performance when the results from the reviews are fed back to the data. The information from new patients tested and reviewed could be used to retrain the models as required with appropriate labelling and thus help improve the accuracy of the models.

MLP	Multi-Layer Perceptron
CNN	Convolutional Neural Networks
GP	General Practitioner
ML	Machine Learning
PPV	Positive Predictive Value
AUC	Area Under the Curve
ROC	Receiver Operating Characteristic
SMOTE	Synthetic over-sampling technique
QOF	Quality outcomes framework

Acknowledgements

NHSX

Dan Schofield

Data Care Solutions

Shakespeare Health Centre

Funding

The Health Foundation

Availability of data and materials:

The data that support the findings of this study are available from Shakespeare Health Centre and Heathrow Medical Centre however restrictions apply to the availability of source data, which were used under license for the current study, and so are not publicly available. Anonymised data may however available from the authors upon reasonable request and with permission of Shakespeare Health Centre and Heathrow Medical Centre.

Ethics approval and consent to participate:

This study was a retrospective data analysis by the care team for comparison with established diagnosis. Any discrepancy was managed by the patient’s own GP on a case-by-case basis. The need for NHS Research Ethical Committee approval was assessed independently by the data controllers at Heathrow Medical Centre and Shakespeare Health Centre (patient’s own GP practice). The NHS Research Ethics Committee tool was used http://www.hra-decisiontools.org.uk/ethics/ in this process. The results found this analysis did not require NHS Research Ethical Committee approval.

All procedures were performed in accordance with General Data Protection Regulation (GDPR), NHS Digital, Medical Research Council (MRC), and NHS health research guidelines.

The need for informed consent was waived by the patient’s own GP practice (Heathrow Medical Centre and Shakespeare Health Centre) as the legal basis for processing data was a ‘task in the public interest’.

https://www.hra.nhs.uk/planning-and-improving-research/policies-standards-legislation/data-protection-and-information-governance/gdpr-guidance/what-law-says/consent-research/

Please contact Dr Sukin Natarajan on any further queries with regards to ethics approvals.

Competing interests:

The Health Foundation funded this research project.

All authors also work for Data Care Solutions, a healthcare consultancy company, that supports primary care providers with healthcare data analytics and service provision.

Consent for publication:

The authors provide their consent to publish the paper.

Authors' contributions:

All the authors wrote the manuscript, 2 - Sukin and 4 - Dhruva prepared the figures and tables and all authors reviewed the manuscript.

Authors' information:

Author details:

Name	Email
Bhuvana Dhruva (corresponding author) Shakespeare Health Centre	[email protected]
Dr Sascha Khakshouri Heathrow Medical Centre	[email protected]
Dr Sukin Natarajan Heathrow Medical Centre	[email protected]
Dr Jay Verma Shakespeare Health Centre	[email protected]

Agarwal S, Yadav AS, Dinesh V, Vatsav KSS, Prakash KSS, Jaiswal S. By artificial intelligence algorithms and machine learning models to diagnosis cancer. Materials Today: Proceedings. 2021 Jul 24;
Zhao X, Ang CKE, Acharya UR, Cheong KH. Application of Artificial Intelligence techniques for the detection of Alzheimer’s disease using structural MRI images. Biocybernetics and Biomedical Engineering. 2021 Apr 1;41(2):456–73.
Attia ZI, Kapa S, Lopez-Jimenez F, McKie PM, Ladewig DJ, Satam G, et al. Screening for cardiac contractile dysfunction using an artificial intelligence–enabled electrocardiogram. Nature Medicine. 2019 Jan 7;25(1):70–4.
Tomita K, Nagao R, Touge H, Ikeuchi T, Sano H, Yamasaki A, et al. Deep learning facilitates the diagnosis of adult asthma. Allergology International. 2019 Oct 1;68(4):456–61.
Saglani S, Custovic A. Childhood Asthma: Advances Using Machine Learning and Mechanistic Studies. Am J Respir Crit Care Med. 2019;199(4):414–22.
Kukreja S. A Comprehensive Study on the Applications of Artificial Intelligence for the Medical Diagnosis and Prognosis of Asthma.
Kaplan A, Cao H, FitzGerald JM, Iannotti N, Yang E, Kocks JWH, et al. Artificial Intelligence/Machine Learning in Respiratory Medicine and Potential Role in Asthma and COPD Diagnosis. Journal of Allergy and Clinical Immunology: In Practice. 2021 Jun 1;9(6):2255–61.
He Z, Feng J, Xia J, Wu Q, Yang H, Ma Q. Frequency of signs and symptoms in persons with asthma. Respiratory Care. 2020 Feb 1;65(2):252–64.
Bose SI, Kenyon CC, Masino ID AJ. Personalized prediction of early childhood asthma persistence: A machine learning approach. 2021; Available from: https://doi.org/10.1371/journal.pone.0247784
Yu G, Li Z, Li S, Liu J, Sun M, Liu X, et al. The role of artificial intelligence in identifying asthma in pediatric inpatient setting. Ann Transl Med [Internet]. 2020;8(21). Available from: http://dx.doi.org/10.21037/atm-20-2501a
QOF Database. 2021.
Classification: Official Quality and Outcomes Framework guidance for 2021/22 2 Contents [Internet]. Available from: https://www.gov.uk/government/publications/nhs-primary-medical-services-directions-2013
Scottish Intercollegiate Guidelines Network., British Thoracic Society., Healthcare Improvement Scotland. British guideline on the management of asthma: a national clinical guideline. 208 p.
Galli E, Rossi P, Mancino G, Brunetti E, Auricchio G, Gianni S. Atopic dermatitis and asthma. Allergy and Asthma Proceedings. 2007 Sep;28(5):540–3.
Bergeron C, Hamid Q. Relationship between Asthma and Rhinitis: Epidemiologic, Pathophysiologic, and Therapeutic Aspects.
Abdulaal A, Patel A, Charani E, Denny S, Alqahtani SA, Davies GW, et al. Comparison of deep learning with regression analysis in creating predictive models for SARS-CoV-2 outcomes. BMC Medical Informatics and Decision Making. 2020 Dec 1;20(1).
Ranganathan P, Pramesh C, Aggarwal R. Common pitfalls in statistical analysis: Logistic regression. Perspectives in Clinical Research. 2017 Jul 1;8(3):148–51.
Gholami R, Fakhari N. Support Vector Machine: Principles, Parameters, and Applications. In: Handbook of Neural Computation. Elsevier; 2017. p. 515–35.
Divya R, Shantha Selva Kumari R, the Alzheimer’s Disease Neuroimaging Initiative. Genetic algorithm with logistic regression feature selection for Alzheimer’s disease classification. Neural Comput & Applic. 2021;33:8435–44.
Breiman L. Random Forests. Vol. 45. 2001.
Opitz D, Maclin R. Popular Ensemble Methods: An Empirical Study. Journal of Artificial Intelligence Research. 1999 Aug 1;11:169–98.
Chang AC. Machine and Deep Learning. In: Intelligence-Based Medicine. Elsevier; 2020. p. 67–140.
Lundberg SM, Allen PG, Lee SI. A Unified Approach to Interpreting Model Predictions [Internet]. Available from: https://github.com/slundberg/shap

Table 1: Summary of the datasets and the pre-processing information

Experiment	Training			Testing
	Group	Pre-processing		Group	Pre-processing
Experiment 1	Audited patients	review date as cut-off	None	all non-audited patients	No cut-off date	Asthma codes removed
Experiment 2	Audited patients	no cut-off date	Asthma codes and medications removed	all non-audited patients	No cut-off date	Asthma codes and medications removed
Experiment 3	non-audited patients	No cut-off date	Asthma codes removed	all audited patients	No cut-off date	Asthma codes removed
Experiment 4	non-audited patients	first asthma code date	none	Audited patients	First asthma code date	none
Experiment 5-7	Non-audited patients	First asthma register entry date	Asthma diagnosis codes removed	Audited patients	First asthma register entry date	Asthma diagnosis codes removed

Table 2: QOF Asthma Code Clusters

Cluster ID	Cluster description
AST_COD	Asthma diagnosis codes
ASTCONTASS_COD	Assessment of asthma control using a validated asthma control questionnaire codes
ASTEXACB_COD	Codes indicating the number of asthma exacerbations
ASTINVITE_COD	Invite for asthma care review codes
ASTMONDEC_COD	Codes indicating the patient has chosen not to receive asthma monitoring
ASTPCADEC_COD	Codes indicating the patient has chosen not to receive asthma quality indicator care
ASTPCAPU_COD	Codes for asthma quality indicator care unsuitable for patient
ASTPCASU_COD	Asthma quality indicator service unavailable codes
ASTRES_COD	Asthma resolved codes
ASTSPIR_COD	Spirometry codes for asthma
ASTTRT_COD	Asthma-related drug treatment codes

Table 3: Results from Experiments

Experiment	Train Set	Model	Test Set	Test Accuracy	Test AUC%
Experiment 1	Audit	MLP	Non-Audit	41.58%	44%
Experiment 2	Audit	MLP	Non-Audit	28%	37%
Experiment 3	Non-Audit	CNN	Audit	49%	50%
Experiment 4	Non-Audit	MLP	Audit	53%	61%
Experiment 5	Non-Audit	MLP	Audit	72%	77%
Experiment 6	Non-audit with new data – testing with models only	LR	Audit with new data	44%	49%
Experiment 7	Non-audit (retrained models with new data)	NB	Audit	72%	79%

Competing interest reported. The Health Foundation funded this research project.

All authors also work for Data Care Solutions, a healthcare consultancy company, that supports primary care providers with healthcare data analytics and service provision.

S1FeatureSet.pdf
S1: Feature Set This is available in the supplementary file.

Application of ML methods in identifying patients with asthma in primary care

Status:

Version 1

Abstract

Background:

Methods:

Results:

Conclusions:

Figures

Background

Methodology

Feature Set:

Evaluation:

Results

Discussion

Conclusion

Abbreviations

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1