Background
Mutations in an enzyme target are one of the most common mechanisms whereby antibiotic resistance arises. Identification of the resistance mutations in bacteria is essential for understanding the structural basis of antibiotic resistance and design of new drugs. However, the traditionally used experimental approaches to identify resistance mutations were usually labor-intensive and costly.
Results
We present a machine learning (ML)-based classifier for predicting rifampicin (Rif) resistance mutations in bacterial RNA Polymerase subunit β (RpoB). A total of 66 resistance mutations were gathered from the literature to form positive dataset, while 53 residue variations of RpoB among a series of naturally occurring species were obtained as negative database. The features of the mutated RpoB and their binding energies with Rif were calculated through computational methods, and used as the mutation attributes for modelling. Classifiers based on four ML algorithms, i.e. decision tree, k nearest neighbors, naïve Bayes and supporting vector machine, were developed, which showed accuracy ranging from 0.69 to 0.76. A majority consensus approach was then used to obtain a new classifier based on the classifications of the four individual ML algorithms. The majority consensus classifier significantly improved the predictive performance, with accuracy, precision, recall and specificity of 0.83, 0.84, 0.86 and 0.83, respectively.
Conclusion
The majority consensus classifier provides an alternative methodology for rapid identification of resistance mutations in bacteria, which may help with early detection of antibiotic resistance and new drug discovery.

Figure 1

Figure 1

Figure 2

Figure 2

Figure 3

Figure 3

Figure 4

Figure 4

Figure 5

Figure 5
This is a list of supplementary files associated with this preprint. Click to download.
Loading...
Posted 17 Dec, 2020
On 01 Feb, 2021
Received 30 Dec, 2020
On 19 Dec, 2020
On 17 Dec, 2020
Invitations sent on 16 Dec, 2020
On 15 Dec, 2020
On 15 Dec, 2020
On 15 Dec, 2020
On 11 Dec, 2020
Posted 17 Dec, 2020
On 01 Feb, 2021
Received 30 Dec, 2020
On 19 Dec, 2020
On 17 Dec, 2020
Invitations sent on 16 Dec, 2020
On 15 Dec, 2020
On 15 Dec, 2020
On 15 Dec, 2020
On 11 Dec, 2020
Background
Mutations in an enzyme target are one of the most common mechanisms whereby antibiotic resistance arises. Identification of the resistance mutations in bacteria is essential for understanding the structural basis of antibiotic resistance and design of new drugs. However, the traditionally used experimental approaches to identify resistance mutations were usually labor-intensive and costly.
Results
We present a machine learning (ML)-based classifier for predicting rifampicin (Rif) resistance mutations in bacterial RNA Polymerase subunit β (RpoB). A total of 66 resistance mutations were gathered from the literature to form positive dataset, while 53 residue variations of RpoB among a series of naturally occurring species were obtained as negative database. The features of the mutated RpoB and their binding energies with Rif were calculated through computational methods, and used as the mutation attributes for modelling. Classifiers based on four ML algorithms, i.e. decision tree, k nearest neighbors, naïve Bayes and supporting vector machine, were developed, which showed accuracy ranging from 0.69 to 0.76. A majority consensus approach was then used to obtain a new classifier based on the classifications of the four individual ML algorithms. The majority consensus classifier significantly improved the predictive performance, with accuracy, precision, recall and specificity of 0.83, 0.84, 0.86 and 0.83, respectively.
Conclusion
The majority consensus classifier provides an alternative methodology for rapid identification of resistance mutations in bacteria, which may help with early detection of antibiotic resistance and new drug discovery.

Figure 1

Figure 1

Figure 2

Figure 2

Figure 3

Figure 3

Figure 4

Figure 4

Figure 5

Figure 5
This is a list of supplementary files associated with this preprint. Click to download.
Loading...