3.1 grouping of 262 compounds
A dataset of 262 samples was obtained from the BitterDB website and some literature, which compiles information on the bitterness of various foods from several resources, including Gouda cheese, fennel, beer, whole wheat bread crumbs, oats, red wine, asparagus, carrots, and roasted cocoa nibs. These substances mainly include polypeptides, alkaloids, phenols, terpenoids, and saponins (Table S1). In recent years, many studies have been conducted on Quantitative Structure-Activity Relationship modeling using bitter peptides based on molecular descriptors and the efficacy of some molecular peptides. Bitter peptides are mainly produced by proteins during the food fermentation process and offer significant benefits for cardiovascular, neurological, immune, and nutritional systems[22]. Phenolic compounds, which are organic compounds found abundantly in plants, have gained attention as an emerging field of nutrition in recent decades. A growing body of evidence suggests that the intake of polyphenols may play a vital role in regulating metabolism, body weight, chronic diseases, and cell proliferation[23]. Phenolic compounds contribute to the bitterness and astringency of various beverages and fruits, such as oranges, coffee, and tea[1]. Alkaloids, a class of nitrogenous organic compounds, are present in natural constituents of vegetables and drinks, some food processing additives, and pollutants[24]. Most alkaloids exhibit a bitter taste and possess a potent biological activity, some of which may be toxic to humans. Terpenoids (terpenes and their oxygenated derivatives) are important secondary metabolites found in medicinal plants and some well-known foods and beverages, contributing to their pronounced bitterness. Saponins, on the other hand, are secondary plant metabolites composed of steroids, triterpenes, or alkaloids with one or more sugar chains[25]. They are bitter and astringent substances found in certain foods. Additionally, fatty acids, fatty alcohols and other categories are also important bitter substances.
3.2 Binding mode of bitter protein TAS2R14 with bitter substances
The binding residues of the TAS2R14 protein that have been experimentally identified include Trp66, Leu85, Thr86, Asn87, Trp89, Thr90, Asn93, His94, Thr182, Ser183, Phe186, Ile187, Tyr240, Ala241, Phe243, Phe247, Ile263, Gln266, and Gly269 [18]. Molecular docking showed similar results to the experimental outcome, as shown in Fig. 1. The binding pocket of the TAS2R14 receptor is quite spacious and can tolerate agonist structures that largely exceed the size of unmodified agonists [26]. No residues were found in the TAS2R14 binding pocket where mutations resulted in a complete loss of function of all agonists, whereas studies of other receptors have identified such restricted targets. The Asn24 and Ile27 residues of TAS2R1[27]. Gly281, and Ser285 residues of TAS2R4[28], Trp261 residues of TAS2R16[29] have been found to be important binding sites for activating proteins. Moreover, TAS2R14 provides a large number of agonist-selective contacts that may outweigh all other promiscuous TAS2Rs[18].
After screening substances that respond to the appeal TAS2R14 binding site, the docking site map is shown in Fig. 1 and Table S3. And the binding energies of the 96 successfully docked substances are shown in Table S1. Regarding substance categorization, almost all classes bound a portion of their substances to TAS2R14, further demonstrating the broad coordination ability of TAS2R14 [18, 26, 30]. Nearly all of the 24 fatty acids, fatty alcohols, and glycerolipids were successfully docked, and all three belong to long-chain lipids with a very long carbon chain that allows easier access to the centers of the TM1 and TM2 chains and less site blockage. These three substances demonstrate high LibDockScore (> 100). LibDockScore is a scoring algorithm that rates the degree of ligand-protein stabilization[31]. Therefore, it indicates that long-chain analogs can form more stable complexes with TAS2R14 proteins, and some of them have low thresholds (< 1mg/Kg). In addition, most of Cinnamic acids and derivatives and flavonoids bind to TAS2R14.
The mode of interaction between the TAS2R14 receptor and the ligand has received extensive attention. The results of the binding site reflect two patterns, as shown in Fig. 1 and Table S3. while alkaloids, amino acids and their derivatives, and benzenes do not interact with Lys8 and Trp89 and mainly interact hydrophobically with Ile187, Cinnamic acids and their derivatives, Prenol lipids, and flavonoids interact hydrophobically mainly with Leu85 and Trp89, and they partially bind to Ile187. Interestingly, the first group usually had lower LibDockScore and higher thresholds, suggesting that unstable binding affects the bitterness thresholds. It can be found that substances with binding energies below − 100 have low thresholds (< 50mg/Kg), while some substances with high thresholds, such as amino acids, have high binding energies. The magnitude of the binding energy indicates the strength of the interaction, and substances with low binding energies indicate that they interact more strongly with TAS2R14, consequently exhibiting stronger bitterness intensity and lower bitterness threshold[32]. However, there are some exceptions. Long-chain substances have the lowest threshold, yet their binding energies vary, indicating that the threshold is not solely influenced by binding strength. Long-chain substances might more readily bind with TAS2R14 to activate it, but the binding interactions are not necessarily strong.
Figure 2 shows the binding of the five known TAS2R14 activators. In agreement with some findings, Leu85 and Trp89 are amongst the most important binding residues[33]. Four substances, Lupulone, Adlupulone, cis-isohumulone, and trans-Isoadhumulone, were found to interact hydrophobically with Leu85 and Trp89, with the main interactions being alkyl and pi-alkyl. Residue Trp89 is deep in the binding pocket[34]; thus, the ability to bind to residue Trp89 may be more stable. Residue Thr86 also undergoes hydrogen bonding interactions with trans-Isoadhumulone cis-isohumulone, with Thr86 and Asn93 being the most common hydrogen bonding interactions in the study[26, 33]. In this study, both residues also show binding to a great extent, with residue Thr86 having more binding ability than Asn93, especially since the flavonoid glycosides almost all interact with THR86 by hydrogen bonding. In addition, some studies have concluded that flufenamic acid, mainly pi-pi, interacts with Trp89, Phe186, and Phe247[35]. The mechanism of action of the benzoic acid analog TAS2R14 includes p-p interactions between the ligand and Phe186, Phe243, and His94 as well as hydrogen bonding with Asn93 and Thr86[26]. These binding residues were also found in our study. With further research, we will explore more binding sites and activators, which will deepen our study of the binding mode of TAS2R14 and the relationship between binding energy and threshold.
3.3 Feature selection
The interaction between bitter compounds and bitter receptors is the key factor underlying bitterness, and the strength of this interaction determines the intensity of bitterness[32, 36]. Binding energy as a quantitative measure for evaluating the strength of receptor-ligand binding is closely related to the bitter taste threshold. Since only one receptor's binding energy can be used as a feature for modeling, using receptors that are able to bind a wide range of substances will improve our model's ability to predict substance types.
A total of 25 bitter receptors have been identified to bind various types of bitter substances. However, TAS2R10, TAS2R14, and TAS2R46 are broadly tuned receptors known to respond to a variety of natural and artificial bitter compounds, while TAS2R3, TAS2R5, TAS2R8, TAS2R13, TAS2R41, TAS2R49, and TAS2R50, are activated by very few bitter compounds, and TAS2R42, TAS2R45, TAS2R48, and TAS2R60 are currently unidentified substances[1]. TAS2R10 and TAS2R14 showed less preference among agonists, while TAS2R14 could bind more chemical scaffolds and more agonists[26, 30]. Most receptors embody a high degree of selectivity for structure; for example, TAS2R16 and TAS2R38 show a strong tendency to recognize β-dglucopyranosides and N–C = S group in molecules[1]. In addition, TAS2R7 is suited for receptors for metal cations such as zinc, magnesium, calcium, aluminum, copper and manganese[37]. TAS2R46, on the other hand, tends to detect sesquiterpene lactones and related compounds[18].
Molecular descriptors are numerical features used to represent and describe the structure of a molecule, and the MolecularDescriptorCalculato module calculates 208 molecular descriptors, with 93 descriptors remaining after removing 0 and most of the descriptors with 0 (and adding binding energies) as shown in Table S2. Feature selection was performed on the training set by embedding method decision tree. The top 15 features selected according to gain are shown in Fig. 3. Due to the stochastic nature of decision tree feature selection, to prevent some features from being missed, some of the features were selected from the remaining 78 features that were considered to be important at the time the other bitterness models were built. Weichen Bo et al[38]. used an artificial neural network algorithm to build a bitter-non-bitter classification model, and based on the important properties they derived to classify bitter substances, we chose heavy atom molecular weight (HeavyAtomMolWt), the hydrophobicity feature (BCUT2D_LOGPHI), backbone atom electronegativity (EState_VSA1 and EState_VSA4), molecular shape (Kappa2). MinEStateIndex was added based on the conserved site modeling[33], in addition, BitterIntense, as a model for classification thresholding, also provides two important features, heavyatom and molar refractivity. HeavyAtom is replaced by HeavyAtomMolWt, molar refractivity is replaced by SMR_VSA5. Finally, we added molecular descriptors, as shown in Fig. 3, and them a correlation analysis was performed, removing substances with too much correlation to reduce multicollinearity, and the final results are shown in Fig. 3. The predictive model with higher R-squared bitter taste thresholds was developed through multiple selections, and the feasibility of the binding energy as a feature was demonstrated by the fact that the binding energy was ranked 12 and successfully retained from multiple selections. Since the randomness of the embedding method does not fully represent the importance of the features, they will be further evaluated by the SHAP algorithm in the modeling below.
3.4 Machine learning model analysis
Four regression models, namely RF, XGBoost, CatBoost and GBDT, were employed for the prediction of bitterness thresholds, and the prediction results are shown in Table S4. Figure 4 displays the optimal model results for each algorithm, which demonstrates a good correlation in both the training set and the test set. The R2 values of PT in the training set were 0.916, 0.893, 0.948 and 0.893, respectively. Similarly, the R2 values of PT in the test set were 0.906, 0.889, 0.936, and 0.877, respectively. The slight difference between the R2 of the training set and the test set indicates a low risk of overfitting. The findings demonstrate that all four machine learning models could successfully estimate the PT of a substance, with the CatBoost algorithm showing the best results in both the training and test sets. To further evaluate the models, three metrics, namely MAE, MSE, and RMSE, were utilized. These three indicators assess the degree of error between the true and predicted values, and the closer the three indicators are to 0, the better the model works. Table 1 demonstrates that the MAE, MSE, and RMSE exhibit opposite tendencies to R2 for these four models. Among these models, the three indicators of CatBoost were the smallest (i.e., 0.936, 0.317, 0.174, 0.417, respectively). Hence, the CatBoost model demonstrated the highest effectiveness and can be used to forecast the bitterness threshold in future studies. In addition, models based on RF and GBDT algorithms also displayed good results.
Table 1
Algorithm | R2 | MAE | MSE | RMSE |
GBDT | 0.877 | 0.477 | 0.333 | 0.577 |
XGBoost | 0.889 | 0.423 | 0.299 | 0.546 |
RF | 0.906 | 0.370 | 0.253 | 0.503 |
CatBoost | 0.936 | 0.317 | 0.174 | 0.417 |
Table S4 lists the training and true values for all test sets, and due to the high R-squared of the model itself, most of the substance predictions are almost identical to the true values. However, there are some flaws in the model; the four categories of Chalcones, Flavanols, and Organooxygen compounds appeared less in the training set, the Flavanols category was not trained, and the remaining 2 types had 1–3 substances trained in the training set. As can be observed, the prediction value of Flavanols is too high. At the same time, the other two substances are predicted accurately, indicating that the model is adequately trained for the training set, and it can effectively predict the various structures of the substances appearing in the training set. However, it may have poorer prediction ability of unknown types. This needs to be improved by collecting more data. Secondly, Amino acids, Peptides, Benzoic acids, and derivatives also showed inaccurate predictions, which is a category of substances that the model needs to focus on subsequently. In contrast, the rest of Glycerolipids, Flavonoid glycosides, and Fatty acids were more accurate.
Among the four algorithms used, GBDT, RF, CatBoost, and XGBoost are all integrated algorithms[39]. Ensemble algorithms combine multiple simple models to create a more powerful model, thereby improving prediction performance. An ensemble technique built on a gradient boosting decision tree is called XGBoost or CatBoost, while GBDT and RF are ensemble approaches based on decision trees[39]. The performance of the model heavily relies on the selection of features. Since our feature selection method uses decision trees, our four algorithms are based on decision trees. RF and GBDT are two distinct Decision Tree-based methods. RF constructs numerous randomized decision trees to achieve integration, whereas GBDT iteratively optimizes the model[40]. RF is more suitable for high-dimensional problems. Both CatBoost and XGBoost are GBDT-based algorithms. XGBoost is fast in terms of execution speed and memory efficiency, and it can fit the data better and find the global optimal solution faster, so it is more suitable for large data volumes. CatBoost, on the other hand, employs ordered boosting strategies to prevent prediction bias caused by target leakage [41]. Due to the small sample size, CatBoost has a built-in regularization mechanism that reduces the risk of overfitting and thus performs better when dealing with small samples or noisy data.
3.5 Feature analysis of 4 models
In Fig. 5A, the feature importance of the CatBoost, GBDT, XGBoost and RF model is ranked by SHAP and measured based on the absolute value of the average Shapley value. Among these models, BCUT2D_MRLOW, Chi2v, PEOE_VSA7 are the three highest ranked feature. Figure 5B presents the summary_polt, which illustrates the effect of each feature on the CatBoost model. In the plot, the PEOE_VSA7, Chi2v and SMR_VSA5 and Binding Energy points red and blue areas at the same time on value < 0, Explain that these features have a positive effect on the predicted value of PT (There is some degree of inverse relationship between the eigenvalues and the results predicted by the model). the BCUT2D_MRLOW points are clustered into two regions: the red region with positive SHAP values (> 0) and the blue region with negative SHAP values (< 0). This indicates that higher BCUT2D_MRLOW values have a positive effect on PT, while lower BCUT2D_MRLOW values have a negative effect. Figure 5C shows the force distribution of the CatBoost model (arranged by similarity). Based on the distribution, it can be observed that the predictive pattern of the features has two ways for all the samples of the model, and the main difference is that the BCUT2D_MRLOW feature contributes positively and negatively to the model.
Chi2v is a molecular descriptor based on atomic charge distribution and molecular topology used to encode the electrical, chemical reactivity, and structural characteristics of molecules. The SMR_VSA descriptor explains the molar refractive index of a molecule and its effect within the receptor or on the way to the receptor. SMR_VSA5 is defined as the sum of the vi of all atoms i. pi denotes the contribution to the molar refractive index of atom i calculated in the SMR descriptor, computed in the range of 0.440 to 0.485[42]. The BCUT2D descriptor is based on the Burden matrix, which encodes the bond strengths between atoms in a molecule. It changes the diagonal elements of the matrix to include atomic properties and then performs eigenvalue decomposition to obtain the highest and lowest eigenvalues. PEOE_VSA8 (a molecular surface area descriptor). Mollogp, SlogP_VSA6 is related to the partition coefficient (logP) and is a characterization of the hydrophobicity of the molecule. Binding Energy, as mentioned earlier, is a descriptor for assessing the strength of ligand-receptor binding and is related to hydrophobic interactions, charge interactions, hydrogen bonding interactions, etc. These molecular descriptors and binding energies are important features for constructing bitter taste threshold models. Molecular descriptors are used to quantitatively describe the physical and chemical information of molecules[33]. Thus, the physical properties they respond to influence the magnitude of the bitterness threshold.
Currently, there are several studies on the bittering mechanism, among which the recognized properties are hydrophobicity, which is the main property found in various types of studies, where researchers have suggested that substances with a strong bitter taste are accompanied by a high degree of hydrophobicity [32, 35, 38]. This relationship can be attributed to the influence of hydrophobic molecules on various factors, such as the distribution of molecules in the cell membrane, the ability to bind to bitter taste receptors, and the interaction force between molecules, thereby affecting the perceived intensity of bitter taste. The molar refractivity is a measure of the extent to which the electron density distribution within a molecule can be distorted[16]. Bitter taste prediction models Premexotac [12] and a recent artificial neural network prediction model[38] considered it as an important feature for bitterness classification, while BitterIntense[16] model is a prediction of the bitterness threshold and this model gained a high importance score in molar refractivity, SMR_VSA5 is a molar refractive index descriptor, and it was ranked 6th in the importance of the model, so we got the same conclusion that the distribution of the molecule's electron cloud affects the intensity of bitterness. The binding energy contributed to the construction of CatBoost model and RF model, but its importance score in GBDT and XGBoost model is almost 0. The comparison of the features shows that the importance of the binding energy is the 11th feature for constructing the bitterness threshold after hydrophobicity and Molar refractivity. Although the contribution of binding energy is 0 in GBDT and XGBoost models, the R-square of these two models is low (< 0.9), which suggests that the inability to find the relationship between the binding energy and the bitter taste threshold may be a reason for the low prediction effect of these two algorithms. Finally, this paper concludes that factors such as the electronic environment of the molecule, hydrophobicity, chemical reactivity, and the strength of binding to the TAS2R14 protein (binding energy) influence the magnitude of the bitterness threshold.
3.6 Threshold prediction for bitter substances
Table S5 presents the predictions for 223 compounds, including their names, structure, binding energy, PT, threshold (mg/kg), Chi2v, PEOE_VSA7 and BCUT2D_MRLOW values. Figure 6A shows the distribution of thresholds for 223 substances, and the result shows that the thresholds for most substances are concentrated in the range of 100–300 mg/kg. To further investigate these substances, a three-dimensional scatter plot of three features was drawn based on these substances, as shown in Fig. 6B. From this plot, it can be observed that sample points with close bitterness thresholds are also closer together in the plot and show a gradual change in colour, indicating that the importance of these three features in regression modelling. Secondly, most of the substances with thresholds lower than 1000 are in the region where Chi2v is less than 2.5 and BCUT2D_MRLOW is greater than − 0.5, and the lower the Chi2v value, the higher the BCUT2D_MRLOW value and the lower the threshold. The substances with high thresholds (> 1000mg/Kg) are more likely to be affected by the BCUT2D_MRLOW feature because these substances are all at the bottom of the 3D map space; however, they are also modulated by PEOE_VSA7, which slightly decreases the thresholds of these substances as PEOE_VSA7 increases.