3.1. Sampling optimization
3.1.1. Involved factors and fitting equations
In our study, Box-Behnken design was involved in attempt to optimize the sampling procedure for feces samples using a µ-CTE produced by Markes International. Tested parameters were as follow: extraction temperature, sampling time and gas flow. All of them were selected as coded independent variables (factors) at three levels: minimum (-1), central (0) and maximum (1). The factors were tested by carrying out 17 random runs (each one in duplicate), including five central points. The used values were: temperature: 24°C, 32°C, 40°C; sampling time: 10 min, 20 min and 30 min; gas flow: 20 mL/min, 35 mL/min and 50 mL/min. The effect of factors was observed for: number of total detected peaks, total peaks area and total intensity of the peaks.
Four polynomial models: linear, two-factor interaction (2FI), quadratic and cubic model, were statistically evaluated. The linear and 2FI models were not significant, while the cubic model was aliased. This means that there were not enough unique design points to independently estimate all the coefficients for those models. Thus, the quadratic model was selected to build the response surface in the subsequent optimization process.
To assess the impact of each factor on the observed responses, the second-order polynomial model (Eq. 1) described the relation between independent variables and the obtained responses.
𝑌 = β0 + Σβ𝑖𝑋𝑖 +Σβ𝑖𝑖𝑋𝑖 2 +Σβ𝑖𝑗𝑋𝑖𝑋𝑗 (1)
where, Y is the obtained response, Xi and Xj are the independent factors, β0, βi, βii, and βij are the regression coefficients for intercept, linear, quadratic, and interaction coefficients terms.
The second order polynomial equations (Eq. 2 to 4) were used to express the obtained responses as a function of coded and independent variables, where Y = response; X1: Extraction temperature (°C); X2: extraction time (minutes); X3: flow rate (mL/min);
Y1 [number of the peaks] = 122.60–7.87 X1 + 7.81 X2 + 4.69 X3 + 7.37 X1 X2 + 6.88 X1 X3 – 7.00 X2 X3 – 27.05 X12 – 5.68 X22 + 7.58 X32 (2)
Y2 [peak areas] = 4.61E + 07–1.98E + 06 X1 + 7.31E + 06 X2 + 6.13E + 06 X3 – 3.18E + 06 X1 X2 – 3.22E + 06 X1 X3 – 6.54E + 06 X2 X3 – 1.67E + 07 X12 – 5.82E + 06 X22 – 7.36E + 06 X32 (3)
Y2 [peak intensity] = 1.04E + 07–4.76E + 05 X1 + 1.98E + 06 X2 + 1.45E + 06 X3 – 5.33E + 05 X1 X2 – 4.91E + 05 X1 X3 – 7.14E + 05 X2 X3 – 3.75E + 06 X12 – 3.28E + 05 X22 – 9.42E + 05 X32 (4)
3.1.2. Analysis of variance (ANOVA) for quadric polynomial model
For evaluation of quadratic model criteria such as: assessment of “adjusted R-squared” and “predicted R-squared” were applied. The “Predicted R-Squared” was in reasonable agreement with the “adjusted R-Squared”, while the recorded difference was less than 0.2 in all three cases (see Table S3). However, ANOVA test was used for evaluation of goodness-of-fit of the proposed design and demonstrated that all three models were significant based on R2 (0.9385 to 0.9665), lack of fit (p > 0.05) and p-values (0.0018 to 0.0002). The R2 values indicate that between 93% up to 96% of the changes observed in the responses were obtained under the combination of factors and could be explained by the model equations. Furthermore, low values of coefficient of variance (between 4.7 % and 15.69 %) were obtained, which indicates an adequate precision of the experimental values. The p-values lower than 0.0500 indicates a corresponding model, while smaller p-value are associated with more significant result. Our obtained p-values show that there is only between 0.02% up to 0.18% chance that large F-values could occur due to the noise in experiment. This implies that all designed quadratic models are highly significant. Lack of Fit (and the obtained F values) are other important index in evaluation of model reliability because the regression equation and coefficient of determination are evaluated to test the model fitting. Our obtained F values (between 0.34 and 1.05) indicates that lack of fit is not significant compared to pure error. Detailed ANOVA test results for the quadratic polynomial model are presented in Table S3.
The actual versus predicted responses are plotted in Fig. 2, part I. The actual values are obtained data, while the predicted values are produced by the model. By observing Fig. 2 part I, we can understand that all three models (A, B and C) are able to predict well the actual response. Residuals are defined as deviation between predicted and actual values. A good model is expecting that the obtained responses will follow a normal distribution along the straight line (see Fig. 2 part II). If an evident S-shape curve is formed, this denotes that the residuals are not following a normal distribution; consequently an inappropriate model was obtained. According with Fig. 2 part II, the Studentized residuals follow a normal distribution. The results presented in Table S3 and Fig. 2 indicates that there is not any evidence pointing out possible errors regarding the adequacy of the model.
3.1.3. The influence of investigated factors on the obtained responses
3.1.3.1. Single factor influence
The influence of each investigated factor on the obtained responses is depicted in Fig. 3. Part A is presenting the influence of temperature variation when sampling time and gas flow were kept constant at central points (sampling time 20 min and gas flow 35 mL/min). As highlighted, for all three models the best responses were obtained around the value of 32°C. Presumably, this means that lowest temperature (24°C) in not enough to fully volatilize the components, while the highest (i.e. 40°C) can produce a series of problems that lead to the loos of volatiles, their conversion in more different types of volatiles and/or some other specific issues. For example, it was reported that microbial fermentation processes are increasing at higher temperatures 37,38, while different patterns of amino-acids decomposition by bacteria in biological samples were also referred 16,19. However, since VOCs have specific optimal temperatures of volatilization, optimal temperature choice depends on VOC of interest, which may differ according with desired applications and on system used for analysis. In our case, we could observe in the chromatograms of samples analyzed at 40°C unseparated (overlapped) wide peaks. Nevertheless, in the mentioned chromatograms both peaks intensity and peaks area were lower compared with the chromatograms recorded at lower temperatures. Regarding the targeted VOCs, we were more interested in highly volatile short chain alcohols acids and other components containing nitrogen and sulfur.
In Fig. 3 part B the influence of sampling time is presented when temperature and gas flow were constant (temperature 32°C and gas flow 35 mL/min). It can be observed that in all three cases sampling time has a direct influence on the obtained responses, longer sampling time giving higher responses. In in case of number of the peaks and peaks area the trend line arrived at the plateau, while in case of peaks intensity is still increasing. This presumably can mean that a longer sampling time could have given a higher response.
When gas flow was investigated at constant sampling time (20 min) and temperature (32°C), the trend line was rising with increasingly of flow in case of peaks number (Fig. 3 part C). In case of peaks area, the plateau was reached between 38 and 44 mL/min, followed by a slight decrease of the trend line when the flow was increased more than 45 mL/min. However, continuous increasing tendency was observed in case of peaks intensity, with plateau tentative at the maximum flow. Consequently, we can affirm in this step that longer sampling time as well as higher gas flow and temperature around 30°C will facilitate the obtaining of the best possible results
3.1.3.2. Two factors influence
Preliminary solutions were obtained after single factor influence checking. Nevertheless, the main aim of our study was to develop a method that can provide the best solution to be used by a combination of the mentioned factors, once in the sampling process the variables can be influenced between themself. Consequently, two factors influence, when a third one (named actual factor) is kept constant at the central point is highlighted in Fig. 4 and will be further discussed. In case of peaks number (Fig. 4 part A), when the actual factor was gas flow (kept constant at 35 mL/min) it was observed that a middle temperature (around 30°C) and middle extraction time (approximatively 20 min) provided the higher response (A1). By maintaining the extraction time constant, middle temperature and high gas flow (> 45 mL/min) provided the best solutions (A2). Eventually, at constant temperature (32°C) it was observed that highest gas flow (tending to the maximum value), could lead to the detection of the maximum possible number of peaks, while the extraction time did not present significant influence (A3). These findings highlight that in case of detected number of peaks, the most relevant factor is flow rate. Moreover, by observing the last “umbrella” of Fig. 4, part A (A3) it is suggested that the usage of higher gas flow could have led to a higher number of detected peaks.
For peaks area (Fig. 4, part B) a very good fitted model was obtained, once the figures shape is like well contoured umbrellas, and the color scale in going from the lowest to the highest values (blue to red). It was highlighted that when the actual factor was gas flow, extraction time between 25 and 30 minutes at about 32°C is giving a good response (B1). If the actual factor was considered extraction time, a temperature around 30°C and a gas flow of 40 mL/min provided the highest response (B2). In the third case (B3), when the actual factor was temperature, the highest extraction time (30 min), when using a gas flow between 35 and 40 mL/min provided the highest peaks area. By observing Fig. 4, part B3 it is highlighted that longer extraction time could have led to the recording of higher peak areas. In case of this model, gas flow and extraction time were more influential factors than temperature.
Finally, the influence of factors on peaks intensity is presented in Fig. 4, part C. Part C1 (gas flow = constant at 35 mL/min) is showing that at highest extraction time (30 min) and middle temperature (around 30°C) is leading to the highest intensities. Moreover, it may be observed that in Part C1 that longer extraction time can provide peaks of higher intensity. Nevertheless, high gas flow (50 mL/min) and temperature near to 30°C provided good solution, although a gas flow > 50 mL/min would probably have produced more intense peaks, as shown in Part C2. It can be observed in Part C3 (where the actual factor was temperature) that a gas flow between 35 and 45 mL/min and a sampling time ≥ 30 minutes led to obtaining of maximum intensity, although a higher extraction time could have resulted in more intense peaks. These findings suggest that in case of peaks intensity the most relevant factor was extraction time.
3.1.4. Obtained solutions
The final obtained solutions were predicted by the software as a result of experimental measurements and total interactions between the three selected factors. The predicted solutions, that were verified than experimentally for each model, as presented in Table 1.
Table 1
Obtained solutions as a result of optimization of sampling parameters
Approach | Temp (°C) | Extrac-tion time (min) | Gas flow (mL/ min) | Predicted response | Standard error | Desirability | Measured response |
R1 (No of peaks) | R2 (Area of peaks) | R3 (Intensity of peaks) |
Number of peaks | 30.661 | 19.612 | 48.687 | 132.678 | 2.879 | 1.000 | 137 | 4.60E + 07 | 1.31E + 07 |
Area of peaks | 30.948 | 25.528 | 38.004 | 4.89E + 07 | 2.25E + 06 | 0.936 | 128 | 5.51E + 07 | 1.67E + 07 |
Intensity of peaks | 30.572 | 29.242 | 46.887 | 1.23E + 07 | 5.96E + 05 | 1.000 | 91 | 4.41E + 07 | 1.17E + 07 |
R1 – the obtained response when the solution acquired for peaks number was used |
R2 – the obtained response when the solution acquired for peaks area was used
R1 – the obtained response when the solution acquired for peaks intensity was used
Because the software affords for selection of solutions (in term of goal and importance), thus allowing the prioritization of options according to needs, certain criteria were chosen in attempt to obtain the best solutions, that will fit our model. Thereby, maximum importance was accorded to each factor (temperature, sampling time and gas flow), while the selected goal was “in range”. To the obtained response we assigned also maximum importance and the selected goal was “maximize”, while in case of error, the selected goal was “in range” and the selected importance was medium. According to Table 1, the most suitable parameters to be used are the parameters obtained according with peak numbers (highlighted in bold).
3.2. Testing of developed model
In order to test the validity of the developed model, Correlation Analysis and Network analysis were run. In case of correlation, parametric tests were used once they are “a statistical tool expecting a linear correlation between the investigated variables coming from the same source, and/or they have the same measure unit” 39. Two different approaches were tackled: correlation between all 205 detected components in the whole matrix (including detected components appearing in pool samples, in samples from each volunteer and in blanks) – represented in Fig. 5; and correlation between all 20 included volunteers and pool samples – as presented in Fig. 6. Network analysis highlighted in Fig. 7 is presenting discrimination between the investigated groups, act that confirms the validity of the model.
3.2.1. Correlation based on detected components
A Pearson moment product correlation was used to highlight the correlation that occurs between the detected components. Because the matrix was complex (including 205 VOCs) a snap shot representation only could be presented in Fig. 5 in the form of a heat map. The correlation values significant at 0.01 level were between r(205) = – 446 to 0.999, which pointed out in a low up to very strong positive or negative correlation. Moreover, the correlation values significant at a 0.05 level accounted for r(205) = – 0.560 to 0.426, denotative of a moderate correlation. The strong and very strong correlation that was achieved when p was significant at 0.01 level (p = 0.01), was a positive correlation. All the components detected in blanks presented a positive correlation with each other (r(205) = 0.145, p = 0.05 up to 0.982, p = 0.01). With exceptions the components detected in pooled samples (accounting the number of 146) positively correlated the detected components. An example was for example, ethyl acetate and 1,3,5-trifluorobenzene, that did not present any correlation with the pool samples, but they were however highlighted as components originated from blank tubes.
Some other examples were VOCs detected in case of limited number of volunteers only, in relatively small amounts, reason why they was not detectable in pool samples. Less often than positive correlation, negativecorrelation (r(205) = – 0.56, up to – 0.15, p = 0.05) was also observed. Components that exhibited the most often negative correlation with other present in the matrix were: 2-propanamine 2-methyl-; fumaronitrile; trifluoromethyltrimethylsilane; 2,5-hexanediol; cyclotetrasiloxane, octamethyl-; cyclopentasiloxane, decamethyl-; biphenyl; dibutyl phthalate; cyclohexane, isothiocyanato-; heptasiloxane, 1,1,3,3,5,5,7,7,9,9,11,11,13,13-benzene, 1,1'-(4,4-dimethyl-1-butene-1,4-diyl)bis-tetradecamethyl- and five more that could not be identified, and remained unknown. They were generally associated with components coming from tubes, septa and columns bleeding. These findings suggest that the background may affect the obtained results, however generally low values of negative correlation were observed.
However, the most of the VOCs common between pooled samples and volunteers’ samples presented moderate or low positively correlation. According with these findings we can assume the validity of the model, while we find positive correlation between samples and pools when investigating the detected output (the VOCs). This denotes also that the model worked correct and the pools were representative.
3.2.2. Correlation based on investigated samples
As a second approach, a Pearson moment correlation was conducted also to check the correlation and the level of significance between pool samples and samples coming from each volunteer. The whole VOCs profile of each sample was used in this boarding. The correlation matrix was designed based on the hierarchical clusters’ analyses obtained using the method “average linkage between groups” within the interval “squared Euclidian distance” and presented in Fig. 6. The dendrogram of clusters’ analysis shows the formation of seven main clusters with different levels of significance. The samples coming from 11 volunteers fused together in one cluster with similar distance level in the left part of dendrogram, finding that suggest that they had kindred VOCs composition. The samples #20, #3, #4, #14 and #5 clustered next to them, highlighting less similarities compared with the first 11 mentioned. Nevertheless, positioned at the opposite site, in a rather arbitrarily manner were the clusters corresponding to volunteers #17,#9, #15 and #1, all of them with different levels of significance. These denote that the VOCs composition of the mention samples presented the most different features when compared with others. All five pool samples with the same level of similarity were segregated, between the other clusters, drawing actually the border between those with kindred VOCs profiles and those that presented most different features. Observing pools positioning in Fig. 6, we can assume that they were representative for the batch of analyzed samples, and consequently we presume also the validity of the developed model, when the analyzed samples were considered as variables.
Regarding the heat map it can be observed that the highest correlation occurred, as expected between the pool samples (r(23) = 0.762 up to 0.970, p = 0.01). The pools were also well correlated with the samples. However they were less correlated with the samples coming from the volunteers #1, #3 and #6. These samples were associated with the three cases where the amount of sample was insufficient to add to the pool. Except of this, positively correlation from low to very high was accounted between samples originated form volunteers. As a general conclusion, all these findings indicate that there is relevant correlation between the investigated samples, both when the investigated variables were the detected VOCs and when each sample was investigated based on the entire profile, even if these characteristics are far different and in appearance not linked-up with each other.
3.2.3. Discrimination between the groups based on Network Analysis
The dataset representing distributions of 205 detected VOCs was used to build the network analysis model (Fig. 7). Consequently, based on the obtained profiles and using the incidence of the peaks, a network analysis was created in order to obtain an exploration of the data. R studio with console (version 3.6.3, Boston, MA, USA) was used for network analyses. The model separated the three investigated groups, by leading into the formation of three cluster groups (group of pools, the group of volunteers and the blanks). In addition, VOCs detected just in volunteers’ samples have been dispersed around the mentioned group (purple circles), common VOCs between pools and volunteers were located mostly between the two groups, and the VOCs detected in blanks were scattered into the left-down part (black diamonds). Regarding the number of VOCs used in network analysis, 2 were specific for blanks, 59 for volunteers, while 146 VOCs were common between the pools and volunteers. We presume that most of VOCs presented in Fig. 7 are endogenous generated by the organism, as a normal process of metabolism. However, certainly part of VOCs are exogenous absorbed by the organism from the environment and eliminated through feces. However, variability in VOCs coming from different subjects was observed, and this phenomenon is related to several factors, among which diet, living style, personal habits, etc.