In this study, lithium-titanate battery was discharged with different discharge currents. Using the obtained data set, the discharge capacity of the lithium titanate oxide battery will be estimated using different machine learning methods and the performance results of the used machine learning methods will be compared.
2.1. Lithium Titanate Oxide Batteries
Different battery parameters such as battery voltage, charge/discharge efficiency, energy density, operating temperature, safety, volume, weight, etc. affect consumers' battery preferences. Lithium Titanate Oxide (Li4Ti5O12) is a chemistry that is gaining traction in various markets, unlike the multitude of lithium-ion battery technologies available on the market. Figure 2.1 shows the structure of the LTO cell. In the LTO battery, unlike the traditional lithium ion battery structure, lithium titanate oxide nano material is used instead of graphite in the anode electrode. It is commonly known as Lithium titanate (LTO) or nano material (nLTO). There are different manufacturers such as Leclanche, Altairnano, Microvast and Toshiba, which can produce lithium titanate powder with different chemistries for use in LTO battery cells [10].
Lithium titanate batteries have both longer cycle life and calendar life than commercially available rechargeable battery technologies such as traditional lithium-ion, nickel-metal hydride (NiMH) batteries and nickel cadmium (NiCd) batteries. The energy storage ability of any rechargeable battery will decrease as a result of repeated charge/discharge cycles. When the battery cell is discharged at increasing C levels, an imbalance or deficiency occurs between the anode and cathode materials that enter into a chemical reaction in the battery as a result of overdischarge of the battery. This imbalance in the battery disrupts the ionic mobility of the battery and as a result, the discharge capacity of the battery decreases [11]. A battery's "cycle life" is the number of times it can be charged and discharged without significant reduction in energy storage capacity. Lithium titanate battery is called a zero strain material. The zero strain state means that the material used in battery chemistry does not substantially change shape as a lithium ion moves in and out of the material during charge/discharge processes. Graphite, the most common material in traditional lithium-ion batteries, expands and contracts by up to 8% with each charge/discharge cycle. This continuous change in volume leads to a shorter cycle and calendar life compared to lithium titanate anodes. Due to their zero strain properties, even after 25,000 cycles, LTO cells retain more than 80% of their original charge capacity [30].
One of the main advantages of a lithium titanate battery is its fast charge and discharge rate. Charging rate is the rate at which battery energy is renewed. Discharge rate is the rate at which the energy stored in the battery is transferred. Thanks to the optimization of the materials used in the negative electrodes of lithium titanate battery cells, LTO batteries can charge and discharge quickly. Fast charging/discharging capacity is important for electric vehicles and public transport buses [30].
Figure 2.2 shows the operating voltage and charge state change during the charge/discharge processes of the LTO battery at room temperature. The intersection of the charge, discharge and average curves indicates that the LTO cell has a stable operating voltage.
LTO (Lithium Titanate Oxide) battery offers fast charging, high charge/discharge rate, high safety, high performance and long lifespan. Low cell voltage and high cost are the disadvantages of this battery. It can be used to create power supplies, electric vehicles, solar energy storage and smart grids.
2.2. Equivalent Circuit Model for Lithium Titanate Battery
Different types of equivalent circuit models are available to meet the different needs required by applications. Criteria such as model accuracy, calculation complexity, ease of applicability, etc. are decisive in determining the battery model. Traditionally, the first-order RC parallel network model is used to determine the polarization state of lithium-based batteries that deviate from the equilibrium potential under dynamic conditions. Using the first-order RC parallel network model as the basic model for the lithium titanate battery in this study is also useful in terms of comparison with other different battery models mentioned in the literature [13].
The lithium titanate battery equivalent circuit model in Fig. 2.3 consists of Uocv (open circuit voltage), R0 (ohmic internal resistance) and RC parallel network.
The initial SOC (State of Charge) value is an important parameter for estimating the state of charge and the current power capacity of the battery. The battery model can be expressed by Eq. 2.1. Here CQ indicates the battery nominal capacity and I0 indicates the battery discharge current [13].
\(SOC=SOC\left({t}_{0}\right)–\frac{1}{{C}_{Q}}\underset{{t}_{0}}{\overset{t}{\int }}{I}_{0}\left(t\right)d\left(t\right)\) | (2.1) |
Up (polarization voltage of the battery) can be expressed by Eq. 2.2.
\(\begin{array}{c}{U}_{p}\left(t\right)={U}_{p}{I}_{0}\left(t\right)\cdot [1-{\text{e}}^{-(t-{t}_{0})/\left(\tau \right)}],{t}_{0}<t<{t}_{1}\\ {U}_{p}\left(t\right)={U}_{p}\left({t}_{1}\right)\cdot \left[{\text{e}}^{-(t-{t}_{0})/\left(\tau \right)}\right],{t}_{1}<t<{t}_{2}\end{array}\) | (2.2) |
Here, τ (time constant) is the time constant of the polarization voltage generation process and is defined as Rp*Cp.
Uo(t) (battery terminal voltage) can be expressed by Eq. 2.3.
\({U}_{0}\left(t\right)={U}_{OCV}\left(t\right)–{U}_{p}\left(t\right)–{R}_{0}{I}_{0}\left(t\right)\) | (2.3) |
To determine the dynamic properties of the LTO (lithium titanate oxide) battery in different situations, UOCV, Ro and Cp model parameters can be expressed as in Eq. 2.4.
\(\begin{array}{c}{U}_{OCV}\left[SOC\right(t\left)\right]{\mid }_{T={T}_{1}}={a}_{1}SOC\left(t\right)+{a}_{2}\\ {R}_{0}\left[SOC\right(t\left)\right]{\mid }_{T={T}_{1}}={b}_{1}SOC\left(t\right)+{b}_{2}\\ {C}_{p}\left[SOC\right(t\left)\right]{\mid }_{T={T}_{1}}={c}_{1}SOC\left(t\right)+{c}_{2}\end{array}\) | (2.4) |
The coefficient values a1,a2,b1,b2 and c1,c2 can be obtained by linear interpolation based on the parameters of the characteristic points in the battery charge/discharge experiments.
2.3. Obtaining Experimental Data
The block diagram of the experimental setup established to obtain experimental data of the LTO battery and conduct charge-discharge experiments is as shown in Fig. 2.4.
Figure 2.4. Experimental Setup Block Diagram
In the experimental setup, battery discharge current, battery voltage, discharged capacity from the battery and battery temperature data were measured. These data are transferred to the computer environment via a programmable DC electronic power supply. The discharge capacity of the lithium-based battery will be estimated by running different machine learning models with the obtained battery data.
The picture of the implemented experimental setup is given in Fig. 2.5.
TDK-Lambda brand programmable power supply was used to charge the battery. With this power supply, the battery was charged with a constant charging current value of 1000 mA and the power supply was adjusted to stop the charging process at 2.8 Volts and 10mA. Gw İnstek 300 brand dc current probe was used to measure the discharge current value of the battery. With this current probe, battery current data is transferred to the programmable dc electronic load device. A K type thermocouple was used to measure the temperature values of the battery. Battery temperature data was transferred to the programmable dc electronic load device via a K-type thermocouple. Gw İnstek Pel 3111 brand programmable dc electronic load was used for the discharge process of the battery. By means of this electronic load, the battery was discharged at different current values and 1.5 volt battery voltage was set as the discharge termination voltage. Battery data was transferred to the computer environment with the multi-interface of the programmable dc electronic load device.
Through the established experimental setup, the min-max scaling method was applied to the discharge test data of the lithium-titanate battery. By scaling the values in the data set, the data to be used in machine learning models are scaled with a common scale in the range of 0–1, which is a more regular and easily understandable range, without distorting the differences between them. Thus, data values that are different from each other are represented in the range of 0–1, reducing the processing time of machine learning models and enabling the models to produce more accurate predictions.
Equation 2.5 is used for the Min-Max scaling method [14].
\({X}_{new}=({X}_{old}–{X}_{min})/({X}_{max}-{X}_{min})\) | (2.5) |
In this equality; Xnew : represents the new x data scaled in the data range, Xold : represents the previous value of any x data in the data range, Xmin : represents the smallest x value in the data range, Xmax : represents the largest x value in the data range. The technical specifications of the LTO battery used in this study are as in Table 2.1.
Table 2.1
Technical Data of the Battery Used
Item | General Parameter | Remark |
Rated Capacity | Typical | 500mAh | Standard discharge (1.0C ) after Standard charge |
Nominal Voltage | 2.4V | Mean Operation Voltage |
Standard charge | Constant Current 500mA (1C) end Voltage 2.8V 10mA cut-off | Charge time : Approx 1.5h |
Standard discharge | Constant current 500mA (1C) end voltage 1.5V | Max. Continuous Discharge Current 10A |
Fast charge | Constant Current 1000mA (2C) end Voltage 2.8V 10mA cut-off | Charge time : Approx 0.8h |
Temperature Range | Charge : 0 ~ 45℃ Discharge : -20 ~ 70℃ | 60 ± 25%R.H. Bare Cell |
2.4. Machine Learning Methods Used in Evaluation
Machine learning is the scientific study of the algorithms, statistical models that computer systems use to perform a specific task without being explicitly programmed. It is a branch of artificial intelligence that focuses on the development of algorithms and statistical models that can learn from and make predictions on data. Machine learning is used to teach machines how to use data more efficiently and to extract information from data using computational methods [15]. Machine learning algorithms are responsible for finding patterns in the data set and creating mathematical models of them. These models are then evaluated based on their predictive capacity for measures of variance in the data itself [16]. Machine learning has various application areas such as data mining, image processing, predictive analytics, handwriting and speech recognition, robotics and computer games, natural language processing, brain-machine interface, information technology, statistics, probability, artificial intelligence, neurobiology [15, 16].
In this study, machine learning methods described below were used.
-
Random Forest: Random forest (RF) regression is a supervised learning algorithm and was proposed by Leo Breiman [17]. It is a collection of decision trees trained using the method commonly known as bagging, which can perform both regression and classification tasks through the use of multiple decision trees. The purpose of the bagging method is to combine multiple decision trees in determining the final output rather than relying on individual decision trees. A better result is achieved with a combination of learning models by performing random row sampling and feature sampling from the dataset, which creates sample datasets for each model [31]. Figure 2.6 shows the working principle of RF regression.
RF regression produces many decision trees for regression and its output is calculated by averaging the outputs of all decision trees [18, 19]. A random sample is selected from the entire dataset using the bagging method. By creating a regression tree, the data that are not selected and left out of the bag are used as the test set [20]. A decision tree is a model that does not have any prior tree structure, and the structure of the tree depends on the complexity of the training data in the learning phase. The decision tree consists of two nodes: the decision node and the leaf node. Each sample of training data is evaluated by decision nodes and transferred to different nodes depending on the value of the sample's features. RF regression produces regression trees using training data X given by X = x1, x2, x3…xn. This method produces k outputs T1(x), T2(x), ......, Tk(x) corresponding to each tree. The final result is calculated by averaging all tree predictions with Eq. 2.6 [19] .
RF(X) = \(\frac{1}{k}\)*\(\sum _{k=1}^{k}\left({T}_{k}\left(X\right)\right)\) | (2.6) |
-
K-Nearest Neighbor: The K-nearest neighbor algorithm is a simple, supervised machine learning algorithm that can be used to solve both classification and regression problems [15, 31]. It is a machine learning method in which the class (learning set) of the data point to be subject to classification and its nearest neighbor (element) are determined and classified according to the k value (number of nearest neighbours). It is a machine learning method that classifies the data to be classified according to its close relationship with previous data. The k-nearest neighbor algorithm works by categorizing data by associating inputs with similar outputs. In a sense, the algorithm searches for similar examples in the training set when it encounters unknown data [21]. K-nearest neighbor algorithm is widely used in different fields such as fault diagnosis, power system protection and medical detection. It is easy to implement and understand [22]. It has the major disadvantage of slowing down significantly as the size of data in use grows.
-
Decision Tree: A graph that represents choices and their consequences in the form of a tree. Nodes in the graph represent an event or choice, and edges of the graph represent decision rules or conditions. Every tree consists of nodes and branches. Each node represents attributes in a group to be classified, and each branch represents a value the node can take [15]. Decision trees are used to classify classes by sorting them according to their parameter values. This method is suitable for small data sets but causes delay for large data sets [23].
-
Linear Regression: A type of supervised machine learning algorithm that calculates the linear relationship between a dependent variable and one or more independent features. When the number of independent features is one, it is known as univariate linear regression, and when there is more than one feature, it is known as multivariate linear regression [31]. Multivariate linear regression is created by generalizing linear regression by considering multiple independent variables and limiting the number of dependent variables to one [24]. The purpose of the algorithm is to find the best linear equation that can predict the value of the dependent variable based on the independent variables. The equation provides a straight line representing the relationship between the dependent and independent variables. The slope of the line indicates how much the dependent variable changes for a unit change in the independent variable(s). Linear regression is widely used in estimating energy production [25].
The working process of machine learning methods is shown in Fig. 2.7. The process starts with processing and training the data. After the model evaluation and selection is made, the parameter settings of the selected model are adjusted and finally the model predictions are tested.
The effectiveness and performance of the method used will be determined according to the accuracy of the prediction obtained from machine learning models.
In the performance evaluations of machine learning methods, Explanatory Coefficient (R2), Mean Squared Error (MSE), Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) values were used. Among these performance measures, R2 is the accuracy rate decision coefficient of the model, and a high value of this coefficient indicates that the prediction relationship is good. MSE, RMSE and MAE are error measures, and their convergence to zero indicates that the model performance is with the least error and the model shows high prediction performance. For example, if the RMSE value is equal to zero, it means that the designed model has a good performance [26]. Various error metrics given in Table 2.2 were used to compare the machine learning models used in this study.
Table 2.2
Error metrics used in performance evaluation of machine learning prediction models
Name | Formula |
Mean Square Error (MSE) | MSE =\(\frac{1}{N}*\sum _{İ=1}^{N}{\left({xi}_{rv}-{xi}_{ev}\right)}^{2}\) |
Root Mean Square Error (RMSE) | RMSE =\(\sqrt{\frac{1}{N}*\sum _{İ=1}^{N}{\left({xi}_{rv}-{xi}_{ev}\right)}^{2}}\) |
Mean Absolute Error (MAE) | MAE =\(∣\frac{1}{N}*\sum _{İ=1}^{N}{\left({xi}_{rv}-{xi}_{ev}\right)}^{2}∣\) |
The difference between the actual values in a data set and the estimated values of the data is the error value. Mean square error is a value found by taking the sum of the squares of the error values occurring in a data set and dividing the result by the total number of samples in the data set. MSE provides information on how much the predicted value differs from the actual value and is a measure of accuracy performance.
In these equations;
N : total number of samples in the data range, xirv : real value of any x data in the data range,
MSE : mean square error, xiev : estimated value of any x data in the data range,
RMSE : root mean error, MAE : mean absolute error, represent.
A Q-Q plot was used to visualize how well the best-performing Random Forest model predictions in terms of error metrics (MSE, RMSE, MAE) and R2 value fit the actual values. Q-Q plot [27] is a graphical method used to determine whether two data samples come from the same population [28, 32]. It shows the comparison of theoretical quantile values of a probability distribution with its actual values and is used to evaluate how well the probability distribution fits the normal distribution or how close it is to another distribution [28, 29]. These graphs show how well the model predictions fit the actual values and how close the error distributions are to a normal distribution [28]. Such Q-Q plots are often used to test the assumption of normality or to visualize how well predictions fit actual values [29]. If the points are tightly distributed along the line, the model's predictions are in good agreement with the actual values. When the points move away from the line, it means that the estimates tend to deviate from the actual values.