Forecast of Complex Financial Big Data Using Model Tree Optimized By Bilevel Evolution Strategy

doi:10.21203/rs.3.rs-606920/v1

Download PDF

Research Article

Forecast of Complex Financial Big Data Using Model Tree Optimized By Bilevel Evolution Strategy

https://doi.org/10.21203/rs.3.rs-606920/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

If a decision tree is constructed through a series of locally optimal solutions, such as the Greedy method, overfitting to the data is likely to occur. In order to avoid overfitting, many previous research have attempted to collectively optimize the structure of a decision tree by using evolutionary computation. However, if attributes of each split and their thresholds are searched simultaneously, the evaluation function becomes intermittent; thus, optimization methods assuming continuous distribution cannot be used. In this study, in order to enable efficient search assuming continuous distribution even for complicated data that contains a lot of noise and extraordinary values, such as financial time series data, the inner level search that optimizes each threshold value collectively given a specific attribute for each split in a model tree and the outer level search that optimizes the attributes of each split were performed by separate evolutionary computing. As a result, we obtained high prediction accuracy that far exceeded the performance of the conventional method.

Systems and Networking

Information Retrieval and Management

Evolutionary computing

Noisy function optimization

Financial market

Forecasting

Model tree

A decision tree that recursively learns splits from higher to lower nodes is generally constructed by a series of local judgments and often involves the problem of noise overfitting. To solve this, various proposals have been made to optimize all splits collectively using evolutionary computation. However, many previous studies searched for the best tree by applying the branches of a highly rated tree to another tree. In the case of data with a complicated structure, the split with a high fitness value at one node is peculiar to the data sample at that node; the fitness value does not improve even if this is applied to another data sample.

A method of searching all splits from one tree simultaneously without exchanging branches has also been proposed. However, because the number of combinations of features and their splitting points, or threshold values, is enormous, such a method is impractical for complicated data.

Bilevel GA, which performs feature selection and searches for the order and thresholds of selected features by separate evolutionary computation, has been proposed recently as an efficient method for collectively optimizing the overall structure of a tree. In particular, this method has shown good estimation accuracy using relatively small-scale data [1, 2, 3].

However, the evaluation function of the tree becomes intermittent when a feature used for the split of the upper node changes owing to the simultaneous search of the feature order and the threshold value. Therefore, it is not possible to use a search method that presupposes continuity. To address this, a GA that repeats individual selection, crossover, and mutation is used as the search method because it optimizes splits that become non-deterministic polynomial time (NP)-hard when complex data are used. In addition, when employing such a GA, a split that is effective for an area region divided by specific upper nodes can be applied to a region divided by other upper nodes. Therefore, the fitness value of the tree does not always improve by generations, and the best tree might not be searched.

In this study, we propose a method for searching splits on the premise of continuous distribution by searching for the optimal threshold under a specific feature and determining the optimal combination of features by separate evolutionary computation. In addition, we compare the computational time and prediction accuracy of this method with those using general machine learning and the results of previous studies using financial time series data as an example of complex real data.

Building a tree while avoiding overfitting

As mentioned in Sect. 2, when constructing a decision tree, there is a risk of falling into a locally optimal solution when using the greedy method, which recursively learns the splits from the upper to lower nodes. To avoid this, methods have been developed for determining a solution with multiple decision trees by ensemble learning [4, 5] and for optimizing splits using evolutionary computation [6].

Although both methods have shown higher accuracy than that using the greedy method, numerous attempts have been made to use evolutionary computation to find the optimal decision tree [7, 8, 9, 10, 11, 12] because interpretability, one of the greatest merits of decision trees, is lost in ensemble learning.

The method of constructing a decision tree using evolutionary computation can be divided into two main threads: the evolutionary induction of decision trees and the evolutionary design of decision-tree components. The former method is an approach that optimizes the overall structure of the tree as a whole whereby each individual in evolutionary computation is the decision tree itself. Similar approaches have been used in numerous previous studies [13, 14, 15, 16]. In the latter method, however, each individual is a component optimized and combined to search the optimal tree structure. In the case of complex data, the fitness value of the tree often does not improve when a split that is effective for the data sample created from splits by specific upper nodes is applied to that created from splits by other upper nodes [17, 18].

Attempts to collectively optimize the split of decision trees by genetic programming have been reported in the literature [19, 20, 21, 22]. However, genetic programming is limited when streamlining optimization calculations by searching under continuous distribution. Moreover, it becomes difficult to converge to a global optimal solution when searching a decision tree with a large number of splits.

Building a tree by evolutionary computation

When optimizing the splits of the entire tree collectively by evolutionary computation, the fitness value of the tree changes non-linearly when the features of the splits of the upper node are changed. It is not possible to use evolutionary computation to generate individuals based on continuous distribution.

Optimization methods assuming continuous distribution include the steepest gradient method, Adam optimization, Newton’s method, and Bayesian optimization [23, 24]. Such methods include one-point search, which has a higher probability to result in a locally optimal solution when handling complex data with a large number of dimensions. Under such conditions, approaches considered to be more suitable include the stochastic search method [25], which is a black box optimization method using multipoint search, and real-valued evolutionary computation. Typical methods of real-valued evolutionary computation include real-valued GA [26], evolution strategy [27], differential evolution [28], and particle swarm optimization [29].

Real-valued GA showed high performance in the evaluation function with problems such as bad scale, intervariable dependency, and multimodality through minimal generation gap [30], unimodal distribution crossover (UNDX) [31], and real-coded ensemble crossover star (REXstar) [32]. However, none of these methods can handle multidimensional complex data that with large noise owing to their difficulty in adjusting the step size, population size, number of offspring, and other factors.

The covariance matrix adaptation evolution strategy (CMA-ES) [33] and distance-weighted exponential natural evolution strategies (DX-NES) [34] provide examples of the evolution strategy in which the need to adjust the step size, population size, and number of offspring is relatively small, which enables even a non-linear discontinuous evaluation function to be searched. However, the search performance of DX-NES deteriorates significantly when this strategy is applied to complex data with large noise [35].

CMA-ES, which is relatively resistant to noise, is considered to be desirable for the evaluation function that changes significantly by changing the threshold of the split in a decision tree. Of the many variations of CMA-ES proposed, [36] reported the best performance for data in noisy and uncertain environments, and the [37] model is suitable for searching decision trees.

Data

To compare the performance of a globally optimized model tree with that using general machine learning or a bilevel GA in a previous research, we evaluate the prediction accuracy using multiple benchmark data. Because the model tree constructed in this study aims to unravel a complicated data structure and extract a universal pattern for population, the data used for accuracy evaluation should also be applicable to complicated and noisy conditions.

1. UC Irvine Machine Learning Depository

From the UC Irvine Machine Learning Depository, we select relatively simple classification problems that have been used in many previous studies in addition to relatively complex regression problems meeting the following conditions.

Explanatory variable type: continuous variable
Data type: time series
Number of explanatory variables: more than 10
Number of samples: about 10,000 or more
Low sparseness

Table 1 shows a summary of the data used in this study.

Table 1 Summary of different works pertaining to face and speech fusion

2. Financial market time series data

In addition, time series data of financial markets are used as actual data of complex systems. Financial market data contain many one-off factors and noise and serve as representative data for which high prediction accuracy cannot be obtained even by machine learning. This study uses as an objective variable the intraday return for the TOPIX Futures nearby month from the opening price at 08:45 to the closing price at 15:15. There are many other stock indices which represent the global financial markets, such as S&P 500, Dow Jones Industrial Average, Euro Stoxx 50 and so on; however, most of them have been on a consistent upward trend since 2009. Although TOPIX Futures have a smaller trading volume than other indices, they have no long-term trends and are a better example of complex system data. As explanatory variables, we select indicators that represent the financial markets of the United States and Japan. Unlike the objective variable, the explanatory variables do not necessarily have to be the price of the product that can actually be traded, although they need to reflect the movement of the entire financial market from a different perspective. For this reason, the change in the closing value of the stock index, exchange rate, and interest rate shown in Table 2 are used.

The financial data used to predict the TOPIX Futures in next business day include 4,901 intraday returns between January 04, 2001, and December 30, 2020 as an objective variable.

Table 2. Financial data used to predict TOPIX Futures in next business day

Methods

The purpose of this study is to obtain high prediction accuracy by a globally optimized model tree. For this purpose, a model tree is constructed by splitting a sample space with certain splits of a tree, evaluating the versatility of the pattern recognition model at each final node of the tree, and searching the best splits so the average versatility of the pattern recognition models in all final nodes becomes highest. The splits are not recursively searched individually; instead, they are simultaneously searched from large-scale combination optimization.

1. Bilevel GA

When optimizing the overall structure of a decision tree collectively, the fitness value of the tree changes discontinuously when the features used for the splits of a certain node change; therefore, the search method based on the continuity of functions cannot be used. In this study, we first identify the features to be used for each split and their positions in a tree randomly, and we then optimize the threshold for each feature by inner level search. Finally, we search the best features and their positions in a tree by outer level search. This method, outlined in Fig. 1, is referred to as a bilevel GA.

In the inner level search, the features used for each split and their positions in a tree are given, which enables the threshold value of each feature to be searched by a method using continuous distribution. All of the explanatory variables in this study are continuous; therefore, the threshold value change continuously. Because the data sample under a certain split changes discontinuously when the threshold values change, the evaluation function of the tree becomes discontinuous. However, data samples under certain splits change gradually one by one as the threshold changes, which enables the use of an evolutionary computation method that generates individuals based on continuous distribution.

In previous bilevel GA research, the position of the feature was also changed at the inner level, and the search method that presupposes the continuity of the function could not be used. Conversely, the inner level search in this study can be performed efficiently using continuous distribution.

In the outer level search, multiple trees with different features and their positions are generated randomly, and the optimal tree is searched by repeating the selection, crossover, and mutation. When evaluating each individual tree at the outer level, that in which all thresholds are already optimized according to the inner level is used.

2. Inner level optimization

CMA-ES, an evolutionary computation that performs multipoint search based on normal distribution, is used in this study to search the threshold value at the inner level search. This method updates the covariance matrix based on the evolutionary path that accumulates the previous solutions and generates offspring in the direction of movement of the solutions. It is suitable for the inner level search because it can search non-continuous evaluation functions by assuming normal distribution. In addition, CMA-ES is more resistant to noise than other efficient search methods that can handle discontinuous evaluation functions.

However, for data with particularly high levels of complexity such as financial time series data, the evaluation function becomes steep and multimodal, and the search for a global optimal solution requires a large amount of calculation even when using CMA-ES.

In general CMA-ES applications, the degree of freedom is ; the time complexity is O(n²); and the spatial complexity is O(n3) for the number of the dimension O(n²) of the evaluation function. By limiting the variance–covariance matrix C^(t+1) used for individual generation to diagonal components, the degree of freedom becomes n, and the amount of time and spatial complexity is reduced to O(n):

(1)

where c_cov ∈[0,1] is the learning rate of diagonal element updates; ∈[0,1] is the weighting coefficient of the evolution path ; is the i-th most rated of the z^(t+1); and is the i-th component of .

3. Outer level optimization

In the outer level search, the features used for each split and their positions are optimized. The parameters to be optimized are discrete values. In a decision tree, if the features used in a certain split are changed, the structure below will change significantly. Therefore, in the outer level search, the search method assuming continuous distribution cannot be used. For this reason, we use a GA that searches for individuals with high fitness values by repeating the selection, selection, crossover, and mutation because the shape of the evaluation function is not the issue.

As many trees in which thresholds of all splits are already optimized according to the inner level search as the population size are randomly generated as the initial population. A certain percentage among them (preservation rate) is left for the next generation in the order of the fitness value of the tree, and a pair is randomly created from the next certain percentage (crossover rate) group. Branches at random positions are swapped in a pair, and the rest of the population will not be passed on to the next generation and will instead be replaced by trees with new features and their positions. This process is regarded as one generation, and the generation change is repeated. In this study, we use 20 population sizes, a 20% preservation rate, a 20% crossover rate, and 200 generations, as shown by the outer level search in Fig. 1.

4. Model evaluation method

In the classification problem, we use the weighted accuracy rate of each final node as the fitness value of the individual (model tree) generated by the bilevel GA. In the regression problem, we use the weighted prediction accuracy by linear regression analysis at the final node. The prediction accuracy is the average R^2 obtained by the five-fold cross-validation method. The linear regression model uses lasso regression with a regularization parameter of 0.1.

Large-scale combination optimization is required to select the optimal model tree when using such an evaluation method. Therefore, the Oakbridge-CX supercomputer system at the Information Technology Center, University of Tokyo, iss used for the calculation.

5. Accuracy comparison of each method

To evaluate the prediction accuracy of each method, the results obtained from following approaches are compared: linear discriminant analysis; logistic regression analysis; support vector machine; neural network; classification and regression tree (CART); random forest; XGBoost; a decision tree constructed by bilevel GA proposed in [1], hereinafter referred to as bilevel GA by related work; and a decision tree constructed by bilevel GA proposed in this study, hereinafter referred to as bilevel GA by this study. For comparing the prediction accuracy used for comparison, the average classification accuracy rate of the results of five verifications is used according to the five-fold cross-validation method

For regression problems, the following methods are used: multiple regression analysis, lasso regression analysis, partial minimum error, neural network, XGBoost, bilevel GA by related work, and a model tree constructed by bilevel GA proposed in this study, hereinafter also referred to as bilevel GA by this study. For the prediction accuracy used for comparison, the average R2 according to the five-fold cross-validation method is used.

For problems using financial time series data, the following methods are used: multiple regression analysis, lasso regression analysis, partial minimum error, neural network, XGBoost, bilevel GA by related work, and bilevel GA by this study. For the prediction accuracy used for comparison, the average R2 according to the five-fold cross-validation method is used.

When using the UC Irvine data in a relatively simple classification problem, bilevel GA by this study showed high prediction accuracy, as did the other methods. In regression problems with a large numbers of data, the prediction accuracy of bilevel GA by this study exceeded that of other methods. Moreover, this method showed a much higher estimation accuracy than that of other methods when using financial data.

Prediction accuracy

For the classification problems of UC Irvine, bilevel GA by this study showed high prediction accuracy, as did the other methods (Table 3). Because the other methods also showed relatively high prediction accuracy, the pattern was easily recognized.

For the regression problems of UC Irvine, bilevel GA by this study showed better prediction accuracy than other methods (Table 4). Some of other methods showed relatively low prediction accuracy because the patterns were difficult to recognize in some data, although the bilevel GA by this study was high even for such complicated problems.

Moreover, in the financial time series data, this method showed a prediction accuracy that greatly exceeded that of other methods (Table 5). Although many studies have been conducted on predicting financial market prices by machine learning, no clear conclusion has been reached. However, bilevel GA by this study showed explicitly higher accuracy than that using general machine learning.

In addition, bilevel GA by this study uses elitism for the outer level search; thus, the fitness of trees improves by generations. Figure 2 shows the transition of the fitness of a best model tree constructed by bilevel GA by this study in Table 5 for each generation.

Table 3 Summary of different works pertaining to face and speech fusion

Table 4 Summary of different works pertaining to face and speech fusion

Table 5 Summary of different works pertaining to face and speech fusion

Furthermore, in the model tree optimized at the outer level, which is the best tree selected by using the bilevel GA by this study, we confirmed the fitness of the tree for each generation at the inner level search (Fig. 3). Because the best solution is not preserved at the inner level search, the fitness of the tree does not always increase with each generation. As indicated in Fig. 3, although the shape of the evaluation function exhibits many irregularities, a global optimum solution was obtained. Therefore, CMA-ES has succeeded in searching a global optimum solution even though the fitness fluctuated owing to the influences of noise and extraordinary values.

Impact of crossover

In this study, we used 20 population sizes, a 20% preservation rate, a 20% crossover rate, and 200 alternations of generations for outer level optimization, as discussed in Sect. 3.2.3.

By using the best tree in Table 5, the outer level search was performed using different crossing rates. The optimal fitness of the tree decreased as the crossover rate increased. Moreover, the fitness did not change significantly when the crossover rate was reduced (Table 6), and the change in the prediction accuracy was small even when values other than the crossover rate changed.

As discussed in Sect. 1, the split at the lower node is effective only for the subsample generated by dividing the total sample at the upper level, which explains why the relatively high crossover rate deteriorated the fitness.

Table 6 Impact on performance for each crossover rate

Decision trees have been widely used for data analysis because of their ease of interpretation. However, if the Greedy method, which recursively searches for split from the upper node to the lower node, is used, overfitting is likely to occur because a tree is constructed with a series of locally optimal solutions. Many previous research has attempted to collectively optimize the structure of a decision tree using evolutionary computation; however, many of them searched attributes of each split and their thresholds simultaneously; thus, optimization methods assuming continuous distribution cannot be used. In this study, we proposed bilevel GA that improved the problems in the previous research. As a result, we found that it surpassed the conventional methods in terms of performance from relatively simple problems to complex problems.

We also found that it was still not possible to derive sufficiently high prediction accuracy even by using the proposed bilevel GA, when data with increased complexity, such as financial time series data, was used. The reason for this is thought to be that the tree was constructed based on the fitness of all final nodes of the tree. We may obtain a better prediction result if a tree is constructed so some final nodes with an extremely low fitness are excluded from the evaluation of the tree and if data classified as the final nodes excluded from the evaluation are not subject to prediction.

Also, in this study, we used a binary tree; however, the space in which pattern recognition model perform well is not necessarily all on one side from a certain threshold of the entire sample data. Therefore, it is desirable to use a multi-way tree when dividing the space by a tree. We want to make these issues for the future task.

Authors’ contributions

JS took on the main role performed the literature review, implemented the proposed model, conducted the experiments and wrote manuscript.

Author details

1 Faculty of Computer Science, Tokyo University of Technology, Tokyo, Japan.

Acknowledgements

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

The authors consent for publication.

Availability of data and materials

All data used in this study are publicly available and accessible in the below sources.

Benchmark data : http://archive.ics.uci.edu/ml/index.php

Financial data : https://finance.yahoo.com/

Ethics approval and consent to participate

The authors Ethics approval and consent to participate.

Funding

The authors declare that they have no fundin

Adibi MA. Single and multiple outputs decision tree classification using bi-level discrete-continues genetic algorithm. Pattern Recogn Lett. 2019;128:190–6. https://doi.org/10.1016/j.patrec.2019.09.001.
Dhebar Y, Deb K. Interpretable rule discovery through bilevel optimization of split-rules of nonlinear decision trees for classification problems. IEEE Transactions on Cybernetics, 2020; 1–12. https://doi.org/10.1109/TCYB.2020.3033003.
Hanh PTH, Thanh PD, Binh HTT. Evolutionary algorithm and multifactorial evolutionary algorithm on clustered shortest-path tree problem. Inf Sci. 2021;553:280–304. https://doi.org/10.1016/j.ins.2020.10.024.
Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: Data mining, inference, and prediction. 2nd ed.: Springer Series in Statistics; 2009.
Seni G, Elder J. Ensemble methods in data mining: Improving accuracy through combining predictions. Morgan & Claypool Publishers; 2010.
Freitas AA. Data mining and knowledge discovery with evolutionary algorithms. New York: Springer-Verlag New York, Inc.; 2002.
Safavian SR, Landgrebe D. A survey of decision tree classifier methodology. IEEE Transactions on Systems Man Cybernetics. 1991;21(3):660–74. https://doi.org/10.1109/21.97458.
Murthy SK. Automatic construction of decision trees from data: A multi-disciplinary survey. Data Min Knowl Disc. 1998;2(4):345–89. https://doi.org/10.1023/A:1009744630224.
Freitas AA. A critical review of multi-objective optimization in data mining: A position paper. ACM SIGKDD Explorations Newsl. 2004;6(2):77–86. https://doi.org/10.1145/1046456.1046467.
Rokach L, Maimon O. Top-down induction of decision trees classifiers—A survey. IEEE Transactions on Systems Man Cybernetics Part C. 2005;35(4):476–87. https://doi.org/10.1109/TSMCC.2004.843247.
Espejo PG, Ventura S, Herrera F. A survey on the application of genetic programming to classification. IEEE Transactions on Systems Man Cybernetics Part C. 2010;40(2):121–44. https://doi.org/10.1109/TSMCC.2009.2033566.
Barros RC, Basgalupp MP, de Carvalho ACPLF, Freitas AA. A survey of evolutionary algorithms for decision tree induction. IEEE Transactions on Systems Man Cybernetics Part C. 2012;42(3):291–312. https://doi.org/10.1109/TSMCC.2011.2157494.
Papagelis A, Kalles D. Breeding decision trees using evolutionary techniques, Eighteenth International Conference on Machine Learning. Morgan Kaufmann Publishers, Inc. 2001. p. 393–400.
Fu Z, Golden BL, Lele S, Raghavan S, Wasil EA. A genetic algorithm-based approach for building accurate decision trees. INFORMS Journal on Computing. 2003;15(1):3–22. https://doi.org/10.1287/ijoc.15.1.3.15152.
Shirasaka M, Zhao Q, Hammami O, Kuroda K, Saito K. Automatic design of binary decision trees based on genetic programming, Second Asia-Pacific Conference on Simulated Evolution and Learning. 1998.
Zhao Q, Shirasaka M. A study on evolutionary design of binary decision trees, IEEE Congress on Evolutionary Computation. 1999. p. 1988–1993.
Tanigawa T, Zhao Q. A study on efficient generation of decision trees using genetic programming, Genetic and Evolutionary Computation Conference. 2000. p. 1047–1052.
Aitkenhead MJ. A co-evolving decision tree classification method. Expert Syst Appl. 2008;34(1):18–25. https://doi.org/10.1016/j.eswa.2006.08.008.
Burgess CJ, Lefley M. Can genetic programming improve software effort estimation? a comparative evaluation. Inf Softw Technol. 2001;43(14):863–73. https://doi.org/10.1016/S0950-5849(01)00192-6.
DeLisle RK, Dixon SL. Induction of decision trees via evolutionary programming. J Chem Inf Comput Sci. 2004;44(3):862–70. https://doi.org/10.1021/ci034188s.
Zhao H. A multi-objective genetic programming approach to developing pareto optimal decision trees. Decis Support Syst. 2007;43(3):809–26. https://doi.org/10.1016/j.dss.2006.12.011.
To C, Pham T. Analysis of cardiac imaging data using decision tree based parallel genetic programming, 6th International Symposium on Image and Signal Processing and Analysis. 2009. p. 317–320.
Shahriari B, Swersky K, Wang Z, Adams RP, de Freitas N. Taking the human out of the loop: A review of bayesian optimization. Proceedings of the IEEE. 2016;104(1):148–175. https://doi.org/10.1109/JPROC.2015.2494218.
Adams RP, Stegle O. Gaussian process product models for nonparametric nonstationarity, International Conference on Machine Learning. 2008. p. 1–8.
Larraga RE, Lozano JA, Pena JM. A review of cooperation between evolutionary computation and probabilistic graphical models, Second Symposium on Artificial Intelligence CIMAF. 1999. p. 314–324.
Davis L. The handbook of genetic algorithms. New York: Van Nostrand Reinhold; 1990.
Beyer HG, Schwefel HP. Evolution strategies: A comprehensive introduction. Nat Comput. 2002;1(1):3–52. https://doi.org/10.1023/A:1015059928466.
Storn R, Price K. Differential evolution—A simple and efficient heuristic for global optimization over continuous spaces. J Global Optim. 1997;11(4):341–59. https://doi.org/10.1023/A:1008202821328.
Kennedy J, Eberhart RC. Particle swarm optimization. IEEE international joint conference on neural networks. 1995. p. 1942–1948.
Sato H, Ono I, Kobayashi S. A new generation alternation model of genetic algorithms and its assessment. Journal of Japanese Society for Artificial Intelligence. 1997. p. 734–744.
Ono I, Kobayashi S, Yoshida K. Optimal lens design by real-coded genetic algorithms using UNDX. Comput Methods Appl Mech Eng. 2000;186(2–4):483–97. https://doi.org/10.1016/S0045-7825(99)00398-9.
Kobayashi S. The frontiers of real-coded genetic algorithms. Journal of Japanese Society for Artificial Intelligence. 2009;24(1):128–43.
Hansen N, Ostermeier A. Completely derandomized self-adaptation in evolution strategies. Evol Comput. 2001;9(2):159–95. https://doi.org/10.1162/106365601750190398.
Fukushima N, Nagata Y, Kobayashi S, Ono I. Proposal of distance-weighted exponential natural evolution strategies, IEEE congress on evolutionary computing. 2012. p. 164–170.
Masutomi K, Nagata Y, Ono I. A novel evolution strategy for noisy function optimization. Transaction of the Japanese Society for Evolutionary Computation. 2015;6(1):1–12. https://doi.org/10.11394/tjpnsec.6.1.
Hansen N, Niederberger ASP, Guzzella L, Koumoutsakos P. A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion. IEEE Trans Evol Comput. 2009;13(1):180–97. https://doi.org/10.1109/TEVC.2008.924423.
Richter SN, Schoen MG, Tauritz DR. Evolving mean-update selection methods for CMA-ES, Evolutionary Computation Conference. 2019. p. 1513–1517.

Download PDF

Editorial decision: Major Revision
25 Jul, 2021
Review #4 received at journal
24 Jul, 2021
Review #3 received at journal
24 Jul, 2021
Reviewer #4 agreed at journal
15 Jul, 2021
Review #2 received at journal
14 Jul, 2021
Reviewer #3 agreed at journal
13 Jul, 2021
Reviewer #2 agreed at journal
10 Jul, 2021
Review #1 received at journal
10 Jul, 2021
Reviews received at journal
10 Jul, 2021
Reviewers invited by journal
10 Jul, 2021
Reviewer #1 agreed at journal
09 Jul, 2021
Editor assigned by journal
14 Jun, 2021
Submission checks completed at journal
14 Jun, 2021
Editor invited by journal
14 Jun, 2021
First submitted to journal
08 Jun, 2021

You are reading this latest preprint version

Forecast of Complex Financial Big Data Using Model Tree Optimized By Bilevel Evolution Strategy

Status:

Version 1

Abstract

Figures

Introduction

Related Work

Building a tree while avoiding overfitting

Building a tree by evolutionary computation

Data And Methods

Data

1. UC Irvine Machine Learning Depository

2. Financial market time series data

Methods

1. Bilevel GA

2. Inner level optimization

3. Outer level optimization

4. Model evaluation method

5. Accuracy comparison of each method

Results And Discussion

Prediction accuracy

Impact of crossover

Conclusions

Declarations

References

Status:

Version 1