Interactive portfolio optimization with cognition-limited human decision making assisted by auxiliary factors

doi:10.21203/rs.3.rs-1613749/v1

The success of an interactive method of portfolio optimization depends on not only the acquisition of a satisfactory solution for a decision maker (DM) but also the amount of cognitive effort the DM affords during interactions. For the purposes of implementing a successful interactive method, however, it is often unrealistic to assume a participating DM with a high cognitive capability. Confronted with the challenge of how a cognition-limited DM is able to remove uncertainties in his/her preference feedback and make more confident judgments during interactions so that a stable, satisfactory solution is found as early as possible, we conduct innovative investigations with respect to DM preference articulation. First, in addition to the objective functions of an optimization model, which are used in the present interactive methods as the only references for DM preference articulation, we aim to define the candidate auxiliary factors involved in the portfolio optimization model and recognize those that are closely correlated with the objective functions (called primary factors) and that can help a DM differentiate solutions if he/she is uncertain in making a judgment according to the primary factors. Next, in order to define an artificial DM to simulate the DM’s decision behaviors being assisted by auxiliary factors in the interactive process, based on a deliberately formulated value function representing the DM’s true preferences on portfolios, we introduce the decision strength (DS) to measure the DM’s capability of making a selection between two solutions. Thus, the DM’s decision behaviors are reflected by his/her DS changing with an increasing number of auxiliary factors incorporated into the primary factors. To verify the proposed interactive method for portfolio optimization, we conduct extensive comparative experiments under the participation of three types of artificial DMs with different levels of cognitive capabilities, comparative experimental results with and without auxiliary factors with respect to four performance measures are presented.

portfolio optimization

preference articulation

feature selection

preference model

decision behavior

Portfolio optimization deals with the selection of the best portfolios that maximize the wealth resulting from an investment (Köksalan & Şakar 2016;Masmoudi & Abdelaziz 2018). In the 1950s, Markowitz proposed a way to quantify the return and risk of a security and introduced a mathematical mean-variance (MV) model (Markowitz 1952༛ Markowitz 1959), the modern portfolio theory, to allocate a certain amount of funds among investment alternatives considering the compromise between their returns and risks. The MV model is a bi-objective optimization problem that can be solved by a mathematical programming method.

After the seminal work of Markowitz, portfolio optimization has been a popular research topic in finance. Different portfolio optimization models have been proposed, and most of them are extensions of the classical MV model that have additional or variational objectives, e.g., skewness (Li et al. 2010), volatility in the portfolios (Ehrgott et al. 2004), semivariance (Markowitz 1959), mean absolute deviation (Konno & Yamazaki 1991), minimax (Young 1998), value at risk (Jorion 1996; Cui et al. 2018) and conditional value at risk (Rockafellar & Uryasev 2000), or additional constraints, e.g., boundaries (Speranza 1996), cardinality (Hardoroudi et al. 2017), transaction costs (Paiva et al. 2019), and transaction lots (Golmakani & Fazel 2011). For comprehensive reviews of the MV model and its extensions, the reader is referred to Metaxiotis and Liagkouras (2012), Kolm et al. (2014), and Kalayci et al. (2019).

As additional objectives and constraints are included and the problem size in real applications increases, mathematical programming and exact methods have proven inefficient in acquiring the Pareto optimal front of the problem (Ertenlice & Kalayci 2018; Altinoz & Altinoz 2019), and researchers have paid particular attention to the development of approximation methods. Due to their inherent parallel computation ability, evolutionary algorithms (EAs) have shown high performance in solving multiobjective optimization problems that are very complex; therefore, they have become the method of choice for portfolio optimization problems that have two or more objectives (Kalayci et al. 2019). These methods include classical algorithms such as the genetic algorithm as well as the latest swarm-based algorithms that are inspired by the behaviors of animals living in herds. The representative EAs for solving the MV model and its extensions are presented briefly as follows: the GA was the first EA used to solve portfolio optimization (Metaxiotis & Liagkouras 2012), and afterwards, the well-known EAs for multiobjective optimization problems were adapted for the solution of portfolio optimization, e.g., the vector evaluated genetic algorithm (VEGA) for portfolio optimization with additional cardinality, floor and round-lot constraints (Skolpadungket et al. 2007); the niched Pareto genetic algorithm II (NPGA-II) for additional cardinality-constrained portfolio optimization models (Anagnostopoulos & Mamanis 2011b); the nondominated sorting genetic algorithm II (NSGA-II) for portfolio optimization models with various risk measures (Anagnostopoulos & Mamanis 2011a); the strength Pareto evolutionary algorithm 2 (SPEA2) for classical MV models (García et al. 2011); the Pareto archived evolution strategy (PAES) for MV models extended by four constraints (cardinality, quantity, preassignment and round lot) (Lwin et al. 2014); the multiobjective evolutionary algorithm based on decomposition (MOEA/D) (He & Aranha 2020) for classical MV models; and the Pareto envelope-based selection algorithm (PESA) for portfolio optimization models with three objectives (risk, return and the number of securities) (Anagnostopoulos & Mamanis 2010). Recently, an increasing number of swarm-based algorithms have been adopted, e.g., particle swarm optimization (PSO) for robust multiobjective portfolio models with higher moments (skewness and kurtosis) (Chen & Zhou 2018); ant colony optimization (ACO) for MV models with additional cardinality constraints (Deng & Lin 2010); artificial bee colony (ABC) for fuzzy portfolio selection models (Gao et al. 2019); brain storm optimization (BSO) for portfolio optimization problems considering transaction fees with no short sales (Niu et al. 2016); bacterial foraging optimization (BFO) for portfolio optimization models with the practical constraints of a minimum buy-in threshold and maximum limit, cardinality, etc. (Mishra et al. 2014); the firefly algorithm (FA) for MV models with additional cardinality and bounding constraints (Tuba & Bacanin 2014); the bat algorithm (BA) for cardinality-constrained portfolio optimization (Kamili & Riffi 2016); the fireworks algorithm (FWA) for MV models extended by additional constraints of cardinality, boundaries, transaction lots, etc. (Bacanin & Tuba 2015); cat swarm optimization (CSO) for cardinality-constrained portfolio optimization (Kamili & Riffi 2015); krill herd (KH) for portfolio optimization models with additional constraints of cardinality, boundaries, transaction lots, etc. (Tuba et al. 2014); cuckoo search (CS) for the MV model and its approximation model with cardinality and boundary constraints (El-Bizri & Mansour 2017); and invasive weed optimization (IWO) for MV-based models incorporating the P/E, market experts' recommendation criteria (Pouya et al. 2016), etc.

With the different EA approaches, a set of nondominated solutions that are close to the Pareto optimal front of the portfolio optimization problem and are evenly distributed are produced. However, confronted with such a large number of portfolios, an investor is usually at a loss regarding how to select a subset or only one that is best in his/her interests (Köksalan & Phelps 2007). Thus, to obtain an investor’s most satisfactory portfolio, an evolutionary multiobjective optimization (EMO), which can incorporate investor’s preference information into the solution process, is appropriate (Köksalan & Şakar 2016). There are three principal methods for the incorporation of a decision maker’s (DM's) preference information into the solution process (Branke et al. 2008), the a priori method, a posteriori method and interactive method, in which the DM articulates his/her preferences before, after and during the solution process, respectively. By the a priori method, the DM gives his/her global preferences in advance, which is very difficult when he/she knows little about the problem (Battiti & Passerini 2010;Guo et al. 2020). The a posteriori method requires a DM to make a selection in terms of his/her true preferences from a large number of Pareto optimal points. However, this is often impractical in reality while the human mind is limited to handling a small number of information pieces simultaneously (Cruz-Reyes et al. 2014; Tomczyk & Kadzinski 2020a). In contrast, the interactive method allows the users to learn from the solution process and adjust their preferences progressively, which makes it more suitable in capturing a DM’s preferences (Branke et al. 2008; Cruz-Reyes et al. 2020). Another advantage of interactive method is that it can greatly reduce the computational time since it focuses on the search of a certain region of interest on the Pareto optimal front of an optimization problem (Ojalehto et al. 2016; Ruiz et al. 2020).

An interactive method has three essential aspects (Xin et al. 2018): a search engine, preference information and preference model. The search engine is used to produce a set of nondominated solutions to a multiobjective optimization problem. The preference information is articulated by the DM in light of the set of solutions produced. The preference model is built from the DM’s preference information and is used to guide the search engine. Many of the aforementioned EAs have been used as search engines due to their excellence in parallel computation (Deb 2020; Meignan et al. 2015). The DM’s preference information can be given in the form of quantification or qualification. Reference points (Deb et al. 2006; Filatovas et al. 2020), reference directions (Li et al. 2017), trade-offs (Branke et al. 2001), weights (Ruiz et al. 2009), preference regions (Hu et al. 2017), quantitative evaluations (Li et al. 2019), etc., have the form of quantification. Extreme point selection (Fowler et al. 2010; Koksalan & Karahan 2010) and pairwise comparison (Phelps & Köksalan 2003; Battiti & Passerini 2010; Branke et al. 2015; Branke et al. 2016) are examples of qualification forms. Because of the great cognitive effort required, preference information articulated by a DM in quantified form may contain a high percentage of noise and thus may have a counterproductive effect in aiding decisions (Wang et al. 2015; Branke et al. 2016). In contrast, preference information in qualified form, especially in the form of holistic pairwise comparisons, requires much less cognitive effort from a DM and hence has become prevalent (Ciomek et al. 2017; Tomczyk & Kadzinski 2020b). There are two main types of preference models used in interactive methods, utility function-based and outranking-based models, which are in agreement with the ideas of the American and the European school of thought on multicriteria decision making (MCDM), respectively (Braun et al. 2017). With the utility function-based preference model, the Pareto optimal solutions are ordered in a complete rank that represents the DM’s preferences. The utility functions include achievement scalarizing functions (ASFs) (Ruiz et al. 2015; Luque et al. 2020), linear functions (Phelps & Köksalan 2003), polynomial functions (Deb et al. 2010), general polynomial functions (Mukhlisullina et al. 2013), Tchebycheff functions (Ozbey & Karwan 2014) and additive functions (Branke et al. 2015; Branke et al. 2016). The outranking-based preference model is expressed in a relational form that enables the comparison of the solutions according to reference profiles from each category (Doumpos et al. 2009). This model has been widely used in interactive methods (Govindan & Jepsen 2016; Cruz-Reyes et al. 2020).

The application of interactive methods to portfolio optimization is a challenging task due to the complexity of various aspects of the optimization problem that are considered simultaneously, and published papers addressing this challenge are still scarce (Fernandez et al. 2019). Recently, Fernandez et al. (2019, 2020) investigated an interactive approach to solving a portfolio optimization problem considering the criteria of expected returns and financial and technical indicators. This approach uses an outranking-based preference model and a differential evolution algorithm. The values of the parameters of the preference model are elicited from the DM directly and indirectly through his/her holistic judgments (pairwise comparisons on a set of solutions). The interactive method for a portfolio optimization problem with four objectives (return on investment, risks as measured by the Sharp and Treynor indices and maximum investment) in Zhou-Kangas & Miettinen (2019) utilizes an ASF as a preference model, by which a reference point specified by the DM is incorporated. Ruiz et al. (2020) applied three interactive methods—the weighting achievement scalarizing function genetic algorithm (WASF-GA), gradient-based NSGA-II (g-NSGA-II) and the parallel multiobjective genetic algorithm (P-MOGA)—to a portfolio optimization problem with three objectives (expected return, below-mean absolute semiderivation and loss aversion), in which the DM’s preference information is represented by a reference point and the preference models are represented by an ASF function, g-dominance relationship and Pareto-dominance relationship. These methods have demonstrated prominent capabilities in helping a DM find his/her satisfactory portfolios. However, considering the large number of MV models together with their extended versions resulting from different scenarios in real applications as well as the many effective interactive methods proposed in the literature, the research on portfolio optimization approaches is far beyond mature.

Since the MV model has had a major impact on academic research and the financial industry as a whole (Kolm et al. 2014) and the latest findings indicate that it is a more robust bi-objective model than other well-known models (Pavlou et al. 2019), it is employed as the optimization problem in our study on interactive methods of portfolio optimization. For the EA, we select the NSGA-II due to its outstanding performance in a wide range of applications for solving multiobjective optimization problems by interactive and noninteractive means (Xin et al. 2018; Banzhaf et al. 2020). The aforementioned approaches involved in the other two aspects of an interactive method, the preference information and preference model, may be adapted for use in solving the MV model. However, considering that it is not often realistic to suppose a DM with high cognitive capability is participating in interactive portfolio optimization, we assume a DM with limited cognition. In addition to a proposed learning-based preference model that is compatible with this DM, we concentrate on reducing his/her cognitive burden and pioneering innovative studies in the area of preference information. Our contributions can be attributed to the following:

First, the DM’s cooperative preference articulation regarding the primary and auxiliary factors is investigated. As a human being is becoming involved in an interactive decision process, whether he/she can articulate his/her preferences correctly is the key to the success of the interactive method. In general, a DM can learn from interactions and adjust his/her preferences (Xin et al. 2018). However, a complex question that a typical DM is required to answer may cause a high percentage of noise in the preference information he/she gives in response; therefore, a large number of interactions may be needed for the adjustment, which is a heavy cognitive burden beyond his/her bounded rationality (Battiti & Passerini 2010). Even though qualitative pairwise comparisons have been adopted as easy questions in many interactive methods, due to the DM’s cognitive limitations, noise can also be induced in his/her responses; thus, corrections are needed through interaction (Goulart & Campelo 2016; Li et al. 2019). Another fact that has been ignored by the researches in the literature is that a DM with limited cognition may be in trouble in giving a certain answer to even an easy question in cases, e.g., his/her shortage of knowledge about the optimization problem at the early stage of interaction, the solutions provided for his/her references becoming too similar at the later stage of interaction, etc. A human’s cognition is affected by both objective and subjective factors (Settles 2010; Frenay & Verleysen 2014; Greco et al. 2016): The former includes the complexity of the question and the lack of information to support a decision, and the latter are the DM’s cognitive limitations. So based on qualitative pairwise comparison questions, our study aims to provide additional valuable information to support a DM in articulating his/her preferences as soon and precisely as possible. In the present interactive methods, a DM’s preferences are articulated only on primary factors, i.e., the objective functions of a multiobjective optimization problem; auxiliary factors, even those closely correlated with the primary ones, are not utilized during the decision process. However, these correlated auxiliary factors can help a DM give a more reliable answer, while he/she may not be confident in the judgment of a pair of solutions with very similar objective values. For example, in the optimization of a bi-objective traveling salesman problem (Wang et al. 2015), when a DM cannot select a better itinerary considering only cost and time objectives with similar values, if he/she is provided with additional city information (denoted as variables of the problem), he/she may select an itinerary with cities that are preferable to him/her. A DM’s cognitive burden can be measured by the total number of pairwise comparisons made during the interactive process, and his/her comparative results are then used to learn a preference model. From the perspective of concept learning (Sheng et al. 2008; Ipeirotis et al. 2014), the lower the percentage of noise in the results, the fewer comparisons are needed to learn the correct preference model. The utility of valuable information such as auxiliary factors in addition to the adoption of an easy question in our interactive method can greatly reduce the noise in a DM’s preference feedback, thus leading to a lower cognitive burden required to obtain a portfolio in his/her best interests.

Second, DM decision behavior supported by primary and auxiliary factors during the interactive process is simulated. There are two types of DMs employed in the interactive methods, real and artificial ones. The inconsistency of human nature and variability among humans make it very hard to conduct good quality experiments (Ojalehto et al. 2016; Chen et al. 2017). So we will use an artificial DM in our interactive method. At present, a DM’s true preference is emulated by a value function of objective functions (Deb et al. 2010; Chen et al. 2017). This DM-emulating value function is only used to give his/her preferred solutions at interaction, however, a DM’s complex decision behavior supported by primary factors is not simulated. In our study, a DM-emulating value function based on primary factors is also formulated for portfolio optimization. This function is used to give his/her preferred solutions in the case that the DM is able to make a selection. Furthermore, the DM’s decision behavior is simulated in the case that he/she is not able to make a selection, showing how he/she becomes able to decide when assisted by auxiliary factors. This simulation is the first attempt to provide an interpretation of a DM’s complex cognitive activity in the interactive decision process.

The remainder of this paper is organized according to the human-machine interaction system depicted in Fig. 1. In section 2, we define auxiliary factors based on the MV model and investigate how they are presented in a cooperative manner to a DM for preference articulation. The behavior of a DM in making a judgment supported by auxiliary factors is simulated in section 3. In section 4, we first give a learning-based preference model compatible with a cognition-limited DM and then describe the implementation steps of our proposed interactive method. The effectiveness of our innovative approaches is demonstrated in section 5. Finally, we conclude this research and suggest future development.

As mentioned previously, one of the main reasons for noise in DM preference feedback is the lack of information to support decision making (Settles 2010; Frenay & Verleysen 2014; Greco et al. 2016). A DM in present interactive methods articulates his/her preferences based on the objective functions, which we called primary factors. When the difference between two solutions is very small, it may be difficult for the DM to judge, and noise may arise in his/her answers. However, when two solutions become indistinguishable in terms of their objective function values, the difference between them as measured by the values of correlated factors may be evident (see Fig. 2). The correlated factors, such as decision variables, and the objective functions are different aspects of an optimization problem, and they are mapped to each other in different forms within one optimization model; therefore, they can impact a DM’s preferences from different perspectives. In addition to objective functions, from a DM’s preference feedback during the interactive decision process, we will analyze the factors that not only greatly impact the DM’s preferences but also have evident distinction with respect to the solutions. These auxiliary factors can assist the DM in making a more confident decision and help him/her obtain a deeper understanding of the optimization problem, thus reducing incorrect and uncertain judgments in his/her preference feedback.

2.1 Definition of auxiliary factors

For our interactive method of portfolio optimization, the primary and auxiliary factors are defined on the MV model. The MV model is described as follows (Lwin 2015):

$$\hbox{max} {z_{\text{1}}}=\sum\limits_{{i=1}}^{N} {{x_i}{u_i}}$$

1

$$\hbox{min} {z_{\text{2}}}=\sum\limits_{{i=1}}^{N} {\sum\limits_{{j=1}}^{N} {{x_i}{x_j}{\sigma _{ij}}} }$$

2

$${\text{s}}{\text{.t}}{\text{.}}\;\;\sum\limits_{{i=1}}^{N} {{x_i}} =1$$

3

$$\;0 \leqslant {x_i} \leqslant 1\;,i=1,2,...,N\;\;$$

4

where N is the number of assets available to be invested, ${u_i}$ is the expected return of asset i$(i=1,...,N)$, and ${\sigma _{ij}}$ is the covariance between assets i and j. ${x_i}$ is a decision variable that denotes the proportion of the budget invested in asset i. ${z_1}$ and ${z_2}$ are two objectives, i.e., the return and risk, respectively. Eq. (3) is the budget constraint, and Eq. (4) is the restriction on the decision variables.

For the MV model, the two objectives return and risk are naturally the primary factors that impact the DM’s preferences during the decision process. The auxiliary factors have the following two characteristics:

(1) Indirect impact on the DM’s preferences. The factors that directly impact the DM’s preferences are primary factors. The indirect impact of the auxiliary factors can be investigated through analysis of their correlation with the primary factors;

(2) Amplifying the dissimilarity between two solutions. In the case that a DM cannot make a certain selection in terms of the primary factors, the factors that amplify the dissimilarity between two solutions make it possible for the DM to make a more confident judgment.

Therefore, the auxiliary factors cannot be specified a priori but can be recognized dynamically during the decision process. Before the solution process begins, a set of candidate auxiliary factors should be provided.

In general, there are three categories of candidate auxiliary factors based on an optimization problem, which are distributed in the objective space, decision space and constraint space.

Objective space: Given the aspiration value A(z) (the value that the DM would like to achieve (Branke et al. 2008)) and reservation value R(z) (the value that should be achieved according to a DM (Branke et al. 2008)) for the objective functions, the derived candidate auxiliary factors can be denoted as |z - A(z)| and |z - R(z)|, respectively. A(z) and R(z) can be specified by the DM according to his/her will and understanding of the problem. Alternatively, they can be set as the upper and lower bound objective values of the solutions in the current generation. i.e., A(z) is composed of the largest return and smallest risk values; R(z) is composed of the smallest return and largest risk values.

Decision space: In addition to the decision variable value x, the derived candidate auxiliary factors can be denoted as |x - A(x)| and |x - R(x)|, where A(x) and R(x) are the aspiration and reservation values for the decision variables. A(x) and R(x) can be specified by the DM according to his/her will and understanding of the problem. Alternatively, they can be set as the upper and lower bound decision values of the solutions in the current generation. i.e., the components of A(x) are the largest decision variable values; the components of R(x) are the smallest decision variable values.

Constraint space: For a constraint $c=g(x) \leqslant 0$, its satisfaction degree c can be used as a candidate auxiliary factor. At the same time, the DM can also specify a lower bound value A(c) and an upper bound value R(c) for this constraint if he/she would like to. Alternatively, the system can set A(c) and R(c) as the smallest and largest constraint values of the solutions in the current generation, respectively.

The candidate auxiliary factors for the MV model are listed in Table 1. Since constraint (3) can be represented as a constant value: $c=g(x)=0$, where$g(x)=$$\sum\nolimits_{{i=1}}^{N} {{x_i}} - 1$, there are no candidate auxiliary factors in the constraint space.

Table 1

Candidate auxiliary factors for MV model
Spaces	Candidate auxiliary factors		Description
Objective space	\|Z - A(Z)\|	\|${z_1}$ - A(${z_1}$)\| \|${z_2}$ - A(${z_2}$)\|	The difference between the objective function values and aspiration values
Objective space	\|Z - R(Z)\|	\|${z_1}$ - R(${z_1}$)\| \|${z_2}$ - R(${z_2}$)\|	The difference between the objective function values and reservation values
Decision space	X	${x_1}$ _…${x_N}$	The decision variable values
	\|X - A(X)\|	\|${x_1}$ - A(${x_1}$)\| _… \|${x_N}$ - A(${x_N}$)\|	The difference between the decision variable values and aspiration values
	\|X - R(X)\|	\|${x_1}$ - R(${x_1}$)\| _… \|${x_N}$ - R(${x_N}$)\|	The difference between the decision variable values and reservation values

2.2 Selection of auxiliary factors

According to the two characteristics of auxiliary factors, a candidate auxiliary factor should be evaluated in terms of its correlation with the primary factors and its distinction with respect to a pair of solutions before being selected as an auxiliary factor.

2.2.1 Computation of correlation and distinction

(1) Estimation of correlation

Since each of the primary and auxiliary factors is a multidimensional variable (see Table 1), the methods for estimating the correlation between individual variables, such as the Pearson rank correlation coefficient or rank correlation coefficient, are not suitable. For two multidimensional variables X and Y, canonical correlation analysis (CCA) is often used to estimate their correlation (Hotelling 1936). By this method, X and Y are mapped to two one-dimensional variables ${X'}$ and ${Y'}$, respectively, and then the correlation between ${X'}$ and ${Y'}$ is estimated as that between X and Y. As the mapping is normally realized through linear regression, the information on some dimension of X (or Y) may be discarded. Partial least-squares regression (PLSR) can overcome this weakness since it draws components from X (or Y) based on an iterative method, enabling ${X'}$ (or ${Y'}$) to contain as much sample information as possible (Wegelin 2000). Therefore, we will use PLSR to estimate the correlation between X and Y.

After ${X'}$ and ${Y'}$are drawn from X and Y, respectively, their Spearman rank correlation coefficient ${r_s}$ (Su & He 2019) is computed as the correlation between X and Y. Considering that ${X'}$ (or ${Y'}$) is not yet a complete approximation of X (or Y) due to information loss, we introduce the coefficient of determination ${R^2}$ and let the self-defining function $g({R^2})$ be a penalty to the correlation estimation.

$${R^2}=1 - \frac{{\sum\limits_{{i=1}}^{n} {{{({y_i} - {{\hat {y}}_i})}^2}} }}{{\sum\limits_{{i=1}}^{n} {{{({y_i} - \bar {y})}^2}} }}$$

5

where ${y_i}$ is the observed value of a variable, $\bar {y}$ is the mean value, and ${\hat {y}_i}$ is the predicted value. Clearly, a smaller value of ${R^2}$ represents a worse approximation of ${X'}$ (or ${Y'}$) to X (or Y) and leads to a smaller value of $g({R^2})$. While ${R^2} \leqslant 0.3$ is very rare in our experimental exploration, in this case, the value of $g({R^2})$ is defined as 0.3 to avoid the occurrence of an abnormal value. Therefore, the function $g({R^2})$ is defined as:

$$g({R^2})=\left\{ {\begin{array}{*{20}{c}} {0.3,}&{{R^2} \leqslant 0.3} \\ {{R^2},}&{{R^2}>0.3} \end{array}} \right.$$

6

Then, the correlation between X and Y is as follows:

$$r=g({R^2}){r_s}$$

7

As a demonstration, Table 2 gives a comparison between CCA and PLSR with respect to the values of the Spearman rank correlation coefficient ${r_s}$ and coefficient of determination ${R^2}$. In the table, all the values for ${r_s}$ obtained by CCA are exactly 1, indicating that each set of candidate auxiliary factors has the same strong correlation with the primary factors, which is not true in reality. Furthermore, the values of ${R^2}$ obtained by CCA are much smaller than those obtained by PLSR, which proves that much more information is discarded by CCA. Similar results were achieved in our repeated comparative experiments; hence, PLSR is more suitable for correlation estimation in our study.

Table 2

The values of ${r_s}$ and ${R^2}$ obtained by CCA and PLSR
Candidate auxiliary factors	${r_s}$(CCA)	${R^2}$(CCA)	${r_s}$(PLSR)	${R^2}$(PLSR)
\|Z - A(Z)\|	1	0.673657	0.7	0.795171
\|Z - R(Z)\|	1	1	1	0.985188
X	1	0.484572	0.9	0.691975
\|X - A(X)\|	1	0.253486	0.7	0.523120
\|X - R(X)\|	1	0.484572	0.6	0.785823
(2) Measurement of distinction

The distinction of a set of candidate auxiliary factors with respect to a pair of solutions is measured by the Euclidian distance between the two solutions, whose components are scaled in terms of the candidate auxiliary factors. Let ${S_1}$ and ${S_2}$ be two solutions, and let their components be scaled by a K-dimensional factor F (i.e., a set of candidate auxiliary factors with K elements) and denoted as $({F_{11}},{F_{12}},...,{F_{1K}})$ and $({F_{21}},{F_{22}},...,{F_{2K}})$, respectively. Then, the distinction of F with respect to ${S_1}$ and ${S_2}$ is measured by:

$${d_F}({S_1},{S_2})=\sqrt {\sum\limits_{{i=1}}^{K} {{{({F_{1i}} - {F_{2i}})}^2}} }$$

8

2.2.2 Selection method based on correlation and distinction

According to the two characteristics of auxiliary factors, correlation should be employed as the first criterion and distinction as the second criterion for their selection from among the candidate auxiliary factors. If the auxiliary factors are more closely correlated with the primary factors, the DM’s decision based on them will be more reliable. Under similar correlations among auxiliary factors, the one with stronger distinction can help the DM make a more confident decision.

We denote the primary factors by Z ($Z=\{ {z_1},{z_2}\}$ or $Z=({z_1},{z_2})$ for the MV model) and assume M sets of candidate auxiliary factors ${F_i}$ $(i=1,...,M)$. In terms of the selection criteria, we rank ${F_i}$ first by their correlations and then by their distinctions in cases of ties. It is worth noting that, in the first ranking, factors with weak correlations need to be eliminated even though they may have strong distinctions. In the second ranking, redundant factors with weaker distinctions also need to be eliminated since they are not helpful for the DM in making a confident decision. For example, ${F_{i+t}}$ is redundant compared with the previous factor ${F_i}$ if its distinction with respect to solutions ${S_1}$ and ${S_2}$ is weaker than that of ${F_i}$; i.e., if ${d_{{F_{i+t}}}}({S_1},{S_2})$<${d_{{F_i}}}({S_1},{S_2})$, then it should be removed since it has a similar correlation but a weaker distinction.

As the time complexities of the correlation analysis and redundancy analysis are $O(M)$ and $O({M^2})$, respectively, if the two rankings are implemented sequentially, an optimal subset of the auxiliary factors can be obtained but at the cost of greater time complexity $O({M^2})$. The philosophy of incorporating auxiliary factors into the DM’s decision making is not only applied to portfolio optimization but is also easily adapted to solve other multiobjective optimization problems in which a large number of candidate auxiliary factors may be defined. Even for the MV model, many more candidate auxiliary factors must be investigated in different application scenarios. Therefore, the feasible selection of auxiliary factors based on correlation and redundancy together with distinction should consider optimality as well as efficiency. According to John et al. (1994), the features in supervised learning are divided into three disjoint subsets: irrelevant, weakly relevant and strongly relevant features. Yu and Liu (2004) further partitions the weakly relevant features into redundant and nonredundant ones through the definition of a Markov blanket for redundancy (see Fig. 3). It also proposes a fast correlation-based filter (FCBF) algorithm for the selection of relevant and nonredundant features (see parts III and IV in Fig. 3). The efficiency of the FCBF is between the best-case complexity $O(M)$ when only one feature is selected and the worst-case complexity $O({M^2})$ when all features are selected. Due to its capability of obtaining a highly suboptimal subset of features with substantial efficiency in application to various forms of high-dimensional data sets, we extend the FCBF to the selection of auxiliary factors based on correlation and distinction.

The FCBF algorithm comprises two consecutive phases: relevance analysis and redundancy analysis. In the relevance analysis, irrelevant features (part I in Fig. 3) are removed; in the redundancy analysis based on the definition of the approximate Markov blanket (Yu & Liu 2004), weakly relevant and redundant features are removed (part II in Fig. 3). The definitions of correlation for these two analyses are only applied to a pair of individual features, and we use the correlation defined by formula (7) for the two sets of factors for the relevance analysis and redundancy analysis.

As in the FCBF, the relevance analysis involves a predetermined threshold $\delta$. Our objective is to provide the DM with auxiliary factors to aid decision making, so in the redundancy analysis, we will not eliminate all the weakly relevant and redundant factors but only those that are not helpful for the DM to make a confident decision. A set of candidate auxiliary factors ${F_i}$ needs to be eliminated if it satisfies the following three conditions:

$$r({F_i},Z) \leqslant r({F_j},Z)$$

9

$$r({F_i},{F_j}) \geqslant r({F_i},Z)$$

10

$${d_{{F_i}}}({S_1},{S_2}) \leqslant {d_{{F_j}}}({S_1},{S_2})$$

11

where r is the correlation between two sets of factors defined by formula (7), ${F_j}$ is another set of candidate auxiliary factors, and Z is the primary factor. Formulas (9) and (10) indicate that ${F_j}$ forms an approximate Markov blanket for ${F_i}$. Formula (11) states that ${F_i}$ has a weaker distinction than ${F_j}$. Following the two phases of the FCBF, our method of selecting auxiliary factors from a set of candidates is given in algorithm 1.

Algorithm 1

selection for auxiliary factors from a set of candidates.

Procedure: AF_Selection.

Input:

$CAF=\{ {F_1},...,{F_M}\}$ a set of candidate auxiliary factors

Z primary factors

$\delta$ threshold for filtering irrelevant factors

a set of solutions expressed by values of $CAF$ and Z

${S_1}$ ,${S_2}$ a pair of solutions selected from

Output:

$AF$ a set of selected auxiliary factors

1: for i = 1 to M do

2: compute $r({F_i},Z)$ on data set ;

3: if $r({F_i},Z)$<$\delta$ then

4: remove ${F_i}$ from $CAF$;

5: end if

6: end for

7: order $CAF$ in descending $r({F_i},Z)$ value;

8: remove first element ${F_j}$ from $CAF$ and append it to $AF$;

9: while ${F_j}$<>null do

10: for each element ${F_i}$ in $CAF$ do

11: if $r({F_i},{F_j}) \geqslant r({F_i},Z)$ and ${d_{{F_i}}}({S_1},{S_2}) \leqslant {d_{{F_j}}}({S_1},{S_2})$ then

12: remove ${F_i}$ from $CAF$;

13: end if

14: end for

15: remove first element ${F_j}$ from $CAF$ and append it to $AF$;

16: end while

17: return $AF$;

In the AF_Selection algorithm, steps 1–6 are the relevance analysis based on our definition of correlation between two sets of factors. Before the implementation of the algorithm, a threshold value $\delta$ is assigned by the DM to remove these irrelevant factors. Steps 7–16 are used to further remove redundant factors with weaker distinctions according to conditions (9)-(11) iteratively. Each time, the first element ${F_j}$ in $CAF$ is removed and appended to $AF$ (steps 8 and 15) and compared with the remaining ${F_i}$ in $CAF$ one by one (steps 10 and 14). Condition (9) is satisfied implicitly since ${F_j}$ is always ranked before ${F_i}$ (step 7), so we only judge whether conditions (10) and (11) are met (step 11). The time complexity of the first phase (steps 1–6) is $O(M)$. In the second phase (steps 7–16), if $AF$ has only one element, it contains just the first factor in $CAF$, which means that after the first factor in $CAF$ is selected into $AF$, all the remaining factors in $CAF$ are removed; steps 9–16 are implemented in only one round, and the time complexity of the second phase is $O(M)$. If $AF$ is equal to $CAF$, it contains all the factors in $CAF$, which means that after each factor in $CAF$ is selected into $AF$, no factor remaining in $CAF$ is removed; steps 9–16 are implemented in M rounds, and the time complexity of the second phase is $O({M^2})$. Therefore, the complexity of the AF_Selection algorithm is between $O(M)$ and $O({M^2})$. Considering that for a real optimization problem, a DM may set a high threshold value $\delta$ for relevance analysis, the number of elements after first-phase filtering in $CAF$ should be much smaller than M; as a result, the AF_Selection algorithm gains a substantially higher efficiency.

In the present interactive approach, a DM’s true preference is often simulated by a value function, which is used to determine the preference of one solution with respect to another in pairwise comparisons (Deb et al. 2010; Chen et al. 2017). For portfolio optimization, in addition to the design of a DM-emulating value function based on primary factors, we need to explore the simulation of a DM’s decision behavior with the assistance of auxiliary factors.

Ojalehto et al. (2016) proposed principles of defining an artificial DM: it should be defined by a steady part which does not change in time, a mechanism of representing and updating the current context as the solution process continues, and a mechanism of generating preference information based on the steady part and the current context; and by changing the parameters of the steady part, different artificial DMs can be obtained for conducting multiple experiments. The artificial DM in our interactive method is defined in accordance with these principles: the DM-emulating value function which will be formulated in the first subsection comprises the steady part; the current context includes the solutions produced under the guide of DM’s preferences (see Algorithm 3 in section 4 for updating solutions) as well as the auxiliary factors recognized in terms of DM’s preferences (see Algorithm 1 in section 2 for updating auxiliary factors); a procedure of generating preference information based on the steady part and the current context is described by Algorithms 2 in the second subsection; and by varying the parameters of the steady part (i.e. $\alpha$ and $\beta$ of the DM-emulating value function), we obtained different artificial DMs for conducting multiple experiments (see section 5).

3.1 DM-emulating value function based on primary factors

A linear value function-emulated preference can always lead to an interactive method to reach the end points of the Pareto front of an optimization problem (Deb et al. 2010). For portfolio optimization, which pursues a combination of different assets to spread the investment risk (Kalayci et al. 2019), this case should be avoided since it means that the DM is only interested in a single asset. In an interactive method for a bi-objective optimization problem, Problem 1 from ZDT1, a nonlinear value function is assumed to provide the DM’s preference in pairwise comparisons (Deb et al. 2010). The MV model has two objectives and many decision variables, as in Problem 1; at the same time, pairwise comparison is adopted for DM preference elicitation in our interactive method of portfolio optimization; therefore, this nonlinear value function is used as the reference for our design of a DM-emulating value function based on primary factors ${z_{\text{1}}}$ and ${z_{\text{2}}}$. The nonlinear value function has the following form:

$$V(Z)=\frac{1}{{{{({z_{\text{1}}}{\text{-}}\alpha )}^2}+{{({z_{\text{2}}} - \beta )}^2}}}$$

12

In formulating the final $V(Z)$ for portfolio optimization, the characteristics of the DM’s preferences in investments must be considered, including the following:

(1) Risk type

The most common investment strategy is building a portfolio consisting of different assets in order to spread the risk (Kalayci et al. 2019). According to their risk tolerances, investors can be classified into three categories: low, median and high risk-tolerant DMs. These three categories of DMs should be specified explicitly by $V(Z)$.

(2) Equitable impacts of return and risk on preferences

Both return and risk impact a DM’s investment preferences. Portfolio optimization pursues a compromise between return and risk that is in the best interest of the DM. In general, a minor change in either the return value or risk value should not result in a major change in DM preference, or vice versa. A value change in either return or risk should lead to a similar value change in $V(Z)$.

(3) Moderate impacts of portfolios on preferences

Each portfolio can impact a DM’s investment preference, and the impacts of different portfolios can be distinguished through a moderate $V(Z)$ value. The values of $V(Z)$ should be distributed smoothly, and there are no cases in which the $V(Z)$ values are extremely large for some portfolios but extremely small for others.

To determine the final $V(Z)$, we implemented many trial experiments a priori on the data sets (described in section 5) to optimize the MV model and found that the following $V(Z)$ can satisfy the above three conditions:

$$V(Z)=\frac{{{{10}^{ - {\text{3}}}}}}{{{{({z_{\text{1}}}{\text{-}}\alpha )}^2}+{{10}^{\text{4}}} \times {{({z_{\text{2}}} - \beta )}^2}}}$$

13

where $\alpha$ and $\beta$ represent the possible values of return and risk, respectively. Different pairs of values of $\alpha$ and $\beta$ can represent different DMs with their specific preferences. A pair of high values of $\alpha$ and $\beta$ indicates a high risk-tolerant DM, a pair of low values of $\alpha$ and $\beta$ indicates a low risk-tolerant DM and so on. If a portfolio has a higher $V(Z)$ value, e.g., its return (${z_{\text{1}}}$) and risk (${z_{\text{1}}}$) are closer to $\alpha$ and $\beta$, respectively, it will be regarded as preferred by the DM.

3.2 DM’s decision behavior assisted by auxiliary factors

Assisting a DM’s decision behavior with auxiliary factors involves two issues: why the DM is not able to make a decision and how he/she comes to be able to make a decision. Considering the preference elicitation of pairwise comparisons, we define the concept of decision strength (DS) for a DM in making a selection.

Definition (Decision Strength)

A DM’s Decision Strength ($DS$) is his/her capability to make a selection between a pair of solutions.

For a DM, there exists a minimum difference $\theta$ between two solutions, which we call a threshold for his/her $DS$. A DM can make a selection if and only if his/her $DS$ is no less than $\theta$, i.e., if the condition for making a selection is satisfied:

$$DS \geqslant \theta$$

14

In the interactive decision process, initially, the DM makes selections based on the primary factors, so his/her $DS$ can be represented by the difference between the $V(Z)$ values of two solutions ${S_1}$ and ${S_2}$:

$$DS=\left| {V({Z_{{S_1}}}) - V({Z_{{S_2}}})} \right|$$

15

where ${Z_{{S_1}}}$ and ${Z_{{S_2}}}$ are objective values corresponding to ${S_1}$ and ${S_2}$, respectively. If $DS<\theta$, the DM is not able to make a selection, and then a set of auxiliary factors and their values will be provided for the DM to try again. In the subsequent decision making assisted by auxiliary factors, the DM’s $DS$ is progressively improved.

As demonstrated in Fig. 2, compared with ${d_Z}({S_1},{S_2})$ in the space defined by primary factor Z, the larger difference ${d_{{F_i}}}({S_1},{S_2})$ of two solutions in the space defined by a set of auxiliary factors ${F_i}$ may help the DM make a selection, which implies that ${d_{{F_i}}}({S_1},{S_2})$ has a certain amount of contribution to the DM’s $DS$. However, the primary factor Z has a dominating impact on the DM’s preferences; no auxiliary factor ${F_i}$ can impact a DM’s preferences separately but only via the primary factor Z, and at the same time, its ‘pure’ contribution to the DM’s $DS$ is less than that of Z. We introduce an activation function $f(x)$, which is used to activate the contribution of incorporating ${F_i}$ into Z, ${\eta _{{F_i}}}$, into the DM’s $DS$.

$$f(x)=\frac{2}{{1+{e^{ - x+1}}}}$$

16

${\eta _{{F_i}}}=$ $f\left( {\frac{{{d_{{F_i}}}({S_1},{S_2})}}{{{d_Z}({S_1},{S_2})}}} \right)$ (17)

It is easily found that the value of $f(x)$ lies in the range (0, 2) and the contribution of primary factor Z to the DM’s $DS$ ${\eta _Z}$=1; therefore, any auxiliary factor ${F_i}$ has a ‘pure’ contribution ${\eta _{{F_i}}} - {\eta _Z}$ to a DM’s $DS$ that is always less than 1.

By incorporating auxiliary factors ${F_i}$ into Z, the DM’s $DS$ can be improved by an amplifying factor ${\eta _{{F_i}}}$> 1 (or ${d_{{F_i}}}({S_1},{S_2})>{d_Z}({S_1},{S_2})$) to the following:

$$DS=DS \times {\eta _{{F_i}}}$$

18

As an increasing number of auxiliary factors are incorporated, the DM’s $DS$ is improved progressively until he/she can make a confident selection, i.e., condition (14) is satisfied.

We describe the DM’s decision process when assisted by auxiliary factors in algorithm 2. Its steps are as follows:

Algorithm 2

DM’s decision process assisted by auxiliary factors.

Procedure: DS_AF.

Input:

$V(Z)$ DM-emulated value function

${S_1}$ ,${S_2}$ a pair of solutions

$AF$ a set of auxiliary factors

Z primary factor

$\theta$ threshold for a DM’s$DS$

Output:

${S_p}$ a preferred solution by DM

1: calculate the initial $DS=\left| {V({Z_{{S_1}}}) - V({Z_{{S_2}}})} \right|$;

2: if $DS \geqslant \theta$ then

3: if $V({Z_{{S_1}}}) - V({Z_{{S_2}}})$ > 0 then

4: return ${S_p}$=${S_1}$;

5: else

6: return ${S_p}$=${S_2}$;

7: end if

8: else

9: while $AF$<>null do

10: remove the first element ${F_i}$ from $AF$ and calculate ${\eta _{{F_i}}}$;

11: if ${\eta _{{F_i}}}$> 1 then

12: update $DS=DS \times {\eta _{{F_i}}}$, go to step 2;

13: end if

14: end while

15: return ${S_p}$= null (both are undifferentiated by DM and preferred equally).

16: end if

Algorithm 2

simulates the DM’s decision behaviors during interaction with a machine or analyst under the assistance of auxiliary factors. Initially, given a pair of solutions with only primary factor values, the DM tries to compare them according to his/her true preference, which is emulated by a value function $V(Z)$ (step 1). If he/she is able to make a selection (step 2), the preferred choice is presented to the analyst (steps 4 and 6). Otherwise, the analyst will provide some more valuable information (auxiliary factors) so that the DM can try again (steps 9–14). The analyst examines the auxiliary factors sequentially (step 10) and gives the DM only those factors that may enhance the DM’s $DS$ for reference (steps 11–13). With the newly added auxiliary factor values of the two solutions, the DM will try to make a judgment again (step 12). If the DM is not able to make a selection after being provided with all the promoting auxiliary factors, the two solutions will be presented to the analyst with equal preference (step 15).

If the DM’s decision strength threshold $\theta$ has a smaller value, fewer auxiliary factors are needed to improve his/her $DS$ so that the condition of making a selection is satisfied. Hence, $\theta$ can be used as a measurement of the DM’s cognitive capability; the lower the value of $\theta$ is, the higher the DM’s cognitive capability.

In this section, we first give a learning-based preference model compatible with a cognition-limited DM and then describe the main implementation steps of our interactive method for portfolio optimization.

4.1 Learning based preference model

A DM’s preference model is built from his/her preference feedback regarding a set of solutions. For a cognition-limited DM, he/she initially has a vague preference structure and may change his/her preferences arbitrarily during the interactive process. A function-based preference model is not suitable since a DM’s preference structure is defined by a predetermined function form and his/her preference changes are restricted to only the coefficient values. A learning-based preference model is adopted in our interactive method due to its high flexibility. It is constructed completely from the DM’s preference feedback and thus can be adapted to the DM’s arbitrary preference changes during the decision process.

The DM’s preference model is used to predict his/her preferred choices from a newly produced set of solutions; thus, the preference model is learned by a classification algorithm. In contrast to the other classification algorithms on the top 10 list, such as naive Bayes and support vector machines (SVMs) (Wu et al. 2008), the decision tree induction algorithm generates a classifier with a much more transparent structure that shows which attribute is selected first and what value is used to make a decision or split. By examining this transparent structure, a DM can develop a deep understanding of how these factors impact his/her preferences, which can help him/her make a better selection in comparing solutions based on these factors. CART (Breiman et al. 1984) and C4.5 (Quinlan 1992) are the two most influential decision tree induction algorithms on the top 10 list; both have easy implementation and a wide range of successful applications, but CART shows slightly better performance in a comparative study of decision tree induction algorithms (Mingers 1989). Therefore, CART is adopted to learn the preference model in our interactive method.

We denote the preference model as PM_DT. In one generation during the NSGA-II process, the produced set of Pareto optimal solutions $P{F^1}$ is employed to build PM_DT, which is then used to predict the preferences for another set of Pareto optimal solutions $P{F^2}$ produced in the next generation. $P{F^1}$, as the training data set, and $P{F^2}$, as the test data set, are distributed on different front lines and should be transformed into one characteristic distribution by normalization as follows:

${\zeta _l}=\frac{{{z_l}}}{{\sum\nolimits_{j} {{z_j}} }}$ , $(l=1,...,L)$ (19)

where ${z_l}$ $(l=1,...,L)$ are the objective values of a solution. By normalization, all samples from $P{F^1}$ and $P{F^2}$ are mapped into one space $\Omega =\left\{ {\sum\nolimits_{l} {{\zeta _l}} =1} \right\}$, in which PM_DT is built, used (for prediction) and assessed.

By pairwise comparison, the DM’s preferences for $P{F^1}$ are divided into several classes and are ranked in ascending order, which we denote as $oder\_DM$. PM_DT is built from a set of Pareto optimal solutions with $oder\_DM$. To reduce the DM’s cognitive burden, we cluster $P{F^1}$ into several groups and select a representative point from each group. These representative points are then presented to the DM for preference elicitation. A representative point’s preference order is assigned to those points belonging to its group.

Once built, PM_DT is used to guide the search of NSGA-II in the next generations. In NSGA-II, the solutions generated by operations such as crossover and mutation are evaluated first by fitness and second by spread (Deb et al. 2002). By incorporating the DM’s preferences, an interactive method tries to search for solutions that are not best in terms of spread but are best in terms of his/her interests. Thus, the solutions should be evaluated first by fitness and second by PM_DT. The order predicted by PM_DT $oder\_PM$ for solutions with the same fitness (i.e., the same nondominance order) is used as the second evaluation.

For $P{F^2}$, we first obtain $oder\_PM$ by PM_DT and then obtain $oder\_DM$ from the DM’s preference feedback (i.e., pairwise comparisons). The difference between $oder\_PM$ and $oder\_DM$ can be used to assess PM_DT’s performance index error rate . $oder\_PM$ is derived from PM_DT, which is built from the DM’s previous preference feedback, and $oder\_DM$ is derived from the DM’s current preference feedback, so can also be used to measure the DM’s preference changes during the interactive decision process.

Since the objective of our interactive approach is to obtain the DM’s preferred solution, we only differentiate the preferred solutions from the nonpreferred ones. Only the solutions in the first class are attributed to the preferred subset and are assigned the class label ‘Y’, and all the others are assigned to the nonpreferred subset with the class label ‘N’. Therefore, the predicted results for a set of solutions include these two class labels as well. Based on the differences in the class labels of the sets of solutions predicted by PM_DT and elicited from the DM, the performance index error rate of PM_DT is computed.

4.2 Procedure of the interactive method of portfolio optimization

The process of our interactive method of portfolio optimization is given as algorithm 3, which includes the NSGA-II as the search engine, pairwise comparison assisted by auxiliary factors as preference elicitation and the CART-learned decision tree as the preference model. It starts with NSGA-II; after PM_DT is built, the solutions’ second evaluation based on the spread in NSGA-II will be replaced by PM_DT, and we denote the updated NSGA-II as PM_NSGA-II. The implementation steps of algorithm 3 are described as follows:

Algorithm 3

interactive method for portfolio optimization.

Procedure: HM_PO.

Input:

size of a population

$gen$ generations during two adjacent interactions

$st$ number of solutions for DM’s preference elicitation

$max\_it$ maximum number of interactions

$min\_e$ minimum of PM_DT’s error rate

$min\_eMean$ minimum mean of PM_DT’s error rate

$min\_eVar$ minimum variance of PM_DT’s error rate

Output:

${Z^{best}}$ a preferred solution by DM

1: generate an initial population of solutions for MV model;

2: run NSGAII for $gen$ generations;

3: collect Pareto optimal solutions $PF$;

4: $it$=0; //$it$ the number of interactions.

5: while terminal condition is not met do

6: $it$=$it$+1;

7: cluster $PF$ into $st$ groups;

8: for each pair of solutions from $st$ representatives do

9: get a set of auxiliary factors by AF_Selection;

10: get a preferred solution by DS_AF;

11: end for

12: get $oder\_DM$ for $PF$;

13: if $it$= 1 then

14: build PM_DT from $PF$; //an initial PM_DT.

15: else

16: calculate for PM_DT;

17: build PM_DT from $PF$; //update PM_DT.

18: end if

19: run PM_NSGAII for $gen$ generations;

20: collect Pareto optimal solutions $PF$;

21: get $oder\_PM$ for $PF$;

22: end while

23: return ${Z^{best}}$=the first order solution in $PF$.

The procedure HM_PO consists of three modules: a search engine (NSGA-II or PM_NSGA-II), DM preference elicitation and preference model formation and evaluation. It starts by running the normal NSGA-II (steps 1–3). Based on the produced set of Pareto optimal solutions $PF$, an interaction takes place (steps 7–11), and the order from the DM’s preference feedback $oder\_DM$ for $PF$ is obtained (step 12). According to $oder\_DM$ for $PF$, PM_DT is initially built after the first interaction (step 14) or updated after the other interaction (step 17). After the DM’s preference model is formulated, it is used to guide the search of NSGA-II; i.e., NSGA-II is replaced by PM_NSGA-II as the search engine (steps 19–20). Then, $oder\_PM$ is derived for the produced set of Pareto optimal solutions $PF$ (step 21). PM_DT’s performance () is estimated (step 16) at the next interaction when $PF$’s order $oder\_DM$ is obtained (step 12). The procedure iterates the implementation of the three modules (steps 5–22) until the termination condition is met, and then it outputs the first-order solution in $PF$ (step 23).

To avoid repetition, we do not list the parameters used by procedures AF_Selection and DS_AF since these parameters are included in procedure HM_PO explicitly or implicitly. Additionally, for simplicity, we only explicitly list the input parameters required by procedure HM_PO, and the parameters required implicitly by procedure HM_PO will be given for initialization in the next section. The parameters $max\_it$, $min\_e$, $min\_eMean$ and $min\_eVar$ are used for the judgment of termination conditions in the different experimental scenarios. The difference between the best obtained solution ${Z^{best}}$ and the DM’s truly satisfactory solution ${Z^*}$, $diff$, is defined as the Euclidean distance between them. In the literature on interactive methods, the test problems are derived from a benchmark library, whose Pareto optimal fronts are given by known functions (Deb et al. 2010); thus, the DM’s truly satisfactory solution ${Z^*}$ can be determined according to a DM-emulating value function $V(Z)$. As the Pareto optimal fronts for the MV model are not defined by a function in the literature, we first have to formulate its Pareto optimal front. Given the data for the MV model (see Table A1 in Appendix), it is found that after running NSGA-II for 100 generations, the distribution of the obtained Pareto optimal solutions is almost stable, so these Pareto optimal solutions are used to fit a function. We adopt scipy.optimize in Python (https://www.programcreek.com/python/index/1490/scipy.optimize) for this fitting process and obtain the following function:

$${z_1}=0.1332 - \frac{{1.0534 \times {{10}^{ - 5}}}}{{{z_2}{\text{+}}8.4321 \times {{10}^{ - 5}}}}$$

20

where ${z_1}$ and ${z_2}$ represent the return and risk, respectively. Figure 4 shows the difference between the Pareto optimal front obtained by running the NSGA-II for 100 generations and a front fitted by function (20). It is seen that fitting function (20) approximates the Pareto optimal front well; hence, it is used to define the Pareto optimal front for the MV model.

In this section, to verify the effectiveness of our proposed interactive method of portfolio optimization, we design various experimental scenarios and present the computational results in comparison.

5.1 Data set and initialization

The data set used for the MV model is drawn from the financial database RESSET, which provides professional services for investment research. RESSET is designed with the participation of experts from Tsinghua University, Peking University, etc. in financial research fields, with reference to the international standards for common databases and considering the actual situation of China's financial market. As the most frequently cited financial database in China for research on subjects such as economics, finance, accounting, statistics and mathematics, it includes the latest data, from daily to yearly scales, which can be openly accessed (http://www.resset.cn). We select monthly stock composite data of the A-share market in 2019, which contains 18 categories with a total of 3828 stocks. For each stock, we compute the average monthly return as its expected return. In each category, we randomly choose one of the top 50% of stocks with a positive expected return. The 18 chosen stocks and their expected returns are listed in Table A1 in Appendix.

The parameters of procedure HM_PO are initialized as = 100, $gen$=5, $st$= 5, $max\_it$=20, $min\_e$= 0, $min\_eMean$=0.01 and $min\_eVar$= 0.01. The initial values of the parameters used by NSGA-II are as follows: crossover probability $pc$= 0.9, crossover distribution index $xoverDistIdx$= 15, mutation probability $pc$= 0.

5.2 Demonstration of decision making assisted by auxiliary factors

As auxiliary factors have not been defined and used in the literature on interactive methods, we first demonstrate how they assist a DM in making a judgment in the interactive decision process. In terms of the form of function $V(Z)$ in Eq. (13), we assume it to be precisely expressed by Eq. (21) below; at the same time, we suppose that the DM’s decision strength threshold is $\theta$= 50.

$$V(Z)=\frac{{1{{\text{0}}^{{\text{-3}}}}}}{{{{({z_{\text{1}}}{\text{-}}0.056)}^2}+{\text{1}}{{\text{0}}^{\text{4}}} \times {{({z_{\text{2}}} - {\text{0}}{\text{.00001}})}^2}}}$$

21

According to Table 1, the aspiration and reservation values can be specified by the DM regarding his/her willingness or set by the system. The DM can give these values either before or during the interactive process. In Table 3, the aspiration and reservation values are assumed to be specified by the DM. For a pair of solutions ${S_1}$ and ${S_2}$ in Table 4, the candidate auxiliary factors derived from these values are shown in Table 5.

Table 3

Aspiration and reservation values for objective functions and decision variables
Spaces	Elements in candidate auxiliary factors		Aspiration values (A) and reservation values (R)
Objective space	A(Z)	A(${z_1}$) A(${z_2}$)	0.056 0.00001
Objective space	R(Z)	R(${z_1}$) R(${z_2}$)	0.05 0.00003
Decision space	A(X)	A(${x_1}$) _… A(${x_{18}}$)	[0.1, 0.05, 0.07, 0.05, 0.05, 0.045, 0.04, 0.04, 0.06, 0.07, 0.06, 0.05, 0.05, 0.05, 0.05, 0.055, 0.052, 0.06]
Decision space	R(X)	R(${x_1}$) _… R(${x_{18}}$)	[0.08, 0.04, 0.05, 0.045, 0.04, 0.035, 0.03, 0.03, 0.05, 0.065, 0.05, 0.04, 0.035, 0.04, 0.042, 0.045, 0.048, 0.05]

Table 4

A pair of solutions with their objective function values
Objective functions (primary factors)		${S_1}$	${S_2}$
Z	${z_1}$ (return)	0.050484	0.052697
Z	${z_2}$ (risk)	0.000051	0.000053

Table 5

Candidate auxiliary factors for MV model
Spaces	Candidate auxiliary factors		${S_1}$	${S_2}$
Objective space	\|Z - A(Z)\|	\|${z_1}$ - A(${z_1}$)\| \|${z_2}$ - A(${z_2}$)\|	0.005516 0.000041	0.003303 0.000043
Objective space	\|Z - R(Z)\|	\|${z_1}$ - R(${z_1}$)\| \|${z_2}$ - R(${z_2}$)\|	0.000484 0.000021	0.002697 0.000023
Decision space	X	${x_1}$${x_2}$ _…${x_{18}}$	0.065698 0.071237 _… 0.062213	0.090612 0.042043 _… 0.067133
	\|X - A(X)\|	\|${x_1}$ - A(${x_1}$)\| \|${x_2}$ - A(${x_2}$)\| _… \|${x_{18}}$ - A(${x_{18}}$)\|	0.034302 0.021237 _… 0.002213	0.009388 0.007957 _… 0.007133
	\|X - R(X)\|	\|${x_1}$ - R(${x_1}$)\| \|${x_2}$ - R(${x_2}$)\| _… \|${x_{18}}$ - R(${x_{18}}$)\|	0.014302 0.031237 _… 0.012213	0.010612 0.002043 _… 0.017133

The threshold for filtering irrelevant factors is set to $\delta$= 0, and this threshold value can be set either before or during the interactive process. Now, procedure AF_Selection (step 9 of procedure HM_PO) is called for the selection of auxiliary factors from the candidates in Table 5. The selected results are shown in Table 6. These factors are ordered first by correlation and then by distinction. The last factor |X - R(X)| labeled with ‘-’ in the columns Correlation and Distinction is filtered since it is found to be redundant and weakly distinctive by analysis. Thus, the selected set of auxiliary factors is $AF$=$\left\{ {\left| {Z - A\left( Z \right)} \right|,\left| {Z - R\left( Z \right)} \right|,X,\left| {X - A\left( X \right)} \right|} \right\}$.

Table 6

Factors ordered by correlation and distinction
Factors	Correlation	Distinction
Z	1.000000	0.002213
\|Z-A(Z)\|	0.989341	0.002213
\|Z-R(Z)\|	0.442137	0.050607
X	0.421130	0.050952
\|X-A(X)\|	0.404046	0.062247
\|X-R(X)\|	-	-

Next, procedure DS_AF (step 10 of procedure HM_PO) is implemented to judge between solutions ${S_1}$ and ${S_2}$. This decision-making process assisted by auxiliary factors is illustrated in Table 7. In the second row, only the primary factor Z is used for making judgments. As the initial $DS$= 13.051418 is lower than $\theta$= 50, which means the DM cannot make a decision, in addition to the primary factor Z, the first auxiliary factor |Z-A(Z)| in $AF$ is presented to the DM for another try. With an increasing number of auxiliary factors presented to the DM, his/her decision strength $DS$ is progressively improved. When auxiliary factor X is presented, his/her decision strength $DS$ is improved to 52.205674, which is higher than $\theta$, and then the DM can make a judgment. The succeeding auxiliary factors in $AF$, e.g., |X-A(X)| in the last row, are not necessarily presented to the DM. As $V({Z_{{S_1}}}) - V({Z_{{S_2}}})$< 0, ${S_2}$is judged to be the preferred choice.

Table 7

The decision strength DS changes with auxiliary factors
Auxiliary factors	Factors for decision making	$DS$
	Z	13.051418
\|Z-A(Z)\|	(Z, \|Z-A(Z)\|)	13.051418
\|Z-R(Z)\|	(Z, \|Z-A(Z)\|, \|Z-R(Z)\|)	26.102837
X	(Z, \|Z-A(Z)\|, \|Z-R(Z)\|, X)	52.205674
\|X-A(X)\|	-	-

5.3 Effectiveness of decision making assisted by auxiliary factors

5.3.1 Comparative experimental design

Our proposed interactive method for portfolio optimization HM_PO is not an update to an existing method that incorporates auxiliary factors into DM decisions but is a compound innovation with respect to preference articulation, DM decision behavior and preference modeling. There is no such method with only one different aspect that is suitable for comparison with our method. To verify the effectiveness of decision making assisted by auxiliary factors, based on procedure HM_PO, we introduce another procedure HM_PO2 with decision making with only primary factors: step 9 in HM_PO is replaced by a statement ‘AF = null’. Since there is no selection of auxiliary factors in HM_PO2, the DM can only make judgments with the primary factor Z. HM_PO and HM_PO2 share all parameters except those specifically used by procedure AF_Selection (Algorithm 1).

Any one of the parameters $max\_it$, $min\_e$, $min\_eMean$ and $min\_eVar$ whose values are given in section 5.1 can be used for the judgment of the termination condition. Four measurements are employed for the performance evaluation:

$$diff$$

1

$diff$ is the Euclidean distance between the best solution obtained ${Z^{best}}$ and the DM’s truly satisfactory solution ${Z^ * }$ defined by the DM-emulating value function $V(Z)$, which is a measurement of the DM’s satisfaction with the solution obtained by an interactive method.

2 e

e is an estimation of PM_DT’s prediction error. Since PM_DT is built according to the DM’s preference feedback in the preceding interaction and is used to predict the preference order of the currently produced set of Pareto optimal solutions, it is a measurement of the DM’s preference changes between two adjacent interactions.

(3) $eMean$ and $eVar$

$eMean$ and $eVar$ are the mean and variance of in a certain number of consecutive interactions (5 consecutive interactions in our experiments). A low value of does not mean a low value of $diff$. For a cognition-limited DM, his/her inaccurate preferences may last for some time, especially in the early stages of interaction. Low values of $eMean$ and $eVar$ indicate that the DM’s preferences have converged, which in most cases means that he/she is satisfied with the obtained solution and thus his/her preferences change little. In many real situations, the DM’s true preference is very complex and is impossible to emulate by a function; therefore, $eMean$ and $eVar$ can be used as an alternative measurement of $diff$.

$$it$$

4

$it$ is the number of interactions between the DM and a machine. Since a certain number of solutions (i.e., $st$) are presented to the DM for pairwise comparison at each interaction, $it$ is the measurement of the DM’s interactive cost. For the aim of obtaining a satisfactory solution via an interactive method, it is not realistic to demand too high an interactive cost. Even when very easy preference elicitation methods, such as pairwise comparisons, are adopted, a DM with a strong cognitive capability can feel fatigue after massive interactions. A more practical interactive method is one that can search the DM’s satisfactory solutions with an interactive cost that is as low as possible.

An effective interactive method produces different results under the participation of DMs with different characteristic preferences. To ensure the representativeness of our experimental results, we assume three types of DMs with low, medium and high risk tolerances, denoted by DM1, DM2 and DM3, respectively. To specify the characteristics of these three types of DMs, we divide the distribution area of the Pareto optimal solutions to the MV model into three parts, i.e., the upper right, middle, and lower left, according to the range of risk values, denoted as low-, medium- and high-risk areas, respectively (see Figure A1 in Appendix). A DM-emulating value function $V(Z)$ with aspiration values for return and risk $(\alpha ,\beta )$ drawn from these three parts is used to specify the three types of DMs. Furthermore, for a comprehensive investigation of how auxiliary factors impact a DM’s decision making, we set three levels of decision strength thresholds for each type of DM, $\theta$= 1, 10 and 30, representing strong, ordinary and weak cognition capabilities, respectively.

Under the participation of three types of DMs with different levels of cognitive capabilities in our interactive method, the following subsections give comparative experimental results with and without auxiliary factors in light of four performance measurements.

5.3.2 Comparative experimental results under DM1 (with low risk tolerance)

For DM1, the aspiration values for return and risk $(\alpha ,\beta )$ are drawn from low-risk areas and are assumed to be (0.056, 0.00001). Our previous experiment (in section 5.2) to demonstrate decision making assisted by auxiliary factors is implemented just in this case, so the corresponding $V(Z)$, aspiration values and reservation values for the elements in the candidate auxiliary factors, as well as the candidate auxiliary factors for the MV model, are the same as those in section 5.2; see Eq. (21), Table 3 and Table 5, respectively. For each point on the Pareto optimal front formulated by the function in Eq. (20), there is a corresponding value in terms of $V(Z)$. The point with the highest value of $V(Z)$ is DM1’s truly satisfactory solution ${Z^*}$(0.05545, 5.1683×10^− 5).

$$diff$$

1

Figure 5 gives the $diff$ values as they change with the number of interactions under DM1 with $\theta$= 1, 10 and 30. From Fig. 5 (a), we can see that in the last interaction, $diff$ has similar values in two cases, but during the interactions, $diff$ reaches lower values in the case with auxiliary factors than that without auxiliary factors. These differences in $diff$ become much more evident in Fig. 5 (b) and (c) in nearly every interaction. It can be concluded that auxiliary factors can help search for a much more satisfactory solution for a DM, especially a DM with a weak capability of cognition.

To see how auxiliary factors impact the solution’s evolution, we plot the distribution of the best solutions obtained under DM1 with $\theta$= 30 during the interactions in Fig. 6. The best solutions obtained in each interaction are labeled with different shapes as well as colors, and Z^ and ${Z^\# }$ represent the centers of the best solutions obtained in the two cases. We can see from Fig. 6 (a) and (b) that the best solutions obtained converge to the DM’s truly satisfactory solution ${Z^*}$ during interaction, but at each interaction, Z^ moves much closer to ${Z*}$ than ${Z^\# }$. The enlarged picture in Fig. 6 (c) for the last interaction in the two cases shows that the best solutions obtained in the case with auxiliary factors have a much more concentrated distribution, which implies a much higher level of satisfaction for the DM.

2 e

Figure 7 gives the values as they change with the number of interactions under DM1 with $\theta$= 1, 10 and 30. In general, there are lower values with auxiliary factors than without auxiliary factors. Figure 7 (a) shows similar values in the two cases, which is because a DM with a strong capability of cognition can make a decision without the need for auxiliary factors most of the time. However, for a DM with a weak capability of cognition, the values in Fig. 7 (b) and (c) change by much smaller magnitudes in the case with auxiliary factors than that without auxiliary factors. These results indicate that auxiliary factors can help a DM eliminate uncertainties and make a more confident decision.

(3) $eMean$ and $eVar$

Figure 8 and 9 are the $eMean$ and $eVar$ values corresponding to the values in Fig. 7, respectively. They also show that these two measures have lower values with auxiliary factors than without auxiliary factors, and the differences become larger as the DM’s cognitive capability declines. As auxiliary factors can help the DM remove uncertainties, his/her preferences will converge to stable conditions at earlier stages; thus, his/her satisfactory solution is captured in fewer interactions.

4 $$it$$

To compare the interactive cost $it$ when the DM obtains a stable satisfactory solution, i.e., when very small $diff$ values do not change much in several consecutive interactions, we relax $max\_it$ to 40 so that a sufficient number of interactions are carried out to judge whether a stable satisfactory solution is obtained. Figure 10 shows, under DM1 with $\theta$= 1, 10 and 30, the number of interactions, i.e., the interactive cost $it$, that is needed to reach a certain value of $diff$. We can see that for a DM with a strong capability of cognition (Fig. 10 (a)), a stable satisfactory solution is obtained at $it$= 6 and 16 in two cases; for a DM with ordinary and weak capabilities of cognition (Fig. 10 (b) and (c)), similar satisfactory solutions are obtained at $it$= 15 and 33, respectively, in the case with auxiliary factors. However, in the case without auxiliary factors, a stable satisfactory solution cannot be reached even in the maximum number of interactions.

For a DM with medium or high risk tolerance, we achieved very similar comparative experimental results as those described above, which are listed in the two subsections of the Appendix. Significant conclusions can be drawn from these consistent results: during interactions assisted by auxiliary factors, a DM can remove uncertainties in judgments, thus facilitating his/her preference convergence speed. As a result, the DM’s stable satisfactory solution can be acquired at a lower interactive cost. The more limited the DM’s cognition, the more evident the effectiveness of this strategy.

Under the assumption of a cognition-limited DM participating in an interactive method of portfolio optimization, we explore a mechanism for helping the DM articulate his/her preferences as precisely as possible during interaction. This will lead to building a more reliable preference model with fewer interactive samples or solutions, guiding EMO to search for the DM’s stable satisfactory solution. When the DM cannot select between a pair of solutions provided with primary factors or objective functions, we suggest some other valuable information involved in the same optimization model that may assist him/her in making a more confident judgment. These candidate auxiliary factors can be defined based on the DM’s willingness and understanding of the optimization problem. Alternatively, they can be set by the system based on the analysis of the current solutions to the optimization problem. Since the objective functions of an optimization problem are always the principal factors that impact a DM’s preferences, the auxiliary factors can have an assisting role in decision making only after being incorporated into the primary factors. The decision strength of the auxiliary factors is determined first by their correlation with the primary factors and then by their distinction with respect to the solutions. Based on the concept of DS, a DM’s progressive decision behaviors, from being unable to make a pairwise comparison to being able to, can be traced transparently and completely.

The clear effectiveness of decision making assisted by auxiliary factors is illustrated in our extensive comparative experiments, especially for a participating DM with a low cognitive capability. However, the availability of valuable information such as auxiliary factors is only one measure in preference articulation that improves the performance of an interactive method. Considering the other preference-related aspects of a human-machine interaction system (see Fig. 1), some new explorations are possible for the further enhancement of its performance.

First, an informative question may reduce the number of interactive samples necessary for a reliable preference model. The learning of a target concept (i.e., the DM’s true preference), according to the Vapnik-Chervonenkis dimension (Gordon & Desjardins 1995) is supported by a certain number of samples. However, if more informative samples, e.g., the hard samples on the boundary of classification suggested by active learning (Settles 2010), are selected, the necessary number of samples can be reduced significantly. Recently, curriculum learning (Bengio et al. 2009) was proposed to select samples with different levels of difficulty, as it was found that hard samples contain more information on the target concept, but at the same time, they involve much uncertainty and may cause confused classifications. The representative samples of clusters selected in Deb et al. (2010) have the same difficulty. Selecting samples or solutions with high-quality information for pairwise comparison should be investigated in the development of a more effective interactive method.

Next, a change-predicting preference model can eliminate unnecessary interactions during the decision process. In the present interactive methods (Battiti & Passerini 2010; Xin et al. 2018), an interaction occurs once after EMO runs for a certain number of generations, which implies an unrealistic assumption that the time at which a DM changes or adjusts his/her preferences during the decision process is known a priori. In fact, the best opportunity for interaction is when the DM is not satisfied with the best option selected from the current set of Pareto optimal solutions and would like to change his/her preferences. If the best interaction opportunities can be predicted by the preference model, this will save many unnecessary interactions. To realize this prediction, a preference model should be designed to trace the DM’s satisfaction changes during the decision process; additionally, a threshold is needed for the judgment of his/her preference change. Moreover, the DM’s preference change pattern should be simulated for the implementation of this human-machine interaction system. These are very challenging research topics in future development.

Funding

This work was supported in part by the National Natural Science Foundation of China (Grant No. 61941302).

Competing Interests

The authors have no relevant financial or non-financial interests to disclose.

Author Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Qiang Ye, Fang Li and Yuning Hu. The first draft of the manuscript was written by Shicheng Hu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Data Availability

The datasets generated and analyzed during the current study are available in the RESSET repository, http://www.resset.cn

Compliance with Ethical Standards

Conflicts of interest The authors declare no potential conflict of interest.

Ethics approval Not Applicable.

Informed Consent Not Applicable.

Altinoz M, Altinoz OT (2019) Systematic initialization approaches for portfolio optimization problems. IEEE Access 7(99):57779–57794. https://doi.org/10.1109/access.2019.2914115
Anagnostopoulos KP, Mamanis G (2010) A portfolio optimization model with three objectives and discrete variables. Comput Oper Res 37(7):1285–1297. https://doi.org/10.1016/j.cor.2009.09.009
Anagnostopoulos KP, Mamanis G (2011a) Multi-objective evolutionary algorithms for complex portfolio optimization problems. CMS 8(3):259–279. https://doi.org/10.1007/s10287-009-0113-8
Anagnostopoulos KP, Mamanis G (2011b) The mean-variance cardinality constrained portfolio optimization problem: An experimental evaluation of five multi-objective evolutionary algorithms. Expert Syst Appl 38:14208–14217. https://doi.org/10.1016/j.eswa.2011.04.233
Bacanin N, Tuba M (2015), January Fireworks algorithm applied to constrained portfolio optimization problem. In IEEE Congress on Evolutionary Computation, Sendai, Japan. https://doi.org/10.1109/CEC.2015.7257031
Banzhaf W, Cheng BHC, Deb K, Holekamp KE, Whittaker DJ (2020) Evolution in Action: Past, Present and Future. Springer, Cham. https://doi.org/10.1007/978-3-030-39831-6
Battiti R, Passerini A (2010) Brain-computer evolutionary multiobjective optimization: a genetic algorithm adapting to the decision maker. IEEE Trans Evol Comput 14(5):671–687. https://doi.org/10.1109/tevc.2010.2058118
Bengio Y, Louradour J, Collobert R, Weston J (2009), June Curriculum learning. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML2009, Montreal, Quebec, Canada. https://doi.org/10.1145/1553374.1553380
Branke J, Corrente S, Greco S, Słowiński R, Zielniewicz P (2016) Using choquet integral as preference model in interactive evolutionary multiobjective optimization. Eur J Oper Res 250(3):884–901. https://doi.org/10.1016/j.ejor.2015.10.027
Branke J, Deb K, Miettinen K, Sowiński R (2008) Multi-objective Optimization: Interactive and Evolutionary Approaches. Springer, Cham
Branke J, Greco S, Slowinski R, Zielniewicz P (2015) Learning value functions in interactive evolutionary multiobjective optimization. Evolutionary Comput IEEE Trans on 19(1):88–102. https://doi.org/10.1109/TEVC.2014.2303783
Branke J, Kaußler T, Schmeck H (2001) Guidance in evolutionary multi-objective optimization. Adv Eng Softw 32(6):499–507. https://doi.org/10.1016/S0965-9978(00)00110-1
Braun M, Shukla P, Schmeck H (2017) Angle-based preference models in multi-objective optimization. In: Trautmann H, Rudolph G, Schütze O, Wiecek M, Jin Y, Grimme C (eds) Evolutionary Multi-Criterion Optimization. Springer, Cham, pp 88–102. https://doi.org/10.1007/978-3-319-54157-0_7
Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and Regression Trees. Chapman & Hall, New York
Chen C, Zhou Y-s (2018) Robust multiobjective portfolio with higher moments. Expert Syst Appl 100:165–181. https://doi.org/10.1016/j.eswa.2018.02.004
Chen L, Xin B, Chen J, Juan L (2017), June A virtual-decision-maker library considering personalities and dynamically changing preference structures for interactive multiobjective optimization. In 2017 IEEE Congress on Evolutionary Computation, CEC. San Sebastian. https://doi.org/10.1109/CEC.2017.7969370
Ciomek K, Kadziński M, Tervonen T (2017) Heuristics for prioritizing pair-wise elicitation questions with additive multi-attribute value models. Omega 71:27–45. https://doi.org/10.1016/j.omega.2016.08.012
Cruz-Reyes L, Fernandez E, Gomez C, Rivera G, Perez F (2014) Many-objective portfolio optimization of interdependent projects with 'a priori' incorporation of decision-maker preferences. Appl Math Inform Sci 8(4):1517–1531. https://doi.org/10.12785/amis/080405
Cruz-Reyes L, Fernandez E, Patricia Sanchez-Solis J, Coello Coello CA, Gomez C (2020) Hybrid evolutionary multi-objective optimisation using outranking-based ordinal classification methods. Swarm Evol Comput 54:100652. https://doi.org/10.1016/j.swevo.2020.100652
Deb K (2020) Rise of Evolutionary Multi-criterion optimization: a brief history of time with key contributions. In: Banzhaf W, Cheng BHC, Deb K, Holekamp KE, Whittaker DJ (eds) Evolution in Action: Past, Present and Future. Springer, Cham, pp 351–368. https://doi.org/10.1007/978-3-030-39831-6_24
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197. https://doi.org/10.1109/4235.996017
Deb K, Sinha A, Korhonen PJ, Wallenius J (2010) An interactive evolutionary multiobjective optimization method based on progressively approximated value functions. IEEE Trans Evol Comput 14(5):723–739. https://doi.org/10.1109/tevc.2010.2064323
Deb K, Sundar J (2006), July Reference point based multi-objective optimization using evolutionary algorithms. In Genetic and Evolutionary Computation Conference, GECCO 2006, Proceedings, Seattle, Washington. http://dx.doi.org/10.5019/j.ijcir.2006.67
Deng G-F, Lin WT (2010) Ant colony optimization for markowitz mean-variance portfolio model. In Panigrahi, B. K., Das, S., Suganthan, P. N., & Dash, S. S. (Eds.), Swarm, Evolutionary, and Memetic Computing. SEMCCO 2010(pp. 238–245), Springer, Berlin, Heidelberg. http://dx.doi.org/10.1007/978-3-642-17563-3_29
Doumpos M, Marinakis Y, Marinaki M, Zopounidis C (2009) An evolutionary approach to construction of outranking models for multicriteria classification: The case of the ELECTRE TRI method. Eur J Oper Res 199(2):496–505. https://doi.org/10.1016/j.ejor.2008.11.035
Eduardo F, Jorge N, Efrain S, Carlos CC (2020) Using evolutionary computation to infer the decision maker's preference model in presence of imperfect knowledge: A case study in portfolio optimization. Swarm Evol Comput 54:100648. https://doi.org/10.1016/j.swevo.2020.100648
Ehrgott M, Klamroth K, Schwehm C (2004) An MCDM approach to portfolio optimization. Eur J Oper Res 155(3):752–770. https://doi.org/10.1016/S0377-2217(02)00881-0
El-Bizri S, Mansour N (2017) Metaheuristics for portfolio optimization. In Tan, Y., Takagi, H., Shi, Y., & Niu, B. (Eds.), International Conference in Swarm Intelligence, ICSI 2017 (pp.77–84), Springer, Cham. https://doi.org/10.1007/978-3-319-61833-3_9
Ertenlice O, Kalayci CB (2018) A survey of swarm intelligence for portfolio optimization: Algorithms and applications. Swarm Evol Comput 39:36–52. https://doi.org/10.1016/j.swevo.2018.01.009
Fernandez E, Navarro J, Solares E, Coello CC (2019) A novel approach to select the best portfolio considering the preferences of the decision maker. Swarm Evol Comput 46:140–153. https://doi.org/10.1016/j.swevo.2019.02.002
Figueira J, Greco S, Ehrgott M (2005) Multiple criteria decision analysis: State of the art surveys. Springer. https://doi.org/10.1007/978-1-4939-3094-4
Filatovas E, Kurasova O, Redondo JL, Fernández J (2019) A reference point-based evolutionary algorithm for approximating regions of interest in multiobjective problems. Top 28(2):402–423. https://doi.org/10.1007/s11750-019-00535-z
Fowler JW, Gel ES, Köksalan MM, Korhonen P, Marquis JL, Wallenius J (2010) Interactive evolutionary multi-objective optimization for quasi-concave preference functions. Eur J Oper Res 206(2):417–425. https://doi.org/10.1016/j.ejor.2010.02.027
Frenay B, Verleysen M (2014) Classification in the presence of label noise: a survey. IEEE Trans Neural Networks Learn Syst 25(5):845–869. https://doi.org/10.1109/TNNLS.2013.2292894
Gao W, Sheng H, Wang J, Wang S (2019) Artificial bee colony algorithm based on novel mechanism for fuzzy portfolio selection. IEEE Trans Fuzzy Syst 27(5):966–978. https://doi.org/10.1109/tfuzz.2018.2856120
García-Rodríguez S, Quintana D, Galván I, Viñuela PI (2011), September Portfolio Optimization Using SPEA2 with Resampling. In Proceedings of the Intelligent Data Engineering and Automated Learning, Norwich. https://doi.org/10.1007/978-3-642-23878-9_16
Golmakani HR, Fazel M (2011) Constrained portfolio selection using particle swarm optimization. Expert Syst Appl 38(7):8327–8335. https://doi.org/10.1016/j.eswa.2011.01.020
Gordon DF, Desjardins M (1995) Evaluation and selection of biases in machine learning. Mach Learn 20(1–2):5–22. https://doi.org/10.1007/bf00993472
Goulart F, Campelo F (2016) Preference-guided evolutionary algorithms for many-objective optimization. Inf Sci 329(1):236–255. https://doi.org/10.1016/j.ins.2015.09.015
Govindan K, Jepsen MB (2016) Electre: A comprehensive literature review on methodologies and applications. Eur J Oper Res 250(1):1–29. https://doi.org/10.1016/j.ejor.2015.07.019
Grazia Speranza M (1996) A heuristic algorithm for a portfolio optimization model applied to the Milan stock market. Comput Oper Res 23(5):433–441. https://doi.org/https://doi.org/10.1016/0305-0548(95)00030-5
Guo YN, Zhang X, Gong D-W, Zhang Z, Yang J-J (2020) Novel interactive preference-based multi-objective evolutionary optimization for bolt supporting networks. IEEE Trans Evol Comput 24(4):750–764. https://doi.org/10.1109/tevc.2019.2951217
Hardoroudi ND, Keshvari A, Kallio M, Korhonen P (2017) Solving cardinality constrained mean-variance portfolio problems via MILP. Ann Oper Res 254(1–2):47–59. https://doi.org/10.1007/s10479-017-2447-x
He Y, Aranha C (2020) Solving portfolio optimization problems using MOEA/D and levy flight. ArXiv, ArXiv: 2003.06737, 1–34
Hotelling H (1936) Relations between two sets of variates. Biometrika 28(3–4):321–377. https://doi.org/10.1093/biomet/28.3-4.321
Hu J, Yu G, Zheng J, Zou J (2017) A preference-based multi-objective evolutionary algorithm using preference selection radius. Soft Comput 21(17):5025–5051. https://doi.org/10.1007/s00500-016-2099-9
Ipeirotis PG, Provost F, Sheng VS, Wang J (2013) Repeated labeling using multiple noisy labelers. Data Min Knowl Disc 28(2):402–441. https://doi.org/10.1007/s10618-013-0306-1
John GH, Kohavi R, Pfleger K (1994) Irrelevant Features and the Subset Selection Problem. In Proceedings of Machine Learning 1994. Inc: Morgan Kaufmann Publishers. https://doi.org/https://doi.org/10.1016/B978-1-55860-335-6.50023-4
Jorion PH (1996) Value at Risk: A New Benchmark for Measuring Derivatives. Irwin Professional Publishers Inc
Kalayci CB, Ertenlice O, Akbay MA (2019) A comprehensive review of deterministic models and applications for mean-variance portfolio optimization. Expert Syst Appl 125:345–368. https://doi.org/10.1016/j.eswa.2019.02.011
Kamili H, Riffi ME (2015) Portfolio selection using the cat swarm optimization. J Theoretical Appl Inform Technol 74(3):374–380
Kamili H, Riffi ME (2016), November A comparative study on portfolio optimization problem. In Proceedings of the 2016 International Conference on Engineering & MIS (ICEMIS), Bangkok. https://doi.org/10.1109/ICEMIS.2016.7745339
Koksalan M, Karahan İ (2010) An interactive territory defining evolutionary algorithm: iTDEA. IEEE Trans Evol Comput 14(5):702–722. https://doi.org/10.1109/tevc.2010.2070070
Köksalan M, Şakar CT (2016) An interactive approach to stochastic programming-based portfolio optimization. Ann Oper Res 245(1–2):47–66. https://doi.org/10.1007/s10479-014-1719-y
Kolm PN, Tütüncü R, Fabozzi FJ (2014) 60 Years of portfolio optimization: Practical challenges and current trends. Eur J Oper Res 234(2):356–371. https://doi.org/10.1016/j.ejor.2013.10.060
Konno H, Yamazaki H (1991) Mean-absolute deviation portfolio optimization model and its applications to Tokyo stock market. Manage Sci 37(5):519–531. https://doi.org/10.1287/mnsc.37.5.519
Li K, Chen R, Savic D, Yao X (2019) Interactive decomposition multi-objective optimization via progressively learned value functions. IEEE Trans Fuzzy Syst 27(5):849–860. https://doi.org/10.1109/tfuzz.2018.2880700
Li X, He Q, Li Y, Zhu Z (2017) Multi-areas outstanding covering optimization method of HF network based on preference ranking elimination NSGAII algorithm. J Electron Inform Technol 39(8):1779–1787. https://doi.org/10.11999/JEIT161172
Li X, Qin Z, Kar S (2010) Mean-variance-skewness model for portfolio selection with fuzzy returns. Eur J Oper Res 202(1):239–247. https://doi.org/10.1016/j.ejor.2009.05.003
Luque M, Gonzalez-Gallardo S, Saborido R, Ruiz AB (2020) Adaptive global WASF-GA to handle many-objective optimization problems. Swarm Evol Comput 54:100644. https://doi.org/10.1016/j.swevo.2020.100644
Lwin K, Qu R, Kendall G (2014) A learning-guided multi-objective evolutionary algorithm for constrained portfolio optimization. Appl Soft Comput 24:757–772. https://doi.org/10.1016/j.asoc.2014.08.026
Lwin KT (2015) Evolutionary Approaches for Portfolio Optimization. PhD thesis, University of Nottingham, Nottingham, UK
Markowitz H (1952) Portfolio Selection. J Finance 7(1):77–91. https://doi.org/10.1111/j.1540-6261.1952.tb01525.x
Markowitz H (1959) Portfolio Selection: Efficient Diversification of Investments. John Wiley and Sons, New York
Masmoudi M, Abdelaziz FB (2018) Portfolio selection problem: A review of deterministic and stochastic multiple objective programming models. Ann Oper Res 267(1–2):335–352. https://doi.org/10.1007/s10479-017-2466-7
Meignan D, Knust S, Frayret JM, Pesant G, Gaud N (2015) A review and taxonomy of interactive optimization methods in operations research. ACM Trans Interact Intell Syst 5(3):1–43. https://doi.org/10.1145/2808234
Metaxiotis K, Liagkouras K (2012) Multi-objective evolutionary algorithms for portfolio management: A comprehensive literature review. Expert Syst Appl 39(14):11685–11698. https://doi.org/10.1016/j.eswa.2012.04.053
Mingers J (1989) An empirical comparison of selection measures for decision-tree induction. Mach Learn 3(4):319–342. https://doi.org/10.1007/bf00116837
Mishra SK, Panda G, Majhi R (2014) Constrained portfolio asset selection using multi-objective bacteria foraging optimization. Oper Res Int Journal 14(1):113–145. https://doi.org/10.1007/s12351-013-0138-1
Mukhlisullina D, Passerini A, Battiti R (2013), August Learning to diversify in complex interactive multi-objective optimization. In proceedings of the 10th Metaheuristics International Conference, Singapore. https://doi.org/10.1.1.380.4047
Niu B, Liu J, Liu J, Yang C (2016), March Brain Storm Optimization for Portfolio Optimization. In proceedings of the International Conference on Sustainable Infrastructure: Advances in Swarm Intelligence, Prague. https://doi.org/10.1007/978-3-319-41009-8_45
Ojalehto V, Podkopaev D, Miettinen K (2016) Towards automatic testing of reference point based interactive methods. In: Handl J, Hart E, Lewis P, López-Ibáñez M, Ochoa G, Paechter B (eds) Parallel Problem Solving from Nature – PPSN XIV, PPSN 2016. Springer, Cham, pp 483–492. https://doi.org/10.1007/978-3-319-45823-6_45
Ozbey O, Karwan MH (2014) An interactive approach for multicriteria decision making using a tchebycheff utility function approximation. J Multi-Criteria Decis Anal 21(3–4):153–172. https://doi.org/10.1002/mcda.1499
Paiva FD, Cardoso RTN, Hanaoka GP, Duarte WM (2019) Decision-making for financial trading: A fusion approach of machine learning and portfolio selection. Expert Syst Appl 115:635–655. https://doi.org/10.1016/j.eswa.2018.08.003
Pavlou A, Doumpos M, Zopounidis C (2019) The robustness of portfolio efficient frontiers. Manag Decis 57(2):300–313. https://doi.org/10.1108/md-02-2018-0129
Phelps S, Köksalan M (2003) An interactive evolutionary metaheuristic for multi-objective combinatorial optimization. Manage Sci 49(12):1726–1738. https://doi.org/10.1287/mnsc.49.12.1726.25117
Quinlan JR (1993) C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc
Rezaei PA, Solimanpur M, Jahangoshai RM (2016) Solving multi-objective portfolio optimization problem using invasive weed optimization. Swarm Evol Comput 28:42–57. https://doi.org/10.1016/j.swevo.2016.01.001
Rockafellar RT, Uryasev S (1999) Optimization of conditional value-at-risk. J Risk 2(3). https://doi.org/10.21314/JOR.2000.038
Ruiz AB, Infantes RS, Bermúdez JD, Luque M, Vercher E (2019) Preference-based evolutionary multi-objective optimization for portfolio selection: a new credibilistic model under investor preferences. J Global Optim 76(2):295–315. https://doi.org/10.1007/s10898-019-00782-1
Ruiz AB, Saborido R, Luque M (2015) A preference-based evolutionary algorithm for multi-objective optimization: the weighting achievement scalarizing function genetic algorithm. J Global Optim 62(1):101–129. https://doi.org/10.1007/s10898-014-0214-y
Ruiz F, Luque M, Cabello JM (2009) A classification of the weighting schemes in reference point procedures for multi-objective programming. J Oper Res Soc 60(4):544–553. https://doi.org/10.1057/palgrave.jors.2602577
Settles B (2010) Active learning literature survey. University of Wisconsin-Madison Department of Computer Science, Technical Report. https://doi.org/10.1016/j.orl.2011.02.001
Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? Improving data quality and data mining using multiple, noisy labelers. In proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Las Vegas. https://doi.org/10.1145/1401890.1401965
Skolpadungket P, Dahal K, Harnpornchai N (2007), September Portfolio optimization using multi-objective genetic algorithms. In proceedings of the meeting of IEEE Congress on Evolutionary Computation (CEC 2007), Osaka. https://doi.org/10.1109/CEC.2007.4424514
Su LM, He HS (2019) Multi-attribute decision making method based on Spearman rank correlation coefficient of interval number. Stat Decis 35(6):3. https://doi.org/10.13546/j.cnki.tjyjc.2019.06.011
Tomczyk MK, Kadzinski M (2020), June On the elicitation of indirect preferences in interactive evolutionary multiple objective optimization. In proceedings of the meeting of the 2020 Genetic and Evolutionary Computation Conference, Cancun
Tomczyk MK, Kadzinski M (2020) Decomposition-Based Interactive Evolutionary Algorithm for Multiple Objective Optimization. IEEE Trans Evol Comput 24(2):320–334. https://doi.org/10.1109/tevc.2019.2915767
Tuba M, Bacanin N (2014), March Upgraded Firefly Algorithm for Portfolio Optimization Problem. In proceedings of the 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, New York
Tuba M, Bacanin N, Pelevic B (2014) Krill Herd (KH) algorithm applied to the constrained portfolio selection problem. Int J Math Computers Simul 8(1):94–102
Wang R, Purshouse RC, Giagkiozis I, Fleming PJ (2015) The iPICEA-g: a new hybrid evolutionary multi-criteria decision making approach using the brushing technique. Eur J Oper Res 243(2):442–453. https://doi.org/10.1016/j.ejor.2014.10.056
Wegelin JA (2000) A survey of partial least squares (pls) methods, with emphasis on the two-block case, Department of Statistics, University of Washington, Technical report
Wu X, Kumar V, Ross Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu PS, Zhou Z-H, Steinbach M, Hand DJ, Steinberg D (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37. https://doi.org/10.1007/s10115-007-0114-2
Xin B, Chen L, Chen J, Ishibuchi H, Hirota K, Liu B (2018) Interactive multi-objective optimization: A review of the state-of-the-Art. IEEE Access 6:41256–41279. https://doi.org/10.1109/access.2018.2856832
Young MR (1998) A minimax portfolio selection rule with linear programming solution. Manage Sci 44(5):643–653. https://doi.org/10.1287/mnsc.44.5.673
Yu L, Liu H (2004) Efficient feature selection via analysis of relevance and redundancy. J Mach Learn Res 5(12):1205–1224. https://doi.org/10.1023/B:JODS.0000045365.56394.b4
Zhou-Kangas Y, Miettinen K (2019) Decision making in multi-objective optimization problems under uncertainty: balancing between robustness and quality. OR Spectrum 41(2):391–413. https://doi.org/10.1007/s00291-018-0540-4
Zhang WY, Xu Q, Wang YH, Zhang BZ (2015) Gene recombination algorithm designed for solving multi-objective traveling salesman problem. System Engineering, 33(2), 68–73. https://doi.org/CNKI:SUN:GCXT.0.2015-02-010

0InteractiveportfoliooptimizationwithAppendixS.docx

Interactive portfolio optimization with cognition-limited human decision making assisted by auxiliary factors

Status:

Version 1

Abstract

Figures

1 Introduction

2 Definition And Selection Of Auxiliary Factors

2.1 Definition of auxiliary factors

2.2 Selection of auxiliary factors

2.2.1 Computation of correlation and distinction

2.2.2 Selection method based on correlation and distinction

3 Dm’s Decision Behavior Supported By Primary And Auxiliary Factors

3.1 DM-emulating value function based on primary factors

3.2 DM’s decision behavior assisted by auxiliary factors

4 Interactive Method For Portfolio Optimization

4.1 Learning based preference model

4.2 Procedure of the interactive method of portfolio optimization

5 Experimental Evaluation

5.1 Data set and initialization

5.2 Demonstration of decision making assisted by auxiliary factors

5.3 Effectiveness of decision making assisted by auxiliary factors

5.3.1 Comparative experimental design

5.3.2 Comparative experimental results under DM1 (with low risk tolerance)

6 Conclusions And Future Developments

Declarations

References

Supplementary Files

Status:

Version 1

Spaces	Candidate auxiliary factors		Description
Objective space	\|Z - A(Z)\|	\|\({z_1}\) - A(\({z_1}\))\| \|\({z_2}\) - A(\({z_2}\))\|	The difference between the objective function values and aspiration values
Objective space	\|Z - R(Z)\|	\|\({z_1}\) - R(\({z_1}\))\| \|\({z_2}\) - R(\({z_2}\))\|	The difference between the objective function values and reservation values
Decision space	X	\({x_1}\) _…\({x_N}\)	The decision variable values
	\|X - A(X)\|	\|\({x_1}\) - A(\({x_1}\))\| _… \|\({x_N}\) - A(\({x_N}\))\|	The difference between the decision variable values and aspiration values
	\|X - R(X)\|	\|\({x_1}\) - R(\({x_1}\))\| _… \|\({x_N}\) - R(\({x_N}\))\|	The difference between the decision variable values and reservation values

Spaces	Elements in candidate auxiliary factors		Aspiration values (A) and reservation values (R)
Objective space	A(Z)	A(\({z_1}\)) A(\({z_2}\))	0.056 0.00001
Objective space	R(Z)	R(\({z_1}\)) R(\({z_2}\))	0.05 0.00003
Decision space	A(X)	A(\({x_1}\)) _… A(\({x_{18}}\))	[0.1, 0.05, 0.07, 0.05, 0.05, 0.045, 0.04, 0.04, 0.06, 0.07, 0.06, 0.05, 0.05, 0.05, 0.05, 0.055, 0.052, 0.06]
Decision space	R(X)	R(\({x_1}\)) _… R(\({x_{18}}\))	[0.08, 0.04, 0.05, 0.045, 0.04, 0.035, 0.03, 0.03, 0.05, 0.065, 0.05, 0.04, 0.035, 0.04, 0.042, 0.045, 0.048, 0.05]

Objective functions (primary factors)		\({S_1}\)	\({S_2}\)
Z	\({z_1}\) (return)	0.050484	0.052697
Z	\({z_2}\) (risk)	0.000051	0.000053