 Research
 Open Access
Portfolio optimization by improved NSGAII and SPEA 2 based on different risk measures
 Massimiliano Kaucic^{1}Email authorView ORCID ID profile,
 Mojtaba Moradi^{2} and
 Mohmmad Mirzazadeh^{3}
 Received: 19 April 2018
 Accepted: 26 May 2019
 Published: 5 June 2019
Abstract
In this study, we analyze three portfolio selection strategies for lossaverse investors: semivariance, conditional valueatrisk, and a combination of both risk measures. Moreover, we propose a novel version of the nondominated sorting genetic algorithm II and of the strength Pareto evolutionary algorithm 2 to tackle this optimization problem. The effectiveness of these algorithms is compared with two alternatives from the literature from five publicly available datasets. The computational results indicate that the proposed algorithms in this study outperform the others for all the examined performance metrics. Moreover, they are able to approximate the Pareto front even in cases in which all the other approaches fail.
Keywords
 Multiobjective portfolio optimization
 Semivariance
 CVaR
 NSGAII
 SPEA 2
 Intermediate crossover
 Gaussian mutation
Introduction
The portfolio selection problem can be defined as the optimal allocation of wealth among a finite number of assets that follows careful processing of all available information about both investors and markets (Meucci 2009). Markowitz’s meanvariance model is by far the most popular procedure in asset allocation (Guerard 2009). Even if it is considered the cornerstone in this field, the meanvariance portfolio optimization model presents two serious drawbacks from a theoretical point of view. First, when asset returns are skewed and fattailed, they tend to include only a limited proportion of stochastically dominant assets in the efficient solutions, and prematurely preclude asset with negatively skewed returns. Second, risk is measured by variance, which treats both the above and below target returns equally, while investors are more concerned about the probability of investment returns falling below the target return. Consequently, risks are underestimated and portfolios that are downside efficient are ruled out.
A solution proposed in the literature to make Markowitz’s approach more effective is to replace the variance with a downside risk measure in order to model the loss aversion of investors properly. In this context, semivariance has been studied extensively (Nawrocki 1999; Sing and Ong 2000). Two main reasons justify these efforts. First, semivariance is an approximation of the skewness for the return distribution, since it measures belowtarget returns. The higher is the value of semivariance, the greater are both the degree of negative skewness and the risk of the investment. Second, semivariance efficient portfolios closely approximate the stochasticdominance efficient set. However, the computation of portfolio semivariance is a difficult task owing to the endogenous nature of the portfolio cosemivariance matrix, which depends on the weights assigned to each asset, that is, changes in the weights affect the periods in which the portfolio underperforms the target level, which, in turn, affects the evaluation of the cosemivariance matrix itself. Some attempts have been proposed to solve this problem directly (Hogan and Warren 1972; Konno et al. 2002; Markowitz 1959; Markowitz et al. 1993). Other studies have focused on the definition of an exogenous cosemivariance matrix that satisfactorily approaches the endogenous one (Ballestero and PlaSantamaria 2005; Cumova and Nawrocki 2011; Estrada 2008; Nawrocki 1991). Nowadays, because of the regulatory importance of quantifying large losses in banking and insurance, another class of downside risk measures, called quantilebased measures, occupies a leading position in the risk management sector. One of the most popular examples is valueatrisk (VaR), defined as the maximum loss occurring over a given period at a given confidence level. Although VaR is apparently easy to use and intuitive, it presents several disadvantages. Primarily, it ignores losses exceeding VaR and is not subadditive, that is, diversification of the portfolio may increase VaR (Artzner et al. 1999). From a computational perspective, VaR is difficult to use when investors want to optimize their portfolios, since it is represented by a nonlinear, nonconvex, and nondifferentiable function with multiple local optima (Gaivoronski and Pflug 2005). Moreover, rational agents wishing to act on their decisions according to expected utility theory may be misled by the information related to the portfolio VaR (Yamai et al. 2002). To deal with these shortcomings, (Rockafellar et al. 2000) introduced the conditional valueatrisk (CVaR), which is defined as the conditional expectation of losses above the VaR (Sarykalin et al. 2008). CVaR is a coherent risk measure in the sense of (Artzner et al. 1999) and, because it is a convex function, optimization problems with CVaR as the minimization objective and/or constraints can be efficiently handled (Krokhmal et al. 2002; Larsen et al. 2002; Rockafellar and Uryasev 2002).
Increasing complexity of practical applications has led researchers to develop heuristic procedures for solving their portfolio optimization problems. These techniques require less domain information to be considered than the standard gradientbased mathematical programming methods do. Moreover, they guarantee satisfactory approximations to solutions in a fair computational time even when they deal with nonconvexity, discontinuity, and integer decision variables. The approaches that have been proposed in the softcomputing literature can be categorized into the following two groups. On one hand, single objective methods optimize a weighted sum of the portfolio objectives. On the other hand, multiobjective evolutionary algorithms (MOEAs) attempt to tackle the allocation problem directly in its multiobjective form by simultaneously optimizing risk and reward. In the first case, the complete set of riskreturn profiles is obtained by varying a parameter that represents the risk aversion of the investor (Chang et al. 2000; Crama and Schyns 2003; Cura 2009; WoodsideOriakhi et al. 2011). In the second case, the complete efficient frontier is represented in a single run (Anagnostopoulos and Mamanis 2011a; Meghwani and Thakur 2017; Mishra et al. 2014). Both categories pay great attention to encoding types and constraint–handling techniques (Liagkouras and Metaxiotis 2015; Meghwani and Thakur 2017; Metaxiotis and Liagkouras 2012; Ponsich et al. 2013). Some reallife situations, which are not considered in Markowitz’s model, have been analyzed recently by (Eftekharian et al. 2017) and (Meghwani and Thakur 2018). (Eftekharian et al. 2017) include as constraints some restrictions on the number of assets in the portfolio, limitations on investing in assets from a given industry, and cardinality, class, and quantity constraints. Furthermore, they developed an improved version of the NSGA II algorithm, called 2Phase NSGA II, to solve the resulting optimization problem. Meghwani and Thakur (2018) focus on the problem of handling equality constraints, like selffinancing constraints, and constraints arising from the inclusion of transaction cost models using MOEAs. Researchers have also focused on socalled swarm intelligence methods to overcome the computational difficulties of realistic portfolio designs. Unlike evolutionary algorithms that utilize the principle of natural selection, these approaches are inspired by the behavior and selforganizing interaction among agents, such as foraging of ant and bee colonies, bird flocking, and fish schooling (see Ertenlice and Kalayci (2018) for a detailed review of the subject).
Over the past few decades, machinelearning algorithms have been widely used to explore financial data and make datadriven predictions (Chao et al. 2019; Kou 2019). For instance, (Huang and Kou 2014) present a kernel entropy manifold learning algorithm to measure the relationships between two financial data points in order to describe the characteristics of a financial system by deriving the dynamic properties of the original data space. Similarly, (Huang et al. 2017) propose an information metricbased manifold learning algorithm to extract the intrinsic manifold of a dynamic financial system and to detect impending crises. (Ergu et al. 2014) focus on the use of the analytical network process in risk assessment and decision analysis of an emergent event. They propose a new consistency index to assess the key factors of risks and analyze the impacts and preferences of decision alternatives. Zhang et al. (2019) develop three soft minimum cost models based on different weighted average operators for resolving consensus decision–making problems. The usefulness of the proposed models is validated in a real–world loan consensus problem, using data from a Chinese P2P platform.
Kalayci et al. (2019) review state–oftheart methods dedicated to meanvariance portfolio optimization.
Evolutionary computations are made by iterations and, in each iteration, the weights of the portfolios are known a priori because they are generated by the algorithm. Thus, it is possible to directly determine in which periods the portfolio underperforms the target level. In this manner, the difficulty of computing the expost portfolio semivariance can be fruitfully resolved by the use of an evolutionary algorithm. One of the first studies to attempt this is (Dueck and Winker 1992), who reformulate the biobjective optimization problem into a singleobjective and solve it through a local search technique, called threshold accepting. In a similar way, (Arnone et al. 1993) propose a solution to a parametric programming problem with the objective of optimizing a convex combination of mean and semivariance. The algorithm is based on Whitley’s GENITOR system (Whitley 1988) and uses a steadystate breeding strategy and elitist selection. Chang et al. (2009) apply a genetic algorithm to portfolio optimization problems in different risk measures, namely variance, semivariance, absolute deviation, and variance with skewness. Recently, (Liagkouras and Metaxiotis 2013) address the meansemivariance portfolio optimization problem from a multiobjective perspective by developing an adhoc evolutionary algorithm. Evidence of the robustness of the algorithm is accomplished in outofsample testing during both bull and bear market conditions on the FTSE 100. Macedo et al. (2017) compare the nondominated sorting genetic algorithm II (NSGAII, Deb et al. (2002)) and the strength Pareto evolutionary Algorithm 2 (SPEA 2, Zitzler et al. (2001)) within the meansemivariance portfolio optimization framework. Numerical experiments indicate that NSGAII outperforms SPEA 2 insample. Senhaji et al. (2016) propose to resolve the problem by combining the continuous Hopfield neural network with NSGAII. The effectiveness of this strategy is proved using a portfolio of 20 assets.
Regarding the quantilebased minimization problems involving many financial instruments, a large number of price scenarios is necessary to estimate risk correctly. As the dimension of the problems increases, this operation can be time consuming (Lim et al. 2010) and the use of heuristics may be advisable or even necessary to detect a solution. An example is provided by the multipurpose datadriven optimization heuristic proposed in Gilli M and Hysi (2006), which deals with different risk functions, such as VaR, CVaR, maximum loss, and Omega, as well as with practical constraints on the portfolio composition. An evolutionary computation approach is developed by (Hochreiter 2007) to solve the general scenariobased riskreturn portfolio optimization problem when standard deviation, VaR, or CVaR are used to represent the risk of the investment. A variant of the NSGAII algorithm based on differential operators is developed by (Krink and Paterlini 2011) for portfolio optimization involving realworld constraints and quantilebased risk measures. Meanwhile, a comparison of the capabilities of different MOEAs to adapt in any addition of new constraints and/or replacement of the risk function is given in (Anagnostopoulos and Mamanis 2011b), and (BaixauliSoler et al. 2012) focus on SPEA 2 performance. Recently, hybrid stock trading systems based on evolutionary metaheuristics and meanCVaR models are proposed in Chen and Wang (2015) and (Qin et al. 2014).
The contribution of this study is twofold. On one hand, based on the promising results of (Subbu et al. 2005) and (BaixauliSoler et al. 2010), we include in the same portfolio optimization framework the lossaverse attitude of investors as well as the capital requirements imposed by the regulator, and we investigate the relationship between semivariance and CVaR in quantifying the downside risk. On the other hand, we extend the research of (Liu et al. 2010) and investigate the effectiveness of using the uniform selection scheme, the extended intermediate crossover operator (Gen and Cheng 2000; Mühlenbein and SchlierkampVoosen 1993), and the Gaussian mutation (Hinterding 1995; Schwefel 1987) in the NSGAII and SPEA 2 algorithms in order to generate the approximated Pareto fronts for the considered downside riskbased portfolio optimization problems. In addition, the performance of the proposed algorithms is compared to that of other variants of the NSGAII and SPEA 2 algorithms that have already been shown to be highly competitive in portfolio optimization problems. The results on five publicly available datasets show that our procedures completely dominate the fronts produced by the counterparts from the literature. Furthermore, our variants of the NSGAII and SPEA 2 algorithms can generate the entire Pareto front for largescale problems for which the other Paretobased approaches are unable to work properly.
The rest of the paper is organized as follows. In “The portfolio selection problem under downside risk measures” section, we introduce three preference relations based on the rewarddownside risk principle and discuss the related portfolio selection problems. “Paretobased evolutionary algorithms” section presents a description of the MOEAs used for solving the resulting optimization problems. “Experimental analysis” section outlines the numerical experiments and “Concluding remarks and future research” section presents concluding remarks and ideas for further research.
The portfolio selection problem under downside risk measures
where x_{i} represents the proportion of capital to be allocated to asset i, with i=1,…,n.
where \(\mathbb {E}(\cdot)\) denotes the expectation and \(\mathbb {E}(R_{i})\) is the expected rate of return of asset i. Given a level z for the rate of return, the cumulative distribution function (cdf) of R(x) is defined as \(F_{R(\mathbf {x})}(z) = P(R(\mathbf {x}) \leq z) = \int _{\{R(\mathbf {x}) \leq z\}} f_{R(\mathbf {x})}(r) dr\). It is also assumed that F_{R(x)} is continuous and strictly increasing with respect to \(z \in \mathbb {R}\)^{1}. The portfolio loss distribution is defined as the negative of the portfolio return distribution, that is, L(x)=−R(x).
To identify the portfolio in \(\mathcal {X}\) that guarantees the “best” rate of return, a model for preferences under uncertainty needs to be defined. We adopt the socalled rewardrisk approach that relates the portfolio selection problem to a multiobjective optimization problem in two steps. First, a set of objectives that the investor perceives as beneficial is identified, and second, a set of objectives he or she considers damaging is identified in relation to R(·). Then, a preference relation is defined based on these criteria as follows.
Definition 1
Let \(f_{1}, \ldots, f_{p} \colon \mathcal {X} \to \mathbb {R}\) and \(g_{1}, \ldots, g_{q} \colon \mathcal {X} \to \mathbb {R}\) be the rewardtype and the risktype objective functions, respectively. Then for all \(\mathbf {x}, \mathbf {y} \in \mathcal {X}\), we say that R(x) dominates (is preferred to) R(y) if and only if f_{i}(x)≥f_{i}(y) for all i=1,…,p and g_{j}(x)≤g_{j}(y) for all j=1,…,q with at least one strict inequality. Alternatively, we can say that portfolio x dominates portfolio y.
The solutions to Problem (3) form the socalled efficient set, or Pareto optimal set, of which image in the rewardrisk space is called efficient frontier, or Pareto front. Under certain smoothness assumptions, it can be induced from the Karush–Kuhn—Tucker conditions that, when the p+q objectives are continuous, the efficient frontier defines a piecewise continuous (p+q−1)dimensional manifold in the decision space (Li and Zhang 2009). Therefore, the efficient frontier of a continuous biobjective portfolio optimization problem is a piecewise continuous 1D curve in \(\mathbb {R}^{2}\), while, in the case of a problem with three objectives, it is a piecewise continuous 2D surface in \(\mathbb {R}^{3}\).
We now specialize this general rewardrisk framework by considering investors that focus on the mean rate of return (2) as the reward criterion and employ different measures to assess the downside risk of their investments.
The meansemivariance model
where [ u]^{−}= max{0,−u} and b represents a given benchmark, for example, the portfolio expected rate of return, the Roy’s safety first criterion, or a Treasury rate of return. In our analysis, we set the target level b equal to 0 in order to estimate the variance of portfolio losses. As proved in Fishburn (1977), this downside risk measure is consistent with utility theory
The meanCVaR model
where \({VaR}_{\alpha }(L(\mathbf {x})) = \sup \left \{z  P\left (L(\mathbf {x}) \geq z\right) \leq \alpha \right \} = F_{L(\mathbf {x})}^{1}(1\alpha)\) is the valueatrisk at the confidence level α of the portfolio x.
The meansemivarianceCVaR model
where V^{−} and CVaR_{α} are the same as the definitions in (4) and (6), respectively.
In this manner, the preferences are modeled as follows.
Definition 2
Portfolio x is preferred to portfolio y if and only if \(\mathbb {E}\left (R(\mathbf {x})\right) \geq \mathbb {E}\left (R(\mathbf {y})\right), {CVaR}_{\alpha }(L(\mathbf {x})) \leq {CVaR}_{\alpha }(L(\mathbf {y}))\) and V^{−}(R(x))≤V^{−}(R(y)), with at least one strict inequality.
This model may produce improved solutions when a meanCVaR efficient portfolio has an excessively large semivariance or when a meansemivariance efficient portfolio has an excessively large CVaR.
Scenariobased framework for portfolio optimization
Now it is possible to reformulate the previously introduced downside riskbased portfolio allocation models in terms of the scenarios conveyed in (9).
where \(\mathbb {I}(\cdot)\) represents the indicator function. In this case, the investor maximizes the expected rate of return as given in Eq. (10) and minimizes the risk calculated according to Eq. (14).
Finally, Problem (8) simultaneously exploits Eqs. (10), (13), and (14) in the optimization process.
Paretobased evolutionary algorithms
The capabilities of MOEAs to generate reasonably good approximations of the Pareto front in a single run and within a limited computational time have already been shown in the literature on the meanvariance portfolio selection problem.
In this study, we focus on two variants of popular algorithms, NSGAII and SPEA 2, and investigate their effectiveness in solving the portfolio optimization problems formulated in the previous section. These algorithms belong to the family of Paretobased MOEAs and include a twolevel ranking scheme to guide the search toward the true Pareto front (Emmerich and Deutz 2018). The first ranking is provided by the Pareto dominance relation, while the second concerns the diversity of the solutions and applies to the individuals that share the same position in the first ranking. However, the methods by which NSGAII and SPEA 2 approximate the true Pareto front differ and the corresponding procedures are described in the following overview^{2}.
NSGAII
The NSGAII procedure is described in Algorithm 1. The crowding distance mechanism is employed to preserve the diversity of solutions. It evaluates the volume of the hyperrectangle defined by two nearest neighbors (Zitzler and Thiele 1999). New solutions, called offsprings, are generated using a selection mechanism and a set of variation operators. Based on the values provided by the ranking scheme, the best individuals from the combination of the current population P_{k} and the offspring pool Q_{k} are detected and those with lower rank and higher crowding distance are saved in the next population P_{k+1}. In case some candidate solutions are of the same rank and not all of them enter the next population, the less crowded individuals from that particular rank are selected to fit the next population, thereby ensuring elitism.
SPEA 2
As reported in Algorithm 2, SPEA 2 first initializes a population of candidate solutions P_{k}, then stores the best solutions in an explicit archive A_{k}, separate from the population. To emphasize nondominated individuals, SPEA 2 uses a combination of the dominance count and the dominance rank methods. Each individual is assigned a raw fitness value depending on both the number of individuals it dominates and the number of individuals by which it is dominated. The density information is expressed as a function of the kth smallest Euclidean distance in the objective space to the kth nearest neighbor. The nondominated individuals from the union of the archive and the current population are then updated. In particular, if the number of nondominated individuals is less than the preestablished archive size, some dominated individual from the current pool form part of the archive. Otherwise, some individuals are removed from the archive using a truncation operator. This procedure recursively removes individuals based on the nearest neighbor Euclidean distance. If there is more than one candidate solution with the same minimum distance, then the decision is made by considering the second nearest neighbor, and so forth. The mating pool used to generate the next population P_{t+1} is filled by the individuals of the updated archive selected on the basis of a given selection mechanism. The offsprings are then generated by a set of variation operators as in the algorithm NSGAII.
Solution approaches
NSGAII and SPEA 2 cannot be directly implemented in their standard format for the solution of realworld portfolio selection problems and special treatments need to be adopted in relation to a number of issues, like the encoding type, genetic operators, and constraint–handling procedures.
Solution representation and initialization

generate n independent unitexponential random variables Y_{1},…,Y_{n} and compute \(T_{n} = \sum \nolimits _{i = 1}^{n} Y_{i}\);

define x_{i}=Y_{i}/T_{n}.
It is evident that, in this manner, all the individuals in the initial population are feasible.
Reproduction process
Two variants of the reproduction process are compared in this study, both employing a selection procedure, a crossover, and a mutation operator.
 1.
Binary tournament selection is used for selecting the parents. In this process, two individuals are chosen randomly from the population and compete against each other. The individual with the highest fitness wins and is included as one parent for the next steps of the reproduction.
 2.
The uniform crossover operator is then applied to the population of parents to produce the offspring population. Two parents generate a single child and its value for each array is selected with equal probability from one or another parent.
 3.The set of children is finally subject to the Gaussian mutation. A percentage P_{mut} of these individuals is selected. Then, each member x of this subset has a probability μ_{m} that a gene x_{i} mutates according to the following rule:where σ_{m} is the mutation step size, and rand_{0},rand_{1} represent two independent random numbers from a standard normal distribution, with i=1,…,n.$$x^{\prime\prime}_{i} = \left\{\begin{array}{ll} x_{i} + \sigma_{m} {rand}_{1}, &\quad \text{if}\ {rand}_{0} < \mu_{m}, \\ x_{i}, &\quad \text{otherwise.} \end{array} \right. $$
 1.
Uniform selection is used to generate two subpopulations of parents. The first of these sets contains the fraction P_{cross} of the original population that is involved into the recombination step. The second set contains the fraction P_{mut} of the original population that is subject to mutation. These sets cannot be separated, in that the same individual can enter the crossover stage and can mutate.
 2.The intermediate crossover is applied to the first subpopulation of parents to generate a first set of offsprings. Differently from the uniform crossover, this operator generates two children for each pair of parents x_{1},x_{2}, as follows$$\begin{array}{*{20}l} x^{\prime}_{1i} & = {csf}_{i} x_{1i}  \left(1{csf}_{i}\right) x_{2i} \\ x^{\prime}_{2i} & = {csf}_{i} x_{2i}  \left(1{csf}_{i}\right) x_{1i} \end{array} $$
where csf_{i}, called the crossover scaling factor, is a random number in the interval \([d, 1+d], d \in \mathbb {R}\) is a parameter to be tuned, and \(\mathbf {x}^{\prime }_{1}, \mathbf {x}^{\prime }_{2}\) are the associated children, with i=1,…,n. The second population of parents is modified by the Gaussian mutation as in the first configuration.
 3.
The subpopulations of children are finally gathered together to form the offspring population.
The variants of the considered MOEAs involving this reproduction design are denoted by NSGAIIa and SPEA 2a.
Constraint–handling procedure
 (a)Each candidate solution \(\mathbf {x} \in \mathbb {R}^{n}\) is first clamped by projecting it onto [0,1]^{n}:with i=1,…,n. In this manner, \(\widetilde {\mathbf {x}} = \left (\widetilde {x}_{1}, \ldots, \widetilde {x}_{n}\right)^{T}\) satisfies the lower bound constraints.$$\widetilde{x}_{i} = \left\{ \begin{array}{ll} 0, & \text{if}\ x_{i} < 0 \\ 1, & \text{if}\ x_{i} > 1 \\ x_{i}, & \text{otherwise} \end{array} \right. $$
 (b)The projected vector \(\widetilde {\mathbf {x}} \) is then normalized through the transformationAfter this step, the individual \(\widetilde {\widetilde {\mathbf {x}}} = \left (\widetilde {\widetilde {x}}_{1}, \ldots, \widetilde {\widetilde {x}}_{n}\right)^{T}\) also verifies the budget constraint.$$\widetilde{\widetilde{x}}_{i} = \frac{\widetilde{x}_{i}}{\sum\nolimits_{j=1}^{n} \widetilde{x}_{j}}, \quad i = 1, \ldots, n\,. $$
This procedure makes all the individuals involved in the search feasible. The corresponding objective function values can now be computed and the MOEAs described above can be applied to identify an approximation of the Pareto set.
Computational complexity
Run time complexity for the algorithms
Algorithm  Runtime 

NSGAIIb  \(\mathcal {O}(G m (2N)^{2})\) 
NSGAIIa  \(\mathcal {O}\left (G m \left (1+2P_{cross}+P_{mut}\right)^{2} N^{2}\right)\) 
SPEA2b  \(\mathcal {O}\left (G (2N)^{3}\right)\) 
SPEA 2a  \(\mathcal {O}\left (G \left (1+2P_{cross}+P_{mut}\right)^{3} N^{3}\right)\) 
Experimental analysis
In this section, we assess the effectiveness of the two variants of NSGAII and SPEA 2 in solving the proposed instances of the portfolio optimization problem.
Description of the datasets
Datasets of weekly returns considered in this study (Bruni et al. 2016)
Dataset name  Stock market index  Assets (n)  Time interval  # of obs. 

DowJones  Dow Jones Industrial Average  28  Feb 1990 – Apr 2016  1363 
FF49Industries  Fama and French 49 Industry  49  Jul 1969 – Jul 2015  2325 
NASDAQ100  NASDAQ 100  82  Nov 2004 – Apr 2016  596 
SP500  S&P 500  442  Nov 2004 – Apr 2016  595 
NASDAQComp  NASDAQ Composite  1203  Feb 2003 – Apr 2016  685 
Classification of the assets belonging to each dataset in terms of skewness (Skew), standard semideviation (Stsd) and standard deviation (Std)
Skew<−0.5  Skew>0.5  

Dataset name  Assets  Stsd>Std  Assets  Stsd<Std 
DowJones  2  2  1  1 
FF49Industries  2  2  6  6 
NASDAQ100  2  2  20  20 
SP500  25  25  109  108 
NASDAQComp  19  15  668  668 
Consequently, lossaverse investors can use standard semideviation (or, equivalently, semivariance) to capture the downside risk conveyed in skewness properly. In this sense, the portfolio selection models designed in the previous section are more appealing than the mean variance model for acting investment choices.
Performance metrics and statistical testing
It is nontrivial to evaluate the quality of the solution sets of MOEAs for two main reasons. First, the presence of multiple conflicting goals makes the definition of a “better algorithm” vague. Second, the stochastic nature of these optimizers suggests that comparisons based on the approximation sets from a single run of each algorithm are not correct.
Regarding the first problem, we can identify three major performance criteria in multiobjective optimization: the capacity of a given algorithm to generate an appropriate number of nondominated solutions, the convergence of the solution set to the true Pareto front, and the diversity of the solutions in the objective space. Accordingly, optimal solution sets with a large number of nondominated solutions, approaching the true Pareto front and even scattering are generally desirable (Jiang et al. 2014).
 (i)
Schott’s spacing metric (S) (Schott 1995)
The spacing metric measures how evenly the solutions are distributed in the approximate efficient front A and it is expressed aswhere d_{i} represents the minimum value of the sum of the absolute difference in objective function values between the ith solution and any other solution in the obtained nondominated set and \(\overline {d}\) is the mean value of these distance measures.$$S(A) = \sqrt{\frac{1}{A}\sum\limits_{i = 1}^{A} \left(d_{i}\overline{d}\right)^{2}} $$When the solutions are equidistantly spaced, the corresponding distance measure is small. Thus, an algorithm finding a set of nondominated solutions with smaller spacing is better.
 (ii)
Generalized spread metric (Δ^{∗}) (Zhou et al. 2006)
The generalized spread metric is a generalization of the well–known Δ metric of (Deb et al. 2002) and simultaneously gauges the distribution and spread of an optimal solution set A for high dimensional multiobjective optimization problems. It takes the form:where d_{i} denotes the Euclidean distance between neighboring solutions with the mean value \(\overline {d}\). The term \(d_{m}^{P}\) is the distance between the extreme (bounding) solutions of A and of the true Pareto front P corresponding to the mth objective function.$$\Delta^{*}(A,P) = \frac{\sum\nolimits_{m = 1}^ M d_{m}^{P} + \sum\nolimits_{i = 1}^{A} d_{i}  \overline{d}}{\sum\nolimits_{m = 1}^ M d_{m}^{e} + A \overline{d}} $$An algorithm finding a smaller value of Δ^{∗} generates a better uniformly distributed set of nondominated solutions.
 (iii)
Inverted generation distance (IGD) (Zhang and Li 2007)
The IGD index has the following formulationwhere d_{i} is the Euclidean distance (in the objective space) between the ith member of the true Pareto front P and the closest solution in A. If P is large enough to represent the Pareto front very well, IGD(A,P) could measure both the diversity and convergence of A.$$IGD(A,P) = \frac{\sqrt{\sum\nolimits_{i = 1}^{P} d_{i}^{2}}}{P} $$A low value of IGD(A,P) means A is very close to the Pareto front and does not miss any part of it. Thus, an algorithm with a lower value of IGD is better.
 (iv)
Hypervolume (HV) (Zitzler and Thiele 1999)
Similar to the IGD index, the hypervolume indicator evaluates both diversity and convergence of an approximation set A. It is defined as the size of the portion of objective space that is dominated by at least one point of A relative to a reference set R, which is formed by points worse than (or equal to) every point in A in every objective. Formally, we define it aswhere v_{i} is the hypercube constructed with the reference set R and the solution a_{i}∈A as the diagonal corners.$$HV(A,R) = volume\left(\bigcup_{i = 1}^{A} v_{i}\right) $$Large values of HV indicate the approximate solutions are closer to the true Pareto front and, at the same time, scattered more evenly in the objective space. Thus, an algorithm with a large value of the HV metric is desirable.
The second question posed at the beginning of this section can be resolved by comparing the algorithms through a sample of approximation sets from multiple runs. In this manner, we obtain a sample of values for each performance metric and for each portfolio optimization problem. A rigorous comparison between the algorithms can then be performed based on nonparametric inference testing (Coello et al. 2007).
Parameter settings
Let us assume that the distribution of historical returns acts as a good proxy of the returns faced over the next holding period. Then, the historical simulation method is used to compute the financial scenarios r_{is}, with i=1,…,n and s=1,…,S, as well as the cosemivariance matrix C^{−}.
Average rankings achieved by the Friedman test for the NSGAIIa algorithm with different parameter settings
NSGAIIa  Rankings 

P_{cross}=0.45,d=0.5,P_{mut}=0.5,μ_{m}=0.1,σ_{m}=0.15  58.89 
P_{cross}=0.45,d=1,P_{mut}=0.3,μ_{m}=0.1,σ_{m}=0.10  5 9 . 3 3 
P_{cross}=0.45,d=1,P_{mut}=0.3,μ_{m}=0.1,σ_{m}=0.15  58.00 
P_{cross}=0.45,d=1,P_{mut}=0.3,μ_{m}=0.3,σ_{m}=0.10  57.89 
P_{cross}=0.45,d=1,P_{mut}=0.5,μ_{m}=0.3,σ_{m}=0.20  59.22 
Average rankings achieved by the Friedman test for the SPEA 2a algorithm with different parameter settings
SPEA 2a  Rankings 

P_{cross}=0.45,d=1,P_{mut}=0.3,μ_{m}=0.1,σ_{m}=0.10  64.89 
P_{cross}=0.45,d=1,P_{mut}=0.3,μ_{m}=0.1,σ_{m}=0.20  63.44 
P_{cross}=0.45,d=1,P_{mut}=0.5,μ_{m}=0.1,σ_{m}=0.10  6 9 . 2 2 
P_{cross}=0.45,d=1,P_{mut}=0.5,μ_{m}=0.1,σ_{m}=0.15  67.89 
P_{cross}=0.45,d=1,P_{mut}=0.5,μ_{m}=0.1,σ_{m}=0.20  67.56 
Overall, the results show that the best choices are higher values for P_{cross} and d and, at the same time, lower values for μ_{m} and σ_{m}. However, SPEA 2a needs a larger sample of mutants to achieve the best rankings with respect to NSGAIIa. The configurations in bold are the best parameter settings and are used in the experiments.
Computational results and discussion
We perform a multiproblem analysis in which the four MOEAs are tested on 15 optimization problems (the three instances of the portfolio selection problem over the five datasets introduced in Table 2). To check the robustness of the results, 20 simulations for each algorithm and for each test problem are used. The algorithms are implemented in MATLAB R2018b and the experiments are carried out on a 2.2 GHz Intel Core i7 laptop with 4 GB RAM.
Average number of nondominated solutions in the approximated Pareto front (Mean) with the corresponding standard deviation (Std) for each algorithm
Problem  Dataset  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  

Mean  std  Mean  Std  Mean  Std  Mean  Std  
MeanSV  DowJones  52.10  5.75  248.06  4.93  55.65  6.12  250  0 
FF49Industries  16.65  4.61  250  0  15.20  3.68  250  0  
NASDAQ100  14.25  2.90  250  0  13.40  2.89  250  0  
SP500  11.55  3.22  250  0  11.50  3.32  250  0  
NASDAQComp  13.60  2.93  202.50  20.97  13.60  2.93  245.20  9.29  
MeanCVaR  DowJones  49.70  6.24  247.33  4.86  51.15  6.66  250  0 
FF49Industries  16.10  3.80  249.95  0.22  17.25  3.19  250  0  
NASDAQ100  16.45  3.27  250  0  16.80  3.86  250  0  
SP500  12.60  2.84  250  0  12.45  2.68  250  0  
NASDAQComp  15.80  3.43  238.60  13.18  16.10  3.91  249.05  2.21  
MeanCVaRSV  DowJones  92.90  14.62  246.33  7.57  91.75  11.96  250  0 
FF49Industries  22.25  7.04  250  0  21.25  6.70  250  0  
NASDAQ100  22.45  5.82  250  0  22.50  3.90  250  0  
SP500  18.20  6.18  250  0  18.85  7.51  250  0  
NASDAQComp  25.65  6.03  250  0  25.65  5.59  250  0 
Mean, standard deviation (Std), median, minimum (Min) and maximum (Max) values of the performance metrics for the compared algorithms for the MeanSV portfolio optimization problem
Dataset  Statistic  S  Δ ^{∗}  IGD  HV  

NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  
DowJones  Mean  0.0064  0.0023  0.0069  0.0034  0.8197  0.6101  0.8047  0.5773  0.4081  0.4695  0.4021  0.3377  0.8638  0.9859  0.8531  1.1709 
Median  0.0057  0.0025  0.0061  0.0031  0.8155  0.5627  0.8076  0.5780  0.4121  0.3381  0.3976  0.3377  0.8619  1.1702  0.8467  1.1709  
Std  0.0019  0.0009  0.0021  0.0011  0.0285  0.1424  0.0302  0.0131  0.0203  0.2699  0.0229  0.0002  0.0393  0.3800  0.0489  0.0001  
Min  0.0041  0.0000  0.0038  0.0029  0.7669  0.5437  0.7452  0.5564  0.3683  0.3375  0.3603  0.3374  0.7963  0.1725  0.7953  1.1707  
Max  0.0113  0.0033  0.0116  0.0080  0.8765  1.0000  0.8622  0.6087  0.4439  0.9994  0.4425  0.3383  0.9405  1.1704  0.9763  1.1710  
FF49Industries  Mean  0.0086  0.0023  0.0089  0.0035  0.7854  0.6003  0.7960  0.6165  0.6558  0.4767  0.6534  0.4765  0.5578  1.0536  0.5555  1.0538 
Median  0.0083  0.0023  0.0085  0.0035  0.7811  0.6008  0.8004  0.6173  0.6575  0.4767  0.6554  0.4763  0.5574  1.0536  0.5509  1.0538  
Std  0.0015  0.0002  0.0018  0.0003  0.0175  0.0117  0.0264  0.0112  0.0178  0.0003  0.0183  0.0004  0.0136  0.0002  0.0097  0.0003  
Min  0.0068  0.0020  0.0060  0.0029  0.7540  0.5828  0.7564  0.5946  0.6208  0.4761  0.6147  0.4759  0.5372  1.0534  0.5410  1.0531  
Max  0.0128  0.0029  0.0131  0.0042  0.8265  0.6220  0.8513  0.6409  0.6837  0.4774  0.6778  0.4771  0.5911  1.0540  0.5763  1.0542  
NASDAQ100  Mean  0.0039  0.0027  0.0040  0.0036  0.8822  0.5199  0.8797  0.5888  0.7326  0.2589  0.7330  0.2588  0.4321  1.2061  0.4311  1.2062 
Median  0.0037  0.0026  0.0039  0.0035  0.8822  0.5226  0.8807  0.5865  0.7338  0.2590  0.7344  0.2588  0.4270  1.2061  0.4297  1.2062  
Std  0.0009  0.0005  0.0008  0.0004  0.0110  0.0204  0.0085  0.0175  0.0151  0.0006  0.0121  0.0005  0.0175  0.0004  0.0248  0.0003  
Min  0.0026  0.0019  0.0027  0.0030  0.8633  0.4904  0.8652  0.5620  0.7021  0.2579  0.7091  0.2581  0.4118  1.2051  0.3986  1.2057  
Max  0.0057  0.0039  0.0057  0.0046  0.8999  0.5507  0.8992  0.6220  0.7536  0.2599  0.7541  0.2599  0.4720  1.2066  0.4793  1.2067  
SP500  Mean  0.0016  0.0048  0.0015  0.0041  0.9547  0.6605  0.9551  0.6318  1.0134  0.2787  1.0135  0.2761  0.1481  1.1828  0.1468  1.1953 
Median  0.0015  0.0047  0.0015  0.0039  0.9547  0.6738  0.9550  0.6289  1.0134  0.2788  1.0135  0.2760  0.1467  1.1830  0.1440  1.1960  
Std  0.0003  0.0015  0.0003  0.0008  0.0049  0.0418  0.0046  0.0237  0.0052  0.0013  0.0053  0.0011  0.0057  0.0060  0.0062  0.0025  
Min  0.0011  0.0025  0.0011  0.0032  0.9459  0.5706  0.9471  0.5942  1.0027  0.2757  1.0025  0.2743  0.1418  1.1727  0.1399  1.1885  
Max  0.0024  0.0077  0.0019  0.0065  0.9643  0.7284  0.9654  0.7074  1.0211  0.2815  1.0217  0.2795  0.1616  1.1917  0.1609  1.1981  
NASDAQComp  Mean  0.0010  0.0045  0.0009  0.0089  0.9714  0.7839  0.9711  0.8766  1.1084  0.2070  1.1085  0.2012  0.0499  1.2214  0.0492  1.2472 
Median  0.0010  0.0036  0.0009  0.0057  0.9721  0.7862  0.9703  0.8578  1.1098  0.2045  1.1099  0.1970  0.0488  1.2219  0.0481  1.2509  
Std  0.0002  0.0019  0.0002  0.0079  0.0020  0.0596  0.0027  0.1232  0.0033  0.0133  0.0033  0.0127  0.0034  0.0133  0.0037  0.0123  
Min  0.0007  0.0030  0.0007  0.0025  0.9678  0.6011  0.9654  0.6346  1.1013  0.1991  1.1013  0.1922  0.0464  1.1886  0.0452  1.2132  
Max  0.0014  0.0107  0.0013  0.0361  0.9740  0.8741  0.9769  1.1636  1.1128  0.2607  1.1129  0.2458  0.0570  1.2395  0.0571  1.2579 
Mean, standard deviation (Std), median, minimum (Min) and maximum (Max) values of the performance metrics for the compared algorithms for the MeanCVaR portfolio optimization problem
Dataset  Statistic  S  Δ ^{∗}  IGD  HV  

NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  
DowJones  Mean  0.0080  0.0021  0.0082  0.0045  0.7888  0.6141  0.7795  0.6066  0.4172  0.4957  0.4120  0.3577  0.8327  0.8893  0.8224  1.1028 
Median  0.0074  0.0023  0.0074  0.0045  0.7922  0.5662  0.7774  0.6054  0.4188  0.3578  0.4108  0.3577  0.8193  1.1041  0.8177  1.1027  
Std  0.0028  0.0008  0.0025  0.0005  0.0293  0.1460  0.0328  0.0102  0.0127  0.2451  0.0115  0.0004  0.0314  0.3845  0.0252  0.0003  
Min  0.0044  0.0000  0.0050  0.0038  0.7340  0.5343  0.7277  0.5956  0.3910  0.3575  0.3955  0.3571  0.7866  0.1725  0.7898  1.1020  
Max  0.0155  0.0029  0.0153  0.0056  0.8532  1.0000  0.8469  0.6372  0.4361  0.9095  0.4360  0.3582  0.9013  1.1046  0.8962  1.1033  
FF49Industries  Mean  0.0085  0.0023  0.0083  0.0035  0.8015  0.6135  0.8026  0.6111  0.6740  0.5055  0.6787  0.5052  0.5184  0.9957  0.5103  0.9953 
Median  0.0083  0.0023  0.0082  0.0034  0.8023  0.6131  0.8030  0.6118  0.6792  0.5056  0.6804  0.5052  0.5171  0.9957  0.5089  0.9954  
Std  0.0018  0.0002  0.0015  0.0005  0.0208  0.0165  0.0255  0.0112  0.0162  0.0002  0.0136  0.0002  0.0134  0.0002  0.0115  0.0003  
Min  0.0061  0.0018  0.0053  0.0028  0.7702  0.5807  0.7529  0.5894  0.6404  0.5051  0.6473  0.5049  0.4992  0.9954  0.4924  0.9946  
Max  0.0115  0.0025  0.0111  0.0046  0.8508  0.6456  0.8556  0.6317  0.6970  0.5059  0.7021  0.5057  0.5592  0.9960  0.5416  0.9958  
NASDAQ100  Mean  0.0044  0.0024  0.0044  0.0047  0.8454  0.5250  0.8410  0.6157  0.7159  0.3011  0.7179  0.3009  0.4227  1.1232  0.4132  1.1225 
Median  0.0044  0.0022  0.0044  0.0047  0.8487  0.5259  0.8399  0.6136  0.7223  0.3010  0.7245  0.3007  0.4169  1.1234  0.4106  1.1225  
Std  0.0007  0.0006  0.0006  0.0006  0.0140  0.0194  0.0137  0.0187  0.0165  0.0007  0.0175  0.0006  0.0154  0.0006  0.0141  0.0007  
Min  0.0031  0.0020  0.0031  0.0037  0.8158  0.4981  0.8131  0.5792  0.6785  0.2999  0.6767  0.2997  0.4024  1.1221  0.3920  1.1215  
Max  0.0060  0.0041  0.0056  0.0057  0.8634  0.5854  0.8716  0.6468  0.7366  0.3025  0.7441  0.3019  0.4523  1.1242  0.4506  1.1239  
SP500  Mean  0.0017  0.0046  0.0017  0.0038  0.9455  0.6555  0.9445  0.6475  0.9658  0.3561  0.9658  0.3528  0.1886  1.0962  0.1877  1.1084 
Median  0.0017  0.0039  0.0017  0.0035  0.9464  0.6576  0.9438  0.6473  0.9660  0.3560  0.9660  0.3529  0.1889  1.0964  0.1868  1.1084  
Std  0.0003  0.0019  0.0003  0.0008  0.0050  0.0306  0.0047  0.0267  0.0028  0.0012  0.0030  0.0008  0.0030  0.0053  0.0038  0.0022  
Min  0.0012  0.0027  0.0013  0.0026  0.9383  0.6110  0.9363  0.5905  0.9606  0.3543  0.9599  0.3513  0.1843  1.0843  0.1828  1.1027  
Max  0.0025  0.0106  0.0026  0.0060  0.9559  0.7413  0.9594  0.6927  0.9709  0.3585  0.9717  0.3541  0.1951  1.1052  0.1952  1.1119  
NASDAQComp  Mean  0.0014  0.0062  0.0014  0.0044  0.9527  0.7833  0.9533  0.7366  0.9476  0.4536  0.9475  0.4497  0.2150  0.9999  0.2143  1.0210 
Median  0.0013  0.0051  0.0014  0.0040  0.9515  0.7882  0.9546  0.7343  0.9482  0.4507  0.9483  0.4504  0.2147  1.0054  0.2139  1.0225  
Std  0.0003  0.0035  0.0002  0.0014  0.0038  0.0381  0.0050  0.0267  0.0027  0.0114  0.0027  0.0053  0.0021  0.0173  0.0024  0.0068  
Min  0.0010  0.0033  0.0011  0.0029  0.9465  0.7014  0.9425  0.6977  0.9412  0.4434  0.9411  0.4388  0.2112  0.9573  0.2111  1.0037  
Max  0.0021  0.0190  0.0018  0.0089  0.9617  0.8544  0.9614  0.8029  0.9509  0.4952  0.9510  0.4585  0.2195  1.0287  0.2197  1.0341 
Mean, standard deviation (Std), median, minimum (Min) and maximum (Max) values of the performance metrics for the compared algorithms for the MeansemivarianceCVaR portfolio optimization problem
Dataset  Statistic  S  Δ ^{∗}  IGD  HV  

NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a  
DowJones  Mean  0.0113  0.1829  0.0122  0.0047  0.7356  0.9073  0.7304  0.5973  0.4643  0.5827  0.4685  0.4243  0.9383  0.9160  0.9349  1.1504 
Median  0.0096  0.0040  0.0111  0.0046  0.7322  0.6091  0.7278  0.5938  0.4655  0.4232  0.4703  0.4243  0.9378  1.1538  0.9353  1.1507  
Std  0.0056  0.5373  0.0051  0.0003  0.0235  0.6294  0.0219  0.0129  0.0106  0.2935  0.0097  0.0013  0.0235  0.4254  0.0213  0.0011  
Min  0.0055  0.0000  0.0068  0.0042  0.7058  0.5683  0.6987  0.5840  0.4371  0.4216  0.4480  0.4225  0.8930  0.0259  0.8970  1.1487  
Max  0.0315  1.7121  0.0269  0.0057  0.8140  2.3622  0.7761  0.6263  0.4836  1.3523  0.4842  0.4274  0.9936  1.1556  0.9650  1.1516  
FF49Industries  Mean  0.0149  0.0048  0.0130  0.0051  0.6899  0.6116  0.6875  0.6264  0.7604  0.5661  0.7592  0.5663  0.4675  0.9934  0.4676  0.9923 
Median  0.0131  0.0048  0.0122  0.0052  0.6973  0.6095  0.6877  0.6293  0.7613  0.5662  0.7615  0.5663  0.4634  0.9935  0.4646  0.9921  
Std  0.0056  0.0004  0.0025  0.0003  0.0188  0.0106  0.0161  0.0110  0.0203  0.0012  0.0213  0.0009  0.0222  0.0006  0.0191  0.0007  
Min  0.0090  0.0040  0.0106  0.0046  0.6452  0.5942  0.6577  0.6071  0.7231  0.5633  0.7126  0.5647  0.4371  0.9924  0.4394  0.9910  
Max  0.0284  0.0056  0.0209  0.0060  0.7161  0.6322  0.7275  0.6430  0.7878  0.5681  0.7861  0.5682  0.5135  0.9944  0.5212  0.9937  
NASDAQ100  Mean  0.0060  0.0065  0.0060  0.0087  0.7885  0.5807  0.7862  0.6090  0.7007  0.3603  0.7013  0.3600  0.5367  1.2166  0.5348  1.2143 
Median  0.0058  0.0051  0.0060  0.0073  0.7846  0.5844  0.7847  0.5979  0.7071  0.3603  0.7065  0.3598  0.5331  1.2167  0.5318  1.2154  
Std  0.0012  0.0045  0.0010  0.0033  0.0131  0.0380  0.0097  0.0326  0.0174  0.0013  0.0187  0.0009  0.0180  0.0009  0.0198  0.0029  
Min  0.0044  0.0041  0.0048  0.0054  0.7720  0.5141  0.7711  0.5572  0.6494  0.3583  0.6499  0.3584  0.5155  1.2148  0.4940  1.2061  
Max  0.0090  0.0249  0.0081  0.0157  0.8123  0.6846  0.8026  0.6820  0.7173  0.3631  0.7353  0.3621  0.5875  1.2177  0.5867  1.2179  
SP500  Mean  0.0025  0.0117  0.0024  0.0131  0.9130  0.6531  0.9114  0.6480  1.0043  0.4065  1.0042  0.4057  0.1793  1.1797  0.1795  1.1822 
Median  0.0024  0.0103  0.0023  0.0114  0.9127  0.6511  0.9104  0.6431  1.0049  0.4059  1.0049  0.4053  0.1784  1.1819  0.1786  1.1841  
Std  0.0004  0.0053  0.0003  0.0070  0.0054  0.0345  0.0050  0.0311  0.0049  0.0038  0.0050  0.0021  0.0041  0.0112  0.0042  0.0072  
Min  0.0018  0.0032  0.0018  0.0049  0.8997  0.5986  0.9060  0.5846  0.9939  0.4017  0.9937  0.4019  0.1718  1.1413  0.1722  1.1611  
Max  0.0033  0.0204  0.0031  0.0325  0.9220  0.7283  0.9257  0.7103  1.0104  0.4183  1.0111  0.4108  0.1880  1.1929  0.1881  1.1913  
NASDAQComp  Mean  0.0027  0.0106  0.0026  0.0123  0.8967  0.6113  0.8968  0.5986  1.0450  0.5133  1.0450  0.5132  0.1612  1.0369  0.1612  1.0417 
Median  0.0027  0.0099  0.0025  0.0100  0.8957  0.6068  0.8967  0.5950  1.0448  0.5119  1.0449  0.5144  0.1615  1.0396  0.1614  1.0425  
Std  0.0004  0.0045  0.0003  0.0051  0.0068  0.0289  0.0068  0.0247  0.0021  0.0083  0.0021  0.0065  0.0017  0.0119  0.0017  0.0080  
Min  0.0021  0.0054  0.0022  0.0063  0.8869  0.5638  0.8850  0.5521  1.0421  0.5025  1.0420  0.5019  0.1581  1.0050  0.1579  1.0245  
Max  0.0038  0.0248  0.0030  0.0233  0.9106  0.6863  0.9140  0.6397  1.0499  0.5377  1.0499  0.5268  0.1644  1.0576  0.1646  1.0537 
Statistics and related pvalues for the Friedman aligned omnibus test for each of the four performance metrics
Performance metric  Statistic  pvalue 

S  0.443  0.9312 
Δ ^{∗}  29.59  1.681e−06 
IGD  30.42  1.124e−06 
HV  33.5  2.528e−07 
Adjusted pvalues for the Friedman aligned posthoc test with Bergmann and Hommel’s correction for multiple comparisons among the four algorithms
Hypothesis  Adj. pvalues  

Δ ^{∗}  IGD  HV  
NSGAIIb versus NSGAIIa  6.276e−05  6.889e−05  9.857e−06 
NSGAIIb versus SPEA 2b  1  1  1 
NSGAIIb versus SPEA 2a  3.098e−05  1.692e−05  8.199e−06 
NSGAIIa versus SPEA 2a  1  1  1 
SPEA 2b versus SPEA 2a  3.098e−05  1.692e−05  8.199e−06 
Contrast estimation results for the performance metric
Performance metric  Estimation  NSGAIIb  NSGAIIa  SPEA 2b  SPEA 2a 

Δ ^{∗}  NSGAIIb  0  0.206  0.0037  0.2158 
NSGAIIa  0.206  0  0.2023  0.0098  
SPEA 2b  0.0037  0.2023  0  0.2121  
SPEA 2a  0.2158  0.0098  0.2121  0  
IGD  NSGAIIb  0  0.4151  0.0001  0.4157 
NSGAIIa  0.4151  0  0.4161  0.0006  
SPEA 2b  0.0001  0.4161  0  0.4167  
SPEA 2a  0.4157  0.0006  0.4167  0  
HV  NSGAIIb  0  0.7012  0.0054  0.7032 
NSGAIIa  0.7012  0  0.7066  0.002  
SPEA 2b  0.0054  0.7066  0  0.7086  
SPEA 2a  0.7032  0.002  0.7086  0 
In summary, balancing the results from the statistical analysis and those concerning the computational complexity, we suggest that the NSGAII algorithm with the proposed configuration should be used to solve the portfolio optimization problems involving downside risk measures. In fact, it produces approximated sets similar to the SPEA 2a algorithm but takes less time to do so.
Concluding remarks and future research
In this study, we described three instances of the portfolio selection problem designed to handle the downside risk of an investment properly. A flexible multiobjective rewardtorisk framework was presented in which expected returns, semivariance, and CVaR of a portfolio can be optimized simultaneously. These problems were tackled using two nondominated sorting algorithms, namely, NSGAII and SPEA 2, which have already showed competitive performance for the mean variance problem. In particular, we proposed a novel combination of operators for the selection and reproduction phases to be included in both algorithms. A comparative analysis was undertaken with respect to a second variant of the same algorithms, involving another configuration design. We used five publicly available datasets ranging from small to largesized portfolio optimization problems. The capabilities of the procedures were assessed in terms of four performance metrics. Finally, a set of statistical tests checked the robustness of these findings. Overall, the numerical experiments showed that the proposed algorithms outperformed the others with respect to all the criteria. Even if the algorithms with the novel variation configuration demanded the use of more computational time as the dimension of the problems increased, they nonetheless yielded reasonable results for the cases in which the other algorithms failed to capture the shape of the Pareto front properly.
These research findings can be put in practice to improve the risk management infrastructure of an investment company. The inclusion of several risk measures in the portfolio optimization process can increase the capabilities of the system to describe the risk, providing more attractive investment opportunities.
Future research work on the topic includes the analysis of outofsample effectiveness for this type of strategy, which is expected to be improved by the incorporation of other simulation techniques to estimate semivariance and CVaR. We are also interested in exploring mechanisms that adaptively exploit several selection schemes and reproduction operators to accelerate convergence and allow the search to be stopped automatically when a suitable level of quality for the approximated set has been attained.
Under these assumptions, the distribution function F_{R(x)} is such that no jumps and no flat parts occur, implying that the equation F_{R(x)}(z)=α has a unique solution for any α∈(0,1), say \(z^{*} = F_{R(\mathbf {x})}^{1}(\alpha)\), where \(F_{R(\mathbf {x})}^{1}\) denotes the inverse of F_{R(x)}.
The terms solution and individual will be used interchangeably, since individuals in the population represent solutions to the problem that is being optimized.
The original study uses a mixedinteger solution encoding and two sets of variation operators to solve the mean variance optimization problem with cardinality constraints. In our paper, we only focus on the procedures concerning the real part of their encoding for the reproduction process.
Declarations
Acknowledgements
The authors would like to thank Gang Kou and Yong Shi for their assistance. The comments and suggestions of fourreviewers are also gratefully acknowledged.
Other declarations
All the authors have approved the manuscript for submission and confirm that the content of the manuscript has notbeen published, or submitted for publication elsewhere.
Funding
No funding was received for this research.
Authors’ contributions
All authors have equally contributed to this work and approved the final manuscript.
Competing interests
The authors declare that they have no competing interests.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Authors’ Affiliations
References
 Acerbi, C, Tasche D (2002) On the coherence of expected shortfall. J Bank Financ 26(7):1487–1503.Google Scholar
 Anagnostopoulos, KP, Mamanis G (2011a) The mean–variance cardinality constrained portfolio optimization problem: An experimental evaluation of five multiobjective evolutionary algorithms. Expert Syst Appl 38(11):14208–14217.Google Scholar
 Anagnostopoulos, KP, Mamanis G (2011b) Multiobjective evolutionary algorithms for complex portfolio optimization problems. Comput Manag Sci 8(3):259–279.Google Scholar
 Arnone, S, Loraschi A, Tettamanzi A, et al (1993) A genetic approach to portfolio selection. Neural Netw World 3(6):597–604.Google Scholar
 Artzner, P, Delbaen F, Eber JM, Heath D (1999) Coherent measures of risk. Math Financ 9(3):203–228.Google Scholar
 BaixauliSoler, JS, AlfaroCid E, FernándezBlanco MO (2010) Several risk measures in portfolio selection: Is it worthwhile?. Span J Financ Acc/Rev Española de Financiación y Contabilidad 39(147):421–444.Google Scholar
 BaixauliSoler, JS, AlfaroCid E, FernandezBlanco MO (2012) A naïve approach to speed up portfolio optimization problem using a multiobjective genetic algorithm. Investigaciones Europeas de Dirección y Economía de la Empresa 18(2):126–131.Google Scholar
 Ballestero, E, PlaSantamaria D (2005) Grading the performance of market indicators with utility benchmarks selected from footsie: a 2000 case study. Appl Econ 37(18):2147–2160.Google Scholar
 Bergmann, B, Hommel G (1988) Improvements of general multiple test procedures for redundant systems of hypotheses In: Multiple Hypothesenprüfung/Multiple Hypotheses Testing, 100–115.. Springer, Berlin.Google Scholar
 Bruni, R, Cesarone F, Scozzari A, Tardella F (2016) Realworld datasets for portfolio selection and solutions of some stochastic dominance portfolio models. Data Brief 8:858–862.Google Scholar
 Calvo, B, Santafé RG (2016) scmamp: Statistical comparison of multiple algorithms in multiple problems. R J 8(1):248–256.Google Scholar
 Chang, TJ, Meade N, Beasley JE, Sharaiha YM (2000) Heuristics for cardinality constrained portfolio optimisation. Comput Oper Res 27(13):1271–1302.Google Scholar
 Chang, TJ, Yang SC, Chang KJ (2009) Portfolio optimization problems in different risk measures using genetic algorithm. Expert Syst Appl 36(7):10529–10537.Google Scholar
 Chao, X, Kou G, Peng Y (2019) Behavior monitoring methods for tradebased money laundering integrating macro and micro prudential regulation: A case from china. Technol Econ Dev Econ. https://doi.org/10.3846/tede.2019.9383.Google Scholar
 Chen, Y, Wang X (2015) A hybrid stock trading system using genetic network programming and mean conditional valueatrisk. Eur J Oper Res 240(3):861–871.Google Scholar
 Coello, CAC, Lamont GB, Van Veldhuizen DA (2007) Evolutionary Algorithms for Solving Multiobjective Problems. Springer, New York.Google Scholar
 Crama, Y, Schyns M (2003) Simulated annealing for complex portfolio selection problems. Eur J Oper Res 150(3):546–571.Google Scholar
 Cumova, D, Nawrocki D (2011) A symmetric lpm model for heuristic mean–semivariance analysis. J Econ Bus 63(3):217–236.Google Scholar
 Cura, T (2009) Particle swarm optimization approach to portfolio optimization. Nonlinear Anal Real World Appl 10(4):2396–2406.Google Scholar
 Deb, K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGAII. IEEE Trans Evol Comput 6(2):182–197.Google Scholar
 Derrac, J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput 1(1):3–18.Google Scholar
 Dueck, G, Winker P (1992) New concepts and algorithms for portfolio choice. Appl Stoch Model Bus Ind 8(3):159–178.Google Scholar
 Eftekharian, SE, Shojafar M, Shamshirband S (2017) 2phase NSGA II: An optimized reward and risk measurements algorithm in portfolio optimization. Algorithms 10(4):130.Google Scholar
 Emmerich, MT, Deutz AH (2018) A tutorial on multiobjective optimization: fundamentals and evolutionary methods. Nat Comput 17(3):585–609.Google Scholar
 Ergu, D, Kou G, Shi Y, Shi Y (2014) Analytic network process in risk assessment and decision analysis. Comput Oper Res 42:58–74.Google Scholar
 Ertenlice, O, Kalayci CB (2018) A survey of swarm intelligence for portfolio optimization: Algorithms and applications. Swarm Evol Comput 39:36–52.Google Scholar
 Estrada, J (2008) Meansemivaraince optimization: A heuristic approach. J Appl Financ 18(1):57–72.Google Scholar
 Fishburn, PC (1977) Meanrisk analysis with risk associated with belowtarget returns. Am Econ Rev 67(2):116–126.Google Scholar
 Gaivoronski, AA, Pflug G (2005) Valueatrisk in portfolio optimization: properties and computational approach. J Risk 7(2):1–31.Google Scholar
 Garcia, S, Fernández A, Luengo J, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power. Inf Sci 180(10):2044–2064.Google Scholar
 Gen, M, Cheng R (2000) Genetic Algorithms and Engineering Optimization, vol 7. Wiley, New York.Google Scholar
 Gilli M, KE, Hysi H (2006) A datadriven optimization heuristic for downside risk minimization. J Risk 8(3):1–18.Google Scholar
 Guerard, JB (2009) Handbook of Portfolio Construction: Contemporary Applications of Markowitz Techniques. Springer, New York.Google Scholar
 Hinterding, R (1995) Gaussian mutation and selfadaption for numeric genetic algorithms In: Evolutionary Computation, 1995., IEEE International Conference On, vol. 1, 384–389.. IEEE Press, Piscataway.Google Scholar
 Hochreiter, R (2007) An evolutionary computation approach to scenariobased riskreturn portfolio optimization for general risk measures In: Workshops on Applications of Evolutionary Computation. LNCS, vol. 4448, 199–207.. Springer, Heidelberg.Google Scholar
 Hogan, WW, Warren JM (1972) Computation of the efficient boundary in the es portfolio selection model. J Financ Quant Anal 7(4):1881–1896.Google Scholar
 Huang, Y, Kou G (2014) A kernel entropy manifold learning approach for financial data analysis. Decis Support Syst 64:31–42.Google Scholar
 Huang, Y, Kou G, Peng Y (2017) Nonlinear manifold learning for early warnings in financial markets. Eur J Oper Res 258(2):692–702.Google Scholar
 Jiang, S, Ong YS, Zhang J, Feng L (2014) Consistencies and contradictions of performance metrics in multiobjective optimization. IEEE Trans Cybern 44(12):2391–2404.Google Scholar
 Kalayci, CB, Ertenlice O, Akbay MA (2019) A comprehensive review of deterministic models and applications for meanvariance portfolio optimization. Expert Syst Appl 125:345–368.Google Scholar
 Konno, H, Waki H, Yuuki A (2002) Portfolio optimization under lower partial risk measures. AsiaPac Financ Mark 9(2):127–140.Google Scholar
 Kou, G (2019) Machine learning methods combined with financial systemic risk. Technol Econ Dev Econ. https://doi.org/10.3846/tede.2019.8740.Google Scholar
 Krink, T, Paterlini S (2011) Multiobjective optimization using differential evolution for realworld portfolio optimization. Comput Manag Sci 8(12):157–179.Google Scholar
 Krokhmal, P, Palmquist J, Uryasev S (2002) Portfolio optimization with conditional valueatrisk objective and constraints. J Risk 4:43–68.Google Scholar
 Larsen, N, Mausser H, Uryasev S (2002) Algorithms for optimization of valueatrisk In: Financial Engineering, Ecommerce and Supply Chain, 19–46.. Springer, Boston.Google Scholar
 Li, H, Zhang Q (2009) Multiobjective optimization problems with complicated pareto sets, MOEA/D and NSGAII. IEEE Trans Evol Comput 13(2):284–302.Google Scholar
 Liagkouras, K, Metaxiotis K (2013) The constrained meansemivariance portfolio optimization problem with the support of a novel multiobjective evolutionary algorithm. J Softw Eng Appl 6(07):22–29.Google Scholar
 Liagkouras, K, Metaxiotis K (2015) Efficient portfolio construction with the use of multiobjective evolutionary algorithms: best practices and performance metrics. Int J Inf Technol Dec Making 14(03):535–564.Google Scholar
 Lim, C, Sherali HD, Uryasev S (2010) Portfolio optimization by minimizing conditional valueatrisk via nondifferentiable optimization. Comput Optim Appl 46(3):391–415.Google Scholar
 Liu, JL, Chou CW, Chen CM (2010) Optimising mobile base station placement using an enhanced multiobjective genetic algorithm. Int J Bus Intell Data Mining 5(1):19–42.Google Scholar
 Macedo, LL, Godinho P, Alves MJ (2017) Meansemivariance portfolio optimization with multiobjective evolutionary algorithms and technical analysis rules. Expert Syst Appl 79:33–43.Google Scholar
 Markowitz, H (1959) Portfolio Selection: Efficient Diversification of Investments. Wiley, New York.Google Scholar
 Markowitz, H, Todd P, Xu G, Yamane Y (1993) Computation of meansemivariance efficient sets by the critical line algorithm. Ann Oper Res 45(1):307–317.Google Scholar
 Meghwani, SS, Thakur M (2017) Multicriteria algorithms for portfolio optimization under practical constraints. Swarm Evol Comput 37:104–125.Google Scholar
 Meghwani, SS, Thakur M (2018) Multiobjective heuristic algorithms for practical portfolio optimization and rebalancing with transaction cost. Appl Soft Comput 67:865–894.Google Scholar
 Metaxiotis, K, Liagkouras K (2012) Multiobjective evolutionary algorithms for portfolio management: A comprehensive literature review. Expert Syst Appl 39(14):11685–11698.Google Scholar
 Meucci, A (2009) Risk and Asset Allocation. Springer, Berlin.Google Scholar
 MezuraMontes, E, Coello CAC (2011) Constrainthandling in natureinspired numerical optimization: past, present and future. Swarm Evol Comput 1(4):173–194.Google Scholar
 Mishra, SK, Panda G, Majhi R (2014) A comparative performance assessment of a set of multiobjective algorithms for constrained portfolio assets selection. Swarm Evol Comput 16:38–51.Google Scholar
 Mühlenbein, H, SchlierkampVoosen D (1993) Predictive models for the breeder genetic algorithm i continuous parameter optimization. Evol Comput 1(1):25–49.Google Scholar
 Nawrocki, DN (1991) Optimal algorithms and lower partial moment: ex post results. Appl Econ 23(3):465–470.Google Scholar
 Nawrocki, DN (1999) A brief history of downside risk measures. J Invest 8:9–25.Google Scholar
 Ponsich, A, Jaimes AL, Coello CAC (2013) A survey on multiobjective evolutionary algorithms for the solution of the portfolio optimization problem and other finance and economics applications. IEEE Trans Evol Comput 17(3):321–344.Google Scholar
 Qin, Q, Li L, Cheng S (2014) A novel hybrid algorithm for meancvar portfolio selection with realworld constraints In: Conference in Swarm Intelligence, 319–327.. Springer, Cham.Google Scholar
 Rockafellar, RT, Uryasev S (2002) Conditional valueatrisk for general loss distributions. J Bank Financ 26(7):1443–1471.Google Scholar
 Rockafellar, RT, Uryasev S, et al (2000) Optimization of conditional valueatrisk. J Risk 2:21–42.Google Scholar
 Sarykalin, S, Serraino G, Uryasev S (2008) Valueatrisk vs. conditional valueatrisk in risk management and optimization In: StateoftheArt DecisionMaking Tools in the InformationIntensive Age, Tutor Oper Res, 270–29.. INFORMS, Catonsville.Google Scholar
 Schott, JR (1995) Fault tolerant design using single and multicriteria genetic algorithm optimization. Master’s thesis, Department of Aeronautics and Astronautics, Massachussetts Institute of Technology, Cambridge, MA.Google Scholar
 Schwefel, HP (1987) Collective Phenomena in Evolutionary Systems In: Preprints of the 31st Annual Meeting of the International Society for General System Research, 1025–1033, Budapest.Google Scholar
 Senhaji, K, El Moutaouakil K, Ettaouil M (2016) Portfolio selection problem: New multicriteria approach for the meansemivariance model In: Logistics Operations Management (GOL), 2016 3rd International Conference On, 1–6.. IEEE Press, Piscataway.Google Scholar
 Sing, TF, Ong SE (2000) Asset allocation in a downside risk framework. J Real Estate Portfolio Manag 6(3):213–223.Google Scholar
 Steuer, RE, Qi Y, Hirschberger M (2005) Multiple objectives in portfolio selection. J Financ Dec Making 1(1):5–20.Google Scholar
 Subbu, R, Bonissone PP, Eklund N, Bollapragada S, Chalermkraivuth K (2005) Multiobjective financial portfolio design: A hybrid evolutionary approach In: Evolutionary Computation, 2005. The 2005 IEEE Congress On, vol. 2, 1722–1729.. IEEE Press, Piscataway.Google Scholar
 Whitley, D (1988) Genitor: A different genetic algorithm In: Proc. Rocky Mountain Conference on Artificial Intelligence, 118–130.. Colorado Institute for Artificial Intelligence, Denver, Colorado.Google Scholar
 WoodsideOriakhi, M, Lucas C, Beasley JE (2011) Heuristic algorithms for the cardinality constrained efficient frontier. Eur J Oper Res 213(3):538–550.Google Scholar
 Yamai, Y, Yoshiba T, et al (2002) Comparative analyses of expected shortfall and valueatrisk: their estimation error, decomposition, and optimization. Monet Econ Stud 20(1):87–121.Google Scholar
 Zhang, H, Kou G, Peng Y (2019) Soft consensus cost models for group decision making and economic interpretations. Eur J Oper Res 277(3):964–980.Google Scholar
 Zhang, Q, Li H (2007) MOEA/D: A multiobjective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput 11(6):712–731.Google Scholar
 Zhou, A, Jin Y, Zhang Q, Sendhoff B, Tsang E (2006) Combining modelbased and geneticsbased offspring generation for multiobjective optimization using a convergence criterion In: Evolutionary Computation, 2006. CEC 2006. IEEE Congress On, 892–899.. IEEE Press, Piscataway.Google Scholar
 Zitzler, E, Laumanns M, Thiele L (2001) SPEA 2: Improving the strength Pareto evolutionary algorithm. TIKreport 103. ETH Zurich, Zurich.Google Scholar
 Zitzler, E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength pareto approach. IEEE Trans Evol Comput 3(4):257–271.Google Scholar