8 Other risk measures¶
In the definitions (2.1)-(2.4) of the classical MVO problem the variance (or standard deviation) of portfolio return is used as the measure of risk, making computation easy and convenient. If we assume that the portfolio return is normally distributed, then the variance is in fact the optimal risk measure, because then MVO can take into account all information about return. Moreover, if
Empirical observations suggest however, that the distribution of linear return is often skewed, and even if it is elliptically symmetric, it can have fat tails and can exhibit tail dependence [1]. As the distribution moves away from normality, the performance of a variance based portfolio estimator can quickly degrade.
No perfect risk measure exists, though. Depending on the distribution of return, different measures can be more appropriate in different situations, capturing different characteristics of risk. Return of diversified equity portfolios are often approximately symmetric over periods of institutional interest. Options, swaps, hedge funds, and private equity have return distributions that are unlikely to be symmetric. Also less symmetric are distributions of fixed-income and real estate index returns, and diversified equity portfolios over long time horizons [MM08].
The measures presented here are expectations, meaning that optimizing them would lead to stochastic optimization problems in general. As a simplification, we consider only their sample average approximation by assuming that a set of linear return scenarios are given.
8.1 Deviation measures¶
This class of measures quantify the variability around the mean of the return distribution, similar to variance. If we associate investment risk with the uncertainty of return, then such measures could be ideal.
Let
Positivity:
with only if ,Translation invariance:
,Subadditivity:
,Positive homogeneity:
Note that positive homegeneity and subadditivity imply the convexity of
8.1.1 Robust statistics¶
A group of deviation measures are robust statistics. Maximum likelihood estimators are highly sensitive to deviations from the assumed distribution. Thus if the return is not normally distributed, robust portfolio estimators can be an alternative. These are less efficient than MLE for normal distribution, but their performance degrades less quickly under departures from normality. One suitable class of estimators to express portfolio risk are M-estimators [VD09]:
where
8.1.1.1 Variance¶
Choosing
By defining new variables
where
8.1.1.2 -shortfall¶
Choosing
Portfolio optimization using sample
By defining new variables
A drawback of this model is that the number of constraints depends on the sample size
For elliptically symmetric return distributions the
However when return is symmetric but not normally distributed, the sample portfolio estimators for
8.1.1.3 MAD¶
A special case of
After modeling the absolute value based on Sec. 13.1.1.3 (Absolute value) we arrive at the following LO:
where
For normally distributed returns, the MAD is proportional to the variance:
The
8.1.1.4 Risk measure from the Huber function¶
Another robust portfolio estimator can be obtained for the case of symmetric return distributions using the Huber function
yielding the risk measure
Modeling the absolute value based on Sec. 13.1.1.3 (Absolute value) we get
Note that the size of this problem depends on the number of samples
8.1.2 Downside deviation measures¶
In financial context many distributions are skewed. Investors might only be concerned with negative deviations (losses) relative to the expected portfolio return
A class of downside deviation measures are lower partial moments of order
where
We have the following special cases:
is the probability of loss relative to the target return . is an incomplete measure of risk, because it does not provide any indication of how severe the shortfall can be, should it occur. Therefore it is best used as a constraint while optimizing for a different risk measure. This way can still provide information about the risk tolerance of the investor. is the expected loss, also called target shortfall. is called target semi-variance.
While lower partial moments only consider outcomes below the target
The
If we define the new variables
where
Contrary to the variance,
However, LPMs are only useful for skewed distributions. If the return distribution is symmetric,
8.2 Tail risk measures¶
Tail risk measures try to capture the risk of extreme events. The measures described below are commonly defined for random variables treating loss as positive number, so to apply them on security returns
Let
Monotonicity: If
, then ,Translation invariance:
,Subadditivity:
,Positive homogeneity:
If
8.2.1 Value-at-Risk¶
Denote the
This is the amount of loss (a positive number) over the given time horizon that will not be exceeded with probability
8.2.2 Conditional Value-at-Risk¶
A modification of VaR that is a coherent risk measure is conditional value-at-risk (CVaR). CVaR is the average of the
which is a linear combination of VaR and the quantity called mean shortfall. The latter is also not coherent on its own.
If the distribution function
8.2.2.1 CVaR for discrete distribution¶
Suppose that the loss distribution is described by points
where
It can be seen that
8.2.2.2 Portfolio optimization with CVaR¶
If we substitute portfolio loss scenarios into formula (8.11), we can see that the quantile
Note that problem (8.12) is equivalent to
This convex function (8.13) is exactly the one found in [RU00], where it is also proven to be valid for continuous probability distributions as well.
Now we can substitute the portfolio return into
Because CVaR is represented as a convex function in formula (8.13), we can also formulate an LO to maximize expected return, while limiting risk in terms of CVaR:
The drawback of optimizing CVaR using problems (8.14) or (8.15) is that both the number of variables and the number of constraints depend on the number of scenarios
8.2.3 Entropic Value-at-Risk¶
A more general risk measure in this class is the entropic value-at-risk (EVaR). EVaR is also a coherent risk measure, with additional favorable monotonicity properties; see in [AJ12]. It is defined as the tightest upper bound on VaR obtained from the Chernoff inequality:
where
8.2.3.1 EVaR for discrete distribution¶
Based on the definition (8.16), the discrete version of the EVaR will be
We can make formula (8.17) convex by substituting a new variable
We can transform formula (8.18) into a conic optimization problem by substituting the first term of the objective with a new variable
8.2.3.2 Portfolio optimization with EVaR¶
Now we can substitute the portfolio return into
Because EVaR is represented as a convex function (8.18), we can also formulate a conic problem to maximize expected return, while limiting risk in terms of EVaR:
A disadvantage of the EVaR conic model is that it still depends on
Note that if we assume the return distribution to be a Gaussian mixture, we can find a different approach to computing EVaR. See in Sec. 8.3.2 (EVaR using Gaussian mixture return).
8.2.4 Relationship between risk measures¶
Suppose that
For example,
8.2.5 Practical considerations¶
Many risk measures in this chapter can be computed in practice through scenario based approximations. The relevant optimization problems can then be conveniently formulated as LO or QO problems. The advantage of these models is that they do not need a covariance matrix estimate. The simplification of the problem structure, however, comes at the cost of increasing the problem dimension by introducing a number of new variables and constraints proportional to the number of scenarios
If
If
where
8.3 Expected utility maximization¶
Apart from selecting different risk measures, we can also approach portfolio optimization through the use of utility functions. We specify a concave and increasing utility function that measures the investors preference for each specific outcome. Then the objective is to maximize the expected utility under the return distribution.
Expected utility maximization can take into account any type of return distribution, while MVO works only with the first two moments, therefore it is accurate only for normally distributed return. MVO is also equivalent with expected utility maximization if the utility function is quadratic, because it models the investors indifference about higher moments of return. The only advantage of MVO in this comparison is that it works without having to discretize the return distribution and work with scenarios.
8.3.1 Optimal portfolio using gaussian mixture return¶
In [LB22] an expected utility maximization approach is taken, assuming that the return distribution is a Gaussian mixture (GM). The benefits of a GM distribution are that it can approximate any continuous distribution, including skewed and fat-tailed ones. Also its components can be interpreted as return distributions given a specific market regime. Moreover, the expected utility maximization using this return model can be formulated as a convex optimization problem without needing return scenarios, making this approach as efficient and scalable as MVO.
We denote security returns having Gaussian mixture (GM) distribution with
where
Using definition (8.24), the distribution of portfolio return
where
To select the optimal portfolio we use the exponential utility function
Thus maximizing the function (8.26) is the same as minimizing the moment generating function of the portfolio return, or equivalently, we can minimize its logarithm, the cumulant generating function:
If
The function (8.28) is convex in
Assuming we have the GM distribution parameter estimates
where
8.3.2 EVaR using Gaussian mixture return¶
We introduced Entropic Value-at-Risk (EVaR) in Sec. 8.2.3 (Entropic Value-at-Risk). EVaR can also be expressed using the cumulant generating function
After substituting the return
We can find a connection between EVaR computation (8.31) and maximization of expected exponential utility (8.27). Suppose that the pair
By assuming a GM distribution for security return, we can optimize problem (8.31) without needing a scenario distribution. First, to formulate it as a convex optimization problem, define the new variable
We can observe that the first term in formula (8.32) is the perspective of
Assuming we have the GM distribution parameter estimates
where
The huge benefit of this EVaR formulation is that its size does not depend on the number of scenarios, because it is derived without using a scenario distribution. It depends only on the number of GM components
8.4 Example¶
This example shows how can we compute the CVaR efficient frontier using the dual form of CVaR in MOSEK Fusion.
8.4.1 Scenario generation¶
The input data is again obtained the same way as detailed in Sec. 3.4.2 (Data collection), but we do only the steps until Sec. 3.4.3.3 (Projection of invariants). This way we get the expected return estimate
# Number of scenarios
T = 99999
# Generate logarithmic return scenarios assuming normal distribution
R_log = np.random.default_rng().multivariate_normal(m_log, S_log, T)
Next, we convert the received logarithmic return scenarios to linear return scenarios using the inverse of formula (3.2).
# Convert logarithmic return scenarios to linear return scenarios
R = np.exp(scenarios_log) - 1
R = R.T
We transpose the resulting matrix just to remain consistent with the notation in this chapter, namely that each column of
# Scenario probabilities
p = np.ones(T) / T
8.4.2 The optimization problem¶
The optimization problem we solve here resembles problem (2.12), but we will change the risk measure from portfolio standard deviation to portfolio CVaR:
Applying the dual CVaR formula (8.13), we get:
We know that formula (8.13) equals CVaR only if it is minimal in
Now we model the maximum function, and arrive at the following LO model of the mean-CVaR efficient frontier:
8.4.3 The Fusion model¶
Here we show the Fusion model of problem (8.37).
def EfficientFrontier(N, T, m, R, p, alpha, deltas):
with Model("CVaRFrontier") as M:
# Variables
# x - fraction of holdings relative to the initial capital.
# It is constrained to take only positive values.
x = M.variable("x", N, Domain.greaterThan(0.0))
# Budget constraint
M.constraint('budget', Expr.sum(x) == 1)
# Auxiliary variables.
t = M.variable("t", 1, Domain.unbounded())
u = M.variable("u", T, Domain.unbounded())
# Constraint modeling maximum
M.constraint(u >= - R * x - Var.repeat(t, T))
M.constraint(u >= 0)
# Objective
delta = M.parameter()
cvar_term = t + u.T @ p / (1-alpha)
M.objective('obj', ObjectiveSense.Maximize,
x.T @ m - delta * cvar_term)
# Create DataFrame to store the results.
columns = ["delta", "obj", "return", "risk"] + \
df_prices.columns.tolist()
df_result = pd.DataFrame(columns=columns)
for d in deltas:
# Update parameter
delta.setValue(d)
# Solve optimization
M.solve()
# Save results
portfolio_return = m @ x.level()
portfolio_risk = t.level()[0] + \
1/(1-alpha) * np.dot(p, u.level())
row = pd.Series([d, M.primalObjValue(),
portfolio_return, portfolio_risk] + \
list(x.level()), index=columns)
df_result = df_result.append(row, ignore_index=True)
return df_result
Next, we compute the efficient frontier. We select the confidence level
alpha = 0.95
# Compute efficient frontier with and without shrinkage
deltas = np.logspace(start=-1, stop=2, num=20)[::-1]
df_result = EfficientFrontier(N, T, m, R, p, alpha, deltas)
On Fig. 8.1 we can see the risk-return plane, and on Fig. 8.2 the portfolio composition for different levels of risk.

Fig. 8.1 The CVaR efficient frontier.¶

Fig. 8.2 Portfolio composition
Footnotes