10 Robust optimization¶

In chapter Sec. 4 (Dealing with estimation error) we have discussed in detail, that the inaccurate or uncertain input parameters of a portfolio optimization problem can result in wrong optimal solutions. In other words, the solution is very input sensitive. Robust optimization is another possible modeling tool to overcome this sensitivity. It is a way of handling estimation errors in the optimization problem instead of the input preparation phase. In the most common setup, the parameters are the estimated mean and estimated covariance matrix of the security returns. In robust optimization, we do not compute point estimates of these, but rather an uncertainty set, where the true values lie with certain confidence. A robust portfolio thus optimizes the worst-case performance with respect to all possible parameter values within their corresponding uncertainty sets. [CJPT18, VD09]

10.1 Types of uncertainty¶

We can form different types of uncertainty sets around the unknown parameters, depending on the nature of the uncertainty, the sensitivity of the solution, and the available information.

Polytope: If we have a finite number of scenarios, we can form a polytope uncertainty set by taking the convex hull of the scenarios.
Interval: We can compute a confidence interval for each of the parameters.
Ellipsoidal region: For vector variables, we can compute confidence regions.

We can model all of these types of uncertainty sets in conic optimization, allowing us to solve robust optimization problems efficiently.

The size of the uncertainty sets reflects the desired level of robustness. For confidence intervals, it is controlled by the confidence level. There is of course a tradeoff, if the uncertainty sets are chosen to be very large, then the resulting portfolios will be very conservative, and they will perform much worse for any given parameter set, than the portfolio designed for that set of parameters. On the other hand, if the size is chosen too small, the resulting portfolio will not be robust enough.

10.2 Uncertainty in security returns¶

Consider problem (2.3), which we restate here:

(10.1)¶

\begin{array}{r} \begin{array}{lrcl} maximize & μ^{T} x - \frac{δ}{2} x^{T} Σ x \\ subject to & 1^{T} x & = & 1. \end{array} \end{array}

Assume that the vector of mean returns $μ$ belongs to the elliptical uncertainty set

(10.2)¶

U_{μ} = {μ ∣ (μ - μ_{0})^{T} Q^{- 1} (μ - μ_{0}) \leq γ^{2}},

where $Q$ is a known positive semidefinite matrix. Then the worst case expected portfolio return will be

min_{μ \in U_{μ}} μ^{T} x = μ_{0}^{T} x - γ \sqrt{x^{T} Q x} .

It follows that the robust version of (10.1) becomes

(10.3)¶

\begin{array}{r} \begin{array}{lrcl} maximize & μ_{0}^{T} x - γ \sqrt{x^{T} Q x} - \frac{δ}{2} x^{T} Σ x \\ subject to & 1^{T} x & = & 1. \end{array} \end{array}

10.3 Uncertainty in the factor model¶

In the article [GI03], the authors consider a factor model on security returns, treat its parameters as uncertain using ellipsoidal uncertainty sets defined as confidence regions, and formulate robust portfolio optimization problems.

Assume that for a random security return vector $R_{t}$ and factor return vector $F_{t}$ at time $t$ , the factor model takes the form

(10.4)¶

R_{t} = μ + β F_{t} + θ_{t},

where $μ$ is the vector of mean security returns, $β$ is the factor loading matrix, and $θ_{t}$ is the vector of residual returns. We also assume an exact factor model (see Sec. 5 (Factor models)), meaning that the factor returns and residual returns are assumed to be independent, and the residual covariance matrix $D$ of $θ_{t}$ is diagonal. Moreover, $E (F_{t}) = 0$ , so the factors carry no information on the mean returns. A final assumption in this section is that the factor covariance matrix $Q \in R^{K \times K}$ is known exactly. This requirement will be relaxed later.

The portfolio mean return and portfolio variance will be $E (R_{x}) = μ^{T} x$ and $Var (R_{x}) = x^{T} (β Q β^{T} + D) x$ . If we compute estimates $μ$ , $β$ , and $D$ of the above quantities, we can reformulate problem (2.1):

(10.5)¶

\begin{array}{r} \begin{array}{lrcl} minimize & t_{1} + t_{2} \\ subject to & μ^{T} x & \geq & r_{\min}, \\ x^{T} D x & \leq & t_{2}, \\ x^{T} β Q β^{T} x & \leq & t_{1}, \\ x & \in & F . \end{array} \end{array}

10.3.1 Uncertainty sets¶

Instead of computing (10.5) using estimates, we assume variables $μ$ , $β$ , and $D$ to lie in the following uncertainty sets:

The diagonal elements $σ_{θ}^{2}$ of the matrix $D$ can take values in an uncertainty interval $[{\underset{―}{σ}}_{θ}^{2}, {\overset{―}{σ}}_{θ}^{2}]$ :

(10.6)¶ $U_{D} = {D ∣ diag (D) \in [{\underset{―}{σ}}_{θ}^{2}, {\overset{―}{σ}}_{θ}^{2}]}$
The factor loadings matrix $β$ belongs to the elliptical uncertainty set

(10.7)¶ $U_{β} = {β ∣ β = β_{0} + B, ‖ B_{i} ‖_{G} \leq ρ_{i}, i = 1, \dots, N}$

where $B_{i}$ is row $i$ of the matrix $B$ , and $‖ b ‖_{G} = \sqrt{b^{T} G b}$ denotes the elliptic norm of $b$ with respect to the positive definite matrix $G \in R^{K \times K}$ .
The vector $μ$ of mean returns is assumed to lie in the uncertainty interval

(10.8)¶ $U_{μ} = {μ ∣ μ = μ_{0} + m, | m | \leq γ}$

10.3.2 Robust problem formulation¶

The goal of robust portfolio selection is then to select portfolios that perform well for all parameter values that constitute these sets of uncertainty. In other words, we are looking for worst case optimal solutions, formulated as minmax optimization problems. Then we can state the robust mean-variance optimization problem:

(10.9)¶

\begin{array}{r} \begin{array}{lrcll} minimize & t_{1} + t_{2} \\ subject to & min_{μ \in U_{μ}} μ^{T} x & \geq & r_{\min}, \\ max_{D \in U_{D}} x^{T} D x & \leq & t_{2}, \\ max_{β \in U_{β}} x^{T} β Q β^{T} x & \leq & t_{1}, \\ x & \in & F . \end{array} \end{array}

To convert the minmax problem (10.9) into a conic optimization problem, we first need to represent the worst case expected portfolio return and portfolio variance using conic constraints:

We can evaluate the worst case expected portfolio return:

$min_{μ \in U_{μ}} μ^{T} x = μ_{0}^{T} x - γ^{T} | x | .$
We can evaluate the worst case residual portfolio variance:

$max_{D \in U_{D}} x^{T} D x = x^{T} Diag ({\overset{―}{σ}}_{θ}^{2}) x .$
We cannot easily evaluate the worst case factor portfolio variance. But we can show that the constraint

$max_{β \in U_{β}} x^{T} β Q β^{T} x \leq t_{1}$

is equivalent to

(10.10)¶ $(ρ^{T} | x |, t_{1}, x) \in H (β_{0}, Q, G) .$

Relation (10.10) is a shorthand notation for the following: Define $H = G^{- 1 / 2} Q G^{- 1 / 2}$ , with spectral decomposition $H = V Λ V^{T}$ , and define $w = V^{T} H^{1 / 2} G^{1 / 2} β_{0}^{T} x$ . Then there exist $τ, s, u \geq 0$ that satisfy the set of conic constraints

$\begin{array}{r} \begin{array}{rcl} τ + 1^{T} u & \leq & t_{1}, \\ s & \leq & 1 / λ_{\max} (H), \\ (ρ^{T} | x |)^{2} & \leq & s τ, \\ w_{i}^{2} & \leq & (1 - s λ_{i}) u_{i}, i = 1, \dots, K \end{array} \end{array}$

There is also a different but equivalent version of this statement, see [GI03].

10.3.3 Robust conic model¶

Now we can formulate the robust optimization problem (10.9):

(10.11)¶

\begin{array}{r} \begin{array}{lrcll} minimize & t_{1} + t_{2} \\ subject to & μ_{0}^{T} x - γ^{T} | x | & \geq & r_{\min}, \\ x^{T} Diag ({\overset{―}{σ}}_{θ}^{2}) x & \leq & t_{2}, \\ (ρ^{T} | x |, t_{1}, x) & \in & H (β_{0}, Q, G), \\ x & \in & F, \end{array} \end{array}

Finally, we can convert (10.11) into conic form by modeling the absolute value based on Sec. 13.1.1.3 (Absolute value) and the quadratic cone based on Sec. 13.1.1.10 (Quadratic form):

(10.12)¶

\begin{array}{r} \begin{array}{lrcll} minimize & t_{1} + t_{2} \\ subject to & μ_{0}^{T} x - γ^{T} z & \geq & r_{\min}, \\ (t_{2}, \frac{1}{2}, \sqrt{{\overset{―}{σ}}_{θ}^{2}} \circ x) & \in & Q_{r}^{N + 2}, \\ (ρ^{T} z, t_{1}, x) & \in & H (β_{0}, Q, G), \\ x & \leq & z, \\ x & \geq & - z, \\ x & \in & F, \end{array} \end{array}

and the constraint $(ρ^{T} z, t_{1}, x) \in H (β_{0}, Q, G)$ can be modeled using the hyperbolic constraint in Sec. 13.1.1.8 (Hyperbolic constraint):

(10.13)¶

\begin{array}{r} \begin{array}{rcl} τ + 1^{T} u & \leq & t_{1}, \\ s & \leq & 1 / λ_{\max} (H), \\ (s, τ, ρ^{T} z) & \in & Q_{r}^{3}, \\ (1 - s λ_{i}, u_{i}, w_{i}) & \in & Q_{r}^{3}, i = 1, \dots, K \end{array} \end{array}

where $τ, s, u \geq 0$ are new variables, $w = V^{T} H^{1 / 2} G^{1 / 2} β_{0}^{T} x$ , and $λ_{i}$ are eigenvalues of $H = G^{- 1 / 2} Q G^{- 1 / 2} = V Λ V^{T}$ .

10.3.4 Case of unknown factor covariance¶

In this section we cover the case when the factor covariance matrix $Q$ is also uncertain, and has the estimate $Q$ . In this case, the robust portfolio optimization problem can still be converted into a conic form, and solved efficiently.

We can give an uncertainty structure to either the factor covariance matrix $Q$ or its inverse $Q^{- 1}$ . Both choices lead to the same worst case portfolio variance constraint. The choice of $Q^{- 1}$ is a bit more restrictive, but allows us to accomodate prior information about the structure of $Q$ . See the details in [GI03]. Here we describe the case of $Q$ .

The matrix estimate $Q$ has the uncertainty structure

(10.14)¶

U_{Q} = {Q ∣ Q = Q_{0} + Δ \geq 0, Δ = Δ^{T}, ‖ N^{- 1 / 2} Δ N^{- 1 / 2} ‖ \leq ζ},

where $Q_{0} \geq 0$ , and the norm is the spectral norm or the Frobenius norm.

Then the worst case factor portfolio variance constraint

max_{β \in U_{β}, Q \in U_{Q}} x^{T} β Q β^{T} x \leq t_{1}

is equivalent to

(10.15)¶

(ρ^{T} | x |, t_{1}, x) \in H (β_{0}, Q_{0} + ζ N, G) .

10.4 Parameters¶

In this section we discuss how the parameters of the uncertainty sets introduced in Sec. 10.3.1 (Uncertainty sets) can be computed from market data. Typically the parameters $μ$ , $β$ , $D$ are estimated from the security return and factor return data, using multivariate linear regression. We can also compute multidimensional confidence regions with any desired confidence level around the least-squares estimates. These confidence regions become the uncertainty sets in the robust portfolio optimization problem. The regression procedure also yields natural choices for the matrix $G$ defining the elliptic norm and the bounds $ρ$ , $γ$ , ${\overset{―}{σ}}_{θ}^{2}$ . In [GI03], there are two methods discussed to construct the uncertainty sets from data, here we detail only one of them.

In (10.4) the factor model is written for all security at one time instant $t$ . Now we write the model for one security $i$ at all time instants:

R_{i} = μ_{i} + β_{i} F + θ_{i}, i = 1, \dots, N

where $β_{i}$ is row number $i$ of $β$ . We can write this in a shorter form as

R_{i} = A Y_{i} + θ_{i}, i = 1, \dots, N

where $Y_{i} = [μ_{i}, β_{i}]^{T} \in R^{(K + 1) \times 1}$ , and $A = [1, F^{T}] \in R^{N \times (K + 1)}$ .

Assuming that the matrix $A$ is rank $K + 1$ , and we have market return series $r_{i}$ , the theory of ordinary least squares leads to the estimate

{\bar{y}}_{i} = [{\bar{μ}}_{i}, {\bar{β}}_{i}]^{T} = (A^{T} A)^{- 1} A^{T} r_{i}

The $ω$ elliptical confidence region around $y_{i}$ is then

U_{y_{i}} (ω) = {y_{i} ∣ ({\bar{y}}_{i} - y_{i})^{T} (A^{T} A) ({\bar{y}}_{i} - y_{i}) \leq (K + 1) (s_{θ}^{2})_{i} c_{K + 1} (ω)},

where $(s_{θ}^{2})_{i} = ‖ r_{i} - A {\bar{y}}_{i} ‖^{2} / (T - K - 1)$ is estimate of the error variance $(σ_{θ}^{2})_{i}$ , $c_{K + 1} (ω)$ is the $ω$ critical value, the solution of $F_{F} (c_{K + 1}) = ω$ , and $F_{F}$ is the CDF of the F-distribution with degrees of freedom $(K + 1, T - K - 1)$ .

Then the full $ω^{N}$ confidence set for $y$ will be $U_{y} (ω) = U_{y_{1}} (ω) \times \dots \times U_{y_{N}} (ω)$ .

10.4.1 The parameters of $U_{μ}$ ¶

If we project $U_{y} (ω)$ along the vector $μ$ , we get the $ω^{N}$ confidence set (10.8), where

\begin{array}{r} \begin{array}{lrcl} (μ_{0})_{i} & = & {\bar{μ}}_{i}, \\ γ_{i} & = & \sqrt{(K + 1) (A^{T} A)_{1, 1}^{- 1} (s_{θ}^{2})_{i} c_{K + 1} (ω)} \end{array} \end{array}

10.4.2 The parameters of $U_{β}$ ¶

Let $P = [0, I] \in R^{K \times (K + 1)}$ be the matrix projecting $y_{i}$ along $β_{i}$ . If we project $U_{y} (ω)$ along $β$ , then we get the $ω^{N}$ confidence set (10.7), where

\begin{array}{r} \begin{array}{lrcl} (β_{0})_{i} & = & {\bar{β}}_{i}, \\ G & = & (P (A^{T} A)^{- 1} P^{T})^{- 1} \\ = & F F^{T} - \frac{1}{T} (F 1) (F 1)^{T}, \\ ρ_{i} & = & \sqrt{(K + 1) (s_{θ}^{2})_{i} c_{K + 1} (ω)} \end{array} \end{array}

10.4.3 The parameters of $U_{D}$ ¶

It would be natural to choose the confidence interval around $(s_{θ}^{2})_{i}$ , the estimate of the error variance $(σ_{θ}^{2})_{i}$ , but we only have a single value. It would be possible to use bootstrapping to construnct an upper bound this way, but it can be computationally expensive.

Since we only require an estimate of the worst case error variance $({\overset{―}{σ}}_{θ}^{2})_{i}$ for the robust optimization problem, it is cheaper to use any reasonable estimate for this purpose.

10.4.4 The parameters of $U_{Q}$ ¶

In case the factor covariance matrix is not known, we can construct its uncerainty region also from data. According to [GI03], the $ω^{K}$ confidence set (10.14) can be parameterized the following way:

\begin{array}{r} \begin{array}{lrcl} Q_{0} & = & Q_{ML}, \\ N & = & Q_{ML}, \\ ζ & = & η / (1 - η), \end{array} \end{array}

where $Q_{ML} = G / (T - 1)$ is the maximum likelihood estimate (MLE) of $Q$ computed from factor return data, and $η$ is the unique solution of

(10.16)¶

F_{Γ} (1 + η) - F_{Γ} (1 - η) = ω,

where $F_{Γ}$ is the CDF of a $Γ (\frac{T + 1}{2}, \frac{T - 1}{2})$ random variable [1], and $ω$ is the desired confidence level. [2] Note that equation (10.16) restricts $ω^{K}$ to be at most $F_{Γ} (2)^{K}$ , which depends on the number of data samples $T$ . However, this limitation is not very restrictive in practice. [3]

10.5 Example¶

Here we show a code example of the robust optimization problem (10.3), that we restate here:

(10.17)¶

\begin{array}{r} \begin{array}{lrcl} maximize & μ_{0}^{T} x - γ \sqrt{x^{T} Q x} - \frac{δ}{2} x^{T} Σ x \\ subject to & 1^{T} x & = & 1. \end{array} \end{array}

We start at the point where data is already prepared, and we show the optimization model. This example considers an elliptical uncertainty region around the expected return vector. If we compute the worst case portfolio return in this case, there will be two terms with quadratic expressions. The first will be $γ \sqrt{x^{T} Q x}$ , where $γ$ controls the size of the uncertainty region. If $γ = 0$ , then we get back the original, non-robust MVO problem. The second quadratic expression $\frac{δ}{2} x^{T} Σ x$ models the portfolio risk, and $δ$ is the risk aversion coefficient.

We can model both terms using the second-order cones. For the term with square-root, the quadratic cone is more appropriate, while the portfolio variance term can be modeled using the rotated quadratic cone. We substitute the square-root term with the new variable $s_{q} = \sqrt{x^{T} Q x}$ , then the objective of the problem will be

# Objective
delta = M.parameter()
wc_return = x.T @ mu0 - gamma * sq
M.objective('obj', ObjectiveSense.Maximize, wc_return - delta * s)

Assuming that $Q = G_{Q} G_{Q}^{T}$ , the square root term can be modeled as

# Robustness
M.constraint('robustness', Expr.vstack(sq, GQ.T @ x), Domain.inQCone())

Similarly, we substitute the risk term with $s = \frac{1}{2} x^{T} Σ x$ , and assuming $Σ = G G^{T}$ , we model the risk as

# Risk constraint
M.constraint('risk', Expr.vstack(s, 1, G.T @ x),
                     Domain.inRotatedQCone())

The full model would look like the following:

with Model("Robust") as M:
    # Variables
    # The variable x is the fraction of holdings in each security.
    # It is restricted to be positive, which imposes no short-selling.
    x = M.variable("x", N, Domain.greaterThan(0.0))

    # The variable s models the portfolio risk term.
    s = M.variable("s", 1, Domain.greaterThan(0.0))

    # The variable sq models the robustness term.
    sq = M.variable("sq", 1, Domain.greaterThan(0.0))

    # Budget constraint
    M.constraint('budget', Expr.sum(x) == 1.0)

    # Objective
    delta = M.parameter()
    wc_return = x.T @ mu0 - gamma * sq
    M.objective('obj', ObjectiveSense.Maximize, wc_return - delta * s)

    # Robustness
    M.constraint('robustness', Expr.vstack(sq, GQ.T @ x), Domain.inQCone())

    # Risk constraint
    M.constraint('risk', Expr.vstack(s, 1, G.T @ x),
                         Domain.inRotatedQCone())

    # Create DataFrame to store the results. Last security names
    # (the factors) are removed.
    columns = ["delta", "obj", "return", "risk"] + \
              df_prices.columns.tolist()
    df_result = pd.DataFrame(columns=columns)
    for d in deltas:
        # Update parameter
        delta.setValue(d)

        # Solve optimization
        M.solve()

        # Save results
        portfolio_return = mu0 @ x.level() - gamma * sq.level()[0]
        portfolio_risk = np.sqrt(2 * s.level()[0])
        row = pd.Series([d, M.primalObjValue(),
                         portfolio_return, portfolio_risk] + \
                        list(x.level()), index=columns)
        df_result = df_result.append(row, ignore_index=True)

Finally, we compute the efficient frontier in the following points:

deltas = np.logspace(start=-1, stop=2, num=20)[::-1]

If we plot the efficient frontier on Fig. 10.1, and the portfolio composition on Fig. 10.2 we can compare the results obtained with and without using robust optimization.

_images/eff_frontier_robust.png — Fig. 10.1 The efficient frontier.¶

_images/portfolio_composition_robust.png — Fig. 10.2 Portfolio composition $x$ with varying level if risk-aversion $δ$ .¶

Footnotes

10 Robust optimization¶

10.1 Types of uncertainty¶

10.2 Uncertainty in security returns¶

10.3 Uncertainty in the factor model¶

10.3.1 Uncertainty sets¶

10.3.2 Robust problem formulation¶

10.3.3 Robust conic model¶

10.3.4 Case of unknown factor covariance¶

10.4 Parameters¶

10.4.1 The parameters of $U_{μ}$ ¶

10.4.2 The parameters of $U_{β}$ ¶

10.4.3 The parameters of $U_{D}$ ¶

10.4.4 The parameters of $U_{Q}$ ¶

10.5 Example¶

Table of Contents

Download PDF

Modeling Cookbook

Cheatsheet

10 Robust optimization¶

10.1 Types of uncertainty¶

10.2 Uncertainty in security returns¶

10.3 Uncertainty in the factor model¶

10.3.1 Uncertainty sets¶

10.3.2 Robust problem formulation¶

10.3.3 Robust conic model¶

10.3.4 Case of unknown factor covariance¶

10.4 Parameters¶

10.4.1 The parameters of Uμ¶

10.4.2 The parameters of Uβ¶

10.4.3 The parameters of UD¶

10.4.4 The parameters of UQ¶

10.5 Example¶

10.4.1 The parameters of $U_{μ}$ ¶

10.4.2 The parameters of $U_{β}$ ¶

10.4.3 The parameters of $U_{D}$ ¶

10.4.4 The parameters of $U_{Q}$ ¶