4 The power cones

So far we studied quadratic cones and their applications in modeling problems involving, directly or indirectly, quadratic terms. In this part we expand the quadratic and rotated quadratic cone family with power cones, which provide a convenient language to express models involving powers other than 2. We must stress that although the power cones include the quadratic cones as special cases, at the current state-of-the-art they require more advanced and less efficient algorithms.

4.1 The power cone(s)

n-dimensional power cones form a family of convex cones parametrized by a real number 0<α<1:

(4.1)Pnα,1α={xRn : x1αx21αx32++xn2, x1,x20}.

The constraint in the definition of Pnα,1α can be expressed as a composition of two constraints, one of which is a quadratic cone:

(4.2)x1αx21α|z|,zx32++xn2,

which means that the basic building block we need to consider is the three-dimensional power cone

(4.3)P3α,1α={xR3 : x1αx21α|x3|, x1,x20}.

More generally, we can also consider power cones with “long left-hand side”. That is, for m<n and a sequence of exponents α1,,αm with α1++αm=1, we have the most general power cone object defined as

(4.4)Pnα1,,αm={xRn : i=1mxiαii=m+1nxi2, x1,,xm0}.

The left-hand side is nothing but the weighted geometric mean of the xi, i=1,,m with weights αi. In particular when all αi are equal and there is only one element on the right-hand side we obtain as a special case the geometric mean cone:

(4.5)GMn={xRn : (i=1n1xi)1/(n1)|xn|, x1,,xn10}.

As we will see later, both the most general power cone and the geometric mean cone can be modeled as a composition of three-dimensional cones P3α,1α, so in a sense that is the basic object of interest.

There are some notable special cases we are familiar with. If we let α0 then in the limit we get Pn0,1=R+×Qn1. If α=12 then we have a rescaled version of the rotated quadratic cone, precisely:

(x1,x2,,xn)Pn12,12(x1/2,x2/2,x3,,xn)Qrn.

A gallery of three-dimensional power cones for varying α is shown in Fig. 4.1.

_images/powcone.png

Fig. 4.1 The boundary of P3α,1α for α=0.1,0.2,0.35,0.5.

4.2 Sets representable using the power cone

In this section we give basic examples of constraints which can be expressed using power cones.

4.2.1 Powers

For all values of p0,1 we can bound xp depending on the convexity of f(x)=xp.

  • For p>1 the inequality t|x|p is equivalent to t1/p|x| and hence corresponds to

    t|x|p(t,1,x)P31/p,11/p.

    For instance t|x|1.5 is equivalent to (t,1,x)P32/3,1/3.

  • For 0<p<1 the function f(x)=xp is concave for x0 and so we get

    |t|xp, x0(x,1,t)P3p,1p.
  • For p<0 the function f(x)=xp is convex for x>0 and in this range the inequality txp is equivalent to

    txpt1/(1p)xp/(1p)1(t,x,1)P31/(1p),p/(1p).

    For example t1x is the same as (t,x,1)P32/3,1/3.

4.2.2 p-norm cones

Let p1. The p-norm of a vector xRn is xp=(|x1|p++|xn|p)1/p and the p-norm ball of radius t is defined by the inequality xpt. We take the p-norm cone (in dimension n+1) to be the convex set

(4.6){(t,x)Rn+1 : txp}.

For p=2 this is precisely the quadratic cone. We can model the p-norm cone by writing the inequality txp as:

ti|xi|p/tp1

and bounding each summand with a power cone. This leads to the following model:

(4.7)ritp1|xi|p((ri,t,xi)P31/p,11/p),ri=t.

When 0<p<1 or p<0 the formula for xp gives a concave, rather than convex function on R+n and in this case it is possible to model the set

{(t,x) : 0t(xip)1/p,xi0}, p<1,p0.

We leave it as an exercise (see previous subsection). The case p=1 appears in Sec. 3.2.7 (Harmonic mean).

4.2.3 The most general power cone

Not all conic optimization software provides direct access to the most general version of the power cone Pnα1,,αm. In this case it is good to know how to model it using the basic three-dimensional power cones P3α,1α. Clearly it suffices to consider a short right-hand side, that is to model the cone

(4.8)Pm+1α1,,αm={x1α1x2α2xmαm|z|, x1,,xm0},

where iαi=1. Denote s=α1++αm1. We split (4.8) into two constraints

(4.9)x1α1/sxm1αm1/s|t|, x1,,xm10,tsxmαm|z|,xm0,

and this way we expressed Pm+1α1,,αm using two power cones Pmα1/s,,αm1/s and P3s,αm. Proceeding by induction gives the desired splitting.

4.2.4 Geometric mean

The geometric mean cone GMn+1=Pn+11/n,,1/n is a direct way to model the inequality

(4.10)|z|(x1x2xn)1/n,

which corresponds to maximizing the geometric mean of the variables xi0. In case this cone is not directly available to the user, tracing through the splitting (4.9) produces an equivalent representation of (4.10) using three-dimensional power cones as follows:

x11/2x21/2|t3|,t311/3x31/3|t4|,tn111/(n1)xn11/(n1)|tn|,tn11/nxn1/n|z|.

4.2.5 Non-homogenous constraints

Every constraint of the form

x1α1x2α2xmαm|z|β, xi0,

where iαi<β and αi>0 is equivalent to

(x1,x2,,xm,1,z)Pm+2α1/β,,αm/β,s

with s=1iαi/β.

4.3 Power cone case studies

4.3.1 Portfolio optimization with market impact

Let us go back to the Markowitz portfolio optimization problem introduced in Sec. 3.3.3 (Markowitz portfolio optimization), where we now ask to maximize expected profit subject to bounded risk in a long-only portfolio:

(4.11)maximizeμTxsubject toxTΣxγ,xi0, i=1,,n,

In a realistic model we would have to consider transaction costs which decrease the expected return. In particular if a really large volume is traded then the trade itself will affect the price of the asset, a phenomenon called market impact. It is typically modeled by decreasing the expected return of i-th asset by a slippage cost proportional to xβ for some β>1, so that the objective function changes to

maximize(μTxiδixiβ).

A popular choice is β=3/2. This objective can easily be modeled with a power cone as in Sec. 4.2 (Sets representable using the power cone):

maximizeμTxδTtsubject to(ti,1,xi)P31/β,11/β(tixiβ),

In particular if β=3/2 the inequality tixi3/2 has conic representation (ti,1,xi)P32/3,1/3.

4.3.2 Maximum volume cuboid

Suppose we have a convex, conic representable set KRn (for instance a polyhedron, a ball, intersections of those and so on). We would like to find a maximum volume axis-parallel cuboid inscribed in K. If we denote by pRn the leftmost corner and by x1,,xn the edge lengths, then this problem is equivalent to

(4.12)maximizetsubject tot(x1xn)1/n,xi0,(p1+e1x1,,pn+enxn)K,e1,,en{0,1},

where the last constraint states that all vertices of the cuboid are in K. The optimal volume is then v=tn. The geometric mean cone GMn+1 can be used for the volume maximization constraint.

Maximizing the volume of an arbitrary (not necessarily axis-parallel) cuboid inscribed in K is no longer a convex problem. However, it can be solved by maximizing the solution to (4.12) over all sets T(K) where T is a rotation (orthogonal matrix) in Rn. In practice one can approximate the global solution by sampling sufficiently many rotations T or using more advanced methods of optimization over the orthogonal group.

_images/icosahedron.png

Fig. 4.2 The maximal volume cuboid inscribed in the regular icosahedron takes up approximately 0.388 of the volume of the icosahedron (the exact value is 3(1+5)/25).

4.3.3 p-norm geometric median

The geometric median of a sequence of points x1,,xkRn is defined as

argminyRni=1kyxi,

that is a point which minimizes the sum of distances to all the given points. Here can be any norm on Rn. The most classical case is the Euclidean norm, where the geometric median is the solution to the basic facility location problem minimizing total transportation cost from one depot to given destinations.

For a general p-norm xp with 1p< (see Sec. 4.2.2 (p-norm cones)) the geometric median is the solution to the obvious conic problem:

(4.13)minimizeitisubject totiyxip,yRn.

In Sec. 4.2.2 (p-norm cones) we showed how to model the p-norm bound using n power cones.

The Fermat-Torricelli point of a triangle is the Euclidean geometric mean of its vertices, and a classical theorem in planar geometry (due to Torricelli, posed by Fermat), states that it is the unique point inside the triangle from which each edge is visible at the angle of 120 (or a vertex if the triangle has an angle of 120 or more). Using (4.13) we can compute the p-norm analogues of the Fermat-Torricelli point. Some examples are shown in Fig. 4.3.

_images/fermat.png

Fig. 4.3 The geometric median of three triangle vertices in various p-norms.

4.3.4 Maximum likelihood estimator of a convex density function

In [TV98] the problem of estimating a density function that is know in advance to be convex is considered. Here we will show that this problem can be posed as a conic optimization problem. Formally the problem is to estimate an unknown convex density function g:R+R+ given an ordered sample y1<y2<<yn of n outcomes of a distribution with density g.

The estimator g~0 is a piecewise linear function

g~:[y1,yn]R+

with break points at (yi,xi), i=1,,n, where the variables xi>0 are estimators for g(yi). The slope of the i-th linear segment of g~ is

xi+1xiyi+1yi.

Hence the convexity requirement leads to the constraints

xi+1xiyi+1yixi+2xi+1yi+2yi+1,i=1,,n2.

Recall the area under the density function must be 1. Hence,

i=1n1(yi+1yi)(xi+1+xi2)=1

must hold. Therefore, the problem to be solved is

maximizei=1nxisubject toxi+1xiyi+1yixi+2xi+1yi+2yi+10,i=1,,n2,i=1n1(yi+1yi)(xi+1+xi2)=1,x0.

Maximizing i=1nxi or the geometric mean (i=1nxi)1n will produce the same optimal solutions. Using that observation we can model the objective with the geometric mean cone GMn+1. Alternatively, one can use as objective ilogxi and model it using the exponential cone as in Sec. 5.2 (Modeling with the exponential cone).