5 Exponential cone optimization¶
So far we discussed optimization problems involving the major “polynomial” families of cones: linear, quadratic and power cones. In this chapter we introduce a single new object, namely the three-dimensional exponential cone, together with examples and applications. The exponential cone can be used to model a variety of constraints involving exponentials and logarithms.
5.1 Exponential cone¶
The exponential cone is a convex subset of
Thus the exponential cone is the closure in
When working with logarithms, a convenient reformulation of (5.2) is
Alternatively, one can write the same condition as
which immediately shows that
is positive semidefinite for

Fig. 5.1 The boundary of the exponential cone
5.2 Modeling with the exponential cone¶
Extending the conic optimization toolbox with the exponential cone leads to new types of constraint building blocks and new types of representable sets. In this section we list the basic operations available using the exponential cone.
5.2.1 Exponential¶
The epigraph
5.2.2 Logarithm¶
Similarly, we can express the hypograph
5.2.3 Entropy¶
The entropy function
5.2.4 Relative entropy¶
The relative entropy or Kullback-Leiber divergence of two probability distributions is defined in terms of the function
Because of this reparametrization the exponential cone is also referred to as the relative entropy cone, leading to a class of problems known as REPs (relative entropy problems). Having the relative entropy function available makes it possible to express epigraphs of other functions appearing in REPs, for instance:
5.2.5 Softplus function¶
In neural networks the function
and therefore
5.2.6 Log-sum-exp¶
We can generalize the previous example to a log-sum-exp (logarithm of sum of exponentials) expression
This is equivalent to the inequality
and so it can be modeled as follows:
5.2.7 Log-sum-inv¶
The following type of bound has applications in capacity optimization for wireless network design:
Since the logarithm is increasing, we can model this using a log-sum-exp and an exponential as:
Alternatively, one can also rewrite the original constraint in equivalent form:
and then model the right-hand side inequality using the technique from Sec. 3.2.7 (Harmonic mean). This approach requires only one exponential cone.
5.2.8 Arbitrary exponential¶
The inequality
where
and therefore to
5.2.9 Lambert W-function¶
The Lambert function
It is the real branch of a more general function which appears in applications such as diode modeling. The
and so it can be modeled with a mix of exponential and quadratic cones (see Sec. 3.1.2 (Rotated quadratic cones)):
5.2.10 Other simple sets¶
Here are a few more typical sets which can be expressed using the exponential and quadratic cones. The presentations should be self-explanatory; we leave the simple verifications to the reader.
Set |
Conic representation |
---|---|
5.3 Geometric programming¶
Geometric optimization problems form a family of optimization problems with objective and constraints in special polynomial form. It is a rich class of problems solved by reformulating in logarithmic-exponential form, and thus a major area of applications for the exponential cone
5.3.1 Definition and basic examples¶
A monomial is a real valued function of the form
where the exponents
For example, the following functions are monomials (in variables
and the following are examples of posynomials:
A geometric program (GP) is an optimization problem of the form
where
A geometric program (5.14) can be modeled in exponential conic form by making a substitution
Under this substitution a monomial of the form (5.11) becomes
for
where
Example
We demonstrate this reduction on a simple example. Take the geometric problem
By substituting
and using the log-sum-exp reduction from Sec. 5.2.6 (Log-sum-exp) we write an explicit conic problem:
Solving this problem yields
5.3.2 Generalized geometric models¶
In this section we briefly discuss more general types of constraints which can be modeled with geometric programs.
Monomials
If
Monomial inequalities
In similar vein, if
It also means that we can add lower and upper variable bounds:
Products and positive powers
Expressions involving products and positive powers (possibly iterated) of posynomials can again be modeled with posynomials. For example, a constraint such as
can be replaced with
Other transformations and extensions
If
are already expressed by posynomials then is clearly equivalent to and . Hence we can add the maximum operator to the list of building blocks for geometric programs.If
are posynomials, is a monomial and we know that then the constraint is equivalent to .The objective function of a geometric program (5.14) can be extended to include other terms, for example:
where
are monomials. After the change of variables we get a slightly modified version of (5.15):(note the lack of one logarithm) which can still be expressed with exponential cones.
5.3.3 Geometric programming case studies¶
Frobenius norm diagonal scaling
Suppose we have a matrix
Minimizing the last sum is an example of a geometric program with variables
Maximum likelihood estimation
Geometric programs appear naturally in connection with maximum likelihood estimation of parameters of random distributions. Consider a simple example. Suppose we have two biased coins, with head probabilities
The probability of obtaining the given outcome equals
and, up to constant factors, maximizing that expression is equivalent to solving the problem
or, as a geometric problem:
For example, if
An Olympiad problem
The 26th Vojtěch Jarník International Mathematical Competition, Ostrava 2016. Let
Using the tricks introduced in Sec. 5.3.2 (Generalized geometric models) we formulate this problem as a geometric program:
Unsurprisingly, the optimal value of this program is
Power control and rate allocation in wireless networks
We consider a basic wireless network power control problem. In a wireless network with
Maximizing the minimal SINR over all receivers (
In the low-SNR regime the problem of system rate maximization is approximated by the problem of maximizing
For more information and examples see [BKVH07].
5.4 Exponential cone case studies¶
In this section we introduce some practical optimization problems where the exponential cone comes in handy.
5.4.1 Risk parity portfolio¶
Consider a simple version of the Markowitz portfolio optimization problem introduced in Sec. 3.3.3 (Markowitz portfolio optimization), where we simply ask to minimize the risk
where
We call (5.20) the risk parity condition. It indeed models equal risk contribution from all the assets, because as one can easily check
Risk parity portfolios satisfying condition (5.20) can be found with an auxiliary optimization problem:
for any
The conic form of problem (5.21) is:
5.4.2 Entropy maximization¶
A general entropy maximization problem has the form
where
Maximization of the entropy function
Often one has an a priori distribution
which leads to an optimization problem
5.4.3 Hitting time of a linear system¶
Consider a linear dynamical system
where we assume for simplicity that
with the following conic form, where the variables are
See Fig. 5.2 for an example. Other criteria for the target set of the trajectories are also possible. For example, polyhedral constraints
are also expressible in exponential conic form for starting points
For a robust version involving uncertainty on

Fig. 5.2 With
5.4.4 Logistic regression¶
Logistic regression is a technique of training a binary classifier. We are given a training set of examples
and choosing label
By taking logarithms and adding regularization with respect to
Problem (5.27) is convex, and can be more explicitly written as
involving softplus type constraints (see Sec. 5.2 (Modeling with the exponential cone)) and a quadratic cone. See Fig. 5.3 for an example.

Fig. 5.3 Logistic regression example with none, medium and strong regularization (small, medium, large