A set optimization approach to utility maximization under transaction costs
Andreas H. Hamel∗, Sophie Qingzhen Wang†
Abstract
A set optimization approach to multi-utility maximization is presented, and duality results are obtained for discrete market models with proportional transaction costs. The novel approach admits to obtain results for non-complete preferences, where the formulas derived closely resemble but generalize the scalar case.
Keywords. utility maximization, non-complete preference, multi-utility representation, set optimization, duality theory, transaction costs
JEL classification. C61, G11
1 Introduction
In this note, we propose a set-valued approach to utility maximization for market models with transaction costs. For finite probability spaces and a one-period set-up, we derive results which resemble very closely the scalar case as discussed in [4, Theorem 3.2.1]. This is far beyond other approaches in which only scalar utility functions are used as, for example, in [1, 3], where a complete preference for multivariate position is assumed. As far as we are aware of, there is no argument justifying such strong assumption, and it does not seem appropriate for market models with transaction costs.
On the other hand, recent results on multi-utility representations as given, among others, in [5] lead to the question how to formulate and solve an “expected multi-utility” maximization problem. The following optimistic goal formulated by Bosi and Herden in [2] does not seem achievable since, in particular, there is no satisfactory multi-objective duality which matches the power of the scalar version: ‘Moreover, as it reduces finding the maximal elements in a given subset of X with respect to ꢀ to a multi-objective optimization problem (cf. Evren and Ok, 2011), in applications, this approach is likely to be more useful than the Richter-Peleg approach.’
The question remains how duality could work in a multi-utility framework. Using a simple model with finitely many utility functions as an exemplary case, our answer is that it works almost in the same way as in the scalar case if set optimization methods are used.
∗Free University Bozen, Faculty of Economics and Management, [email protected] †Harvard University Department of Economics, [email protected]
1
Our approach is different, and it has several advantages. First, we demonstrate that the
“usual” duality can be established if one allows for a set-valued extension of the problem. This means that the vector-valued problem is replaced by a problem where the objective function maps into a well-defined complete lattice of sets. Secondly, the problem formulation allows to separate the market model, namely the time dependent solvency cone, from the the preference relation expressed by a (vector-valued) utility function and provides a link to no-arbitrage type results for market models with transaction costs as, for example, given in [16].
Our model can be interpreted as follows: The decision maker has a complete preference for random positions of each asset in the market separately which is representable by a utility function (the classic von Neumann-Morgenstern set-up). These preferences work “component-wise,’ i.e. they are determined independently of the other assets. This may seem very restrictive and is indeed not the most general situation. However, there are two arguments in favor of studying it. First, it admits non-complete preferences which goes beyond the above quoted references and actually provides a path for a solution of the corresponding (multi-)utility maximization problem. Secondly, via the market model, an exchange mechanism is built into the model which makes the components of the portfolio dependent on each other when one looks for best alternatives. The underlying idea is that one can exchange assets, but cannot exchange utility of one asset for utility of another.
The main tool is a set-valued Lagrangian, to which recent duality results for set-valued functions from [10] are applied (compare also the survey [8]). The resulting formulas look very much like the scalar ones, a feature that is nearly impossible to achieve if one would only apply multi-objective optimization duality.
The results are based on the senior thesis of the second author written in 2011 at Princeton
University and supervised by the first author and B. Rudloff (now at Vienna University for Economics and Business).
2 Problem formulation
Let (Ω, P) be a finite probability space where the σ-algebra is assumed to be the power set
of Ω = {ω1, ω2, . . . , ωN } with pn := P ({ωn}) > 0 for all n ∈ {1, . . . , N}.
A one-period conical model for a market with d assets is given by a pair (K0, KT = KT (ω)) of finitely generated convex cones satisfying IRd+ ⊆ K0 = IRd and IRd+ ⊆ KT (ω) = IRd for all ω ∈ Ω. The cones K0, KT are called solvency cones and may arise, for example, due to explicit proportional transaction costs (see [13]) or bid-ask spreads (see [16]).
The linear space of IRd-valued random variables X : Ω → IRd is denoted by L0d :=
L0d(Ω, P) which can be identified with IRdN . Further, denote
ꢀ
L0d (KT ) = X ∈ L0d | ∀ω ∈ Ω: X (ω) ∈ KT (ω) ,
- ꢃ
- ꢄ
- ꢁ
- ꢂ
and L0d = L0d IRd+ . Finally, 1I ∈ L0d stands for the function with 1I (ω) = 1 for all ω ∈ Ω.
+
In this note, it is assumed that d utility functions Ui : IR → IR ∪ {−∞}, i = 1, . . . , d are given which are concave and non-decreasing. We define a vector-valued function U : IRd →
2
IRd ∪ {−∞} by
(
(U1 (x1) , U2 (x2) , . . . , Ud (xd))T
::
∀i ∈ {1, 2, . . . , d} : xi ∈ dom Ui
otherwise
U (x) =
−∞
This can be interpreted as follows. The investor has a scalar utility function for each of the d assets which is independent from holdings in the others, and a portfolio is not acceptable for the investor if one asset produces a utility −∞. Of course, this means that the investor has a complete preference for random positions of each asset which admits a von NeumannMorgenstern representation. We note that even this might be a strong assumption and refer to [6, Chap. 2] for further details. However, this assumption is in some respect much less
d
ˆstrong than the existence of a function U : IR → IR ∪ {−∞} which represents a complete
preference (a total order) on the set of all multivariate positions. This is the starting point e.g. in the recent [1, 3]
Since the Ui’s are concave and non-decreasing, the function U is IRd+-concave, that is
∀t ∈ (0, 1) , ∀x, x0 ∈ IRd : U (tx + (1 − t) x0) ∈ tU (x) + U ((1 − t) x0) − IRd+, and U is IRd+-monotone increasing, that is x ∈ x0 + IRd+ ⇒ U (x) ∈ U (x0) + IRd+ where we agree upon x+(−∞) = (−∞)+x = −∞, t (−∞) = −∞ for all t > 0 and −∞ ≤ x for all x ∈ IRd.
The basic problem of this note is
maximize IE [U (X)] subject to X ∈ x1I − K01I − L0d (KT ) where x ∈ IRd is the given initial endowment and the expected value is understood componentwise with IE [−∞] = −∞.
The set x1I − K01I − L0d (KT ) is precisely the set of (random) portfolios which can be obtained by trading at time t = 0 and time t = T starting with the initial endowment x ∈ IRd. The first question is how the maximization is understood. In contrast to (more popular) vector approaches, we will understand the above problem as a set-valued one. The advantage of this approach is that the notions of supremum and infimum make sense, and consequently, that a complete duality theory is available. One may compare the appendix and the survey [8] for details.
We define the set
- n
- ꢃ
- ꢄo
G(IRd, −IRd+) = A ⊆ IRd | A = cl co A − IR+d
- ꢃ
- ꢄ
and introduce an addition on G IRd, −IR+d by
(
cl {a1 + a2 | a1 ∈ A1, a2 ∈ A2}
::
A1, A2 = ∅
otherwise
A1 ⊕ A2 =
,
∅
and a multiplication · with non-negative real numbers by
−IRd+
{ta | a ∈ A}
∅
:::t = 0
- t · A =
- t > 0, A = ∅
t > 0, A = ∅
3
- ꢃ
- ꢄ
On G IRd, −IRd+ , the inclusion ⊆ is a partial order which is compatible with the two
ꢃ ꢃ
- ꢄ
- ꢄ
algebraic operations just defined, thus the quadruple G IRd, −IR+d , ⊕, ·, ⊆ is an ordered conlinear space in the sense of [7, 8]. We shall denote this structure by GO and drop the · for multiplication in most cases.
The infimum and supremum of a set A ⊆ GO are given by
- [
- \
- sup A = cl co
- A, inf A =
A
- A∈A
- A∈A
which are clearly elements of GO, thus GO even is a complete lattice (see [7, 8]). Here, we agree upon sup A = ∅ in GO for A = ∅.
Our basic problem now is to find, in GO,
- n
- o
sup IE [U (X)] − IRd+ | X ∈ x1I − K01I − Ld0 (KT )
(UMAX) where we use the convention IE [U (X)] − IRd+ = ∅ whenever IE [U (X)] = −∞.
The plan of the remainder of the paper is as follows. In the next section, we will give a dual characterization of the constraint of (UMAX). Then, we introduce a Lagrangian function, define primal and dual value functions and establish a strong duality theorem which provides the essentials for a solution of (UMAX). The reader may compare [4, Chapter 3] for a concise presentation for the scalar case d = 1. It is the main purpose of this note to demonstrate that the vector-valued case d > 1 can be dealt with by means of set-valued duality as proposed in [7, 10], and that in doing so one obtains meaningful analogs to the well-known scalar formulas. Such a theory is not really possible in terms of the vector order since the supremum (or infimum) with respect to a vector order usually does not make sense since it produces “utopia” solutions which are not feasible in general.
3 Constraints transformation
Defining
C = K01I + L0d (KT )
we want to describe C by means of dual variables. Note that in our setting C is always closed since it can be identified with a finitely generated cone in IRdN (see [15, Theorem 19.1]). In a more general situation, the closedness of the set of all terminal positions follows from a no-arbitrage type condition, see for example [16, Theorem 2.1].
- ꢅ
- ꢆ
Using the duality pairing (X, Y ) → IE Y T X on Ld0 × Ld0, L0d is turned into a Hilbert space which can be identified with its dual, and for this topological setting, we obtain
- ꢅ
- ꢆ
- ꢁ
- ꢂ
- ꢀ
+
L0d (KT ) = L0d KT+ = Y ∈ Ld0 | ∀ω ∈ Ω: Y (ω) ∈ KT+ (ω)
+
where denotes the (positive) dual cone (see [17, p. 7]). Compare [9, Lemma 3.1].
Lemma 3.1 For x ∈ IRd, X ∈ Ld0,
- ꢅ
- ꢆ
x1I − X ∈ C
⇔
∀ (Y, v) ∈ Yd : IE Y T X ≤ vT x
where
- n
- o
- ꢁ
- ꢂ
Yd = (Y, v) ∈ L0d × IRd | v ∈ K0+\{0}, Y ∈ Ld0 KT+ , IE [Y ] = v .
4
Proof. By a separation argument, one may see that the dual cone of C is
- ꢀ
- ꢁ
- ꢂ
C+ = Y ∈ Ld0 | Y ∈ Ld0 KT+ ∧ IE [Y ] ∈ K0+
.
Since C is a closed convex cone the bipolar theorem produces
- ꢅ
- ꢆ
- ꢅ
- ꢆ
x1I − X ∈ C
⇔
∀Y ∈ C+ : IE Y T (x1I − X) = xT IE [Y ] − IE Y T X ≥ 0.
- ꢁ
- ꢂ
The set Yd is in one-to-one relationship with C+\{0}. Indeed, Y ∈ L0d KT+ implies Y ∈
- ꢁ
- ꢂ
Ld0
and this in turn IE [Y ] ≥ 0 since IRd+ ⊆ KT (ω), hence KT+ (ω) ⊆ IRd+ for all ω ∈ Ω.
+
Thus, v = IE [Y ] = 0 implies Y = 0. The rest is obvious.
ꢀ
By means of Lemma 3.4 in [9] it can be shown that the set Yd also is in one-to-one relationship with the set
- ꢇ
- ꢈ
- ꢁ
- ꢂ
dQ dP
Wd = (Q, w) ∈ M1,d × IRd | w ∈ K0+\{0}, diag (w)
∈ L0d KT+
.
P
- dQ
- P
This can be arranged by Y = diag (w) dP , IE [Y ] = w. Here, M1,d is the set of all vector probability measures with components which are absolutely continuous with respect to P. Thus, we can write the utility maximization problem as
- n
- o
sup IE [U (X)] − IRd+ | ∀ (Q, w) ∈ Wd : wT IEQ [X] ≤ wT x .
Under our assumptions, we can reduce the number of constraints in the above problem to a finite number. This can be done due to the fact that the cones K0 and KT (ω) are assumed to be finitely generated. Therefore, the cone C can be identified with a finitely generated cone in IRdN . Hence (see [15, Corollary 19.2.2]) the cone C+ in IRdN is finitely generated by,
- ꢁ
- ꢂ
- ꢁ
- ꢂ
say, Y 1, Y 2, . . . , Y M ∈ Yd. Let Q1, w1 , . . . , QM , wM ∈ Wd be the pairs corresponding
- ꢁ
- ꢅ
- ꢆꢂ
- ꢁ
- ꢅ
- ꢆꢂ
to Y 1, IE Y 1 , . . . , Y M , IE Y M . Then
Qm
- T
- T
x1I − X ∈ C
⇔
∀m ∈ {1, . . . , M} : (wm) IE
[X] ≤ (wm) x.
We can further simplify the problem by observing that, for (Q, w) ∈ Wd,
N
X
wT IEQ [X] ≤ wT x
⇔
wT qˆ(Tn)ξn ≤ wT x