A Set Optimization Approach to Utility Maximization Under Transaction Costs

A set optimization approach to utility maximization under transaction costs

Andreas H. Hamel^∗, Sophie Qingzhen Wang^†

Abstract

A set optimization approach to multi-utility maximization is presented, and duality results are obtained for discrete market models with proportional transaction costs. The novel approach admits to obtain results for non-complete preferences, where the formulas derived closely resemble but generalize the scalar case.

Keywords. utility maximization, non-complete preference, multi-utility representation, set optimization, duality theory, transaction costs

JEL classiﬁcation. C61, G11

1 Introduction

In this note, we propose a set-valued approach to utility maximization for market models with transaction costs. For finite probability spaces and a one-period set-up, we derive results which resemble very closely the scalar case as discussed in [4, Theorem 3.2.1]. This is far beyond other approaches in which only scalar utility functions are used as, for example, in [1, 3], where a complete preference for multivariate position is assumed. As far as we are aware of, there is no argument justifying such strong assumption, and it does not seem appropriate for market models with transaction costs.
On the other hand, recent results on multi-utility representations as given, among others, in [5] lead to the question how to formulate and solve an “expected multi-utility” maximization problem. The following optimistic goal formulated by Bosi and Herden in [2] does not seem achievable since, in particular, there is no satisfactory multi-objective duality which matches the power of the scalar version: ‘Moreover, as it reduces finding the maximal elements in a given subset of X with respect to ꢀ to a multi-objective optimization problem (cf. Evren and Ok, 2011), in applications, this approach is likely to be more useful than the Richter-Peleg approach.’
The question remains how duality could work in a multi-utility framework. Using a simple model with finitely many utility functions as an exemplary case, our answer is that it works almost in the same way as in the scalar case if set optimization methods are used.

^∗Free University Bozen, Faculty of Economics and Management, [email protected] ^†Harvard University Department of Economics, [email protected]

1

Our approach is different, and it has several advantages. First, we demonstrate that the
“usual” duality can be established if one allows for a set-valued extension of the problem. This means that the vector-valued problem is replaced by a problem where the objective function maps into a well-defined complete lattice of sets. Secondly, the problem formulation allows to separate the market model, namely the time dependent solvency cone, from the the preference relation expressed by a (vector-valued) utility function and provides a link to no-arbitrage type results for market models with transaction costs as, for example, given in [16].
Our model can be interpreted as follows: The decision maker has a complete preference for random positions of each asset in the market separately which is representable by a utility function (the classic von Neumann-Morgenstern set-up). These preferences work “component-wise,’ i.e. they are determined independently of the other assets. This may seem very restrictive and is indeed not the most general situation. However, there are two arguments in favor of studying it. First, it admits non-complete preferences which goes beyond the above quoted references and actually provides a path for a solution of the corresponding (multi-)utility maximization problem. Secondly, via the market model, an exchange mechanism is built into the model which makes the components of the portfolio dependent on each other when one looks for best alternatives. The underlying idea is that one can exchange assets, but cannot exchange utility of one asset for utility of another.
The main tool is a set-valued Lagrangian, to which recent duality results for set-valued functions from [10] are applied (compare also the survey [8]). The resulting formulas look very much like the scalar ones, a feature that is nearly impossible to achieve if one would only apply multi-objective optimization duality.
The results are based on the senior thesis of the second author written in 2011 at Princeton
University and supervised by the first author and B. Rudloff (now at Vienna University for Economics and Business).

2 Problem formulation

Let (Ω, P) be a ﬁnite probability space where the σ-algebra is assumed to be the power set

of Ω = {ω₁, ω₂, . . . , ω_N} with p_n:= P ({ω_n}) > 0 for all n ∈ {1, . . . , N}.

A one-period conical model for a market with d assets is given by a pair (K₀, K_T= K_T(ω)) of ﬁnitely generated convex cones satisfying IR^d₊⊆ K₀= IR^dand IR^d₊⊆ K_T(ω) = IR^dfor all ω ∈ Ω. The cones K₀, K_Tare called solvency cones and may arise, for example, due to explicit proportional transaction costs (see [13]) or bid-ask spreads (see [16]).
The linear space of IR^d-valued random variables X : Ω → IR^dis denoted by L⁰_d:=
L⁰_d(Ω, P) which can be identiﬁed with IR^dN. Further, denote

ꢀ

L⁰_d(K_T) = X ∈ L⁰_d| ∀ω ∈ Ω: X (ω) ∈ K_T(ω) ,

ꢃ

ꢄ

ꢁ

ꢂ

and L⁰_d= L⁰_dIR^d₊. Finally, 1I ∈ L⁰_dstands for the function with 1I (ω) = 1 for all ω ∈ Ω.

+

In this note, it is assumed that d utility functions U_i: IR → IR ∪ {−∞}, i = 1, . . . , d are given which are concave and non-decreasing. We deﬁne a vector-valued function U : IR^d→

2

IR^d∪ {−∞} by

(

(U₁(x₁) , U₂(x₂) , . . . , U_d(x_d))^T

::

∀i ∈ {1, 2, . . . , d} : x_i∈ dom U_i

otherwise
U (x) =

−∞

This can be interpreted as follows. The investor has a scalar utility function for each of the d assets which is independent from holdings in the others, and a portfolio is not acceptable for the investor if one asset produces a utility −∞. Of course, this means that the investor has a complete preference for random positions of each asset which admits a von NeumannMorgenstern representation. We note that even this might be a strong assumption and refer to [6, Chap. 2] for further details. However, this assumption is in some respect much less

d

ˆstrong than the existence of a function U : IR → IR ∪ {−∞} which represents a complete

preference (a total order) on the set of all multivariate positions. This is the starting point e.g. in the recent [1, 3]
Since the U_i’s are concave and non-decreasing, the function U is IR^d₊-concave, that is

∀t ∈ (0, 1) , ∀x, x⁰∈ IR^d: U (tx + (1 − t) x⁰) ∈ tU (x) + U ((1 − t) x⁰) − IR^d₊, and U is IR^d₊-monotone increasing, that is x ∈ x⁰+ IR^d₊⇒ U (x) ∈ U (x⁰) + IR^d₊where we agree upon x+(−∞) = (−∞)+x = −∞, t (−∞) = −∞ for all t > 0 and −∞ ≤ x for all x ∈ IR^d.
The basic problem of this note is

maximize IE [U (X)] subject to X ∈ x1I − K₀1I − L⁰_d(K_T) where x ∈ IR^dis the given initial endowment and the expected value is understood componentwise with IE [−∞] = −∞.
The set x1I − K₀1I − L⁰_d(K_T) is precisely the set of (random) portfolios which can be obtained by trading at time t = 0 and time t = T starting with the initial endowment x ∈ IR^d. The first question is how the maximization is understood. In contrast to (more popular) vector approaches, we will understand the above problem as a set-valued one. The advantage of this approach is that the notions of supremum and infimum make sense, and consequently, that a complete duality theory is available. One may compare the appendix and the survey [8] for details.
We define the set

n

ꢃ

ꢄo

G(IR^d, −IR^d₊) = A ⊆ IR^d| A = cl co A − IR₊^d

ꢃ

ꢄ

and introduce an addition on G IR^d, −IR₊^dby

(

cl {a₁+ a₂| a₁∈ A₁, a₂∈ A₂}

::

A₁, A₂= ∅

otherwise

A₁⊕ A₂=

,

∅

and a multiplication · with non-negative real numbers by



−IR^d₊

{ta | a ∈ A}
∅

:::t = 0



t · A =

t > 0, A = ∅



t > 0, A = ∅

3

ꢃ

ꢄ

On G IR^d, −IR^d₊, the inclusion ⊆ is a partial order which is compatible with the two

ꢃ ꢃ

ꢄ

algebraic operations just defined, thus the quadruple G IR^d, −IR₊^d, ⊕, ·, ⊆ is an ordered conlinear space in the sense of [7, 8]. We shall denote this structure by GÔand drop the · for multiplication in most cases.
The infimum and supremum of a set A ⊆ GÔare given by

[

\

sup A = cl co

A, inf A =

A

A∈A

which are clearly elements of GÔ, thus GÔeven is a complete lattice (see [7, 8]). Here, we agree upon sup A = ∅ in GÔfor A = ∅.
Our basic problem now is to find, in GÔ,

n

o

sup IE [U (X)] − IR^d₊| X ∈ x1I − K₀1I − L_d⁰(K_T)
(UMAX) where we use the convention IE [U (X)] − IR^d₊= ∅ whenever IE [U (X)] = −∞.
The plan of the remainder of the paper is as follows. In the next section, we will give a dual characterization of the constraint of (UMAX). Then, we introduce a Lagrangian function, deﬁne primal and dual value functions and establish a strong duality theorem which provides the essentials for a solution of (UMAX). The reader may compare [4, Chapter 3] for a concise presentation for the scalar case d = 1. It is the main purpose of this note to demonstrate that the vector-valued case d > 1 can be dealt with by means of set-valued duality as proposed in [7, 10], and that in doing so one obtains meaningful analogs to the well-known scalar formulas. Such a theory is not really possible in terms of the vector order since the supremum (or inﬁmum) with respect to a vector order usually does not make sense since it produces “utopia” solutions which are not feasible in general.

3 Constraints transformation

Deﬁning
C = K₀1I + L⁰_d(K_T)

we want to describe C by means of dual variables. Note that in our setting C is always closed since it can be identiﬁed with a ﬁnitely generated cone in IR^dN(see [15, Theorem 19.1]). In a more general situation, the closedness of the set of all terminal positions follows from a no-arbitrage type condition, see for example [16, Theorem 2.1].

ꢅ

ꢆ

Using the duality pairing (X, Y ) → IE Y ^TX on L_d⁰× L_d⁰, L⁰_dis turned into a Hilbert space which can be identiﬁed with its dual, and for this topological setting, we obtain

ꢅ

ꢆ

ꢁ

ꢂ

ꢀ

+

L⁰_d(K_T) = L⁰_dK_T⁺= Y ∈ L_d⁰| ∀ω ∈ Ω: Y (ω) ∈ K_T⁺(ω)

+

where denotes the (positive) dual cone (see [17, p. 7]). Compare [9, Lemma 3.1].

Lemma 3.1 For x ∈ IR^d, X ∈ L_d⁰,

ꢅ

ꢆ

x1I − X ∈ C

⇔

∀ (Y, v) ∈ Y_d: IE Y ^TX ≤ v^Tx

where

n

o

ꢁ

ꢂ

Y_d= (Y, v) ∈ L⁰_d× IR^d| v ∈ K₀⁺\{0}, Y ∈ L_d⁰K_T⁺, IE [Y ] = v .

4

Proof. By a separation argument, one may see that the dual cone of C is

ꢀ

ꢁ

ꢂ

C⁺= Y ∈ L_d⁰| Y ∈ L_d⁰K_T⁺∧ IE [Y ] ∈ K₀⁺
.

Since C is a closed convex cone the bipolar theorem produces

ꢅ

ꢆ

ꢅ

ꢆ

x1I − X ∈ C

⇔

∀Y ∈ C⁺: IE Y ^T(x1I − X) = x^TIE [Y ] − IE Y ^TX ≥ 0.

ꢁ

ꢂ

The set Y_dis in one-to-one relationship with C⁺\{0}. Indeed, Y ∈ L⁰_dK_T⁺implies Y ∈

ꢁ

ꢂ

L_d⁰

and this in turn IE [Y ] ≥ 0 since IR^d₊⊆ K_T(ω), hence K_T⁺(ω) ⊆ IR^d₊for all ω ∈ Ω.

+

Thus, v = IE [Y ] = 0 implies Y = 0. The rest is obvious.

ꢀ

By means of Lemma 3.4 in [9] it can be shown that the set Y_dalso is in one-to-one relationship with the set

ꢇ

ꢈ

ꢁ

ꢂ

dQ dP

W_d= (Q, w) ∈ M_1,d× IR^d| w ∈ K₀⁺\{0}, diag (w)

∈ L⁰_dK_T⁺

.

P

dQ

P

This can be arranged by Y = diag (w) _dP, IE [Y ] = w. Here, M_1,dis the set of all vector probability measures with components which are absolutely continuous with respect to P. Thus, we can write the utility maximization problem as

n

o

sup IE [U (X)] − IR^d₊| ∀ (Q, w) ∈ W_d: w^TIE^Q[X] ≤ w^Tx .
Under our assumptions, we can reduce the number of constraints in the above problem to a finite number. This can be done due to the fact that the cones K₀and K_T(ω) are assumed to be finitely generated. Therefore, the cone C can be identified with a finitely generated cone in IR^dN. Hence (see [15, Corollary 19.2.2]) the cone C⁺in IR^dNis finitely generated by,

ꢁ

ꢂ

ꢁ

ꢂ

say, Y ¹, Y ², . . . , Y ^M∈ Y_d. Let Q¹, w¹, . . . , Q^M, w^M∈ W_dbe the pairs corresponding

ꢁ

ꢅ

ꢆꢂ

ꢁ

ꢅ

ꢆꢂ

to Y ¹, IE Y ¹, . . . , Y ^M, IE Y ^M. Then