Faculty of Science Department of Applied Mathematics, Computer Science and Statistics
Utility based risk measures
Jasmine Maes
Promotor: Prof. Dr. D. Vyncke Supervisor: H. Gudmundsson
Thesis submitted tot obtain the academic degree of Master of Science: Applied Mathematics
Academic year 2015–2016
Acknowledgements
First of all I would like to thank my supervisor Mr. Gundmundson for letting me come by his office whenever I felt like I needed it, for coming up with good ideas and for supporting me throughout the thesis. I also would like to thank my promotor prof. Vyncke for giving me advice when I asked for it, while still allowing me a lot of freedom. Last but not least I would like to thank my friends for listing to all my complaints when things didn’t go as planned and when I got stuck, and my parents for their financial support during my education.
The author gives permission to make this master thesis available for consultation and to copy parts of this master thesis for personal use. In the case of any other use, the limitations of the copyright have to be respected, in particular with regard to the obligation to state expressly the source when quoting results from this master dissertation.
Ghent, 1 June 2016.
2
Contents
Preface 6
1 Mathematical representation of risk 8 1.1 Definitions and properties ...... 8 1.2 The acceptance set of a risk measure ...... 12 1.3 The penalty function ...... 16 1.4 Robust representation of convex risk measures ...... 20
2 An introduction to decision theory 24 2.1 The axioms of von Neumann-Morgenstern ...... 24 2.2 Risk and utility ...... 26 2.3 Certainty equivalents ...... 28 2.3.1 The ordinary certainty equivalent ...... 28 2.3.2 The optimised certainty equivalent ...... 29 2.3.3 The u-Mean certainty equivalent ...... 33 2.4 The exponential utility function ...... 33 2.5 Stochastic dominance ...... 35
3 Value at Risk and Expected shortfall 38 3.1 Value at Risk ...... 38 3.1.1 General properties ...... 39 3.1.2 Consistency with expected utility maximisation ...... 42 3.2 Expected shortfall ...... 45 3.2.1 General properties ...... 46 3.2.2 Consistency with expected utility maximisation ...... 49
4 Utility based risk measures 54 4.1 Utility based shortfall risk measures ...... 54 4.2 Divergence risk measures ...... 60 4.2.1 Construction and representation ...... 60 4.2.2 The coherence of divergence risk measures ...... 71 4.2.3 Examples ...... 75 4.3 The ordinary certainty equivalent as risk measure ...... 76
5 Utility functions 78 5.1 The power utility functions ...... 80 5.2 The exponential utility functions ...... 81 5.3 The polynomial utility functions ...... 84
4 5.4 The SAHARA utility functions ...... 88 5.5 The κ-utility functions ...... 96
Conclusion 102
A Dutch summary 104
B Additional computations 107 B.1 Computations regarding the SAHARA utility class ...... 107 B.1.1 Computation of the utility function ...... 107 B.1.2 Computation of the divergence function ...... 110 B.2 Computations regarding the κ-utility class ...... 112 B.2.1 Determining the asymptotic behaviour ...... 112 B.2.2 Computation of the divergence function ...... 114
5 Preface
When choosing between different investment opportunities it is tempting to select the one which offers the highest expected return. However, this strategy would ignore the risk associated with that investment. Generally speaking we have that the larger the expected return of an investment, the larger the risk associated with it. Taking into account the risk of a particular investment is not only necessary to pick the best investment, but also to set up capital requirements. These capital requirements should create a buffer for potential losses of the investments. But how do we describe and measure this risk? We could of course try to describe the cumulative distribution or the density function of the investment. Although this would give us a lot of information about the risk involved, it could still be very difficult to compare different investment opportunities in terms of risk. But a more important problem is that it gives us too much information in some sense. Therefore it would be useful to summarize the distribution of the investment into a number, which represents the risk. These numbers can then be used to determine the necessary capital requirements. More formally if the stochastic variable X models the returns of an investment, then a risk measure is a mapping ρ : X 7→ ρ(X) such that ρ(X) ∈ R. Because a stochastic variable can be viewed as a function, a risk measure can be interpreted as a functional. We could therefore study risk measures by looking at them as purely mathematical objects. Using techniques and ideas from math- ematical analysis we could then analyse properties of these functionals. This is exactly what we will do in the first chapter of this thesis. Studying risk measures only from a purely mathematical point of view has the downside that it ignores the intuition behind it. The attitude towards risky alter- natives is a subjective matter determined by personal preferences. These personal preferences can be represented using so called utility functions. Utility functions are commonly used in economics to model how people make decisions under un- certainty. In the second chapter we will therefore introduce this decision theoretic framework and explain the necessary concepts of economics. Armed with both a strong mathematical and economic framework we will then apply these concepts to two commonly used risk measures in industry, Value at Risk and Expected Shortfall. This analysis will be the subject of the third chapter. The fourth chapter combines the axiomatic approach from the first chapter and the economic ideas from the second chapter and describes different ways in which utility functions can be used to construct risk measures. We will introduce utility based shortfall risk measures and divergences risk measures. Using ideas from mathematical optimisation we will link different utility based risk measures and discuss different representations of these risk measures.
6 After this discussion the question arises which utility function we should use to construct these utility based risk measures. Because utility functions represent personal preferences we do not believe that there is a straightforward answer to this question. However, the properties of the utility function used in the risk measures do affect this risk measure. The last chapter takes a closer look at different classes of utility functions which appear in literature and asses their properties in the context of utility based risk measures.
7 1 Mathematical representation of risk
In this chapter we will look at risk measures from a solely mathematical point of view. We will define what a risk measure is, and what properties it should have. We will take a closer look at the concepts of the acceptance set and the penalty function. Finally we will introduce the robust representation of a risk measure. The contents of this chapter is largely based on the theorems found in [11].
1.1 Definitions and properties
Consider a probability space (Ω, F,P ). Where Ω represents the set of all possible scenario’s, where F is a σ-algebra and where P is a probability measure. The future value of a scenario is uncertain and can be represented by a stochastic variable X. This is a function on the set of all possible scenario’s to the real numbers, X :Ω → R. Let X denote a given linear space of functions X :Ω → R including the constants. A risk measure ρ is a mapping ρ : X → R. Our goal is to define ρ in such a way that it can quantify the risk of a market position X, such that it can serve as a measure to determine the capital requirement of X. That is the amount of capi- tal needed when invested in a risk-free manner will make the position acceptable. Using this interpretation of ρ(X), we would like to have a risk measure that has some likeable properties.
First of all, if the value of the portfolio X is smaller then the value of the portfolio Y almost surely, then it would be logical that than you need more money to make the position of X acceptable, than to make the position of Y acceptable. This property is called monotonicity. Property 1. (Monotonicity) If X ≤ Y , then ρ(X) ≥ ρ(Y ). Secondly, it is logical to assume that to make the position X +m acceptable, where m is a risk-free amount, we need to have ρ(X) − m. This is precisely the amount
8 ρ(X) to make the position X acceptable reduced by the risk-free amount m we already had. This property is called translation invariance or cash invariance. Property 2. (Translation invariance) If m ∈ R, then ρ(X + m) = ρ(X) − m. Definition 1.1. A mapping ρ : X → R is called a monetary risk measure if ρ satisfies both monotonicity and translation invariance. Here we would like to point out that some authors define a monetary risk such that ρ can also take the values of +∞ and −∞. But then they use the additional property that ρ(0) is finite or even normalized ρ(0) = 0.
In [11] we found the following lemma. Lemma 1.1. Any monetary risk measure ρ is Lipschitz continuous with respect to the supremum norm k · k, we have: |ρ(X) − ρ(Y )| ≤ kX − Y k (1.1) Proof. We have that X − Y ≤ sup |X(ω) − Y (ω)|, ω∈Ω hence X ≤ Y +kX−Y k. Using monotonicity we find that ρ(X) ≥ ρ (Y + kX − Y k). Using translation invariance we get ρ(X) ≥ ρ(Y ) − kX − Y k. This gives us that ρ(X) − ρ(Y ) ≥ −kX − Y k or equivalently, ρ(Y ) − ρ(X) ≤ kX − Y k. We also have that Y − X ≤ sup |Y (ω) − X(ω)|. ω∈Ω Again using monotonicity and translation invariance we find that ρ(Y ) ≥ ρ(X) − kY −Xk. From this we conclude that ρ(X)−ρ(Y ) ≤ kY −Xk = kX −Y k. Hence we have that ρ(Y ) − ρ(X) ≤ kX − Y k and ρ(X) − ρ(Y ) ≤ kX − Y k. This leads us to conclude that |ρ(X) − ρ(Y )| ≤ kX − Y k.
An important subclass of monetary risk measures are the convex risk measures. These risk measures have the extra property of being convex. Property 3. (Convexity) ρ(λX +(1−λ)Y ) ≤ λρ(X)+(1−λ)ρ(Y ), for 0 ≤ λ ≤ 1 We will prove in lemma 1.2 that for monetary risk measures this property is equiv- alent with the property of quasi convexity. Property 4. (Quasi convexity) ρ(λX + (1 − λ)Y ) ≤ max (ρ(X), ρ(Y )) Definition 1.2. A convex risk measure is a monetary risk measure satisfying the convexity property. We can easily interpret the property of quasi convexity. Consider an investor who can invest his resources in such a way that he obtains X, or in another way so that he obtains Y . If he spends only a fraction λ of his resources on the first investment strategy and the rest on Y , he will obtain λX + (1 − λ)Y . This diversification strategy will give him a risk of ρ(λX + (1 − λ)Y ). The property of quasi convexity then states that the risk of this diversified portfolio cannot be greater than the risk of the riskiest investment strategy. In [11, p 178] we find the following statement which we will prove in this thesis.
9 Lemma 1.2. A monetary risk measure is convex if and only if it is quasi convex. Proof. First consider a risk measure satisfying convexity, hence ρ(λX+(1−λ)Y ) ≤ λρ(X) + (1 − λ)ρ(Y ), for 0 ≤ λ ≤ 1. Without loss of generality we can assume that ρ(X) ≥ ρ(Y ) and hence max (ρ(X), ρ(Y )) = ρ(X). We find that
ρ(λX + (1 − λ)Y ) ≤ λρ(X) + (1 − λ)ρ(Y ) ≤ λρ(X) + (1 − λ)ρ(X) = ρ(X) = max (ρ(X), ρ(Y ))
From which we can conclude that convexity of a monetary risk measure implies quasi convexity. Now consider a monetary risk measure satisfying quasi convexity. For all X,Y ∈ X we can define X0 := X+ρ(X) and Y 0 := Y +ρ(Y ). Then it is clear that X0,Y 0 ∈ X . Without loss of generality we can suppose that ρ(Y 0) ≥ ρ(X0). Because ρ is quasi convex we have that ρ(λX0 + (1 − λ)Y 0) ≤ ρ(Y 0). Rewriting this expression in terms of X and Y we find that ρ (λX + λρ(X) + (1 − λ)Y + (1 − λ)ρ(Y )) ≤ ρ (Y + ρ(Y )). Now using the fact that ρ satisfies translation invariance we have that
ρ (λX + (1 − λ)Y ) − λρ(X) − (1 − λ)ρ(Y ) ≤ ρ(Y + ρ(Y )) = ρ(Y ) − ρ(Y ) = 0
We can conclude that ρ(λX + (1 − λ)Y ) ≤ λρ(X) + (1 − λ)ρ(Y ) for all X,Y ∈ X , i.e. ρ is a convex risk measure.
We can define a special subclass of convex risk measures using the notion of posi- tive homogeneity. Consider an investor who invests his wealth using an investment strategy that replicates X, with an associated risk ρ(X). If he only invests a frac- tion λ of his wealth in the same investment strategy he will obtain λX, with an associated risk of ρ(λX). If this risk equals the proportional risk of the initial investment, we say that the risk measure satisfies the property of positive homo- geneity. Property 5. (Positive Homogeneity) If λ ≥ 0, then ρ(λX) = λρ(X) Definition 1.3. A coherent risk measure is a convex risk measure satisfying pos- itive homogeneity Coherent risk measures can also be defined by using the sub-additivity property. If a risk measure is sub-additive, you can decentralize the task of managing the risk of different positions. Consider an investor who has invested his wealth in a contingent claim X + Y . If the risk measure is sub-additive this will never be greater than ρ(X) + ρ(Y ). Property 6. (Sub-additivity) ρ(X + Y ) ≤ ρ(X) + ρ(Y ) It is stated in [11] that a coherent risk measure is a monetary risk measure sat- isfying positive homogeneity and sub-additivity. We now prove this equivalent definition.
10 Lemma 1.3. For a monetary risk measure that satisfies positive homogeneity, the convexity property is equivalent to the sub-additivity property.
Proof. First assume ρ is sub-additive, X,Y ∈ X , and 0 ≤ λ ≤ 1. We find that:
ρ(λX + (1 − λ)Y ) ≤ ρ(λX) + ρ((1 − λ)Y ) = λρ(X) + (1 − λ)ρ(Y ).
The first inequality follows from the fact that ρ is sub-additive, the second equality uses the assumption that ρ satisfies positive homogeneity. Note that λX ∈ X and (1 − λ)Y ∈ X because of the assumed linearity of X . 0 1 Now assume ρ is convex. Then for a fixed λ, 0 < λ < 1, define X := λ X and 0 1 0 0 Y := (1−λ) Y . Notice that X ,Y ∈ X . It follows from the convexity property and the positive homogeneity that
ρ(X + Y ) = ρ(λX0 + (1 − λ)Y 0) ≤ λρ(X0) + (1 − λ)ρ(Y 0) = ρ(λX0) + ρ((1 − λ)Y 0) = ρ(X) + ρ(Y ).
This proves that ρ satisfies the sub-additive property.
So far we have defined a coherent risk measure as a risk measure which satisfies the following four properties:
1. (Monotonicity) If X ≤ Y , then ρ(X) ≥ ρ(Y ).
2. (Translation invariance) If m ∈ R, then ρ(X + m) = ρ(X) − m.
3. (Positive homogeneity) If λ ≥ 0, then ρ(λX) = λρ(X).
4. (Convexity)ρ(λX + (1 − λ)Y ) ≤ λρ(X) + (1 − λ)ρ(Y ), for 0 ≤ λ ≤ 1.
Where the convexity property can be replaced by the subadditivity propery. How- ever some author’s like [1] and [2] use a positivity axiom instead of the monotonicity axiom.
Property 7. (Positivity) ∀X ≥ 0 ⇒ ρ(X) ≤ 0.
In general positivity and monotonicity are not equivalent. However it turns out that when a risk measure satisfies the positive homogeneity property and the sub- additivity property they are. The reason for using the positivity property instead of monotonicity property, is that the positivity property is often easier to prove.
Lemma 1.4. If a risk measure is translation invariant, sub-additive and positive homogeneous, then it is positive if and only if it is monotone.
11 Proof. First suppose the risk measure is positive homogeneous, translation invariant,sub- additive and positive. Because of positivity we have that
(X − Y ) ≥ 0 ⇒ ρ(X − Y ) ≤ 0. (1.2)
Using the sub-additivity property we find that
ρ(X) = ρ(X − Y + Y ) ≤ ρ(X − Y ) + ρ(Y ). (1.3)
Combining equation 1.2 and equation 1.3 we find that
X ≥ Y ⇒ ρ(X) ≤ ρ(Y ). (1.4)
We conclude that the risk measure is monotone. Now suppose the risk measure is positive homogeneous, translation invariant, sub-additive and monotone, we need to show that it is positive. Using monotonicity we find that
X ≥ 0 ⇒ ρ(X) ≤ ρ(0). (1.5)
Using positive homogenity we find that for all λ > 0,
ρ(0) = ρ(λ0) = λρ(0). (1.6)
Because this is true for all λ > 0 we can conclude that ρ(0) = 0. Using equation 1.5 we can conclude that X ≥ 0 ⇒ ρ(X) ≤ 0. (1.7) This proves positivity.
Remark 1.1. Using lemma 1.3 and lemma 1.4 we see that a risk measure is coherent if and only if it satisfies the following properties for X,Y ∈ X .
1. (Positivity) X ≥ 0 ⇒ ρ(X) ≤ 0.
2. (Sub-additivity) ρ(X + Y ) ≤ ρ(X) + ρ(Y ).
3. (Positive homogeneous) ∀λ > 0 ρ(λX) = λρ(X).
4. Ttranslation invariant) ∀m ∈ R ρ(X + m) = ρ(X) − m.
1.2 The acceptance set of a risk measure
In the previous section we interpreted ρ(X) as the amount of capital which, if invested in a risk-free manner, makes the position X acceptable. In this section we will define the acceptance set of a risk measure. This is the set of all positions which do not require surplus capital. We will also demonstrate the relationship between the properties of the risk measure and the corresponding acceptance set.
12 Definition 1.4. The acceptance set induced by a monetary risk measure ρ is de- fined by Aρ := {X ∈ X |ρ(X) ≤ 0}. (1.8) The following theorem was taken from [11] and proves that there is a clear con- nection between the properties of a monetary risk measure and the associated acceptance set. We have worked out the proof.
Theorem 1.1. If ρ is a monetary risk measure with acceptance set A := Aρ then
1. A is non-empty.
2. A is closed in X with respect to the supremum norm k · k.
3. inf{m ∈ R|m ∈ A} > −∞. 4. X ∈ A, Y ∈ X , Y ≥ X ⇒ Y ∈ A.
5. ρ can be recovered from A:
ρ(X) = inf{m ∈ R|m + X ∈ A}. (1.9)
6. If ρ is a convex risk measure, then A is a convex set.
7. If ρ is positively homogeneous, then A is a cone. In particular is ρ is a coherent risk measure, A is a convex cone.
Proof. 1. Consider m = ρ(0), then m ∈ X . We will prove that m ∈ A. m ∈ A ⇔ ρ(m) ≤ 0 ⇔ ρ(0) − m ≤ 0 ⇔ ρ(0) ≤ m.
1 2. Consider a sequence Xn ∈ A such that Xn → X . We need to prove that X ∈ A. Suppose ρ(X) > 0 then ∃c > 0 : |ρ(Xn) − ρ(X)| > c but using lemma 1.1 we have that kXn − Xk ≥ |ρ(Xn) − ρ(X)| > c > 0. If Xn converges to X in the supremum norm the left-hand side goes to 0. This gives us a contradiction. Hence ρ(X) ≤ 0, and therefore ρ(X) ∈ A.
3. ∀m ∈ A we have: m ∈ A ⇔ ρ(m) ∈ A ⇔ ρ(0) − m ≤ 0 ⇔ ρ(0) ≤ m. Hence ρ(0) is a lower bound for all m ∈ A. This concludes the proof since we supposed ρ(0) is finite for a monetary risk measure.
4. We know that X ∈ A ⇒ ρ(X) ≤ 0 and using monotonicity Y ≥ X ⇒ ρ(Y ) ≤ ρ(X). Combining those two facts we find that ρ(Y ) ≤ ρ(X) ≤ 0. Finally we can conclude that Y ∈ A.
5. Notice that inf{m ∈ R|ρ(m + X) ≤ 0} = inf{m ∈ R|ρ(X) ≤ m} = ρ(X). 6. We need to prove that ∀X,Y ∈ A and ∀λ ∈ [0, 1] we have that λX + (1 − λ)Y ∈ A. It is sufficient to prove that ρ (λX + (1 − λ)Y ) ≤ 0. Since X,Y ∈ A, and λ ∈ [0, 1] we have λρ(X) ≤ 0 and (1 − λ)ρ(Y ) ≤ 0. Since ρ is convex we have ρ (λX + (1 − λ)Y ) ≤ λρ(X) + (1 − λ)ρ(Y ) ≤ 0. This is what we needed to prove.
1convergence with respect to the supremum norm k · k
13 7. To prove that A is a cone it is sufficient to prove that ∀X ∈ X and ∀λ ≥ 0, we have that λX ∈ A. Because ρ is positively homogeneous we have that λX ∈ A ⇔ ρ (λX) ≤ 0 ⇔ λρ (X) ≤ 0 ⇔ ρ(X) ≤ 0 ⇔ ρ(X) ∈ A. This proves that A is a cone. From the above proofs it follows directly that if ρ is a coherent risk measure then A is a convex cone.
In 1.4 we defined for each monetary risk measure the associated acceptance set. We can also do the opposite and define for each acceptance set an associated risk measure.
Definition 1.5. ρA(X) := inf{m ∈ R|m + X ∈ A} This is a very intuitive definition for a risk measure. If X is a financial position, then ρA(X) is the minimal amount of money needed to make the position X acceptable. Next theorem will show that the properties of the acceptance set are linked to the properties of the associated risk measure. This theorem was found in [11] and we have worked out the proof. Theorem 1.2. If A is a non-empty subset of X such that properties 3 and 4 from theorem 1.1 are both satisfied, Then the functional ρA has the following properties
1. ρA is a monetary risk measure.
2. If A is a convex set, then ρA is a convex risk measure.
3. If A is a cone, then ρA is positively homogeneous. In particular if A is a convex cone then ρA is a coherent risk measure.
4. A ⊆ AρA , and A = AρA if and only if A is k · k-closed in X .
Proof. 1. To prove that ρA is a monetary risk measure, we need to check that ∀X ∈ X ρA(X) is finite and that, ρA(X) satisfies monotonicity and transla- tion invariance.
(translation invariance) We need to prove that for X ∈ X and m ∈ R ρA(X + m) = ρA(X) − m. This follows almost immediately from the properties of the infimum, since ρA(X) − m = inf{l ∈ R|l + X ∈ A} − m = inf{l ∈ R|l + X + m ∈ A} = ρA(X + m) (monotonicity) X ≤ Y ⇒ m + X ≤ m + Y ∀m ∈ R this implies that inf{m ∈ R|m+X ∈ A} ≥ inf{m ∈ R|m+Y ∈ A}. Using the definition of ρA we conclude that X ≤ Y ⇒ ρA(X) ≥ ρA(Y )
(ρA(X) is finite) Since A= 6 ∅, we can find a Y ∈ A. Fix this Y and let X ∈ X . From the assumptions on X we have that X and Y are both bounded, hence there exists a m ∈ R such that m + X > Y . Using that Y ∈ A, monotonicity and the translation invariance we find 0 ≥ ρA(Y ) ≥ ρA(X + m) = ρA(X) − m. We conclude that ∀X ∈ X ρA(X) ≤ m < +∞. Because we have assumed property 3 from theorem 1.1, we have ρA(0) > −∞. We need to prove that ρA(X) > −∞ ∀X ∈ X . Take m0 ∈ R such that X+m0 ≤ 0. Using translation invariance and 0 0 monotonicity we find that ρA(X + m ) = ρA(X) − m ≥ ρA(0) > −∞. From this we can conclude that for a random X ∈ X ρA(X) > −∞.
14 2. We need to prove that if A is convex, then ∀X1,X2 ∈ X and ∀λ ∈ [0, 1] ρA (λX1 + (1 − λ)X2) ≤ λρA(X1) + (1 − λ)ρA(X2). Because of translation invariance we find ∀i ∈ 1, 2 ρA (Xi + ρA(Xi)) = ρA(Xi) − ρA(Xi) = 0, hence ρA(Xi) + Xi ∈ A. Because A is a convex set we have λ (ρA(X1) + X1) + (1 − λ)(ρA(X2) + X2) ∈ A. Using this we find that
0 ≥ ρA (λ (ρA(X1) + X1) + (1 − λ)(ρA(X2) + X2))
= ρA (λX1 + (1 − λ)X2) − λρA(X1) − (1 − λ)ρA(X2).
From this we can conclude that ∀X1,X2 ∈ X , λ ∈ [0, 1], ρA (λX1 + (1 − λ)X2) ≤ λρA(X1) − (1 − λ)ρA(X2). Which is precisely what we needed to prove.
3. If A is a cone we need to prove that ∀X ∈ X and ∀λ ≥ 0, ρA(λX) = λρA(X). We first prove ρA(λX) ≤ λρA(X). We know that since ρA(X)+X ∈ A and A is a cone that λ (ρA(X) + X) ∈ A. Hence we have 0 ≥ ρA (λ (ρA(X) + X)) = ρA(λX) − λρA(X). This proves that ρA(λX) ≤ λρA(X). To prove the opposite inequality take m such that m < ρA(X). Then m + X/∈ A, which also implies that for λ ≥ 0 λm + λX∈ / A. Which is equivalent with λm < ρA(λX). We have that λm < λρA(X) ⇒ λm < ρA(λX) This can only be true if λρA(X) ≤ ρA(λX). Finally we can conclude that λρA(X) = ρA(λX).
4. First we’ll prove the inclusion A ⊂ AρA . For this take an X ∈ A then it is
clear that ρA(X) = inf{m ∈ R|m + X ∈ A} ≤ 0 and therefore X ∈ AρA .
Secondly from part 2 of theorem 1.1 we know that if A = AρA , then A is k · k-closed in X . Finally assume that A is k · k-closed in X . We need to
prove that AρA ⊂ A, hence we need to prove that X ∈ AρA ⇒ X ∈ A. This
is equivalent with X/∈ A ⇒ X/∈ AρA . Take an X/∈ A it is sufficient to prove that ρA(X) > 0. To prove this we need to take m > kXk. Since A is k · k-closed in X , X\A is k · k-open in X . Because X ∈ X \ A we can find a λ ∈ (0, 1) such that λm + (1 − λ)X/∈ A. Therefore we have
0 ≤ ρA(λm + (1 − λ)X) = ρA((1 − λ)X) − λm.
Because ρA is a monetary risk measure we can apply lemma 1.1. We find that |ρA((1 − λ)X) − ρA(X)| ≤ kX − λX − Xk = λkXk.
Using the two inequalities which we have obtained above, we can conclude that ρA(X) ≥ ρA((1 − λ)X) − λkXk ≥ λ(m − kXk) > 0.
This is precisely what we needed to prove.
We have connected the concepts of monetary risk measures, convex risk measures and coherent risk measures to the concept of the acceptance set. The acceptance contains all acceptable financial positions. But what is an acceptable position? This is subjective and can depend on the risk-aversion of the portfolio-holder. Or it could depend on regulations of a supervisory agency.
15 1.3 The penalty function
In 1921 the economist Frank Knight formulated a distinction between risk and uncertainty. Risk only applies to situations where, although we do not know the outcome of an event, we can accurately assign a probability measure to the differ- ent outcomes. This situation might occur when tossing a fair coin. Although you do not know if the coin will land head’s up or not, you know (with certainty) that 1 this will happen with probability 2 . Uncertainty in Knigth’s work is different. It applies to situations in which we do not have all the information to accurately assign a probability measure to the dif- ferent outcomes. This type of uncertainty, named after Knight, is called Knightian uncertainty. Knightian uncertainty is very common in real world situations. Con- sider for example the future return of a stock. The return of the stock is uncertain and we cannot accurately assign a probability measure to the different returns. From historical returns of the stock we could estimate such a probability measure. But would this be the correct probability measure? Obviously not. In this section we consider the case of Knightian uncertainty where we have a measurable space (Ω, F) but without a fixed probability measure assigned to this space. Let X be the space of all bounded measurable functions on (Ω, F) endowed with the supremum-norm k · k. It is straightforward to show that X is a Banach space. Let M1 := M1 (Ω, F) be the set of all probability measures on (Ω, F) and denote with M1,f the set of all functions Q : F → [0, 1] with are normalized i.e. Q (Ω) = 1 and which are finitely additive. It is clear that M1 ⊂ M1,f and that the elements of M1,f are not necessarily probability measures since it is not guaran- teed that they satisfy σ-additivity. In the next section we use the notation EQ [X] R with Q ∈ M1,f for XdQ, where the integral is understood to be a Lebesgue integral and Q ∈ M1,f .
Definition 1.6. A penalty function for ρ on M1,f is a functional α : M1,f → R ∪ {+∞} such that inf α (Q) ∈ R. (1.10) Q∈M1,f Penalty functions are strongly linked to convex risk measures. Each penalty func- tion defines a convex risk measure and convex risk measures can be represented by using a penalty function. We will prove this in the next two theorems. Theorem 1.3. The functional
ρ(X) := sup (EQ [−X] − α (Q)) (1.11) Q∈M1,f defines a convex risk measure on X , such that ρ(0) = − inf α(Q). Q∈M1,f
Proof. For each Q ∈ M1,f we define for all X in X the functional ρQ(X) := (EQ [−X] − α (Q)). We will first show that ρQ satisfies monotonicity and transla- tion invariance. Monotonicity follows from X ≤ Y ⇒ −X ≥ −Y
⇒ EQ [−X] ≥ EQ [−Y ] ⇒ (EQ [−X] − α(Q)) ≥ (EQ [−Y ] − α(Q)) ⇒ ρQ(X) ≥ ρQ(Y ).
16 To prove that ρQ satisfies translation invariance take X ∈ X and m ∈ R. We have that
ρQ(m + X) = EQ [−(X + m)] − α(Q) = EQ [−X] − α(Q) − m = ρQ(X) − m. Where we have used that Q is normalized. We also want to prove that the functional ρQ is convex. From the proof of 1.2 we know that it is sufficient to prove that ∀X,Y ∈ X and ∀λ ∈ [0, 1] we have that ρQ (λX + (1 − λY ) ≤ max (ρQ(X), ρQ(Y )). We can assume without loss of generality that EQ [−X] ≤ EQ [−Y ], then ρQ(X) ≤ ρQ(Y ) and therefore max (ρQ(X), ρQ(Y )) = ρQ(Y ). We also have that
ρQ (λX + (1 − λ)Y ) = EQ [− (λX + (1 − λ)Y )] − α(Q) = λEQ [−X] + (1 − λ)EQ [−Y ] − α(Q) ≥ λEQ [−Y ] + (1 − λ)EQ [−Y ] − α(Q) = EQ [−Y ] − α(Q) = ρQ(Y ) = max (ρQ(X), ρQ(Y )) .
The properties monotonicity, translation invariance and convexity are satisfied for all Q ∈ M1,f . Hence we have that the functional defined by 1.11 also satisfies these properties since they are preserved when taking the supremum over all Q ∈ M1,f . Because of the definition of a penalty function and the fact that X ∈ X is bounded, we have that ρ(X) only takes finite values. We can conclude that ρ(X) is a convex risk measure. The fact that ρ(0) = − inf α(Q) follows immediately from the Q∈M1,f properties of supremum and infimum.
Next theorem will prove that we can represent each convex risk measure using a penalty function. The proof of this theorem is not easy and uses results from functional analysis. For the ease of the reader we give these results without proof.
Theorem 1.4. (Separating hyperplane theorem) In a topological vector space E, any two disjoint convex sets B and C, one of which has an interior point, can be separated by a non-z´ero continuous linear functional l on E, i.e.,
l(x) ≤ l(y) ∀x ∈ C, ∀y ∈ B. (1.12)
Proof. Without proof, see [11, p.508].
Theorem 1.5. (Riesz representation theorem) There is a one-to-one correspon- dence between the set of functions Q ∈ M1,f and linear continuous functionals l on X such that l(1) = 1 and l(X) ≥ 0 for X ∈ X . The correspondence is defined R by l(X) = EQ [X] = XdQ, ∀Q ∈ M1,f . Proof. Without proof see [11, p.506].
Theorem 1.6. Any convex risk measure ρ on X is of the form