Stochastic Modeling: From Rough Heston Model to Affine Volterra Equatio

Zixuan Wang, Yuanchun Ye

Mentor: Shuoqing Deng

Abstract In the famous Heston model, where the variance satisfies a CIR process, the characteristic function of the log-price can be calculated explicitly by the Riccati equation. When replacing the Brownian motion by the fractional Brownian motion, one now faces the Rough Heston model and difficulties arise because of the lack of Markovian structure. In this paper, we will first study the mechanism of calculating the characteristic function of the Rough Heston model using the convergence of a suitable class of Hawkes processes. Then, we will study the optimal consumption and stochastic control problems in the Heston framework and Affine Volterra framework by some specific examples.

Contents

1 Introduction 1

2 Local Volatility2

3 Heston model4 3.1 Heston Dynamics...... 4 3.2 The Heston PDE...... 4 3.3 The Call Price...... 5

4 Rough Heston Model6 4.1 Relationship between Hawkes process and Rough Heston model...... 6 4.2 The rough limits of Hawkes processes...... 7

5 Merton Problem8

6 Future Work 9

7 References 10

1 Introduction

In financial mathematics, the Black-Scholes Equation is a well-known partial differential equa- tion (PDE) that governs the evolution of price of a European call or put under Black-Scholes

1 model. The assumptions under the model state that the price of assets follows a geometric Brownian motion with constant drift and volatility. The Black-Scholes PDE is

∂U 1 ∂2U ∂U + σ2S2 + rS = rU, (1.1) ∂t 2 ∂S2 ∂S where U is the price of the as a function of stock price S and time t, r is the risk-free interest rate, and σ is the volatility of the stock. The derivation of this PDE can be obtained using pricing by replication, where we have assumed that the market is complete under Black-Scholes model. Here we are going to use 1 option and φ stocks to replicate cash X, which has the risk-free rate r. In continuous time, we have

Xt = u(t, St) + φSt, (1.2)

dXt = rXtdt. (1.3)

Under Black-Scholes model, we also have dSt = rStdt + σStdWt. Applying Ito’s formula to Xt, we have dxt

1 2 2 = utdt + uxdSt + σ St dt + φdSt 2 (1.4) 1 = (u + σ2S2)dt + (u + φ)(rS dt + σS dW ) t 2 t x t t t = rXtdt

Now, we need to match the coefficients before dt and dWt:

• Coefficient before dWt: ux + φ = 0,

1 2 2 • Coefficient before dt: ut + 2 σ St = rXt = r(u(t, St) + φSt).

Solving for φ and plug back to the second equation, we obtain the famous Black-Scholes PDE in equation(1.1). However, due to the stock market crash in 1987, the classical Black-Scholes model began to lose its validity. Traders realized that extreme events could happen and markets have a significant skew. As a result, the possibility for extreme events needed to be factored into option pricing, and volatility smiles started occurring. It shows that increases or decreases as options move more in-the-money (ITM) or out-of-the-money (OTM). In order to incorporate real-world situations like non-constant volatility into our volatility model, Dupire Local Volatility and Heston model was introduced. Moreover, rough Heston model will also be discussed in the following passage.

2 Local Volatility

The concept of a local volatility was developed when Bruno Dupire, Emanuel Derman and Iraj Kani noted that there is a unique diffusion process consistent with the risk neutral densities derived from the market prices of European options. They have observed that volatility could be a function of stock price and time. Thus, the Dupire Local Volatility in its most general form is(assume there is no dividend):

∂C 1 ∂2C ∂C = σ2K2 − rK . (2.5) ∂T 2 ∂K2 ∂K

2 The underlying St follows the process

dSt = rtStdt + σ(St, t)StdWt. (2.6)

We need the following preliminaries: R T • Discount factor P (t, T ) = exp(− t rsds). • Fokker-Planck Equation. Denote f(St, t) the probability density function of the underlying price St at time t. Then f satisfies the following equation: ∂f ∂ ∂2 = − [µSf(S, t)] + σ2S2f(S, t). (2.7) ∂t ∂S ∂S2 The implication of applying Fokker-Planck Equation in our model is that there are lots of liquid options (vanilla options) in the market, and we can use the price of the options to derive the volatility function, which is consistent with the option price. • time-t price of European call with strike K, denoted by C = C(S, K): Z ∞ + C = P (t, T )E[(ST − K) ] = P (t, T ) (ST − K)f(S, T )dS. (2.8) K

We need the following derivatives of the call C(St, t), which can be easily derived using equa- tion(2.8): ∂C Z ∞ = −P (t, T ) f(S, T )dS, (2.9) ∂K K ∂2C = P (t, T )f(K,T ), (2.10) ∂K2 ∂C Z ∞ ∂ = −rT C + P (t, T ) (ST − K) [f(S, T )]dS. (2.11) ∂T K ∂T ∂f Substituting the Fokker-Planck equation for ∂t at t=T, we have: Z ∞ 2 ∂C ∂ ∂  2 2  + rT C = P (t, T ) (ST − K)(− [µT Sf(S, T )] + 2 σ S f(S, T ) )dS. (2.12) ∂T K ∂S ∂S We need to evaluate two integrals from the above equation: Z ∞ ∂ I1 = µT (ST − K) [Sf(S, T )] dS, (2.13) K ∂S Z ∞ 2 ∂  2 2  I2 = (ST − K) 2 σ S f(S, T ) )dS. (2.14) K ∂S In order to solve these two integrals, we first need to utilize two identities, namely Z ∞ C K ∂C ST f(S, T )dS = − , (2.15) K P (t, T ) P (t, T ) ∂K 1 ∂2C f(K,T ) = . (2.16) P (t, T ) ∂K2 These two integrals can be easily evaluated by using these two identities and the integral by parts technique. Finally, we have obtained the solutions of the two integrals: −µ C µ K ∂C I = T + T , (2.17) 1 P (t, T ) P (t, T ) ∂K

3 σ2K2 ∂2C I = . (2.18) 2 P (t, T ) ∂K2 Plugging them back to equation(2.12), we have achieved the Dupire Local Volatility PDE. Moreover, we have solved a function of local volatility. Assuming zero interest rate and zero dividend yield, we have: ∂C σ2(K,T ) = ∂T . (2.19) 1 2 ∂2C 2 K ∂K2 A limitation for local volatility is that it makes the assumption that the volatility must has something to do with the stock price and time. Research works find that the volatility is more likely to move in a random process. So Heston model was introduced, in which the volatility follows .

3 Heston model 3.1 Heston Dynamics

The Heston model assumes that St follows a Black-Sholes type stochastic process, while the stochastic variance vt follows a Cox, Ingersoll, Ross process. Thus √ dSt = rStdt + vtStdW1,t (3.20) √ dvt = κ(θ − vt)dt + σ vtdW2,t

E[dW1,tdW2,t] = ρ dt

3.2 The Heston PDE Under the Heston model, the market is no longer complete as we assumed in the Black-Sholes model. To replicate an option, we need to construct a portfolio consisting of one option V = V (S, v, t), ∆ units of the stock S, and φ units of another option U = U(S, v, t). The value of the portfolio is: Π = V + ∆ S + φ U. Assuming the portfolio is self-financing, the change in the value of the portfolio is: dΠ = dV + ∆ dS + φ dU. (3.21) Apply Itˆo’sLemma to dV and dU, differentiating with respect to the variables t, S and v. Plug the results into (3.21), we get: n∂ V 1 ∂2 V ∂2 V 1 ∂2 V o dΠ = + vS2 + ρσ vS + σ2v dt+ ∂ t 2 ∂ S2 ∂ v∂ S 2 ∂ v2 n∂ U 1 ∂2 U ∂2 U 1 ∂2 U o φ + vS2 + ρσ vS + σ2v dt+ (3.22) ∂ t 2 ∂ S2 ∂ v∂ S 2 ∂ v2 n∂ V ∂ U o n∂ V ∂ U o + φ + ∆ dS + + φ dv. ∂ S ∂ S ∂ v ∂ v

In order for the portfolio to be hedged against movements and volatility in the stock, the last two terms of equation (3.22) containing dS and dv must be zero. Thus the hedge parameters are: ∂ V ∂ v φ = − ∂ U , ∂ v (3.23) ∂ U ∂ V ∆ = −φ − . ∂ S ∂ S

4 Plug the values in equation (3.23) into equation (3.22) gives us a form of dΠ without unknown parameters, which we write in a simplified form as dΠ = (A + φ B)dt. On the other hand, the portfolio must earn the risk free rate, r. Hence dΠ = rΠ dt. Combining the two equations of dΠ, we get: ∂ V ∂ U A − rV + rS ∂ S B − rU + rS ∂ S ∂ V = ∂ U , (3.24) ∂ v ∂ v where ∂ V 1 2 ∂2 V ∂2 V 1 2 ∂2 V • A = ∂ t + 2 vS ∂ S2 + ρσ vS ∂ v∂ S + 2 σ v ∂ v2 , ∂ U 1 2 ∂2 U ∂2 U 1 2 ∂2 U • B = ∂ t + 2 vS ∂ S2 + ρσ vS ∂ v∂ S + 2 σ v ∂ v2 .

By observation, both sides of equation (3.24) can be written as a function f(S, v, t) of S, v, and t. Following Heston assumption, specify the function as f(S, v, t) = −κ(θ − v) + λ(S, v, t), where λ(S, v, t) is the price of volatility risk. Substitute f(S, v, t) into the left-hand side of equation (3.23) and substitute for B. To get the log price, let x = lnS. We get the Heston PDE of the log price version.

∂ U 1 ∂2U 1 ∂ U ∂2U 1 ∂ U ∂U + v + (r − v) + ρσ v + σ2v − rU + [κ(θ − v) − λv] = 0, (3.25) ∂ t 2 ∂ x2 2 ∂ x ∂ v∂ x 2 ∂ v2 ∂v where the market price of the risk is a linear function of the volatility, following Heston. So that λ(S, v, t) = λv.

3.3 The Call Price The form of call price is

xt −rτ Ct(K) = e P1(x, v, τ) − e KP2(x, v, τ), (3.26) where Pj(x, v, τ) each represents the probability of the call expiring in-the-money, conditional on xt and vt at time t, and τ = T − t is the time to expiration. The call price in equation (3.26) follows the PDE in equation (1.1). Compute the derivatives of call price in equation (3.26) and substitute into the PDE for C, Under Heston assumption, the characteristic functions for x = lnST have the following affine structure: f(φ; x, v) = exp(C(τ, φ) + D(τ, φ)v + iφx), (3.27) where C and D are coefficients. Applying two-dimension Feynman-Kac theorem, the characteristic function in (3.27) will follow the PDE equation

∂f 1 ∂f ∂f 1 ∂2f 1 ∂2f ∂2f − + (r − v) + κ(θ − v) + v + σ2v + σρv = 0. (3.28) ∂τ 2 ∂x ∂v 2 ∂x2 2 ∂v2 t ∂x∂v

Compute the derivatives in equation (3.28), drop the f terms, and re-arrange to obtain two differential equations ∂D 1 1 1 = ρσiφD − φ2 + σ2D2 − φ − (κ + λ)D, ∂τ 2 2 2 (3.29) ∂C = riφ + aD. ∂τ

5 The first formula in equation (3.29) fulfills the Riccati equation. This simplifies the problem since Riccati equation is solvable. We solve for the equation and get the call price

a 1 − gedτ C = rφτ + [(b − ρσiφ + d)τ − 2ln( )], (3.30) σ2 1 − g where • a = κθ, p • d = (ρσφ − b)2 − σ2(2uφ − φ2), and b−ρσφ+d • g = b−ρσφ−d .

4 Rough Heston Model

Since Heston model failed to capture the heavy tail property in financial market, we want to calculate the characteristic function of rough heston model. However, due to lack of the Markovian structure of the rough heston model, we are not able to calculate its characteristic function directly. Therefore, we will rely on the characteristic function of Hawkes process, which is a well-known imgrant process, and investigate the correspondance between Hawkes process and rough heston model. We want to find a suitable class of Hawkes process that will converge in law under skorokhod topology to rough heston model. Combining all together, we are able to obtain the characteristic function for the log-price in rough heston model.

4.1 Relationship between Hawkes process and Rough Heston model We work on a sequence of probability spaces(ΩT , F T , PT ), indexed by T > 0 (going to infinity), on which N T = (N T,+,N T,−) is a bidimensional Hawkes process with intensity

 T,+   Z t T λt 1 T T λt = T,− =µ ˆT (t) + φ (t − s)dNs (4.31) λt 1 0

T,+ T,− Note that λt = λt . This is because that the stock price can jump up or down, and we assume that the density of jumping up equals to the desity of jumping down. T T For a given T , the probability space is equipped with the filtration (Ft )t≥0, where (Ft ) is the T σ-algebra generated by (Ns )s≤t. We have made a particular choice for the heavy-tailed functions defining φT as follows. Definition 4.1.1 There exists β > 0,1/2 < α < 1 and γ > 0 such that

−α T T aT = 1 − γT , φ = ϕ χ (4.32) where 1 1 β χ = , ϕT = a ϕ, ϕ = f α,1 (4.33) β + 1 1 β T with f α,1 the Mittag-Leffler density function that captures the heavy-tail property.

Now we spilt Hawkes process into drift and diffusion term, let us write

Z t T T,+ T,− T T Mt = (Mt ,Mt ) = Nt − λs ds (4.34) 0

6 T for martingale associated to the Nt . We easily obtain Z t Z t T,+ T T,+ 1 T T,+ T,− λt =µ ˆT (t) + φ (t − s)λs ds + φ (t − s)(dMs + βdMs ) (4.35) 0 1 + β 0 Now let X ψT = (φT )∗k, (4.36) k≥1

T ∗1 T T ∗k R t T T ∗(k−1) where (φ ) = φ and for k > 1, (φ ) (t) = 0 φ (s)(φ ) (t − s)ds. The point for introducing convolution is to convert it into multiplication by applying Laplace transform. So we get Z t Z t T,+ T 1 T T,+ T,− λt =µ ˆT (t) + ψ (t − s)ˆµT (s)ds + ψ (t − s)(dMs + βdMs ). (4.37) 0 1 + β 0

α−1 The inhomogeneous intensityµ ˆT (t) should be of order µt with µT = µT , where µ is some T 1−aT T,+ positive constant. Define renormalized intensity C = λt . t µT T After some computations, we have Z t Z t q T 1 − aT T µˆT (T s) T T T Ct = µˆT (tT )+ T (1 − aT )ψ (T (t − s)) ds+υ T (1−aT )ψ (T (t−s)) Cs dBs , µT 0 µT 0 (4.38) where s Z tT dM T,+ + βdM T,− 1 + β2 BT = s s , υ = . (4.39) t q 2 0 T,+ 2 T,− γµ(1 + β) T (λs + β λs ) Using Laplace transform, we have the following expression for CT : Z t Z t q T 1 − aT α,γ µˆT (T s) α,γ T T Ct = µˆT (tT ) + aT f (t − s) ds + υ aT f (t − s) Cs dBs , (4.40) µT 0 µT 0

From the computations in the proof of Theorem 4.2.1, we derive the right choice ofµ ˆT . Definition 4.1.2 The baseline intensityµ ˆT is given by Z t Z t 1 T T µˆT (t) = µT + ξµT ( (1 − φ (s)ds) − φ (s)ds), (4.41) 1 − aT 0 0

α−1 with ξ > 0 and µT = µT for some µ > 0.

4.2 The rough limits of Hawkes processes We now state the limiting behavior of our specified bidimensional nearly unstable Hawkes pro- cesses with heavy tails. For t ∈ [0, 1], we define

Z tT r α T 1 − aT T T 1 − aT T T T µ T T Xt = α NtT , Λt = α λs ds, Zt = (Xt − Λt ). (4.42) T µ T µ 0 1 − aT

T T T Here, Xt is the rescaled Hawkes process, Λt is the rescaled drift term, and Zt is the rescaled diffusion term. T T T Theorem 4.2.1 As T −→ ∞, under definition 4.1.1. and 4.1.2., the process (Λt ,Xt ,Zt )t∈[0,1] converges in law for the Skorokhod topology to (Λ,X,Z), where

Z t   Z t  1  1 p dBs Λt = Xt = Ysds ,Zt = Ysds 2 (4.43) 0 1 0 dBs ,

7 and Y is the unique solution of the rough stochastic differential equation s Z t 2 Z t 1 α−1 1 + β 1 α−1p Yt = ξ + (t − s) γ(1 − Ys)ds + γ 2 (t − s) YsdBs, (4.44) Γ(α) 0 γµ(1 + β ) Γ(α) 0 where B1 + βB2 B = (4.45) p1 + β2 and (B1,B2) is a bidimensional Brownian motion. Furthermore, for any  > 0, Y has Holder regularity α − 1/2 − . Thanks to Theorem 4.2.1, we are now able to build such microscopic processes converging to the log-price. More precisely, for θ > 0, let us define: r θ r1 − a θ 1 − a P T = T (N T,+ − N T,−) − T N T,+. (4.46) 2 T αµ T T 2 T αµ T We have the following corollary of Theorem 4.2.1. Corollary 4.2.2 As T −→ ∞, under definition 4.1.1. and 4.1.2., the sequence of processes T (Pt )t∈[0,1] converges in law for the skorokhod topology to

t t Z p 1 Z Pt = VsdW s − Vsds, (4.47) 0 2 0 where V is the unique solution of the rough stochastic differential equation s Z t 2 Z t 1 α−1 θ(1 + β ) 1 α−1p Yt = θξ + (t − s) γ(θ − Vs)ds + γ 2 (t − s) VsdBs, (4.48) Γ(α) 0 γµ(1 + β ) Γ(α) 0

with (W,B) a correlated bidimensional Brownian motion whose bracket satisfies

1 − β d < W, B >t= dt. (4.49) p2(1 + β2)

Thus, we have successfully built a sequence of microscopic process P T , defined by (4.47), which converges to the logarithm of our rough Heston process of interest.

5 Merton Problem

Merton problem is a decision made by an investor to choose how much to consume and also allocates his invest portfolio, given a known amount of wealth at the beginning. The investor gains utility from both the consumption activity and the investment returns. Mathematically, Merton problem is an optimal control problem in which an investor is given the state variable (Xt) and control variables (π, C). The objective for Merton problem is to maximize expected utility.

Here we write out the Supreme expectation (the objective function) for the investor: Z ∞ −βt max E[ e U(ct)dt], (5.50) (π,c) ∈ Π×C 0

x1−γ where the utility function is: U(x) = 1−γ .Π × C represents the product space of admissible strategies for the investor, where Π represents all of the possible investment portfolios, and C

8 represents the possible range of consumption. Note that in our model, we assume that the utility function is of power form. There are also other forms of utility function, including natural log, exponential, etc.

We apply the Heston assumptions in the Merton problem. Under Heston assumption (), the SED for the stock is:

dS0 = rS0dt, t t √ dSt = rStdt + vtStdW1,t, (5.51) √ dvt = κ(θ − vt)dt + σ vtdW2,t. The portfolio process is:

dXt = rXtdt + πt(µ − r)dt + πtσdWt − ctdt. (5.52)

What left for us to do is to apply dynamic programming principle to solve for the HJB equation. Then do the verification part. It is the extension of Feynman-Kac Theorem. Due to the limitation of time, we don’t have time to finish this part. Hopefully we would complete it in the future.

6 Future Work

Our research could also be extended as studied under affine Volterra process. The concept of Affine Volterra process is developed by Jaber, Larsson, and Pulido (2019), which is defined as solutions of certain stochastic convolution equations with affine coefficients. In the model, we need to switch the kernel of fractional Brownian motion to a more general form. Affine Voleterra processes are neither , nor Markov processes, making it im- possible to solve for the exponential function as we did in precious parts. To overcome these obstacles, we would introduce the idea of limit process. Also, because of the time limitation, we can’t finish this part.

9 7 References

[1] Bingyan Han, Hoi Ying Wong, Merton’s portfolio problem under Volterra Heston model, Finance Research Letters, 2020, 101580, ISSN 1544-6123. [2] Dupire, B.. “Pricing with a Smile.” (1994). [3] El Euch, O, Rosenbaum, M. The characteristic function of rough Heston models, Mathemat- ical Finance. 2019; 29: 3– 38. [4] Gatheral, J. (2006). The Volatility Surface: A Practitioner’s Guide. New York: John Wiley Sons. [5] Kamp, R. V. D.. “Local volatility modelling.” (2009). [6] Lewis, A.L.(2000). Option Valuation Under Stochastic Volatility: With Mathematical Code. Finance Press. [7] Steven L. Heston, A Closed-Form Solution for Options with Stochastic Volatility with Ap- plications to Bond and Currency Options, The Review of Financial Studies, Volume 6, Issue 2, April 1993, Pages 327–343.

10