INTRADAY VOLATILITY SURFACE CALIBRATION
Master Thesis
Tobias Blomé & Adam Törnqvist
Master thesis, 30 credits Department of Mathematics and Mathematical Statistics Spring Term 2020
Intraday volatility surface calibration Adam T¨ornqvist,[email protected] Tobias Blom´e,[email protected]
c Copyright by Adam T¨ornqvist and Tobias Blom´e,2020
Supervisors: Jonas Nyl´en Nasdaq Oskar Janson Nasdaq Xijia Liu Department of Mathematics and Mathematical Statistics
Examiner: Natalya Pya Arnqvist Department of Mathematics and Mathematical Statistics
Master of Science Thesis in Industrial Engineering and Management, 30 ECTS Department of Mathematics and Mathematical Statistics Ume˚aUniversity SE-901 87 Ume˚a,Sweden
i Abstract
On the financial markets, investors search to achieve their economical goals while simultaneously being exposed to minimal risk. Volatility surfaces are used for estimating options’ implied volatilities and corresponding option prices, which are used for various risk calculations.
Currently, volatility surfaces are constructed based on yesterday’s market in- formation and are used for estimating options’ implied volatilities today. Such a construction gets redundant very fast during periods of high volatility, which leads to inaccurate risk calculations.
With an aim to reduce volatility surfaces’ estimation errors, this thesis explores the possibilities of calibrating volatility surfaces intraday using incomplete mar- ket information. Through statistical analysis of the volatility surfaces’ historical movements, characteristics are identified showing sections with resembling mo- tion patterns. These insights are used to adjust the volatility surfaces intraday.
The results of this thesis show that calibrating the volatility surfaces intraday can reduce the estimation errors significantly during periods of both high and low volatility. However, these results highly depend on the conducted choices when constructing and analyzing the volatility surfaces which leave room for further reasearch.
ii Sammanfattning
F¨or investerare p˚afinansmarknader v¨arlden ¨over ¨ar m˚alet att n˚asina ekonomiska m˚almed s˚al˚agrisk som m¨ojligt. D¨arf¨or ¨ar korrekta och precisa riskber¨akningar av h¨ogsta prioritet. Volatilitetsytor anv¨ands vid riskber¨akningar f¨or att estimera optionspriser och optioners implicita volatiliteter. Idag konstrueras volatilitet- sytor baserat p˚amarknadsinformation fr˚anen dag och anv¨ands f¨or estimation n¨asta dag. Under perioder av h¨og volatilitet blir denna sorts konstruktion l¨att inaktuell, vilket leder till felaktiga riskber¨akningar.
M˚aletmed detta examensarbete var att reducera en volatilitetsytas estima- tionsfel. Detta genoma att utforska m¨ojligheten att kalibrera en volatilitetsyta baserat p˚al¨opande information under dagen. Genom att analysera hur vola- tilitetsytor r¨or sig ¨over tid identiferades karakt¨arsdrag och m¨onster som kan anv¨andas f¨or att kalibrera volatilitetsytor l¨opande under en dag.
Resultatet i detta examensarbete visar att kalibrering av volatilitetsytor int- ra dag kan reducera volatilitetsytors estimationsfel oavsett perioder av h¨og eller l˚agvolatilitet. Detta resultat ¨ar dock beroende av hur volatilitetsytorna ¨ar ska- pade och analyserade, vilket ger utrymme f¨or vidare studier inom omr˚adet.
iii Acknowledgements
Firstly, we would like to thank Nasdaq for bringing the idea of this thesis to us and letting us do our thesis at the Ume˚aoffice. Under the circumstances of the COVID-19 pandemic, we are deeply thankful for letting us use Nasdaq’s equipment to complete this thesis from home.
Secondly, we thank Jonas Nyl´enand Oskar Janson at Nasdaq for providing valuable insights, discussions and stunning supervision. We would also like to direct our gratitude to Markus Nyberg for useful expertise and assistance on how to write an academical report.
Thirdly, we would like to thank our supervisor Xijia Liu, at Ume˚aUniver- sity, for knowledge on data analysis and remarks on the structure and content in this thesis.
Finally, we direct a thank you to our fiancees Victoria Bertilsson and Melina Ahlenius˚ for your support. Without you we would probably still be analysing the volatility surfaces. A special thanks to you Victoria for letting us redo the apartment to a home office and to you Melina, for your help in report writing and the idea proposal of dividing the volatility surfaces into sections.
//Adam & Tobias
iv Contents
1 Introduction 1 1.1 Background ...... 1 1.2 Problematization ...... 3 1.3 Project goal ...... 4 1.4 Datasets ...... 4 1.4.1 LME Copper ...... 4 1.4.2 WTI NYMEX ...... 4 1.5 Limitations ...... 5 1.6 Literature review ...... 5 1.7 Software ...... 5
2 Theory 6 2.1 Options ...... 6 2.2 Implied volatility ...... 7 2.3 The volatility surface ...... 8 2.4 Moneyness ...... 9 2.5 Total implied volatility and variance ...... 9 2.6 Construction of a volatility surface ...... 10 2.7 Arbitrage ...... 12 2.7.1 Arbitrage conditions for options ...... 13 2.7.2 Arbitrage conditions for implied volatility ...... 14 2.8 K-Means clustering ...... 15 2.9 Transition matrix ...... 16
3 Method 17 3.1 Calculate implied volatility ...... 17 3.2 Arbitrage: tests and how to eliminate it ...... 17 3.2.1 Calendar spread arbitrage ...... 18 3.2.2 Butterfly arbitrage ...... 18 3.3 Construction of a volatility surface ...... 19 3.3.1 Calculate change and error between market data and the surface ...... 25 3.4 Analysis of the surface ...... 25 3.4.1 Part 1: Implied volatility time series analysis ...... 26 3.4.2 Part 2: Magnitude of change analysis ...... 28 3.5 Intraday calibration ...... 30 3.6 The Intraday Calibration Model (ICM) ...... 33 3.6.1 Model description ...... 33 3.6.2 Model evaluation ...... 34 3.7 Parallel shift ...... 35
v 4 Results 36 4.1 Review of assumptions and approach ...... 36 4.2 LME Copper ...... 36 4.3 WTI NYMEX ...... 40 4.4 Comparison and conclusion of results ...... 43 4.5 Volatile periods ...... 45
5 Discussion 47 5.1 Results ...... 47 5.2 Construction of the volatility surfaces ...... 47 5.3 Analysis of the volatility surface ...... 48 5.4 Future studies ...... 50
vi List of Figures
1 S&P 500 from April 28th 2015 to April 28th 2020...... 2 2 S&P 500’s intraday price process from March 23rd to March 25th 2020...... 3 3 Example of a volatility surface...... 9 4 A raw SVI parameterization fitted to market data...... 20 5 Visualization of Durrleman’s condition. Arbitrage opportunities are introduced as g(x) falls below the dashed line...... 21 6 Polynomial regression interpolation in the time to maturity di- rection for log-moneyness = 0...... 22 7 Volatility smiles for different time to maturities T where no cal- endar arbitrage opportunities are introduced...... 22 8 A volatility surface expressed in total implied variance, ω, in (a) and expressed in implied volatility, σimp, in (b)...... 24 9 An example of relative day-to-day changes, δ, within a volatility surface...... 26 10 Example of a regression model of 4th degree fitted to a time series (i,j) dt ...... 27 11 A volatility surface divided into sections using K-means clustering of y(i,j), here using K =6...... 28 (i,j) 12 Relative day-to day changes of the time series dt shown in Figure 10. The solid line represent the mean change over the time period...... 28 13 A section from Figure 11 divided into subsections using K-means based on standard deviation, here using K =4 ...... 29 14 Intraday calibration adjustments of a volatility surface based upon 20% of available data...... 32 15 Relative day-to-day changes within a volatility surface. This fig- ure should be examined as a reference to Figure 14...... 33 16 Daily TSS in the LME Copper volatility surface for the second year...... 37 17 Method performances for the first 30 days in the second year. . . 37 18 Error increases for the parallel shift. The occasions are indicated by red dots...... 38 19 Error increases for the Intraday Calibration Model. The occa- sions are indicated by red dots...... 39 20 Daily TSS in the WTI NYMEX volatility surface for the second year...... 40 21 Method performances for the first 30 days in the second year. . . 41 22 Error increases for the parallel shift. The occasions are indicated by red dots...... 42 23 Error increases for the Intraday Calibration Model. The occa- sions are indicated by red dots...... 42 24 Daily TSS in the volatility surfaces for the second year...... 43
vii 25 Relative day-to-day changes within a volatility surface...... 44 26 Method performances on LME Copper for the six highest volatile days in the second year...... 45 27 Method performances on WTI NYMEX for the six highest volatile days in the second year...... 46 28 Example of a volatility surface divided into sections and subsec- tions...... 50
viii List of Tables
1 Examples of estimated coefficients of the regression models fitted to different implied volatility time series...... 27 2 Example of a transformation matrix...... 30 3 Hyper-parameters of ICM for the LME Copper dataset...... 36 4 Method performances for LME Copper for the second year. . . . 38 5 Results on the occasions when error is increased for LME Copper. 39 6 Hyper-parameters of ICM for the WTI NYMEX dataset. . . . . 40 7 Method performances for WTI NYMEX for the second year. . . 41 8 Results on the occasions when error is increased for WTI NYMEX. 43 9 Method performances for the 95% quantile of the most volatile days of the second year...... 46
ix 1 INTRODUCTION
1 Introduction
This section provides a background story about the financial market and exam- ples of volatility in the market. Further in, problematization, project goal, the data available and limitations of this thesis are presented.
1.1 Background On the financial markets, henceforth the market, participants aim to achieve their economic goals while being exposed to minimal risk. As the rapid evo- lution of technology, there exists a wide ever-increasing set of assets, financial contracts and derivatives available on the market. An example of a financial contracts are options, which gives the owner the right, but not the obligation, to buy or sell an underlying asset at an agreed-upon price and/or date. The underlying asset can be anything from commodities like corn or wheat to a stock index, for example, the S&P 500. An option can be described as a ”financial insurance” since it has a limited downside. Consider buying a stock for $100 and an option giving you the right to sell the stock for $80 in the future. If the stock price falls below $80 in the future, the option limits the loss. Options are therefore often used as a tool for limiting the risk of investments.
In finance, the risk is defined in terms of the volatility σ, which is the degree of variation for an asset’s price process. It is measured by calculating the standard deviation of the returns for a given period of time. Recalling that standard deviation is the square-root of the variance, volatility represents how much an asset’s price swings around its mean price. An asset with high volatility is there- fore considered a higher risk since the price is expected to be less predictable. In Figure 1, the price process of the S&P 500 over the last five years is shown along with its corresponding logarithmic returns. It is clear to see that the volatility is not constant over time but fluctuates and that high volatility implies large movements in the price process. The abnormal movements in the price process and the logarithmic returns during 2020 is the COVID-19 pandemic. If one only examines the price process it seems like the S&P 500’s price decreases steady during this period and one could argue that the price is rather predictable - it will be less than the day before. However, examining the logarithmic returns as well one finds that there exists large positive return during the period, making the price process less predictable.
1 1 INTRODUCTION
(a) S&P 500’s price process. (b) S&P 500’s logarithmic returns.
Figure 1: S&P 500 from April 28th 2015 to April 28th 2020.
Imagine that we are financial experts and our task is to set a fair price on an option where the S&P 500 is the underlying asset, then we are interested in the volatility. The option is valuable only if the underlying asset reaches a certain price within a given period of time. The probability of reaching this level is higher if the volatility is high, which motivates for a higher option price. How- ever, if the price is too high the option will not sell and we will miss out on potential profits. On the other hand, if the price is too low we will lose money to smart investors who take advantage of this opportunity. But how do we de- termine the ”correct” volatility and the corresponding price? We can examine the asset’s historical volatility, ask what the market participants think is a rea- sonable price or examine the price history for similar contracts.
Theoretically, an option’s price is determined using a pricing formula with in- formation available in the option’s contract and the market as input, including the volatility of the underlying asset. The choice of formula to use depends on the option’s style but the most common one is Black-Scholes formula, originally derived for pricing European call options.
In practice, it is not as straightforward since the volatility of the underlying asset is unknown. However, from an existing option contract, one can derive the option’s implied volatility using a pricing formula in reverse. The pricing formula is used for various values of volatility until the theoretical price equal to the option’s market price is found. Implied volatility is called a forward-looking and subjective measure since it is derived from an existing option and current market conditions, being the volatility justifying the option’s market price.
As financial experts, we can now determine the fair price of an option by study- ing similar options’ contracts. But what if similar options don’t exist? The answer to this question is to use the volatility surface. Using existing option contracts we can determine their implied volatilities and then interpolate be- tween these discrete points to construct a surface, which can be used to find any option’s implied volatility and the corresponding price. It sounds simple in
2 1 INTRODUCTION theory, but in reality it is not.
For a volatility surface to be trustworthy, there must exist options for vari- ous contract specifications and these need to be frequently and recently traded. One common way to create a volatility surface is to use option trades for a whole day to represent the next upcoming day’s prices and market conditions. However, during periods of very high volatility, such as the financial crisis or COVID-19 pandemic, yesterday’s market conditions do not hold for today.
In Figure 2, the intraday price process of S&P 500 is shown for March 23rd and 24th, two days during the COVID-19 pandemic with very high volatility. Considering these two days, imagine once again that we are financial experts but our task now is to construct a volatility surface based on the market condi- tions of March 23rd that will be used for estimation on March 24th. If we only examine the price process, we can safely state that something changed in the market conditions between the close on March 23rd and the opening on March 24th. This illustrates how yesterdays prices and market conditions are unable to represent today and the need for methods to adjust the volatility surface intraday using incomplete market information.
Figure 2: S&P 500’s intraday price process from March 23rd to March 25th 2020.
1.2 Problematization The problem with intraday adjustments is two-fold, there are fewer option trades and these are not synchronized. It doesn’t hold to adjust the volatility surface unless the new option trades reflect the current market conditions and that the volatility surface’s market fit is improved.
At Nasdaq, where this thesis work is conducted, the front office and risk man- agement functions require accurate estimates of option portfolios intraday to
3 1 INTRODUCTION properly value options and measure risk from trading activities. Nasdaq has implemented end-of-day volatility surface generation for equity options as part of a product line offering but has not yet extended it with intraday calibration [13]. End-of-day volatility surfaces might not be sufficient during periods of very high volatility and using incorrect volatility surfaces for risk calculations may give rise to significant inaccuracies. One should also remember that dur- ing periods of high volatility it is of most importance that risk calculations are accurate and trustworthy.
1.3 Project goal Under circumstances as in Figure 2, a volatility surface based upon yesterdays information is to a large extent redundant and needs to be adjusted intraday using today’s incomplete market data in order to be useful. The goal of this thesis is to find one or several approaches to calibrate a volatility surface intraday given incomplete market data.
1.4 Datasets This section provides a brief description of the datasets used for this thesis. All datasets are provided by the Nasdaq Ume˚aOffice but due to confidentiality they can not be disclosed in details.
1.4.1 LME Copper This dataset consists of end-of-day volatility surfaces for American options on copper over two years, traded at the London Metal Exchange [4]. It contains implied volatilities and corresponding market information for a uniform set of options with different strike prices and maturities. Due to no information about how the volatility surfaces are constructed, the data is treated as ”real” market data expressed in implied volatility. For more information about contract spec- ifications, visit the London Metal Exchange website at https://www.lme.com.
1.4.2 WTI NYMEX This dataset consists of end-of-day volatility surfaces for American options on Light Sweet Crude Oil Futures over two years, traded at the New York Mer- cantile Exchange [11]. It contains implied volatilities and corresponding market information for a uniform set of options with different strike prices and matu- rities. Due to no information about how the volatility surfaces are constructed, the data is treated as ”real” market data, expressed in implied volatility. For more information about contract specifications, visit the CME Group website at https://www.cmegroup.com.
4 1 INTRODUCTION
1.5 Limitations The data provided by the Nasdaq Ume˚aOffice is not real market data but instead data of very high quality. This should be considered when examining the methods and results presented in this thesis. It has not been possible to access real data to validate the results of this thesis as an effect of the COVID-19 pandemic, which should be kept in mind when examining the results. On the other hand, we have tried to simulate the data as intraday market data to give the models in this thesis a representation as close as possible to reality.
1.6 Literature review Implied volatility and the nature of volatility surfaces has been well studied both by practitioners and scholars ever since the disclosure of the Black-Scholes formula in 1973 [6]. While there exist extensive research on methods for mod- elling implied volatility and constructing volatility surfaces, including both local volatility models and stochastic volatility models [10, 6, 1], the field of calibrat- ing the volatility surfaces intraday have remained rather unexplored. Prac- titioners have studied the possibility of moving volatility surfaces by parallel shifts, which has been proven to improve the volatility surfaces accuracy during periods of high volatility but fail when volatility is low [15]. Models with incon- sistent performances are not suitable for risk calculations, which are one of the volatility surfaces main usages, and it is therefore motivated to investigate the field of intraday calibration of the volatility surfaces further.
1.7 Software The software used for implementations throughout this thesis work was Python. The following packages was used:
(i) NumPy: Calculations. (ii) pandas: Data analysis. (iii) scikit-learn: Predictive data analysis. (iv) matplotlib: Visualization.
(v) SciPy: Statistical tools.
5 2 THEORY
2 Theory
This section provides the theory behind the methods used in this thesis. Start- ing off with how to determine the implied volatility of an option. Then, theory for constructing a volatility surface is given along with necessary arbitrage con- straints.
2.1 Options In Section 1.1 the concept of options was briefly introduced, in this section a more detailed explanation follows. An option gives the owner the right to buy or sell an underlying asset at an agreed-upon price, known as the strike price, at a later point in time, known as the maturity date. Options which give the owner the right to buy an underlying asset for the strike price are known as call options while options which give the owner the right to sell an underlying asset are known as put options. Consider a scenario where the price of the under- lying asset is higher than the strike price, then call option are denoted as ”in the money” and put options as ”out the money”. If the price of the underlying asset is less than the strike price, then call options are out the money and put options are in the money.
It exists various styles of options depending on the contract specifications but the two most common ones are European style or American style. If the options’ contract only allow the owner to exercise the options at the maturity date they are called European options. If it is allowed to exercise the options at any time up until the maturity date, they are called American options. The price of an European call option is determined by Black-Scholes formula. Definition 2.1.1. The price of an European call option with strike price K and time to maturity T is given by the Black-Scholes formula: −rT C = sN[d1] − Ke N[d2] (1) where s is the price of the underlying asset, N[·] is the cumulative distribution function for the normal distribution N(0,1), r is the risk free rate and ln(s/K) + (r − σ2/2)T d1 = √ σ T √ d2 = d1 − σ T. where σ is the volatility of the underlying asset. The payoff for an European call option is: max(s − K, 0), which can be written more compact as: (s − K)+. To price an European put option one can use the put-call-parity.
6 2 THEORY
Definition 2.1.2. Consider an European call option and an European put op- tion with strike price K and time to maturity T. Denoting the pricing functions by c(t,s) and p(t,s), the following relation exists:
p(t, s) = Ke−r(T −t) + c(t, s) − s known as the put-call-parity. The payoff for an European put option is:
max(K − s, 0), which can be written more compact as:
(K − s)+.
While European options have a closed formula for determining their prices, American options don’t. Due to the possibility to exercise the options at any time, the prices of American options change up until the maturity date and can’t be calculated explicitly but need to be approximated. One common method for approximating American option prices’ is by using a Binomial tree, which trace the option prices’ evolution in discrete time steps. This method is fully described in [8] along with other pricing methods.
2.2 Implied volatility The implied volatility of an option can be derived using Black-Scholes formula (1), or any other pricing formula, in reverse for various values of volatility until the theoretical price equals the option’s market price.
The Black-Scholes formula was derived with assumptions on the dynamics of the underlying asset, where one of the assumption is that the volatility of the underlying asset is assumed constant. Under this assumption one would expect the same implied volatility for different strike prices and maturities given the assumption of constant volatility. But when extracting the implied volatility from available market data variations depending on the strike price and time to maturity are often observed. For a detailed review of all assumptions made for the Black-Scholes formula, [2] is recommended. Since the implied volatility of an underlying asset varies for different strike prices and time to maturities, it is defined as below.
Definition 2.2.1. Implied volatility is a function of strike price K and time to maturity T : σimp = f(K,T |C, s, r) (2) where C, s and r are observed in the market.
7 2 THEORY
2.3 The volatility surface Given options available on the market for a certain underlying asset, the implied volatility can be derived for each option. This set of discrete data points can be interpolated to create a surface, known as the implied volatility surface, hence- forth volatility surface, which can be used to determine the implied volatility for any combination of strike price and time to maturity. An example of a volatility surface is shown in Figure 3, here visualized in log-moneyness instead of strike price, see Section 2.4 for further explanation.
Investigations of volatility surfaces have derived a handful general characteristics [5, 14]. In regards of the volatility surface’s general appearance, the following observed profile characteristics can be stated.
(i) The volatility surface has a smile profile in the strike price direction, known as a volatility smile. (ii) The volatility surface has a linear leaning profile in the time to maturity direction known as the term structure. (iii) The magnitude of the volatility smile decreases as time to maturity in- creases. Shorter maturities display pronounced smiles and longer maturi- ties give rise to shallow smiles.
Observations also show that the volatility surface changes over time and there- fore the following time dependent characteristics can be stated.
(i) Implied volatility has high positive auto-correlation and mean-reversion, known as volatility clustering. (ii) Implied volatility and returns in the underlying asset are negatively cor- related. This is known as the leverage effect.
(iii) The variance of daily variations in the surface can be described with very few principal components.
8 2 THEORY
Figure 3: Example of a volatility surface.
2.4 Moneyness When visualizing a volatility surface it is common to substitute the strike price direction to moneyness or log-moneyness, as shown in Figure 3. Definition 2.4.1. For an option with strike price K and underlying price s, moneyness is defined as: K x = s and the corresponding log-moneyness is defined as: K ln (x) = ln ( ). s Using moneyness or log-moneyness instead of strike price is convenient since it allows one to center the volatility surface around 1 or 0. This is more suitable when comparing surfaces of different underlying assets.
2.5 Total implied volatility and variance Implied volatility is often transformed into total implied variance since it allows for simplicity in calculations and expressions.
9 2 THEORY
Definition 2.5.1. For an option with implied volatility σimp and time to ma- turity T, the total implied volatility is defined as: √ Σ = σimp T and the total implied variance as:
2 ω = σimpT. (3)
2.6 Construction of a volatility surface There exist various methods for constructing volatility surfaces and the choice of the method to use is up to the practitioner. One commonly used method is the stochastic volatility inspired parameterization, henceforth denoted as the raw SVI parameterization. This method’s popularity among practitioners is due to its two key properties [7]:
2 (i) For a fixed time to maturity T , the implied variance σimp is linear in the moneyness x as |x| → ∞. (ii) It is easy to fit against options’ market prices whilst ensuring no calendar spread arbitrage.
The raw SVI parameterization can be used for constructing volatility surfaces slice-by-slice in the time to maturity direction. There exist extensions of the raw SVI parameterization method, for example the neutral SVI parameterization or the surface SVI parameterization presented in [7]. However, for this thesis the raw SVI parameterization, proposed in [12] will be used. Definition 2.6.1. The raw SVI parameterization of the total implied variance for a fixed time to maturity T is defined as:
ωSVI (x) = a + b(ρ(x − m) + p(x − m)2 + σ2, where x is moneyness and a, b, σ, ρ, m is the parameter set. Remark: The raw SVI parameter σ is not to be confused with the volatility of the underlying’s price process.
When adjusting the raw SVI parameters, the effects on the volatility smile are: (i) a changes the vertical translation of the volatility smile in the positive direction, (ii) b affects the angle between the put and call wing, (iii) ρ rotates the smile, (iv) m changes the horizontal translation of the smile,
10 2 THEORY
(v) σ reduces the at-the-money (ATM) curvature of the smile. These parameters can be difficult to understand and therefore the SVI-jump- wing (SVI-JW) parameters are introduced. Definition 2.6.2. The SVI-jump-wing (SVI-JW) parameterization expressed in terms of the raw SVI parameters, for a fixed time to maturity T is defined as: √ a + b(−ρm + m2 + σ2) v = , T T 1 b m ψT = √ ρ − √ , ωT 2 m2 + σ2 1 pT = √ b(1 − p), ωT 1 cT = √ b(1 + p), ωT 1 p vˆ = a + bσ 1 − ρ2, T T
where ωT = vT T . This parameterization depends explicitly on the time to maturity T and can be viewed as generalizing the raw SVI parameterization. The SVI-JW parameters have the following interpretations:
(i) vT gives the ATM implied total variance,
(ii) ψT gives the ATM skew,
(iii) pT gives the slope of the left wing (put options),
(iv) cT gives the slope of the right wing (call options),
(v)ˆvT is the minimum implied variance. √ If smiles scaled perfectly as 1/ ωT , these parameters would be constant, hence independent of T . This makes it easy to extrapolate the volatility surface for higher value on T . Also note that by definition, for any T > 0 we have:
∂σ (x, T ) ψ = imp T ∂x x=0 Assume that m 6= 0. For any T > 0, define the (T -dependence) quantities: Lemma 2.6.1. Assume that m 6= 0. For any T > 0, define the (T-dependence) quantities: √ 2ψ ω r 1 β = ρ − T T and α = sign(β) − 1, b β2
11 2 THEORY where we have further assumed that β ∈ [−1, 1]. Then, the raw SVI and SVI-JW parameters are related as follows: √ ωT b = (cT + pT ), 2 √ p ω ρ = 1 − T T , b p 2 a =v ˆT T − bσ 1 − ρ , (v − ψ )T m = T T , b{−ρ + sign(α)pq − α2 − αp1 − ρ2} σ = αm.
If m = 0, then the formula above for b, ρ and a still hold, but σ = (vT T − a)/b. Proof. This lemma is omitted to Gatheral’s work in [7]. The relationships between the raw SVI- and SVI-JW parameters are strong tools for calibrating the parameterization for a better market fit and to elimi- nate arbitrage possibilities. They are also key-components when calibrating the volatility surface intraday.
2.7 Arbitrage Arbitrage is a phenomenon that guarantees a positive payoff with zero proba- bility of a negative payoff for a portfolio [2]. It is defined as: Definition 2.7.1. An arbitrage possibility is a portfolio h with the properties
h Vt = 0, h VT > 0, with probability 1,
where V denotes the payoff and t < T . Arbitrage can be classified as static or dynamic. Dynamic arbitrage refers to a strategy that re-balances the portfolio over time while for static arbitrage, the portfolio remains unchanged [9].
In the aspect of a volatility surface it is important to handle static arbitrage to avoid miss-pricing of options. Gatheral gives in [7] a compact definition of a volatility surface free of static arbitrage: Definition 2.7.2. A volatility surface is free of static arbitrage if and only if the following conditions are satisfied.
(i) it is free of calendar spread arbitrage.
(ii) each volatility smile is free of butterfly arbitrage.
12 2 THEORY
The calendar spread arbitrage concerns arbitrage possibilities in the time to maturity direction of the volatility surface while the butterfly arbitrage con- cerns arbitrage possibilities in the moneyness direction. Before handling these conditions, arbitrage conditions for options and implied volatility are needed.
2.7.1 Arbitrage conditions for options Theorem 2.7.1. Let s > 0 be a constant and denote the price of an European call option by C(K,T ), where K is the strike price and T is the time to maturity. (a) Let C : (0, ∞) × [0, ∞) → R satisfy the following conditions.
(A1) (Convexity in K) C(·,T ) is a convex function, ∀T ≥ 0; (A2) (Monotonicity in T) C(·,K), is non-decreasing, ∀K > 0; (A3) (Large strike limit) lim C(K,T ) = 0, ∀T ≥ 0; K→∞ (A4) (Bounds) (s − K)+ ≤ C(K,T ) ≤ s, ∀K > 0,T ≥ 0; and (A5) (Expiry Value) C(K, 0) = (s − K)+, ∀K > 0. Then (i) the function Cˆ : [0, ∞)×[0, ∞) → R ( s, if K = 0 (K,T ) 7→ C(K,T ), if K > 0 satisfies assumptions (A1)-(A5) but with K ≥ 0 instead of K > 0; and (ii) there exists a non-negative Markov martingale X with the property that + Cˆ(K,T ) = E((Xt − K) |X0 = s) for all K,T ≥ 0. (b) All of the listed conditions in part (a) of this theorem are necessary prop- erties of Cˆ for it to be the conditional expectation of a call option under the assumption that X is a (non-negative) martingale. Remark: (s − K)+ is the payoff-function of an European call option.
13 2 THEORY
2.7.2 Arbitrage conditions for implied volatility The following arbitrage conditions follow the ones stated in Theorem 2.9 in [16]. These conditions handle static arbitrage and they are applied for total implied volatility. √ K Theorem 2.7.2. Let s > 0, x = ln ( s ) and Σ(x, T ) = σimp(x, T ) T satisfy the following conditions:
(i) (Smoothness) for every T > 0, Σ(x, T ) is twice diffentiable with respect to x. (ii) (Positivity) for every x ∈ R and T > 0,
Σ(x, T ) > 0.
(iii) (Durrleman’s Condition) for every T > 0 and x ∈ R, x∂ Σ2 1 0 ≤ 1 − x − Σ2(∂ Σ)2 + Σ∂ Σ, Σ 4 x xx where Σ denotes Σ(x, T ). (iv) (Monotonicity in T) for every x ∈ R, Σ(X, ·) is non-decreasing. (v) (Large moneyness behaviour) for every T > 0
lim d+(x, Σ(x, T )) = −∞. x→∞
(vi) (Value at maturity) for every x ∈ R,
Σ(x, 0) = 0.
Then
Ce : [0, ∞)×[0, ∞) → R ( sB(x, Σ(x, T )), if K > 0 (K,T ) 7→ s, if K = 0
is a call price surface parameterised by s that is free of static arbitrage.
Remark: ∂x and ∂xx denotes the first and second order partial derivatives of Σ(x, T ) with respect to x. d+(·) is a component of B(·), which is a ”scaled Black-Scholes” function. For further explanations, see Roper [16] Definition 2.3 and Definition 2.4.
Now, all theory for constructing a volatility surface free of arbitrage has been presented. The remaining theory are essential concepts used for analyzing and adjusting the volatility surface.
14 2 THEORY
2.8 K-Means clustering K-means clustering is used for partitioning a dataset into K distinct, non- overlapping clusters. K-means algorithm will assign each observation in a dataset to exactly one of the K clusters, after specifying the number of clusters. Let C1, ..., CK denote sets containing the indices of the observation in each cluster. The clustering sets will satisfy two properties:
C1 ∪ C2 ∪ ... ∪ CK = {1, ..., n} 0 Ck ∩ Ck 0 = θ, ∀k 6= k Which says, no observation belongs to more than one cluster and that the clusters are non-overlapping. A good clustering is one for which the within- cluster variation is as small as possible. Let the the within-cluster variation for cluster Ck be denoted by W (Ck), then the problem to solve is:
K X min W (Ck) . (4) C1,...,CK k=1
The formula above says that we want to partition the observations into K clus- ters such that the total within-cluster variation is as small as possible. In order to make this calculation, the within-cluster variation needs to be defined. The most common choice is to use squared Euclidean distance. That is defined:
p 1 X X 2 W (Ck) = (xij − xi 0j) , (5) |Ck| 0 i,i ∈Ck j=1 where |Ck| denotes the number of observations in the kth cluster. Combining (4) and (5) gives the optimization problem that defines K-means clustering:
K p X 1 X X 2 min (xi,j − xi 0j) . (6) C1,...,C K |Ck| 0 k=1 i,i ∈Ck j=1
To solve equation (6) and determine the clusters, the step-by-step implementa- tion below can be used.
(i) Randomly assign a number , from 1 to K, to each of the observations. This serves as initial cluster assignments for the observations. (ii) Iterate until the cluster assignments stop changing: (a) For each of the K clusters, compute the cluster centroid. The kth cluster centroid is the vector of the p feature means for the observa- tions in the kth cluster. (b) Assign each observation to the cluster whose centroid is the closest, where closest is defined using Euclidean distance.
15 2 THEORY
2.9 Transition matrix A transition matrix describes the movements of a Markov chain over a finite state space S with cardinality S. Let the probability of moving from i to j in one time step be defined as P r(j|i) = Pi,j and that the total probability of moving from state i to all other states equals to 1, then transition matrix P is defined as: P1,1 P1,2 ...P1,j ...P1,S P2,1 P2,2 ...P2,j ...P2,S ...... ...... P = . Pi,1 Pi,2 ...Pi,j ...Pi,S ...... ...... PS,1 PS,2 ...PS,j ...PS,S
16 3 METHOD
3 Method
This section provides methods for calculating implied volatility of an option, construction of a volatility surface, elimination of arbitrage, measure errors and finally how to analyze the volatility surface. Figures and examples in this section are constructed using the LME Copper dataset.
3.1 Calculate implied volatility Implied volatility can be determined using various approaches and one common way is to use an iterative search. Having the market price C of an European call option with strike price K, time to maturity T , the risk free rate r and Black-Scholes formula as the pricing formula, Newton’s method can be used. Below follows a step-by-step implementation to determine the implied volatility (2) using Newton’s method.
(i) Let {C, s, K, T, r} be known and constant.
q 2π C (ii) Set σn = σ0 = T s as the initial guess of the implied volatility.
(iii) Determine the theoretical price Cˆ using Black-Scholes formula (1) with input parameters {s, K, T, r, σn}. (iv) If |Cˆ − C| < , STOP, where is the accepted estimation error.
Cˆ−C (v) Set σn+1 = σn − , where ν(σn) is the derivative of C w.r.t. the ν(σn) implied volatility. (vi) Go to step (iii).
Remark: The initial guess σ0 is a closed form estimate of the implied volatility derived by Menachem Brenner and Marti G. Subrahmanyan in [3].
3.2 Arbitrage: tests and how to eliminate it The arbitrage conditions for implied volatility stated in Theorem 2.7.2 can be linked to either calendar spread arbitrage or butterfly arbitrage and the two conditions stated in Definition 2.7.2 can therefore be extended.
Extension of Definition 2.7.2 (i): A volatility surface is free of calendar spread arbitrage if and only if conditions (iv) and (vi) in Theorem 2.7.2 are satisfied.
Extension of Definition 2.7.2 (ii): Each volatility smile is free of butter- fly arbitrage if and only if conditions (iii) and (v) in Theorem 2.7.2 are satisfied.
Using these extensions it is possible to construct tests for detecting arbitrage possibilities in the volatility surface.
17 3 METHOD
3.2.1 Calendar spread arbitrage To determine if a volatility surface is free of calendar spread arbitrage, the following test can be used: Procedure 3.2.1 (Calendar spread arbitrage test). For each maturity slice Ti and log-moneyness xj, determine the total implied variance ω(xj,Ti), where i = {1, 2, . . . , n}, j = {1, 2, . . . , m}. If for all xj,
ω(xj,Ti) ≤ ω(xj,Ti+1), the volatility surface is free of calendar spread arbitrage. Remark: This test was originally constructed by Ohman¨ [17].
For visualization of this test one can plot the total implied variance against the log-moneyness for each maturity slice in the same figure. If non of the lines intersect, the volatility surface is free of calendar spread arbitrage.
To eliminate calendar spread arbitrage, Gatheral and Jacquier [7] propose Lemma 5.1, which states that it is possible to interpolate between two maturity slices T1 and T2 without risk of introducing static arbitrage in the interpolated region. Here follows this lemma:
Lemma 3.2.1 (Lemma 5.1 in [7]). Given two volatility smiles ω(x, T1) and ω(x, T2) with T1 < T2 where the two smiles are free of butterfly arbitrage and such that ω(x, T2) > ω(x, T1) for all x, there exists an interpolation such that the interpolated region is free of static arbitrage for T1 < T < T2. Proof. This lemma is omitted to Gatheral and Jacquier [7].
3.2.2 Butterfly arbitrage To determine if a volatility surface is free of butterfly arbitrage, the following test can be used: Procedure 3.2.2 (Butterfly arbitrage test). For a fixed maturity slice T , de- termine condition (iii) in Theorem 2.7.2 for each xj, j = {1, 2, . . . , m}. In other words, let
x∂ Σ2 1 g(x) = 1 − x − Σ2(∂ Σ)2 + Σ∂ Σ Σ 4 x xx and calculate g(xj) for j = {1, 2, . . . , m}. If for all xj
g(xj) > 0 the volatility surface is free of butterfly arbitrage.
18 3 METHOD
To eliminate butterfly arbitrage Gatheral and Jacquier [7] propose to fix the SVI- JW parameters vT , ψT och pT , for a maturity slice T and choose the remaining parameters as:
0 0 0 4pT cT cT = pT + 2ψT , and vˆT = vT 0 . (7) (pT + cT ) It follows by continuity of the parameterization in all of the SVI-JW parame- ∗ ∗ ∗ 0 ters that there must exist a pair of parameters (cT , vˆT ), where cT ∈ (cT , cT ) and ∗ 0 vˆT ∈ (ˆvT , vˆT ), such that the new volatility smile is free of butterfly arbitrage and as close as possible to the original one in some sense.
According to Gatheral and Jacquier [7], this set of parameters should insure no butterfly arbitrage but it is proven that in some particular cases when arbi- trage is introduced in the left wing of the smile, pT can’t be fixed but also has to be chosen [17]. The choice of pT is not restricted to a closed interval as the other two parameters.
The optimal choice of parameters can be found using least squares with an objective function set to the differences between the total implied variances given from the new parameterization and the original plus a large penalty P for butterfly arbitrage. The expression to minimize is then:
m X SVI 0 2 min (ωj − ωj) + P, cT ,vˆT ,pT j=1
SVI where ωj is the total implied variance given from the original parameteriza- 0 tion and ωj is the total implied variance given from the new parameterization for log-moneyness j.
Remark: The size of P should be large enough to insure no butterfly arbi- trage. From our empirical findings, P = 10000 is sufficient.
3.3 Construction of a volatility surface In this section we will describe how to use the raw SVI parameterization for con- structing a volatility surface. When finished, the volatility surface is represented by an uniformly distributed discrete set of implied volatilities. Henceforth total implied variance is used since the raw SVI parameterization is defined in total implied variance.
For a fixed time to maturity T , the raw SVI parameterization, Definition 2.6, is a function to determine the total implied variance for any choice of moneyness x. The goal is to determine the set of parameters {a, b, ρ, m, σ} allowing the best fit to the market data. This is an optimization problem which can be solved using least squares where the expression to minimize is defined as:
19 3 METHOD
m X SVI 2 min wj(ωj − ωˆj) (8) a,b,ρ,m,σ j=1
SVI where wj is a weight, ωj is the raw SVI-parameterization andω ˆj is the ob- served total implied variance obtained using Newton’s method described in 2.2 to determine the implied volatility and transform it into total implied variance using (3).
Remark: Various methods exist for solving (8), for example the Levenberg- Marquardt algorithm or the Trust Region Reflective algorithm, which was used throughout this thesis work since it is robust and the default method in the SciPy package.
In Figure 4, a raw SVI parameterization is displayed together with correspond- ing market data.
Figure 4: A raw SVI parameterization fitted to market data.
Having a raw SVI parameterization, we must insure that no butterfly arbitrage opportunities are introduced. In [7], the authors give an example of when butter- fly arbitrage is introduced which is re-created in Figure 5. The set of parameters used for this example are {a, b, ρ, m, σ} = {−0.0410, 0.1331, 0.3060, 0.3586, 0.4153}.
20 3 METHOD
Figure 5: Visualization of Durrleman’s condition. Arbitrage opportunities are introduced as g(x) falls below the dashed line.
To eliminate butterfly arbitrage we transform the raw SVI parameters into SVI- JW parameters and then apply least squares. The objective function is set to the differences between the total implied variances given from the new param- eterization and the original plus a large penalty for butterfly arbitrage.
When all butterfly arbitrage opportunities are eliminated we can safely inter- polate between a pair of raw SVI parameterizations without risk of introducing new butterfly arbitrage. We therefore choose a uniformly distributed set of log-moneyness and use the raw SVI parameterization to determine the corre- sponding set of total implied variances for each time to maturity.
The interpolation between the sets of total implied variances can be done in various ways. Since we know that the surface has a linear leaning profile in the time to maturity direction, linear- or polynomial regression are suitable approaches. Using such models we can predict a uniformly distributed set of maturities for each log-moneyness. In Figure 6, an interpolation of market data using linear regression and the predicted set of maturities are shown.
21 3 METHOD
Figure 6: Polynomial regression interpolation in the time to maturity direction for log-moneyness = 0.
Now, the calendar spread arbitrage test, defined in Procedure 3.2.1, can be performed. In Figure 6, a visualization of volatility smiles for time to maturity 40, 76 and 112 days are shown. Here, no calendar arbitrage opportunities are introduced since each volatility smile are strictly greater than the other with longer time to maturity. In the case of intersection between volatility smiles, Lemma 3.2.1 is used to eliminate the calendar arbitrage.
Figure 7: Volatility smiles for different time to maturities T where no calendar arbitrage opportunities are introduced.
22 3 METHOD
With no calendar spread arbitrage, we now have an arbitrage free set of discrete total implied variances to represent the surface. The volatility surface is shown in Figure 8 where plot (a) displays the surface expressed in total implied variance and (b) displays it in implied volatility. The surface is constructed using total 2500 discrete points in the range [−0.5, 0.5] for log-moneyness and [4, 300] for time to maturity with 50 steps in each range.
23 3 METHOD
(a) A volatility surface expressed in total implied variance, ω.
(b) A volatility surface expressed in implied volatility, σimp.
Figure 8: A volatility surface expressed in total implied variance, ω, in (a) and expressed in implied volatility, σimp, in (b).
24 3 METHOD
3.3.1 Calculate change and error between market data and the sur- face To measure change in the volatility surface, relative change is used. For a fixed time to maturity Ti and log-moneyness xj, a relative change is defined as:
σt+1 − σt δi,j = + 1 (9) σt where σt is the implied volatility σimp in the volatility surface for day t and σt+1 is the implied volatility σimp in the volatility surface for day t + 1 at time to maturity Ti and log-moneyness xj.
To measure a volatility surface’s fit against the market and available options, the total squared relative error is used. For a certain option’s time to maturity Ti and log-moneyness xj, a squared relative error is defined as: