. Introduction to Pricing .

Liuren Wu

Zicklin School of Business, Baruch College

Options Markets

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 1 / 78 Outline

1. General principles and applications

2. Illustration of hedging/pricing via binomial trees

3. The Black-Merton-Scholes model

4. Introduction to Ito’s lemma and PDEs

5. Real (P) v. risk-neutral (Q) dynamics

6. implied surface

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 2 / 78 Review: Valuation and investment in primary securities

The securities have direct claims to future cash flows.

Valuation is based on forecasts of future cash flows and risk: DCF (Discounted Cash Flow Method): Discount time-series forecasted future cash flow with a discount rate that is commensurate with the forecasted risk. We diversify as much as we can. There are remaining risks after diversification — market risk. We assign a premium to the remaining risk we bear to discount the cash flows.

Investment: Buy if market price is lower than model value; sell otherwise.

Both valuation and investment depend crucially on forecasts of future cash flows (growth rates) and risks (, credit risk).

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 3 / 78 Compare: securities

Payoffs are linked directly to the price of an “underlying” .

Valuation is mostly based on replication/hedging arguments. Find a portfolio that includes the underlying security, and possibly other related derivatives, to replicate the payoff of the target derivative security, or to away the risk in the derivative payoff. Since the hedged portfolio is riskfree, the payoff of the portfolio can be discounted by the riskfree rate. No need to worry about the risk premium of a “zero-beta” portfolio. Models of this type are called “no-” models.

Key: No (little) forecasts are involved. Valuation is based, mostly, on cross-sectional comparison. It is less about whether the underlying security price will go up or down (forecasts of growth rates /risks), but about the relative pricing relation between the underlying and the derivatives under all possible scenarios...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 4 / 78 Readings behind the technical jargons: P v. Q

P: Actual probabilities that earnings will be high or low, estimated based on historical data and other insights about the company. Valuation is all about getting the forecasts right and assigning the appropriate price for the forecasted risk — fair wrt future cashflows (and your risk preference). Q: “Risk-neutral” probabilities that one can use to aggregate expected future payoffs and discount them back with riskfree rate. It is related to real-time scenarios, but it is not real-time probability. Since the intention is to hedge away risk under all scenarios and discount back with riskfree rate, we do not really care about the actual probability of each scenario happening. We just care about what all the possible scenarios are and whether our hedging works under all scenarios. Q is not about getting close to the actual probability, but about being fair/consistent wrt the prices of securities that you use for hedging.

Associate P with time-series forecasts and Q with. cross-sectional...... Liurencomparison. Wu (Baruch) Option Pricing Introduction Options Markets 5 / 78 A Micky Mouse example

Consider a non-dividend paying in a world with zero riskfree interest rate. Currently, the market price for the stock is $100. What should be the for the stock with one year maturity? The forward price is $100.

Standard forward pricing argument says that the forward price should be equal to the cost of buying the stock and carrying it over to maturity. The buying cost is $100, with no storage or interest cost or dividend benefit.

How should you value the forward differently if you have inside information that the company will be bought tomorrow and the stock price is going to double? Shorting a forward at $100 is still safe for you if you can buy the stock at $100 to hedge.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 6 / 78 Investing in derivative securities without insights

If you can really forecast (or have inside information on) future cashflows, you probably do not care much about hedging or no-arbitrage pricing.

What is risk to the market is an opportunity for you. You just lift the market and try not getting caught for insider trading.

If you do not have insights on cash flows and still want to invest in derivatives, the focus does not need to be on forecasting, but on cross-sectional consistency.

No-arbitrage pricing models can be useful. Identifying the right hedge can be as important (if not more so) as generating the right valuation.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 7 / 78 What are the roles of an option pricing model?

1. Interpolation and extrapolation: Broker-dealers: Calibrate the model to actively traded option contracts, use the calibrated model to generate option values for contracts without reliable quotes (for quoting or book marking). Criterion for a good model: The model value needs to match observed prices (small pricing errors) because market makers cannot take big views and have to passively take order flows. Sometimes one can get away with interpolating (linear or spline) without a model. The need for a cross-sectionally consistent model becomes stronger for extrapolation from -dated options to long-dated contracts, or for interpolation of markets with sparse quotes. The purpose is less about finding a better price or getting a better prediction of future movements, but more about making sure the provided quotes on different contracts are “consistent” in the sense that no one can buy and sell with you to lock in a profit on you...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 8 / 78 What are the roles of an option pricing model?

2. generation Investors: Hope that market price deviations from the model “fair value” are mean reverting. Criterion for a good model is not to generate small pricing errors, but to generate (hopefully) large, transient errors. When the market price deviates from the model value, one hopes that the market price reverts back to the model value quickly. One can use an error-correction type specification to test the performance of different models. However, alpha is only a small part of the equation, especially for investments in derivatives.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 9 / 78 What are the roles of an option pricing model?

3. Risk hedge and management: Hedging and managing risk plays an important role in derivatives. Market makers make money by receiving bid-ask spreads, but options order flow is so sparse that they cannot get in and out of contracts easily and often have to hold their positions to . Without proper hedge, market value fluctuation can easily dominate the spread they make. Investors (hedge funds) can make active derivative investments based on the model-generated alpha. However, the convergence movement from market to model can be a lot smaller than the market movement, if the positions are left unhedged. — Try the forward example earlier. The estimated model dynamics provide insights on how to manage/hedge the risk exposures of the derivative positions. Criterion of a good model: The modeled risks are real risks and are the only risk sources. Once one hedges away these risks, no other risk is left...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 10 / 78 What are the roles of an option pricing model?

4. Answering economic questions with the help of options observations Macro/corporate policies interact with investor preferences and expectations to determine financial security prices. Reverse engineering: Learn about policies, expectations, and preferences from security prices. At a fixed point in time, one can learn a lot more from the large amount of option prices than from a single price on the underlying. Long time series are hard to come by. The large cross sections of derivatives provide a partial replacement. The Breeden and Litzenberger (1978) result allows one to infer the risk-neutral return distribution from option prices across all strikes at a fixed maturity. Obtaining further insights often necessitates more structures/assumptions/an option pricing model. Criterion for a good model: The model contains intuitive, easily interpretable, economic meanings.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 11 / 78 Forward pricing

Terminal Payoff:(ST − K), linear in the underlying security price (ST ). No-arbitrage pricing: Buying the underlying security and carrying it over to expiry generates the same payoff as signing a forward. The forward price should be equal to the cost of the replicating portfolio (buy & carry). This pricing method does not depend on (forecasts) how the underlying security price moves in the future or whether its current price is fair. but it does depend on the actual cost of buying and carrying (not all things can be carried through time...), i.e., the implementability of the replicating strategy.

(r−q)(T −t) Ft,T = St e , with r and q being continuously compounded rates of carrying cost (interest, storage) and benefit (dividend, foreign interest, convenience yield). When arbitrage trading does not work, q can be traded as a “convenience ...... yield,” or a prediction of future spot movements...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 12 / 78 Option pricing

+ + Terminal Payoff: Call — (ST − K) , put — (K − ST ) . No-arbitrage pricing:

Can we replicate the payoff of an option with the underlying security so that we can value the option as the cost of the replicating portfolio? Can we use the underlying security (or something else liquid and with known prices) to completely hedge away the risk? If we can hedge away all risks, we do not need to worry about risk premiums and one can simply set the price of the contract to the value of the hedge portfolio (maybe plus a premium).

In most practical situations, options cannot be replicated by the underlying — Options are useful and informative about (i) expectations of the future (ii) risk premiums. The difficulty lies in how to separate the two.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 13 / 78 Another Mickey Mouse example: A one-step binomial tree

Observation: The current stock price (St ) is $20. The current continuously compounded interest rate r (at 3 month maturity) is 12%. (crazy number from Hull’s book). Model/Assumption: In 3 months, the stock price is either $22 or $18 (no dividend for now).

ST = 22  P St = 20 PPP Comments: ST = 18 Once we make the binomial dynamics assumption, the actual probability of reaching either node (going up or down) no longer matters. We have enough information (we have made enough assumption) to price options that expire in 3 months. Remember: For idealistic derivative no-arbitrage pricing, what matters is the list of possible scenarios, but not the actual probability of each scenario

happening...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 14 / 78 A 3-month

Consider a 3-month call option on the stock with a strike of $21.

Backward induction: Given the terminal stock price (ST ), we can compute + the option payoff at each node, (ST − K) . The zero-coupon bond price with $1 par value is: 1 × e−.12×.25 = $0.9704. B = 1  T  ST = 22 Bt = 0.9704 Q cT = 1 St = 20 Q Q BT = 1 ct = ? Q ST = 18 cT = 0 Two angles: Replicating: See if you can replicate the call’s payoff using stock and bond. ⇒ If the payoffs equal, so does the value. Hedging: See if you can hedge away the risk of the call using stock. ⇒ If the hedged payoff is riskfree, we can compute the present value using

riskfree rate...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 15 / 78 Replicating

B = 1  T  ST = 22 Bt = 0.9704 Q cT = 1 St = 20 Q Q BT = 1 ct = ? Q ST = 18 cT = 0 Assume that we can replicate the payoff of 1 call using ∆ share of stock and D par value of bond, we have

1 = ∆22 + D1, 0 = ∆18 + D1.

Solve for ∆: ∆ = (1 − 0)/(22 − 18) = 1/4. ∆ = Change in C/Change in S, a sensitivity measure. − 1 − Solve for D: D = 4 18 = 4.5 (borrow). Value of option = value of replicating portfolio 1 − × = 4 20 4.5 0.9704 = 0.633...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 16 / 78 Hedging

B = 1  T  ST = 22 Bt = 0.9704 Q cT = 1 St = 20 Q Q BT = 1 ct = ? Q ST = 18 cT = 0 Assume that we can hedge away the risk in 1 call by shorting ∆ share of stock, such as the hedged portfolio has the same payoff at both nodes: 1 − ∆22 = 0 − ∆18.

Solve for ∆: ∆ = (1 − 0)/(22 − 18) = 1/4 (the same): ∆ = Change in C/Change in S, a sensitivity measure. − 1 − 1 − The hedged portfolio has riskfree payoff: 1 4 22 = 0 4 18 = 4.5.

Its present value is: −4.5 × 0.9704 = −4.3668 = 1ct − ∆St . − − 1 ct = 4.3668 + ∆St = 4.3668 + 4 20 = 0.633.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 17 / 78 One principle underlying two angles

If you can replicate, you can hedge: Long the option contract, short the replicating portfolio. The replication portfolio is composed of stock and bond. Since bond only generates parallel shifts in payoff and does not play any role in offsetting/hedging risks, it is the stock that really plays the hedging role. The optimal hedge ratio when hedging options using is defined as the ratio of option payoff change over the stock payoff change — Delta. Hedging derivative risks using the underlying security (stock, currency) is called delta hedging. To hedge a long position in an option, you need to short delta of the underlying. To replicate the payoff of a long position in option, you need to long delta of the underlying.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 18 / 78 The limit of delta hedging

Delta hedging completely erases risk under the binomial model assumption: The underlying stock can only take on two possible values. Using two (different) securities can span the two possible realizations. We can use stock and bond to replicate the payoff of an option. We can also use stock and option to replicate the payoff of a bond. One of the 3 securities is redundant and can hence be priced by the other two.

What will happen for the hedging/replicating if the stock can take on 3 possible values three months later, e.g., (22, 20, 18)? It is impossible to find a ∆ that perfectly hedge the option position or replicate the option payoff. We need another instrument (say another option) to do perfect hedging or replication.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 19 / 78 Introduction to risk-neutral valuation

p BT = 1   ST = 22 B = 0.9704 Q t Q St = 20 Q Q B = 1 1 − p T ST = 18 Compute a set of artificial probabilities for the two nodes (p, 1 − p) such that the stock price equals the expected value of the terminal payment, discounted by the riskfree rate. − ⇒ 20/0.9704−18 20 = 0.9704 (22p + 18(1 p)) p = 22−18 = 0.6523 Since we are discounting risky payoffs using riskfree rate, it is as if we live in an artificial world where people are risk-neutral. Hence, (p, 1 − p) are the risk-neutral probabilities. These are not really probabilities, but more like unit prices:0.9704p is the price of a security that pays $1 at the up state and zero otherwise. 0.9704(1 − p) is the unit price for the down state. To exclude arbitrage, the same set of unit prices (risk-neutral probabilities) ...... should be applicable to all securities, including options...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 20 / 78 Risk-neutral probabilities

Under the binomial model assumption, we can identify a unique set of risk-neutral probabilities (RNP) from the current stock price. The risk-neutral probabilities have to be probability-like (positive, less than 1) for the stock price to be arbitrage free. What would happen if p < 0 or p > 1? If the prices of market securities do not allow arbitrage, we can identify at least one set of risk-neutral probabilities that match with all these prices. When the market is, in addition, complete (all risks can be hedged away completely by the assets), there exist one unique set of RNP. Under the binomial model, the market is complete with the stock alone. We can uniquely determine one set of RNP using the stock price alone. Under a trinomial assumption, the market is not complete with the stock alone: There are many feasible RNPs that match the stock price. In this case, we add an option to fully determine a unique set of RNP. That is, we use this option (and the trinomial) to complete the market...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 21 / 78 Market completeness

Market completeness (or incompleteness) is a relative concept: Market may not be complete with one asset, but can be complete with two... To me, the real-world market is severely incomplete with primary market securities alone (stocks and bonds). To complete the market, we need to include all available options, plus model structures. Model and instruments can play similar rules to complete a market.

RNP are not learned from the stock price alone, but from the option prices. Risk-neutral dynamics need to be estimated using option prices. Options are not redundant. Caveats: Be aware of the reverse flow of information and supply/demand shocks.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 22 / 78 Muti-step binomial trees

A one-step binomial model looks very Mickey Mouse: In reality, the stock price can take on many possible values than just two. Extend the one-step to multiple steps. The number of possible values (nodes) at expiry increases with the number of steps. With many nodes, using stock alone can no longer achieve a perfect hedge in one shot. But locally, within each step, we still have only two possible nodes. Thus, we can achieve perfect hedge within each step. We just need to adjust the hedge at each step — dynamic hedging. By doing as many hedge rebalancing at there are time steps, we can still achieve a perfect hedge globally. The market can still be completed with the stock alone, in a dynamic sense. A multi-step binomial tree can still still very useful in practice in, for example, handling American options, discrete dividends, interest rate term

structures ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 23 / 78 Constructing the binomial tree

 S u  t St  fu Q ft Q QQ St d fd

One way to set up the tree is to use (u, d) to match the stock return volatility (Cox, Ross, Rubinstein): √ u = eσ ∆t , d = 1/u.

σ— the annualized return volatility (standard deviation). a−d The risk-neutral probability becomes: p = u−d where a = er∆t for non-dividend paying stock. a = e(r−q)∆t for dividend paying stock. − a = e(r rf )∆t for currency. a = 1 for futures...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 24 / 78 Estimating the return volatility σ

To determine the tree (St , u, d), we set √ u = eσ ∆t , d = 1/u.

The only un-observable or “free” parameter is the return volatility σ.

Mean expected return (risk premium) does not matter for derivative pricing under the binomial model because we can completely hedge away the risk. St can be observed from the .

We can√ estimate the return volatility using the time series data: [ ∑ ] b 1 1 N − 2 ⇐ 1 σ = ∆t N−1 t=1(Rt R) . ∆t is for annualization.

Implied volatility: We can estimate the volatility (σ) by matching observed option prices.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 25 / 78 The Black-Merton-Scholes (BMS) model

Black and Scholes (1973) and Merton (1973) derive option prices under the following assumption on the stock price dynamics,

dSt = µSt dt + σSt dWt

The binomial model: Discrete states and discrete time (The number of possible stock prices and time steps are both finite). The BMS model: Continuous states (stock price can be anything between 0 and ∞) and continuous time (time goes continuously). BMS proposed the model for stock option pricing. Later, the model has been extended/twisted to price currency options (Garman&Kohlhagen) and options on futures (Black). I treat all these variations as the same concept and call them indiscriminately the BMS model.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 26 / 78 Primer on continuous time process

dSt = µSt dt + σSt dWt

The driver of the process is Wt , a Brownian motion, or a Wiener process. The sample paths of a Brownian motion are continuous over time, but nowhere differentiable. It is the idealization of the trajectory of a single particle being constantly bombarded by an infinite number of infinitessimally small random forces. Like a shark, a Brownian motion must always be moving, or else it dies. If you sum the absolute values of price changes over a day (or any time horizon) implied by the model, you get an infinite number. If you tried to accurately draw a Brownian motion sample path, your pen would run out of ink before one second had elapsed.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 27 / 78 Properties of a Brownian motion

dSt = µSt dt + σSt dWt

The process Wt generates a random variable that is normally distributed with mean 0 and variance t, ϕ(0, t). (Also referred to as Gaussian.) Everybody believes in the normal approximation, the experimenters because they believe it is a mathematical theorem, the mathematicians because they believe it is an experimental fact!

The process is made of independent normal increments dWt ∼ ϕ(0, dt). “d” is the continuous time limit of the discrete time difference (∆). ∆t denotes a finite time step (say, 3 months), dt denotes an extremely thin slice of time (smaller than 1 milisecond). It is so thin that it is often referred to as instantaneous. Similarly, dWt = Wt+dt − Wt denotes the instantaneous increment (change) of a Brownian motion. By extension, increments over non-overlapping time periods are − ∼ − independent: For (t1 > t2 > t3), (Wt3 Wt2 ) ϕ(0, t3 t2) is independent of (Wt − Wt ) ∼ ϕ(0, t2 − t1)...... 2 1 ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 28 / 78 Properties of a normally distributed random variable

dSt = µSt dt + σSt dWt

If X ∼ ϕ(0, 1), then a + bX ∼ ϕ(a, b2). If y ∼ ϕ(m, V ), then a + by ∼ ϕ(a + bm, b2V ).

Since dWt ∼ ϕ(0, dt), the instantaneous price change ∼ 2 2 dSt = µSt dt + σSt dWt ϕ(µSt dt, σ St dt). dS ∼ 2 The instantaneous return S = µdt + σdWt ϕ(µdt, σ dt). Under the BMS model, µ is the annualized mean of the instantaneous return — instantaneous mean return. σ2 is the annualized variance of the instantaneous return — instantaneous return variance. σ is the annualized standard deviation of the instantaneous return — instantaneous return volatility.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 29 / 78 Geometric Brownian motion

dSt /St = µdt + σdWt

The stock price is said to follow a geometric Brownian motion. µ is often referred to as the drift, and σ the diffusion of the process. Instantaneously, the stock price change is normally distributed, 2 2 ϕ(µSt dt, σ St dt). Over longer horizons, the price change is lognormally distributed. The log return (continuous compounded return) is normally distributed over all horizons:( ) − 1 2 d ln St = µ 2 σ dt + σdWt . (By Ito’s lemma) ∼ − 1 2 2 d ln St ϕ(µdt 2 σ dt, σ dt). ∼ − 1 2 2 ln St ϕ(ln S0((+ µt 2)σ t, σ t). ) ∼ − 1 2 − 2 − ln ST /St ϕ µ 2 σ (T t), σ (T t) .

− 1 2 µt 2 σ t+σWt − 1 2 Integral form: St = S0e , ln St = ln S0 + µt 2 σ t + σWt

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 30 / 78 Normal versus lognormal distribution

dSt = µSt dt + σSt dWt , µ = 10%, σ = 20%, S0 = 100, t = 1.

2 0.02

1.8 0.018 S =100 1.6 0.016 0 1.4 0.014 ) 0 t /S

t 1.2 0.012

1 0.01

0.8 PDF of S 0.008 PDF of ln(S 0.6 0.006

0.4 0.004

0.2 0.002

0 0 −1 −0.5 0 0.5 1 50 100 150 200 250 ln(S /S ) S t 0 t

The earliest application of Brownian motion to finance is by Louis Bachelier in his dissertation (1900) “Theory of Speculation.” He specified the stock price as following a Brownian motion with drift:

dSt = µdt + σdWt

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 31 / 78 Simulate 100 stock price sample paths

dSt = µSt dt + σSt dWt , µ = 10%, σ = 20%, S0 = 100, t = 1.

0.05 200

0.04 180 0.03 160 0.02

0.01 140

0 120 Stock price Daily returns −0.01 100 −0.02 80 −0.03

−0.04 60 0 50 100 150 200 250 300 0 50 100 150 200 250 300 Days days

− 1 2 Stock with the return process: d ln St = (µ 2 σ )dt + σdWt . Discretize to daily intervals dt ≈ ∆t = 1/252. Draw standard normal random variables ε(100 × 252) ∼ ϕ(0, 1). √ − 1 2 Convert them into daily log returns: Rd = (µ 2 σ )∆t + σ ∆tε. ∑ 252 R Convert returns into stock price sample paths: St = S0e d=1 d ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 32 / 78 Ito’s lemma on continuous processes

Given a dynamics specification for xt , Ito’s lemma determines the dynamics of any functions of x and time, yt = f (xt , t). Example: Given dynamics on St , derive dynamics on ln St /S0. For a generic continuous process xt ,

dxt = µx dt + σx dWt , the transformed variable y = f (x, t) follows, ( ) 1 dy = f + f µ + f σ2 dt + f σ dW t t x x 2 xx x x x t You can think of it as aTaylor expansion up to dt term, with (dW )2 = dt

Example 1:( dSt = µ)St dt + σSt dWt and y = ln St . We have 1 2 d ln St = µ − σ dt + σdWt 2 √ √ Example 2: dvt = κ (θ − vt ) dt + ω vt dWt , and σt = vt . ( ) 1 1 1 1 dσ = σ−1κθ − κσ − ω2σ−1 dt + ωdW t 2 t 2 t 8 t 2 t Example 3: dx = −κx dt + σdW and v = a + bx 2. ⇒ dv =?. t t t t . . t...... t...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 33 / 78 Ito’s lemma on jumps

For a generic process xt with jumps, ∫ dxt = µx dt + σx dWt + g(z)(µ(dz∫ , dt) − ν(z, t)dzdt) , = (µx − Et [g]) dt + σx dWt + g(z)µ(dz, dt),

where the random counting measure µ(dz, dt) realizes to a nonzero value for a given z if and only if x jumps from xt− to xt = xt− + g(z) at time t. The process ν(z, t)dzdt compensates the jump process so that the last term is the increment of a pure jump martingale. The transformed variable y = f (x, t) follows, ( ) 1 2 dyt = ft∫+ fx µx + 2 fxx σx dt + fx σx dWt ∫ (+ (f (xt− + g(z), t) − f (xt−,)t)) µ(dz, dt) − fx g(z)ν(z, t)dzdt, − E 1 2 = ft∫+ fx (µx t [g]) + 2 fxx σx dt + fx σx dWt + (f (xt− + g(z), t) − f (xt−, t)) µ(dz, dt)

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 34 / 78 Ito’s lemma on jumps: Merton (1976) example

( ) 1 2 dyt = ft∫+ fx µx + 2 fxx σx dt + fx σx dWt ∫ (+ (f (xt− + g(z), t) − f (xt−, t)))µ(dz, dt) − fx g(z)ν(z, t)dzdt, − E 1 2 = ft∫+ fx (µx t [g(z)]) + 2 fxx σx dt + fx σx dWt + (f (xt− + g(z), t) − f (xt−, t)) µ(dz, dt) Merton (1976)’s jump-diffusion process: ∫ z dSt = µSt−dt + σSt−dWt + St− (e − 1) (µ(z, t) − ν(z, t)dzdt) , and y = ln St . We have ( ) ∫ ∫ 1 d ln S = µ − σ2 dt + σdW + zµ(dz, dt) − (ez − 1) ν(z, t)dzdt t 2 t z f (xt− + g(z), t) − f (xt−, t) = ln St − ln St− = ln(St−e ) − ln St− = z. The specification (ez − 1) on price guarantees that the maximum downside jump is no larger than the pre-jump price level St−.

− 2 − (z µJ ) 1 2σ2 In Merton, ν(z, t) = λ √ e J . σJ 2π ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 35 / 78 The key idea behind BMS option pricing

Under the binomial model, if we cut time step ∆t small enough, the binomial tree converges to the distribution behavior of the geometric Brownian motion. Reversely, the Brownian motion dynamics implies that if we slice the time thin enough (dt), it behaves like a binomial tree. Under this thin slice of time interval, we can combine the option with the stock to form a riskfree portfolio, like in the binomial case. Recall our hedging argument: Choose ∆ such that f − ∆S is riskfree. The portfolio is riskless (under this thin slice of time interval) and must earn the riskfree rate. Since we can hedge perfectly, we do not need to worry about risk premium and expected return. Thus, µ does not matter for this portfolio and hence does not matter for the option valuation. Only σ matters, as it controls the width of the binomial branching.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 36 / 78 The hedging proof

The stock price dynamics: dS = µSdt + σSdWt . Let f ((S, t) denote the value of) a derivative on the stock, by Ito’s lemma: 1 2 2 dft = ft + fS µS + 2 fSS σ S dt + fS σSdWt . At time t, form a portfolio that contains 1 derivative contract and −∆ = fS of the stock. The value of the portfolio is P = f − fS S.

The instantaneous uncertainty of the portfolio is fS σSdWt − fS σSdWt = 0. Hence, instantaneously the delta-hedged portfolio is riskfree. Thus, the portfolio( must earn the riskfree rate: dP)= rPdt. − 1 2 2 − − dP = df fS dS = ft + fS µS + 2 fSS σ S fS µS dt = r(f fS S)dt This lead to the fundamental partial differential equation (PDE): 1 2 2 ft + rSfS + 2 fSS σ S = rf . If the stock pays a continuous dividend yield q, the portfolio P&L needs to be modified as dP = df − fS (dS + qSdt), where the stock investment P&L includes both the capital gain (dS) and the dividend yield (qSdt). − 1 2 2 The PDE then becomes: ft + (r q)SfS + 2 fSS σ S = rf .

No where do we see the drift (expected return) of. the. . . price. . . . dynamics...... (. µ). in. . . the PDE...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 37 / 78 Partial differential equation

The hedging argument leads to the following partial differential equation: ∂f ∂f 1 ∂2f + (r − q)S + σ2S 2 = rf ∂t ∂S 2 ∂S 2 The only free parameter is σ (as in the binomial model). Solving this PDE, subject to the terminal payoff condition of the derivative + (e.g., fT = (ST − K) for a European call option), BMS derive analytical formulas for call and value. Similar formula had been derived before based on distributional (normal return) argument, but µ was still in. More importantly, the derivation of the PDE provides a way to hedge the option position. The PDE is generic for any derivative securities, as long as S follows geometric Brownian motion. Given boundary conditions, derivative values can be solved numerically from the PDE...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 38 / 78 How effective is the BMS delta hedge in practice?

If you sell an option, the BMS model says that you can completely remove the risk of the call by continuously rebalancing your stock holding to neutralize the delta of the option. We perform the following experiment using history S&P options data from 1996 - 2011: Sell a call option Record the P&L from three strategies H Hold: Just hold the option. S Static: Perform delta hedge at the very beginning, but with no further rebalancing. D Dynamic: Actively rebalance delta at the end of each day. “D” represents a daily discretization of the BMS suggestion. Compared to “H” (no hedge), how much do you think the daily discretized BMS suggestion (“D”) can reduce the risk of the call?

[A] 0, [B] 5%, [C] 50%, [D] 95% ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 39 / 78 Example: Selling a 30-day at-the-money call option

1.4 100 Hold Static 95 1.2 Dynamics 90

1 85

80 0.8 75 0.6 70

0.4 65 Mean Squared Hedging Error

Percentage Variance Reduction 60 0.2 55 Static Dynamic 0 50 0 5 10 15 20 25 30 0 5 10 15 20 25 30 Days Days

Unhedged, the PL variance can be over 100% of the option value. Un-rebalanced, the delta hedge deteriorates over time as the delta is no longer neutral after a whole. Rebalanced daily, 95% of the risk can be removed over the whole sample period.

The answer is “D.” ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 40 / 78 Hedging options at different maturities and time periods

Different maturities Different years 100 100

95 95

90 90

85 85

80 80

75 75

70 70

65 65 1 month 1 month Percentage Variance Reduction 60 Percentage Variance Reduction 60 2 month 2 month 55 5 month 55 5 month 12 month 12 month 50 50 0 5 10 15 20 25 30 1996 1998 2000 2002 2004 2006 2008 2010 Days Year

Dynamic delta hedge works equally well on both short and long-term options. Hedging performance deteriorates in the presence of large moves.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 41 / 78 The BMS formula for European options

−q(T −t) −r(T −t) ct = St e N(d1) − Ke N(d2), −q(T −t) −r(T −t) pt = −St e N(−d1) + Ke N(−d2), where ln(S /K)+(r−q)(T −t)+ 1 σ2(T −t) d = t √ 2 , 1 σ T −t √ ln(S /K)+(r−q)(T −t)− 1 σ2(T −t) d = t √ 2 = d − σ T − t. 2 σ T −t 1 Black derived a variant of the formula for futures (which I like better):

−r(T −t) ct = e [Ft N(d1) − KN(d2)],

ln(F /K) 1 σ2(T −t) with d = t √ 2 . 1,2 σ T −t (r−q)(T −t) Recall: Ft = St e . Once I know call value, I can obtain put value via put-call parity: −r(T −t) ct − pt = e [Ft − Kt ].

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 42 / 78

 1 2 − −r(T −t) ln(Ft /K) 2 σ (T t) ct = e [Ft N(d1) − KN(d2)] , d1,2 = √ σ T − t

Since Ft (or St ) is observable from the underlying stock or futures market, (K, t, T ) are specified in the contract. The only unknown (and hence free) parameter is σ. We can estimate σ from time series return (standard deviation calculation). Alternatively, we can choose σ to match the observed option price — implied volatility (IV). There is a one-to-one correspondence between prices and implied volatilities. Traders and brokers often quote implied volatilities rather than dollar prices. The BMS model says that IV = σ. In reality, the implied volatility calculated from different option contracts (across different strikes, maturities, dates) are usually different.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 43 / 78 Real versus risk-neutral dynamics

Recall: Under the binomial model, we derive a set of risk-neutral probabilities such that we can calculate the expected payoff from the option and discount them using the riskfree rate. Risk premiums are hidden in the risk-neutral probabilities. If in the real world, people are indeed risk-neutral, the risk-neutral probabilities are the same as the real-world probabilities. Otherwise, they are different. Under the BMS model, we can also assume that there exists such an artificial risk-neutral world, in which the expected returns on all assets earn risk-free rate. The stock price dynamics under the risk-neutral world becomes, dSt /St = (r − q)dt + σdWt . Simply replace the actual expected return (µ) with the return from a risk-neutral world (r − q)[ex-dividend return]. We label the true probability measure by P and the risk-neutral measure by Q...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 44 / 78 The risk-neutral return on spots

dSt /St = (r − q)dt + σdWt , under risk-neutral probabilities.

In the risk-neutral world, investing in all securities make the riskfree rate as the total return. If a stock pays a dividend yield of q, then the risk-neutral expected return from stock price appreciation is (r − q), such as the total expected return is: dividend yield+ price appreciation =r.

Investing in a currency earns the foreign interest rate rf similar to dividend yield. Hence, the risk-neutral expected currency appreciation is (r − rf ) so that the total expected return is still r.

Regard q as rf and value options as if they are the same.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 45 / 78 The risk-neutral return on forwards/futures

If we sign a , we do not pay anything upfront and we do not receive anything in the middle (no dividends or foreign interest rates). Any P&L at expiry is excess return. Under the risk-neutral world, we do not make any excess return. Hence, the forward price dynamics has zero mean (driftless) under the risk-neutral probabilities: dFt /Ft = σdWt . The carrying costs are all hidden under the forward price, making the pricing equations simpler.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 46 / 78 Return predictability and option pricing

Consider the following stock price dynamics,

dSt /St = (a + bXt ) dt + σdWt , dXt = κ (θ − Xt ) + σx dW2t , ρdt = E[dWt dW2t ].

Stock returns are predictable by a vector of mean-reverting predictors Xt . How should we price options on this predictable stock? Option pricing is based on replication/hedging — a cross-sectional focus, not based on prediction — a time series behavior. Whether we can predict the stock price is irrelevant. The key is whether we can replicate or hedge the option risk using the underlying stock. Consider a derivative f (S, t), whose( terminal payoff depends) on S but not on 1 2 2 X , then it remains true that dft = ft + fS µS + 2 fSS σ S dt + fS σSdWt .

The delta-hedged option portfolio (f − fS S) remains riskfree. 1 2 2 The same PDE holds for f : ft + rSfS + 2 fSS σ S = rf , which has no dependence on X . Hence, the assumption f (S, t) (instead of f (S, X , t)) is

correct...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 47 / 78 Return predictability and risk-neutral valuation

Consider the following stock price dynamics (under P),

dSt /St = (a + bXt ) dt + σdWt , dXt = κ (θ − Xt ) + σx dW2t , ρdt = E[dWt dW2t ].

Under the risk-neutral measure (Q), stocks earn the riskfree rate, regardless of the predictability: dSt /St = (r − q) dt + σdWt Analogously, the futures price is a martingale under Q, regardless of whether you can predict the futures movements or not: dFt /Ft = σdWt Measure change from P to Q does not change the diffusion component σdW . The BMS formula still applies — Option pricing is dictated by the volatility (σ) of the uncertainty, not by the return predictability.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 48 / 78 Stochastic volatility and option pricing

Consider the following stock price dynamics, √ dSt /St = µdt + vt dWt√, dvt = κ (θ − vt ) + ω vt dW2t , ρdt = E[dWt dW2t ]. How should we price options on this stock with stochastic volatility? Consider a call option on S. Even though the terminal value does not depend on vt , the value of the option at other periods have to depend on vt .

Assume f (S, t) does not depend on vt , and thus P = f − fS S is a riskfree portfolio, we have( ) − 1 2 − − dPt = dft fS dSt = ft + fS µS + 2 fSS vS fS µS dt = r(f fS S)dt. 1 2 ft + rfS S + 2 fSS vS = rf . The PDE is a function of v. Hence, it must be f (S, v, t) instead of f (S, t). For f ((S, v, t), we have (bivariate version of Ito): ) 1 2 − 1 2 dft =√ft + fS µS + √2 fSS vt S + fv κ (θ vt ) + 2 fvv ω vt + fSv ωvt ρ dt + fS σS vt dWt + fv ω vt dW2t .

Since there are two sources of risk (Wt , W2t ), delta-hedging is not going to

lead to a riskfree portfolio...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 49 / 78 Pricing kernel

The real and risk-neutral dynamics can be linked by a pricing kernel. In the absence of arbitrage, in an economy, there exists at least one strictly positive process, Mt , the state-price deflator, such that the deflated gains process associated with any admissible strategy is a martingale. The ratio of Mt at two horizons, Mt,T , is called a stochastic discount factor, or informally, a pricing kernel.

For an asset that has a terminal payoff ΠT , its time-t value is EP pt = t [Mt,T ΠT ]. The time-t price of a $1 par riskfree zero-coupon bond that expires at EP T is pt = t [Mt,T ]. In a discrete-time representative agent economy with an additive CRRA utility, the pricing kernel is equal to the ratio of the marginal utilities of consumption, ( ) ′ −γ u (ct+1) ct+1 Mt,t+1 = β ′ = β u (ct ) ct

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 50 / 78 From pricing kernel to exchange rates

h Let Mt,T denote the pricing kernel in economy h that prices all securities in that economy with its currency denomination. The h-currency price of currency-f (h is home currency) is linked to the pricing kernels of the two economies by, fh Mf ST t,T fh = h St Mt,T

fh fh The log currency return over period [t, T ], ln ST /St equals the difference between the log pricing kernels of the two economies.

Let S denote the dollar price of pound (e.g. St = $2.06), then pound − dollar ln ST /St = ln Mt,T ln Mt,T . If markets are completed by primary securities (e.g., bonds and stocks), there is one unique pricing kernel per economy. The exchange rate movement is uniquely determined by the ratio of the pricing kernels. If markets are not completed by primary securities, exchange rates (and their options) help complete the markets by requiring that the ratio of any two ...... viable pricing kernels go through the exchange rate...... and...... their...... options...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 51 / 78 From pricing kernel to measure change

In continuous time, it is convenient to define the pricing kernel via the following multiplicative decomposition: ( ∫ ) ( ∫ ) T T − E − ⊤ Mt,T = exp rs ds γs dXs t t

r is the instantaneous riskfree rate, E is the stochastic exponential martingale operator, X denotes the risk sources in the economy, and γ is the market price of the risk X . ( ∫ ) ( ∫ ∫ ) E − T − T − 1 T 2 If Xt = Wt , t γs dWs = exp t γs dWs 2 t γs ds . In a continuous time version of the representative agent example, dXs = d ln ct and γ is relative risk aversion. r is usually a function of X .

The exponential martingale E(·) defines the measure change from P to Q.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 52 / 78 Measure change defined by exponential martingales

Consider the following exponential martingale that defines the measure change P Q from to( : ∫ ) dQ E − t dP t = 0 γs dWs ,

P i i i i i i i E i If the dynamics is: dSt = µ St dt + σ St dWt , with ρ dt = [dWt dWt ], i Q i then the dynamics of St under is: dSt = ( ) i i i i i E − i i i i − i i i i i i µ St dt + σ St dWt + [ γt dWt , σ St dWt ] = µt γσ ρ St dt + σ St dWt i i − i i If St is the price of a traded security, we need r = µt γσ ρ . The risk i − i i premium on the security is µt r = γσ ρ .

How do things( change∫ if) the( pricing∫ kernel) is( given∫ by: ) − T E − T ⊤ E − T ⊤ Mt,T = exp t rs ds t γs dWs t γs dZs , i where Zt is another Brownian motion independent of Wt or Wt .

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 53 / 78 Measure change defined by exponential martingales: Jumps

Let X denote a pure jump process with its compensator being ν(x, t) under P. Consider a measure change defined by the exponential martingale: dQ E − dP t = ( γXt ), The compensator of the jump process under Q becomes: ν(x, t)Q = e−γx ν(x, t). Example: Merton (176)’s compound Poisson jump process, − 2 − (x µJ ) 1 2σ2 ν(x, t) = λ √ e J . Under Q, it becomes σJ 2π Q 2 (x−µ )2 (x−µ ) −γx− J − J Q Q √1 2σ2 Q √1 2σ2 2 ν(x, t) = λ e J = λ e J with µ = µJ − γσ σJ 2π σJ 2π J J Q 1 γ(γσ2−2µ ) and λ = λe 2 J J .

If we start with a symmetric distribution (µJ = 0) under P, positive risk aversion (γ > 0) leads to a negatively skewed distribution under Q. Vassilis Polimenis, Skewness Correction for Asset Pricing, wp.

K¨uchler& Sørensen, Exponential Families of Stochastic Processes, Springer, 1997. Reference: ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 54 / 78 Implied volatility

Recall the BMS formula:

Ft,T  1 2 − −r(T −t) ln K 2 σ (T t) c(S, t, K, T ) = e [Ft,T N(d1) − KN(d2)] , d1,2 = √ σ T − t

The BMS model has only one free parameter, the asset return volatility σ. Call and put option values increase monotonically with increasing σ under BMS. Given the contract specifications (K, T ) and the current market observations (St , Ft , r), the mapping between the option price and σ is a unique one-to-one mapping. The σ input into the BMS formula that generates the market observed option price (or sometimes a model-generated price) is referred to as the implied volatility (IV). Practitioners often quote/monitor implied volatility for each option contract instead of the option invoice price...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 55 / 78 The relation between option price and σ under BMS

45 50 K=80 K=80 40 K=100 45 K=100 K=120 K=120 35 40 t t 35 30 30 25 25 20 20

15 Put option value, p Call option value, c 15 10 10

5 5

0 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Volatility, σ Volatility, σ An option value has two components: Intrinsic value: the value of the option if the underlying price does not move (or if the future price = the current forward). Time value: the value generated from the underlying price movement. Since options give the holder only rights but no obligation, larger moves generate bigger opportunities but no extra risk — Higher volatility increases the option’s time value. At-the-money option price is approximately linear in implied volatility.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 56 / 78 Implied volatility versus σ

If the real world behaved just like BMS, σ would be a constant. In this BMS world, we could use one σ input to match market quotes on options at all days, all strikes, and all maturities. Implied volatility is the same as the security’s return volatility (standard deviation). In reality, the BMS assumptions are violated. With one σ input, the BMS model can only match one market quote at a specific date, strike, and maturity.

The IVs at different (t, K, T ) are usually different — direct evidence that the BMS assumptions do not match reality. IV no longer has the meaning of return volatility. IV still reflects the time value of the option. The intrinsic value of the option is model independent (e.g., e−r(T −t)(F − K)+ for call), modelers should only pay attention to time value.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 57 / 78 Implied volatility at (t, K, T )

At each date t, strike K, and expiry date T , there can be two European options: one is a call and the other is a put. The two options should generate the same implied volatility value to exclude arbitrage. Recall put-call parity: c − p = er(T −t)(F − K). The difference between the call and the put at the same (t, K, T ) is the forward value. The forward value does not depend on (i) model assumptions, (ii) time value, or (iii) implied volatility. At each (t, K, T ), we can write the in-the-money option as the sum of the intrinsic value and the value of the out-of-the-money option: If F > K, call is ITM with intrinsic value er(T −t)(F − K), put is OTM. Hence, c = er(T −t)(F − K) + p. If F < K, put is ITM with intrinsic value er(T −t)(K − F ), call is OTM. Hence, p = c + er(T −t)(K − F ). If F = K, both are ATM (forward), intrinsic value is zero for both options. Hence, c = p...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 58 / 78 Implied volatility quotes on OTC currency options

At each date t and each fixed time-to-maturity τ = (T − t), OTC currency options are quoted in terms of Delta-neutral implied volatility (ATMV): A straddle is a portfolio of a call & a put at the same strike. The strike here is set to make the portfolio delta-neutral: q(T −t) − q(T −t) − ⇒ 1 ⇒ e N(d1) e N( d1) = 0 N(d1) = 2 d1 = 0.

25-delta : RR25 = IV (∆c = 25) − IV (∆p = 25).

25-delta butterfly spreads: BF25 = (IV (∆c = 25) + IV (∆p = 25))/2 − ATMV . When trading currency options OTC, options and the corresponding BMS delta of the underlying are exchanged at the same time.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 59 / 78 From delta to strikes

Given these quotes, we can compute the IV at the three (strike) levels: IV = ATMV at d1 = 0 IV = BF25 + ATMV + RR25/2 at ∆c = 25% IV = BF25 + ATMV − RR25/2 at ∆p = 25% The three strikes at the three deltas can be inverted as follows: ( ) 1 2 K = F exp ( 2 IV τ √ ) at d1 = 0 1 2 − −1 qτ K = F exp ( 2 IV τ N (∆c e )IV √τ ) at ∆c = 25% 1 2 −1 | | qτ K = F exp 2 IV τ + N ( ∆p e )IV τ at ∆p = 25%

Put-call parity is guaranteed in the OTC quotes: one implied volatility at each delta/strike. These are not exactly right... They are simplified versions of the much messier real industry convention.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 60 / 78 Violations of put-call parities

On the exchange, calls and puts at the same maturity and strike are quoted/traded separately. It is possible to observe put-call parity being violated at some times. Violations can happen when there are market frictions such as short-sale constraints. For American options (on single name stocks), there only exists a put-call inequality. The “effective” maturities of the put and call American options with same strike and expiry dates can be different. The option with the higher chance of being exercised early has effectively a shorter maturity. Put-call violations can predict future spot price movements. One can use the call and put option prices at the same strike to o − −rτ qτ compute an “option implied spot price:” St = (ct pt + e K) e . The difference between the implied spot price and the price from the − o stock market can contain predictive information: St St . Compute implied forward for option pricing model estimation...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 61 / 78 The information content of the implied volatility surface

At each time t, we observe options across many strikes K and maturities τ = T − t. When we plot the implied volatility against strike and maturity, we obtain an implied volatility surface. If the BMS model assumptions hold in reality, the BMS model should be able to match all options with one σ input. The implied volatilities are the same across all K and τ. The surface is flat. We can use the shape of the implied volatility surface to determine what BMS assumptions are violated and how to build new models to account for these violations. For the plots, do not use K, K − F , K/F or even ln K/F as the moneyness measure. Instead, use a standardized measure, such as ln K/√F ATMV τ , d2, d1, or delta. Using standardized measure makes it easy to compare the figures across maturities and assets...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 62 / 78 Implied volatility smiles & skews on a stock

AMD: 17−Jan−2006 0.75

0.7

0.65

Short−term smile 0.6

0.55 Implied Volatility

0.5 Long−term skew

0.45

Maturities: 32 95 186 368 732 0.4 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 Moneyness= ln(K/F ) σ√τ ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 63 / 78 Implied volatility skews on a stock index (SPX)

SPX: 17−Jan−2006 0.22

0.2

0.18 More skews than smiles

0.16

0.14 Implied Volatility

0.12

0.1

Maturities: 32 60 151 242 333 704

0.08 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 Moneyness= ln(K/F ) σ√τ ...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 64 / 78 Average implied volatility smiles on currencies

JPYUSD GBPUSD 14 9.8

9.6 13.5 9.4 13 9.2

12.5 9

8.8 12 Average implied volatility Average implied volatility 8.6 11.5 8.4

11 8.2 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90 Put delta Put delta

Maturities: 1m (solid), 3m (dashed), 1y (dash-dotted)

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 65 / 78 Return non-normalities and implied volatility smiles/skews

BMS assumes that the security returns( (continuously) compounding) are ∼ − 1 2 2 normally distributed. ln ST /St N (µ 2 σ )τ, σ τ . µ = r − q under risk-neutral probabilities. A smile implies that actual OTM option prices are more expensive than BMS model values. ⇒ The probability of reaching the tails of the distribution is higher than that from a normal distribution. ⇒ Fat tails, or (formally) leptokurtosis. A negative skew implies that option values at low strikes are more expensive than BMS model values. ⇒ The probability of downward movements is higher than that from a normal distribution. ⇒ Negative skewness in the distribution. Implied volatility smiles and skews indicate that the underlying security return distribution is not normally distributed (under the risk-neutral measure —We are talking about cross-sectional behaviors, not time series)...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 66 / 78 Quantifying the linkage

( ) Skew. Kurt. ln K/F IV (d) ≈ ATMV 1 + d + d 2 , d = √ 6 24 σ τ

If we fit a quadratic function to the smile, the slope reflects the skewness of the underlying return distribution. The curvature measures the excess kurtosis of the distribution. A normal distribution has zero skewness (it is symmetric) and zero excess kurtosis. This equation is just an approximation, based on expansions of the normal density (Read “Accounting for Biases in Black-Scholes.”) The currency option quotes: Risk reversals measure slope/skewness, butterfly spreads measure curvature/kurtosis. Check the VOLC function on Bloomberg.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 67 / 78 Revisit the implied volatility smile graphics

For single name stocks (AMD), the short-term return distribution is highly fat-tailed. The long-term distribution is highly negatively skewed. For stock indexes (SPX), the distributions are negatively skewed at both short and long horizons. For currency options, the average distribution has positive butterfly spreads (fat tails). Normal distribution assumption does not work well. Another assumption of BMS is that the return volatility (σ) is constant — Check the evidence in the next few pages.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 68 / 78 Stochastic volatility on stock indexes

SPX: Implied Volatility Level FTS: Implied Volatility Level 0.5 0.55

0.5 0.45 0.45 0.4 0.4 0.35 0.35

0.3 0.3

0.25 0.25 Implied Volatility Implied Volatility 0.2 0.2 0.15 0.15 0.1

0.1 0.05 96 97 98 99 00 01 02 03 96 97 98 99 00 01 02 03

At-the-money implied volatilities at fixed time-to-maturities from 1 month to 5 years.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 69 / 78 Stochastic volatility on currencies

JPYUSD GBPUSD 28 12 26 24 11

22 10 20 9 18

16 8

Implied volatility 14 Implied volatility 7 12 6 10 8 5

1997 1998 1999 2000 2001 2002 2003 2004 1997 1998 1999 2000 2001 2002 2003 2004

Three-month delta-neutral straddle implied volatility.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 70 / 78 Stochastic skewness on stock indexes

SPX: Implied Volatility Skew FTS: Implied Volatility Skew 0.4 0.4

0.35 0.35

0.3 0.3 0.25 0.25 0.2 0.2 0.15 0.15 0.1

0.1

Implied Volatility Difference, 80%−120% Implied Volatility Difference, 80%−120% 0.05

0.05 0 96 97 98 99 00 01 02 03 96 97 98 99 00 01 02 03

Implied volatility spread between 80% and 120% strikes at fixed time-to-maturities from 1 month to 5 years.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 71 / 78 Stochastic skewness on currencies

JPYUSD GBPUSD

50 10 40

30 5

20 0

10 −5 RR10 and BF10 RR10 and BF10 0 −10 −10

−20 −15 1997 1998 1999 2000 2001 2002 2003 2004 1997 1998 1999 2000 2001 2002 2003 2004

Three-month 10-delta risk reversal (blue lines) and butterfly spread (red lines).

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 72 / 78 What do the implied volatility plots tell us?

Returns on financial securities (stocks, indexes, currencies) are not normally distributed. They all have fatter tails than normal (most of the time). The distribution is also skewed, mostly negative for stock indexes (and sometimes single name stocks), but can be of either direction (positive or negative) for currencies.

The return distribution is not constant over time, but varies strongly. The volatility of the distribution is not constant. Even higher moments (skewness, kurtosis) of the distribution are not constant, either.

A good option pricing model should account for return non-normality and its stochastic (time-varying) feature.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 73 / 78 Abnormal implied volatility shapes

Sometimes, the implied volatility plot against moneyness can show weird shapes: Implied volatility frown — This does not happen often, but it does happen. It is more difficult to model than a smile.

The implied volatility shape around corporate actions such as mergers, takeovers, etc. Although most of our models are time homogeneous, calendar day effects are important considerations in practice. How to reconcile the two? Build a business calendar and modify the option time to maturity based on business activities: Treat non-trading days as zero (or a small fraction) of one day. Treat each earnings announcement date as two days. Pre-process the data as much as possible before applying to a model. Most deterministic components should be pre-processed.

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 74 / 78 Institutional applications of options:

⇒ gain volatility exposures Buy/sell an out-of-the-money option and delta hedge. Daily delta hedge is a requirement for most institutional volatility investors and options market makers. Call or put is irrelevant. What matter is whether one is long/short vega.

Delta hedged P&L∫ is equal to dollar-gamma weighted variance T 2 r(T −t) 2 2 St n(d1(St√,t,IV0)) difference: PLT = e (σ − IV ) dt. 0 t 0 2 St IV0 T −t Views/quotes are expressed not in terms of dollar option prices, but rather in terms of implied volatilities (IV). Implied volatilities are calculated from the Black-Merton-Scholes (BMS) model. The fact that practitioners use the BMS model to quote options does not mean they agree with the BMS assumptions. Rather, they use the BMS model as a way to transform/standardize the option price, for several practical benefits...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 75 / 78 Why BMS implied volatility?

There are several practical benefits in transforming option prices into BMS implied volatilities.

1. Information: It is much easier to gauge/express views in terms of implied volatilities than in terms of option prices. Option price behaviors all look alike under different dynamics: Option prices are monotone and convex in strike... By contrast, how implied volatilities behavior against strikes reveals the shape of the underlying risk-neutral return distribution. A flat implied volatility plot against strike serves as a benchmark for a normal return distribution.

Deviation from a flat line reveals deviation from return normality. ⇒ Implied volatility smile — leptokurtotic return distribution ⇒ Implied volatility smirk — asymmetric return distribution

...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 76 / 78 Why BMS implied volatility?

2. No arbitrage constraints: Merton (1973): model-free bounds based on no-arb. arguments: Type I: No-arbitrage between European options of a fixed strike and maturity vs. the underlying and cash: call/put prices ≥ intrinsic; call prices ≤ (dividend discounted) stock price; put prices ≤ (present value of the) ; put-call parity. Type II: No-arbitrage between options of different strikes and maturities: bull, bear, calendar, and butterfly spreads ≥ 0. Hodges (1996): These bounds can be expressed in implied volatilities. Type I: Implied volatility must be positive. ⇒If market makers quote options in terms of a positive implied volatility surface, all Type I no-arbitrage conditions are automatically guaranteed. 3. Delta hedge: The standard industry practice is to use the BMS model to calculate delta with the implied volatility as the input. No delta modification consistently outperforms this simple practice in all

practical situations...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 77 / 78 When to get away from implied volatility?

At very short maturities for out-of-the-money options. The implied volatility is computed from a pure diffusion model, under which out-of-money option value approaches zero with an exponential rate (very fast) as the time to maturity nears zero. The chance of getting far out there declines quickly in a diffusive world. In the presence of jumps, out-of-money option value approaches zero at a much lower rate (linear). The chance of getting far out there can still be significant if the process can jump over. The two difference convergence rates imply that implied volatilities for out-of-money options will blow up (become infinity) as the option maturity shrinks to zero, if the underlying process can jump. One can use the convergence rate difference to test whether the underlying process is purely continuous or contains jumps (Carr & Wu (2003, JF).) Bid-ask spread can also make the implied volatility calculation

difficult/meaningless at very short maturities...... Liuren Wu (Baruch) Option Pricing Introduction Options Markets 78 / 78