<<

Risk and Management Spring 2010

Arbitrage Pricing Theory: Theory and Applications To Financial Data Analysis Basic equation

= Et equity in a trading account at time t (liquidation value) = + Δ Rit return on i from time t to time t t (includes income) = Qit dollars invested in stock i at time t r = interest rate

N N = + Δ + − ⎛ ⎞ Δ ()+ Δ Et+Δt Et Et r t ∑Qit Rit ⎜∑Qit ⎟r t before rebalancing, at time t t i=1 ⎝ i=1 ⎠

N N N () = + Δ + − ⎛ ⎞ Δ + + Δ Et+Δt Et Et r t ∑Qit Rit ⎜∑Qit ⎟r t ∑| Qi(t+Δt) - Qit | after rebalancing, at time t t i=1 ⎝ i=1 ⎠ i=1

ε = transaction cost (as percentage of stock price) ε

N N = + Δ + − ⎛ ⎞ Δ Et+Δt Et Et r t ∑Qit Rit ⎜∑Qit ⎟r t i=1 ⎝ i=1 ⎠

N ∑ Qit Ratio of (gross) i=1 Leverage = to equity Et

≥ Qit 0 `` - only '' N ≥ = = Qit 0, ∑Qit Et Leverage 1, long only position i=1 Reg - T : Leverage ≤ 2 () accounts for retail Day traders : Leverage ≤ 4 Professionals & institutions : - based leverage θ

θ Portfolio Theory Introduce dimensionless quantities and view returns as random variables

Q N = i Leverage = Dimensionless ``portfolio i ∑ i weights’’ Ei i=1 ΔΠ E − E − E rΔt ΔE = t+Δt t t = − rΔt Π E E t θ ~ All investments financed = − Δ θ Ri Ri r t (at known IR) σ ΔΠ N ~ = Rθ i Π ∑ i θ i=1

θ ΔΠ N ~ ΔΠ N ~ ~ N θ σ ⎛ ⎞ ⎛ ⎞ 2 ⎛ ⎞ ⎛ ⎞ σ E = E Ri ; = Cov Ri , R j = ρ ⎜ Π ⎟ ∑ i ⎜ ⎟ ⎜ Π ⎟ ∑ i j ⎜ ⎟ ∑ i j i j ij ⎝ ⎠ i=1 ⎝ ⎠ ⎝ ⎠ ij=1 ⎝ ⎠ ij=1 θ Sharpeθ Ratio θ σ ⎛ ΔΠ ⎞ N ⎛ ~ ⎞ E E R ⎜ Π ⎟ ∑ i ⎜ i ⎟ ⎝ ⎠ i=1 ⎝ ⎠ s = s(),..., = = θ 1 N ⎛ ΔΠ ⎞ N θ σ ⎜ ⎟ σ ρ Π ∑ i j i j ij ⎝ ⎠ i=1

Sharpe ratio is homogeneous of degree zero in the portfolio weights. Hence, it is independent of the leverage.

Investors who prefer large returns & low may use the as a metric to evaluate investment portfolios.

Quoting a fund manager (in 2006): ``I don’t eat volatility’’. (HF managers are compensated as % of returns, not Sharpe ratio ☺.) Pricing Theory (Ross, 1971)

Assumptions:

•There are N •m tradable as factors (e.g. ETFs or baskets, or portfolios) • Linear regression of stock returns on factors has uncorrelated residuals (uncorrelated with the factors and among each other)

~ m ~ = + + Ri i ∑ ik F k i k =1 ~ = − Δ = Ri Ri r t return onα stock i, financed ~ β = − Δ = F k Fk r t return on factor k, financedε ε = i residual of the linear regression of stock ret. on factor returns α = i excess returns Market-neutral portfolios

Consider the following investment portfolio:

⎧Qi dollars in stock i ⎪ ⎨ N − Q β dollars in k ⎪ ∑ i ik ⎩ i=1 β ΔΠ N ~ m ⎛ N ⎞ ~ = Q R − ⎜ Q ⎟ F Π ∑ i i ∑∑ i ik k i=1 kβ==11⎝ i ⎠ ⎛ ⎞ N ⎜ ~ M ~ ⎟ = Q αR − F ∑∑i ⎜ i ik k ⎟ i==11⎜ εk ⎟ ⎝ ⎠α N NN = ()+ = + ε ∑Qi i i ∑∑Qi i Qi i i=1 i1==1 i

Market neutral portfolios are only exposed to residuals θ α σ

θ σ ε θ Diversifiedα Market-neutral portfolio

⎛ ΔΠ ⎞ N ⎛ ΔΠ ⎞ N E = 2 = 2 2 () ⎜ Π ⎟ ∑ i i ⎜ Π ⎟ ∑ i i ⎝ ⎠ αi=1 ⎝ ⎠ i=1 ()σ Choose weights of sign opposite to sign of = i i σ N ε excess return & uniformly distributed

⎛ ΔΠ ⎞ 1 N α ⎛ ΔΠ ⎞ 1 N E = , 2 = 2 () ⎜ ⎟ ∑ i ⎜ ⎟ 2 ∑ i Π N = Π N = ⎝ ⎠ i 1 ⎝ ⎠ i α1 σ α ε ∞ N 1 dF() ∑ i ∫ σ α N = −∞ s = i 1 ≈ N ⋅ σ = N 1 N ∞ σ 2 2 () 2 ()2 2 ∑ i dG N = ∫ i 1 0 No-arbitrage argument (S. Ross, 1971/1974)

If a significant fraction of the excess returns are non-zero, i.e. if α > 0 investing in the diversified MN portfolio would rise to an infinite Sharpe ratio as N tends to infinity (or, for large enough N, a huge Sharpe ratio).

This would represent a clear arbitrage opportunity which would be eventually eliminated by trading.

Conclusion, if the ``factor model’’ holds true with tradable factors and uncorrelated residuals then excess returns should vanish and stocks returns should satisfy the normative model β ~ m ~ = ε + ()= Excess returns are zero Ri ∑ ik Fk i with E i 0 k =1 ε

~ m ~ This can be seen as an ⎛ ⎞ = β ⎛ ⎞ E⎜ Ri ⎟ ∑ ik E⎜ Fk ⎟ ``equilibrium’’ pricing relation ⎝ ⎠ k=1 ⎝ ⎠ of sorts. Capital Asset Pricing Model (Lintner, Sharpe, 1965)

Theβ capital asset-pricing model is a one-factor model to explain stock returnsε and stock prices. ε ε ε

~ ~ = + = = ≠ Ri i F 1 i , i 0, i j 0, if i j

~ β F 1 = return of the (cap - weighted market portfolio) β

~ ~ ()σ ⎛ ⎞ ⎛ ⎞ Cov Ri , F1 ρi i E Ri = E F 1 , = = ⎜ ⎟ i ⎜ ⎟ i ()σ ⎝ ⎠ ⎝ ⎠ Var FM M Remarks & comments

APT introduces a probability measure on the set of returns, which is not specified explicitly. The existence of such probability would imply stationarity of returns, which in practice does not hold.

Technicality: the diversified portfolio constructed here has finite leverage as N tends to infinity. This follows from the fact that there are finitely many factors and that the leverage on the ``stocks’’ before hedging with the factors is 1.

APT, as presented here, makes no assumptions about dynamics, i.e. portfolios that vary in time, or about market-timing strategies, holding periods, etc.

The theory was made before the proliferation of ETFs, which are tradable factors.

ETFs provide an important set of factors (sectors, size, etc) that can be used to make APT ``concrete’’ and applicable. Testing APT with data from Jan 5, 2009 to Jan 29 2010

Using daily returns from Jan 5 2009 to Jan 29, 2010 for the components of the S&P 500 , we explored the ``partition’’ of the eigenvectors/eigenvalues into significant and noise components.

We did this by testing for α = 0 and for the correlations of residuals.

One way to analyze the correlations of residuals is by doing a PCA again and analyzing the corresponding eigenvalues and DOS.

The assumption that residuals are uncorrelated allows comparison with theoretical eigenvalue distributions such as Marcenko-Pastur. One Factor Model (CAPM)

1. Compute correlation matrix of S&P 500

2. Compute the first eigenportfolio

3. Compute residuals for all 500 stocks by regression α ~ β ~ = + + ε Ri i i F 1 i 4. Analyze the vector of alphas

5. Analyze the correlation matrix of the residuals Sorted Excess Returns, 1-factor

AIG is an outlier. Sorted Excess Returns 1-factor, excluding AIG

Max=60 pbs, Min=-40 bps, average=1.9 bps, stdev=11bps Eigenvalues of the correlation matrix of residuals (m=1)

λ = 1 33.59 λ 33.59 1 = = 6.7% N 500

Recall that lambda_1 for the original correlation matrix was ~ 220, so the residuals matrix has ``smaller’’ correlations. Ratio lambda/N is a proxy for the average correlation. λ

First eigenvalue & average correlation

= (1)T ρ ()1 1 V CV ρN = ()(1) 2 + (1) (1) ∑ Vi ∑Vi V j ij i=1 i≠ j = + (1) (1) 1 ∑Vi V j ij i≠ j ρ (1) (1) ∑Vi V j ij ⎛ ⎞ ≠ =1+ ⎜ V (1)V (1) ⎟⋅ i j ⎜∑ i j ⎟ (1) (1) λ ⎝ i≠ j ⎠ ∑Vi V j i≠ j ρ (1) (1) ∑Vi V j ij −1 ≠ 1 N()N −1 1 = i j ∴ V (1) ≈ , V (1)V (1) ≈ = N −1 λ (1) (1) (1) (1) i ∑ i j ∑Vi V j ∑Vi V j N i≠ j N i≠ j ρ i≠ j

−1 ρ λ 1 ≈ ≈ 1 N −1 N

Density of States, or Histogram, of Eigenvalues

Lambda_1 = 33.59 A lot of detached evs suggests that there are additional factors. Sorted excess returns, m=15 (without AIG) Top 100 eigenvalues of the correlation matrix of residuals (m=15)

λ = 1 8.91 λ 8.91 1 = =1.8% N 500

Sorted Excess Returns, m=20 Sorted Eigenvalues, m=20

λ = 1 8.73 λ 8.73 1 = =1.74% N 500 Density of States m=20 λ = 1 7.75 λ 7.75 1 = =1.6% N 500 Density of States m=30 Excess returns () as a function of the number of eigenportfolios (m)

average mmaxmin averageabs stdev 1 0.6283% -0.3941% 0.0196% 0.0776% 0.1129% 15 0.5095% -0.5096% 0.0065% 0.0687% 0.1004% 20 0.5095% -0.5485% 0.0968% 0.0667% 0.0968% 30 0.5095% -0.3957% 0.0049% 0.0664% 0.0960% Marcenko-Pastur Distribution for the DOS of a Random Correlation Matrix

Theoremγ :LetX be a T by N matrix of standardized normal random variables and let C=X’X. Then, the DOS of C approaches the Marcenko Pastur distribution as N,T tend to infinity withλ the ratio N/T held constant. γ λ = λN = ()+ 2 = ()− 2 + 1 − 1γ T γ δ λ λ + πγ ()()−λ λ − = ⎛ − 1 ⎞ ()+ 1 + λ − MP( ) ⎜1 ⎟ ⎝ ⎠ 2 λ Marcenko-Pastur Distribution gamma=500/269= 1.858736 Interpreting the DOS for the residuals in terms of Marcenko Pastur

Delta mass with weight = 1-1/1.8587=0.46 (degeneracy)

Remaining DOS Marcenko Pastur compared to data with m=30

x Evaluating the use of ETFs as factors in APT

We found out how many eigenportfolios are needed approximately to explain the systematic portion of stock returns using panel data for stock returns.

We obtained a matrix of random residuals if we choose m=15 or higher.

Since eigenportfolios are not tradable (except perhaps for the first one), this leaves us with the identification problem.

We perform an analysis of APT using sector ETFs as factors.

Three experiments:

* Multiple regression on 19 ETFs

* Matching pursuit on 19 ETFs

* Association of a single ETF to each stock Sorted Excess Returns, Factors=19 ETFs Multiple Regression

average average abs stdev 0.0390% 0.2755% 0.1167% (AIG not included) Density of States, residuals with 19 ETFs After removing mass at zero (19 etfs, MR) (Red line=Marcenko-Pastur)

Lambda_max=11.9613 Excess Returns: Matching Pursuit, 19 ETFs (w/o AIG)

average average abs stdev 0.0405% 0.0799% 0.0904% Noise Spectrum for Matching Pursuit Residuals

Lambda_max=12.02 Excess Returns after projecting on the corresponding industry ETF for each stock

average average abs stdev 0.0461% 0.0823% 0.1153% 100 top eigenvalues for residuals after removing industry ETF

Lambda_1 = 17.52 Density of States for Correlation matrix of residuals after removing the sector ETF for each stock Dynamics

So far, the analysis that we made assumes a fixed window.

We should ask ourselves how these relationships change across time, i.e. if the factor count and the R-squared that we obtained are stable across time.

This is particularly important if the factor model is used for hedging or for relative of stocks with respect to ETFs.

The following charts show some results of the PCA analysis viewed as time passes, using a moving window to calculate the eigenvalues and eigenvectors. Number of significant eigenvectors at 55% level

55% explained variance 35

30

25

20

15

10

5

0

Number of Eigen vectors Fixed number of eigenvectors (factors)

Explained Variance for 15 Eigenvalues 0.6

0.58

0.56

0.54

0.52

0.5

0.48

0.46

0.44

0.42

0.4

Explained Variance Number of factors explaining 55% of the variance versus VIX volatility index (2002-2008)