Super-Replication of the Best Pairs Trade in Hindsight
Alex Garivaltis∗ March 18, 2019
Abstract This paper derives a robust on-line equity trading algorithm that achieves the greatest possible percentage of the final wealth of the best pairs rebalancing rule in hindsight. A pairs rebalancing rule chooses some pair of stocks in the market and then perpetually executes rebalancing trades so as to maintain a target fraction of wealth in each of the two. After each discrete market fluctu- ation, a pairs rebalancing rule will sell a precise amount of the outperforming stock and put the proceeds into the underperforming stock. Under typical conditions, in hindsight one can find pairs rebalancing rules that would have spectacularly beaten the market. Our trading strategy, which extends Ordentlich and Cover’s (1998) “max-min universal portfolio,” guaran- tees to achieve an acceptable percentage of the hindsight-optimized wealth, a percentage which tends to zero at a slow (polynomial) rate. This means that on a long enough investment horizon, the trader can enforce a compound-annual growth rate that is arbitrarily close to that of the best pairs rebalancing rule in hindsight. The strategy will “beat the market asymptotically” if there turns out to exist a pairs rebalancing rule that grows capital at a higher asymptotic rate than the market index. The advantages of our algorithm over the Ordentlich and Cover (1998) strat- egy are twofold. First, their strategy is impossible to compute in practice. Sec- ond, in considering the more modest benchmark (instead of the best all-stock arXiv:1810.02444v3 [q-fin.PM] 14 Mar 2019 rebalancing rule in hindsight), we reduce the “cost of universality” and achieve a higher learning rate.
Keywords: Super-replication, Pairs trading, Correlation options, Constant- rebalanced portfolios, Universal portfolios, Kelly criterion, Robust procedures, Minimax
JEL Classification: C44, D81, D83, G11, G12, G13
∗Assistant Professor of Economics, Northern Illinois University, 514 Zulauf Hall, DeKalb IL 60115. E-mail: [email protected]. ORCID: 0000-0003-0944-8517.
1 1 Introduction
1.1 Literature Review
The theory of asymptotic portfolio growth was initiated by Kelly (1956), who con- sidered repeated bets on horse races with odds that diverge from the true win prob- abilities. Kelly set forth the natural goal of optimizing the asymptotic growth rate of one’s capital. This implies that one should act each period so as to maximize the expected log of his capital. By the Law of Large Numbers, the realized per-period continuously-compounded growth rate converges to the expected growth rate. The Kelly rule was used by Beat the Dealer author Edward O. Thorp (1966) to properly size his bets at the Nevada blackjack tables. For example, imagine a situ- ation where you have a 50.5% chance of winning the next hand. What percentage of your net worth should you bet? The classical mean-variance (Markowitz 1952) theory has no answer, except to say that it depends on your particular appetite for risk. For instance, the extreme choices of betting 0 percent or betting 100 percent are both undominated in the mean-variance plane. The Kelly criterion gives a much more satisfactory answer: bet 50.5% − 49.5% = 1% of your wealth. This achieves the (optimum) capital growth rate of 0.005% per hand played in this (favorable) situa- tion. By the rule of 72, you would expect to double your wealth after approximately 72/0.005 = 14, 400 hands. Thus, it became clear to many people that the log-optimal portfolio theory should replace mean-variance as the dominant decision criterion. Breiman (1961) proved that the Kelly rule outperforms any “essentially different strategy” by an exponential factor, and it has the shortest mean waiting time to reach a distant wealth goal. Thorp’s (2017) biography discusses his use of log-optimal portfolios in his money management career on Wall Street. Cover’s (1987) survey and his information theory textbook (2006) are excellent primers of the theory of asymptotic growth. Cover and Gluss (1986) were the first to exhibit an on-line trading algorithm that could achieve the Kelly growth rate even when starting in total ignorance of the return process. Assuming finitely-supported returns, they applied Blackwell’s (1956) approachability theorem to get a trading strategy that grows wealth at the same asymptotic rate as the best rebalancing rule (or fixed-fraction betting scheme) in hindsight. Thus began a whole host of so-called “universal trading strategies” that, under mild conditions, “beat the market asymptotically” for highly arbitrary (e.g. nonstationary or serially correlated) return processes. Cover (1991) gave the first simple and intuitive universal portfolio, at the same time removing the restriction to finitely-supported returns. Jamshidian (1992) trans- planted Cover’s (1991) idea into a continuous-time market with several correlated stocks whose Itˆoprocesses have unknown, time-varying parameters that satisfy some asymptotic stability conditions. Ordentlich and Cover (1996) gave the “universal portfolio with side information,” along with more perspicuous proofs of the main (1991) regret bounds. For example, Thorp’s infamous “count” in Blackjack is a canonical source of side information. Ordentlich and Cover (1998) super-replicated the final wealth of the best rebal- ancing rule in hindsight at time-0, although they did not use the terminology of financial derivatives so thoroughly. It seems that their paper was not inspired so much by derivative pricing as it was by Shtarkov’s (1987) “universal source code” in information theory. Properly interpreted, the universal source code amounts to a robust scheme for betting on repeated horse races with unknown (and perhaps nonstationary) win probabilities. More recently, Iyengar (2005) has studied universal investment for discrete-time markets with two assets and proportional transaction costs. Stoltz and Lugosi (2005)
2 extended the game-theoretic notion of internal regret to the case of on-line portfolio selection problems. DeMarzo, Kremer, and Mansour (2006) used discrete-time on- line trading algorithms to derive no-arbitrage bounds for the prices of derivative securities. Gy¨orfi,Lugosi, and Udina (2006) gave universal procedures that find and exploit hidden complicated dependences of asset prices on the past evolution of the market. Kozat and Singer (2011) investigated semiconstant rebalanced portfolios that may (to avoid transaction costs) opt out of rebalancing altogether in selected investment periods.
1.2 Contribution
This paper offers a workaround for two practical problems encountered by would-be practitioners of Ordentlich and Cover’s (1998) max-min universal portfolio. First, for markets with many assets, the practitioner must wait a tremendously long time for his bankroll to “pull away” from the market averages. Second, the on-line portfolio weights are impossible to calculate in practice, since the Ordentlich-Cover algorithm requires large-scale computation of multilinear forms. The cleverest methods of com- putation either exhaust the computer’s memory or else they require eons of CPU time. Ordentlich and Cover’s (1998) max-min universal portfolio is only viable for markets with two or three stocks, at best. Naturally, we want procedures that work for a market with, say, 500 assets. Accordingly, we take up the more modest goal of performing well (at the end of the investment horizon) relative to the best pairs rebalancing rule in hindsight. Our notion of a “pairs rebalancing rule” allows for the degenerate possibility of keeping 100% of wealth in either of the two stocks. Thus, the best pairs rebalancing rule in hindsight will do at least as well as the best performing stock in the market. Our use
3 of a less aggressive benchmark leads to a computable trading strategy that “learns” more quickly, although in the long run its “understanding” of market dynamics will be somewhat less subtle than that of the original Ordentlich and Cover (1998) strategy.
1.3 Motivating Example
To motivate the paper, we use a continuous-time version of “Shannon’s Demon” (Poundstone 2010) to illustrate the fact that the possibility of “beating the market asymptotically” is no contradiction to the random walk model of stock prices. For simplicity, consider two stocks i ∈ {1, 2} that follow independent geometric Brownian motions. Suppose that the price processes Si(t) evolve according to
2 dSi(t) σ = dt + σ dWi(t), (1) Si(t) 2
where W1(t),W2(t) are independent unit Brownian motions. In Shannon’s original lecture (Poundstone 2010), at each (discrete) time step the stock price either doubled or got cut in half, each with equal probability. To match this tradition, we put σ = log 2 = 0.7. We have
Si(t) = eσWi(t) = 2Wi(t). (2) Si(0)
Note that lim log{Si(t)/Si(0)}/t = 0. This means that the stocks themselves have t→∞ zero asymptotic growth; they trade “sideways.” Now, consider a gambler who continuously maintains half his wealth in each stock. This is not a buy-and-hold strategy: the rebalancing rule dictates that he sell some shares of whichever stock performed better over [t, t + dt]. He puts the proceeds into the underperforming stock. If the trader starts with a dollar, his wealth V (t) evolves
4 according to
2 dV (t) 1 dS1(t) 1 dS2(t) σ σ = · + · = dt + [dW1(t) + dW2(t)]. (3) V (t) 2 S1(t) 2 S2(t) 2 2
Applying Itˆo’sLemma for functions of several diffusion processes (Wilmott 2001), we get σ V (t) = exp σ2t/4 + [W (t) + W (t)] . (4) 2 1 2
We thus have lim log[V (t)]/t = σ2/4 = 12%. From two dead-money substrates, the t→∞ gambler has manufactured continuous growth at a rate of 12% per unit time, leaving the market portfolio in the dust. Notice that this growth is merely the result of “volatility harvesting” (Poundstone 2010) or “volatility pumping” (Luenberger 1998). Note that the gambler has not attempted to guess which stock will outperform over the interval [t, t+dt] — rather, he just rebalances his portfolio after the fact. A sample path for Shannon’s Demon has been simulated in Figure 1. For a pair of correlated stocks, the dynamics will be substantially the same, albeit with a lower growth rate. It is an axiom of capital growth theory that one should seek out a pair of volatile, uncorrelated stocks. But which ones? For the Dow Jones (30) stocks, there are