Serializable Isolation for Snapshot Databases

Serializable Isolation for Snapshot Databases ∗ Michael J. Cahill Uwe Röhm Alan D. Fekete [email protected] [email protected] [email protected] School of Information Technologies University of Sydney NSW 2006 Australia ABSTRACT 1. INTRODUCTION Many popular database management systems offer snap- Serializability is an important property when transactions shot isolation rather than full serializability. There are well- execute because it ensures that integrity constraints are main- known anomalies permitted by snapshot isolation that can tained even if those constraints are not explicitly declared to lead to violations of data consistency by interleaving trans- the DBMS. If a DBMS enforces that all executions are seri- actions that individually maintain consistency. Until now, alizable, then developers do not need to worry that inconsis- the only way to prevent these anomalies was to modify the tencies in the data might appear as artifacts of concurrency applications by introducing artificial locking or update con- or failure. It is well-known how to use strict two-phase lock- flicts, following careful analysis of conflicts between all pairs ing (and various enhancements such as escrow locking and of transactions. multigranularity locking) to control concurrency so that se- This paper describes a modification to the concurrency con- rializable executions are produced [11]. Some other concur- trol algorithm of a database management system that auto- rency control algorithms are known that ensure serializable matically detects and prevents snapshot isolation anomalies execution, but these have not been adopted in practice, be- at runtime for arbitrary applications, thus providing serial- cause they usually perform worse than a well-engineered im- izable isolation. The new algorithm preserves the proper- plementation of strict two-phase locking (S2PL). ties that make snapshot isolation attractive, including that Snapshot isolation (SI) [3] is an alternative approach to readers do not block writers and vice versa. An implementa- concurrency control, taking advantage of multiple versions tion and performance study of the algorithm are described, of each data item. In SI, a transaction T sees the database showing that the throughput approaches that of snapshot state as produced by all the transactions that committed be- isolation in most cases. fore T starts, but no effects are seen from transactions that overlap with T. This means that SI never suffers from Incon- sistent Reads. In a DBMS using SI for concurrency control, Categories and Subject Descriptors reads are never delayed because of concurrent transactions' H.2.4 [Database Management]: Systems|Transaction writes, nor do reads cause delays in a writing transaction. processing In order to prevent Lost Update anomalies, SI does abort a transaction T when a concurrent transaction commits a General Terms modification to an item that T wishes to update. This is called the \First-Committer-Wins" rule. Algorithms, Performance, Reliability Despite the nice properties of SI, it has been known since SI was formalized in [3] that SI allows non-serializable ex- Keywords ecutions. In particular, it is possible for an SI-based con- Multiversion Concurrency Control, Serializability Theory, currency control to interleave some transactions, where each Snapshot Isolation transaction preserves an integrity constraint when run alone, but where the final state after the interleaved execution does not satisfy the constraint. This occurs when concurrent transactions modify different items that are related by a constraint, and it is called the Write Skew anomaly. ∗The author is also an employee of Oracle Corporation. This Example 1: Suppose that a table Duties(DoctorId, Shift, work was done while at the University of Sydney. Status) represents the status (\on duty"or\reserve") for each doctor during each work shift. An undeclared invariant is that, in every shift, there must be at least one doctor on duty. A parametrized application program that changes a Permission to make digital or hard copies of all or part of this work for doctor D on shift S to \reserve" status, can be written as in personal or classroom use is granted without fee provided that copies are Figure 1. not made or distributed for profit or commercial advantage and that copies This program is consistent, that is, it takes the database bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific from a state where the integrity constraint holds to another permission and/or a fee. state where the integrity constraint holds. However, sup- SIGMOD’08, June 9–12, 2008, Vancouver, BC, Canada. pose there are exactly two doctors D1 and D2 who are on Copyright 2008 ACM 978-1-60558-102-6/08/06 ...$5.00. 729 BEGIN TRANSACTION • The algorithm never delays a read operation; nor do readers cause delays in concurrent writes. UPDATE Duties SET Status = 'reserve' WHERE DoctorId = :D • Under a range of conditions, the overall throughput is AND Shift = :S close to that allowed by SI, and much better than that AND Status = 'on duty' of strict two-phase locking. SELECT COUNT(DISTINCT DoctorId) INTO tmp • The algorithm is easily implemented by small modifi- FROM Duties cations to a system that provides SI. WHERE Shift = :S AND Status = 'on duty' We have made a prototype implementation of the algorithm in the open source data management product Oracle IF (tmp = 0) THEN ROLLBACK ELSE COMMIT Berkeley DB [16], and we evaluate the performance of this implementation compared to the product's implementations of Strict Two-Phase Locking (S2PL) and SI. Figure 1: Example parameterized application ex- The key idea of our algorithm is to detect, at runtime, hibiting write skew distinctive conflict patterns that must occur in every non- serializable execution under SI, and abort one of the trans- duty in shift S. If we run two concurrent transactions, which actions involved. This is similar to the way serialization run this program for parameters (D1;S) and (D2;S) respec- graph testing works, however our algorithm does not operate tively, we see that using SI as concurrency control will allow purely as a certification at commit-time, but rather aborts both to commit (as each will see the other doctor's status transactions as soon as the problem is discovered; also, our for shift S as still unchanged, at \on duty"). However, the test does not require any cycle-tracing in a graph, but can be final database state has no doctor on duty in shift S, violat- performed by considering conflicts between pairs of transac- ing the integrity constraint. tions, and a small amount of information which is kept for Despite the possibility of corrupting the state of the database, each of them. Our algorithm is also similar to optimistic con- SI has become popular with DBMS vendors. It often gives currency control [15] but differs in that it only aborts a trans- much higher throughput than strict two-phase locking, es- action when a pair of consecutive conflict edges are found, pecially in read-heavy workloads, and it also provides users which is characteristic of SI anomalies. This should lead with transaction semantics that are easy to understand. Many to significantly fewer aborts than optimistic techniques that popular and commercially important database engines pro- abort when any single conflict edge is detected. Our detec- vide SI, and some in fact use SI when serializable isolation tion is conservative, so it does prevent every non-serializable is requested [13]. execution, but it may sometimes abort transactions unnec- Because SI allows data corruption, and is so common, essarily. there has been a body of work on how to ensure serializable The remainder of the paper is structured as follows: in executions when running with SI as concurrency control. Section 2 we give an introduction to snapshot isolation and The main techniques proposed so far [9, 8, 14] depend on current approaches to ensuring serializable isolation with SI; doing a design-time static analysis of the application code, in Section 3 we describe the new Serializable SI algorithm; in and then modifying the application if necessary in order to Section 4 we describe the implementation in Oracle Berkeley avoid the SI anomalies. For example, [9] shows how one can DB and in Section 5 we evaluate its performance. Section 6 introduce write-write conflicts into the application, so that concludes. all executions will be serializable even on SI. Making SI serializable using static analysis has a number 2. BACKGROUND of limitations. It relies on an education campaign so that application developers are aware of SI anomalies, and it is 2.1 Snapshot Isolation unable to cope with ad-hoc transactions. In addition, this SI is a concurrency control approach that uses multiple must be a continual activity as an application evolves: the versions of data to provide non-blocking reads. When a analysis requires the global graph of transaction conflicts, transaction T starts executing, it gets a conceptual times- so every minor change in the application requires renewed tamp start-time(T); whenever T reads a data item x, it does analysis, and perhaps additional changes (even in programs not necessarily see the latest value written to T; instead T that were not altered). In this paper, we instead focus on sees the version of x which was produced by the last to com- guaranteeing serializability for every execution of arbitrary mit among the transactions that committed before T started transactions, while still having the attractive properties of and also modified x (there is one exception to this: if T has SI, in particular much better performance than is allowed itself modified x, it sees its own version). Thus, T appears to by strict two-phase locking.

Serializable Isolation for Snapshot Databases

A Fast Implementation of Parallel Snapshot Isolation

One-Copy Serializability with Snapshot Isolation Under the Hood

Making Snapshot Isolation Serializable

Firebird 3.0 Developer's Guide

Ensuring Consistency Under the Snapshot Isolation

Where We Are Snapshot Isolation Snapshot Isolation

SQL: Transactions Introduction to Databases Compsci 316 Fall 2017 2 Announcements (Tue., Oct

Postgres-R(SI) Data Replication and Snapshot Isolation Example

Precisely Serializable Snapshot Isolation

Snapshot Isolation

Serializable Isolation for Snapshot Databases

A Critique of ANSI SQL Isolation Levels