Back to Resources

Storage Engine Research Resources

From theoretical foundations to practice

Theoretical Resources

Scalable and Robust Snapshot Isolation for High-Performance Storage Engines
Adnan Alhomssi and Viktor Leis, 2023 Intermediate
4.0
A study on scalable and robust snapshot isolation techniques for high-performance storage engines.
Download PDF
ARIES: A transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging
C. Mohan et al., 1992 Intermediate
5.0
A seminal paper on transaction recovery using write-ahead logging.
Download PDF
Staring into the Abyss: An Evaluation of Concurrency Control with One Thousand Cores
Xiangyao Yu et al., 2014 Intermediate
5.0
In particular, as the number of cores increases, the problem of concurrency control becomes extremely challenging. With hundreds of threads running in parallel, the complexity of coordinating competing accesses to data will likely diminish the gains from increased core counts.
Download PDF
Avoiding Scalability Collapse by Restricting Concurrency
Dave Dice et al., 2019 Intermediate
5.0
Saturated locks often degrade the performance of a multithreaded application, leading to a so-called scalability collapse problem.
Download PDF
Rethinking Logging, Checkpoints, and Recovery for High-Performance Storage Engines
Michael Haubenschild et al., 2020 Intermediate
5.0
In this work, we propose a new logging and recovery design that supports incremental and fuzzy checkpointing, index recovery, out-of-memory workloads, and low-latency transaction commits.
Download PDF
Rethink Data Anomalies In Databases
Haixiang Li et al., 2021 Advanced
5.0
A general framework called Coo with the capability to formally define all data anomalies and quantitatively describe them.
Download PDF

Practical Resources

Deep Dive: InnoDB Transactions and Write Paths
2018 Beginner
5.0
InnoDB transactions and write paths From the client connection to physical storage.
Download PDF
Scalability of write-ahead logging on multicore and multisocket hardware
Johnson et al., 2012 Intermediate
5.0
A study on the scalability of write-ahead logging in modern hardware environments.
Download PDF
Building a Database on S3
Matthias Brantner et al., 2008 Intermediate
5.0
The purpose of this paper is to demonstrate the opportunities and limitations of using S3 as a storage system for general-purpose database applications which involve small objects and frequent updates.
Download PDF
Diving Deep on S3 Consistency
2021 Beginner
5.0
We leveraged that experience to add strong consistency at the high availability that S3's customers have come to appreciate.
View Article
PolarDB Serverless: A Cloud Native Database for Disaggregated Data Centers
Wei Cao et al., 2021 Intermediate
5.0
In this paper, we describe the novel architecture of PolarDB Serverless, which follows the disaggregation design paradigm: the CPU resource on compute nodes is decoupled from remote memory pool and storage pool. Each resource pool grows or shrinks independently, providing on-demand provisoning at multiple dimensions while improving reliability.
Download PDF
LavaStore: ByteDance's Purpose-built, High-performance, Cost-effective Local Storage Engine for Cloud Services
Hao Wang et al., 2024 Advanced
5.0
Presents the design and implementation of LavaStore, a high-performance cost-effective local storage engine purpose-built for ByteDance's needs.
Download PDF
High Performance Transactions via Early Write Visibility
Jose M. Faleiro et al., 2017 Advanced
5.0
This delayed write visibility can signi?cantly impact the performance of serializable database systems by reducing concurrency among con?icting transactions.
Download PDF
Contention-Aware Lock Scheduling for Transactional Databases
Hao Wang et al., 2018 Advanced
4.0
Contention-aware lock scheduling for transactional databases.
Download PDF