A list of papers I have read.
- Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores
- Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
- WiscKey: Separating Keys from Values in SSD-conscious Storage
- MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph
- PebblesDB: Building Key-Value Stores using Fragmented Log-Structured Merge Trees
- Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store
- What’s Really New with NewSQL?
- Spanner: Google’s Globally-Distributed Database
- Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
- Time, Clocks, and the Ordering of Events in a Distributed System
- The Google File System
- MapReduce: Simplified Data Processing on Large Clusters
- Bigtable: A Distributed Storage System for Structured Data
- The Design of a Practical System for Fault-Tolerant Virtual Machines
- In Search of an Understandable Consensus Algorithm (Extended Version)
- ZooKeeper: Wait-free coordination for Internet-scale systems
- Strong and Efficient Consistency with Consistency-Aware Durability