This is an introduction to relational and non-relational database management systems. We will learn how to query database systems via languages such as SQL (the structured query language). Then, we will see how database systems work internally, how they store and index data, how they process and optimize queries, and how they process transactions while providing guarantees such as isolation, atomicity, and durability (ACID guarantees).
We will address questions of database design and discuss novel approaches to data management via NoSQL or NewSQL systems. Finally, we will discuss about graph, stream, and spatial data, and about systems that are specialized for those data types. The following recordings are taken from the Fall 2020 online lecture at Cornell by Prof. Immanuel Trummer (www.itrummer.org).
Click on the following topics to access videos and slides:
|Intro to Query Processing; Buffer Management; Slides|
|Estimating Processing Costs; Simple Nested-Loops Join Algorithms; Slides|
|Advanced Join Algorithms: Hash Join, Sort-Merge Join; Slides|
|Join Wrap Up; Other Relational Operators; Connecting Operators; Slides|
|Query Optimization; Cost and Cardinality Estimation; Slides|
|Plan Enumeration; Query Optimization Demo in Postgres; Slides|
About the first two thirds of the course are based on the textbook "Database Management Systems" by Raghu Ramakrishnan and Johannes Gehrke. The last third of the course discusses recently proposed systems, more details about them can be found in the corresponding research papers.