CS 561

Data Systems Architectures


Class at a glance

Class: Tue/The 11:00am-12:15pm (CDS 364)
Instructor: Manos Athanassoulis 

Lab: Fri 1:25-2:15pm (MCS B33)
Teaching Fellow: Teona Bagashvili 

Office: CDS 928
Office Hours: Posted on Piazza

Discussion on Piazza / Grades on Gradescope
TF Office Hours: Posted on Piazza

Announcements

  • Semester starts on Jan 21 - stay tuned for updates.


Class Milestones - Important Dates

Keep in mind the Official Semester Dates.

  • January 31, submit project 0
  • February 14, submit project 1
  • February 23, submit your project proposal
  • February 25, last day to drop (without a "W")
  • March 3 - March 7, meet with your assigned mentor (graded)
  • March 6, Student-led discussion
  • March 22, submit your mid-semester progress report
  • March 27, Student-led discussion
  • March 31 - April 4, meet with your assigned mentor (graded)
  • April 4, last day to drop (with a "W")
  • April 8, Student-led discussion
  • April 26, final submission of project code and report


Class Schedule (tentative)

Here you can find the tentative schedule of the class (which might change as the semester progresses).

Class : Introduction to Data Systems and CS561

Readings

Class : Data Systems Architectures Essentials – Part 1

Readings

Class : Data Systems Architectures Essentials – Part 2

Readings

Class : LSM intro and Class Project Overview

Readings

A: Storage Layouts

Class : Row-Stores vs. Column-Stores

Readings

Class : Log-Structured Merge (LSM) Trees

Readings

Class : Scans in Key-Value Stores

Readings

B. Indexing

Class : Introduction to Indexing Design

Readings

Class : The design space of data structures

Readings

Class : Guest Lecture on Robust and Learned Tuning: Andy Huynh

Readings

Class : Guest Lecture on Sortedness-Aware Indexing: Aneesh Raman

Readings

Class : Adaptive Radix Trees

Readings

Class : Bitmap indexing (Student Discussion )

Readings

C. Modern Hardware

Class : Modern hardware trends

Readings

Class : Guest Lecture on SSD Design Elements: Teona Bagashvii

Readings

Class : ACE storage Management

Readings

Class : Serverless Computing (Student Discussion )

Readings

D. Query Processing

Class : Skew-aware Optimal Joins

Readings

Class : Guest Lecture: TBD

Class : BMI-based Query Optimization (Student Discussion )

Readings

Class : Guest lecture on OSDB: George Neville-Neil

E. ML For Data Systems

Class : ML for Systems and Learned Query Evaluation

Readings

Class : Guest lecture on cloud data Lakes: Andrew Lamb

Class : Learned Indexes

Readings

Class : Exam

Project Presentation

Class : Project Presentations A

Project Presentation - I

Class : Project Presentationa B

Project Presentation - II

Project Awards (by popular vote)

Awards

  • Most Engaging Presentation: “Benchmark Compression With Near Sortedness” by Harshitha Tumkur Kailasa Murthy, Vishwas Bhaktavatsala
  • Project with Highest Technical Depth: “Query-Driven Compaction in LSM-Trees” by Karatsenidis Konstantinos, Shubham Kaushik, Nishil Agrawal
  • Best Overall Project: “Range Deletes in LSM-Trees” by Jingyi Li, Ming-Han Hsieh, Yu-Cheng Huang
  • Honorable Mention: “Exploring the Performance of Data Compression Algorithms with Varying Data Sortedness” by Shivangi and Vani Singhal