Classes will be held on Zoom - check your inbox for the meeting URL.
Date | Topic | Presenter | Papers |
---|---|---|---|
18/5 | Introduction | Jonathan - class introduction Roberta - MapReduce Vaastav - GFS |
|
25/5 | Big Data | (Guido) - RDDs Thomas - Naiad |
|
1/6 | Data Stores | No class due to public holiday but reviews are still due! | |
8/6 | Microservices | (Reyhaneh) - Bigtable Marlene - Benchmark |
|
15/6 | Orchestration | Mei - Borg Mazen - Autopilot |
|
22/6 | Serverless 1 | Guido - OpenLambda Safya - Shuffling |
|
29/6 | Serverless 2 | Gerd - Serverless Forward, Back Jannis - Serverless in the Wild |
|
6/7 | Workloads & Performance | Sepehr - Resource Central Thomas - Performance Analysis |
|
13/7 | Tracing | Vaastav - Pivot Tracing Gerd - Canopy |
|
20/7 | Machine Learning | Kevin - Parameter Server Sepehr - Ray |
|
27/7 | Geo-Distribution | Reyhaneh - Spanner Pengqiu - TAO |
|
3/8 | When Hardware Changes | Mehrshad - Disaggregation Ahmed - KVell |
Before you start reading papers and writing reviews, take a look at the following resources:
Paper reviews are due 11:59pm the evening before class. Some papers are optional and do not require a review (but are highly recommended reading!).
The URL next to each paper will take you to the review website. You'll find a PDF of each paper on the review website. The review form provides some guidance on writing a paper review.
Each paper is presented by a student, who then also leads the in-class discussion.
In total you will be expected to present once or twice throughout the semester.
We will spend approximately 40 minutes for each paper: about 15 minutes each for the presentation and discussion, with some extra time for wiggle room.
If you are assigned to present, then you do not need to submit reviews for that week's papers.
Your presentation is an oral presentation - no powerpoint slides!
Your presentation should last approximately 15 minutes. You should not take longer than this unless people are asking questions during the presentation.
Try to structure your presentation around the following questions:
Notes. You should make bulleted notes to go along with your presentation that you share with the class ahead of time. Your bullet points should be a guide to your oral presentation, that everyone can use to follow along with. Don't include figures - everybody will have a copy of the paper with them, so you can just refer to figures directly in the paper.
Structure. If you're having difficulty structuring your presentation, a good starting point is to search online for existing presentations. For many papers you can find the original conference presentation of the paper. You can see how the paper's authors structured their presentation, and you are allowed to use this to structure your own presentation.
Discussion. At the end of your presentation you should provide your personal opinions about the paper. This is the starting point for the discussion. You should also prepare several open-ended questions that you can pose to the class to drive the discussion.
How do systems like Google and Facebook actually work? What does it take to build a planet-scale system? What can we do today using cloud computing that wasn't possible a decade ago?
In this course we will look at the research behind modern cloud and datacenter systems. We will read recent research papers in operating systems, computer networks, and distributed systems. We will look at modern application trends including big data analytics, microservices, serverless architectures, and machine learning. Along the way we will learn how to design, implement, and evaluate robust, effective, and efficient systems.
The course is intended for Masters and Ph.D. students in Computer Science, but enterprising Bachelors students are also welcome to participate.
There are no formal prerequisites beyond a basic knowledge of how computer systems (operating systems, databases, networks, distributed systems) work internally.
To register for the seminar you must apply by e-mailing the instructor (Jonathan Mace), explaining why you wish to take the seminar and what your background in computer systems is. Places in the seminar are limited. If you wish to take the seminar without credit, please say so in your e-mail.
The class has two main components:
Grading for the class will be 50% paper reviews, 25% presentation, and 25% class participation.
When you present, no reviews are required for that week's papers. You may also skip 4 reviews, no questions asked.