ECE/CS 6960/5960 Fundamentals of Cloud Systems

Project Ideas

Lecture 1 (01/08/2019) (slides, syllabus)

  1. What is cloud computing

  2. Examples on cloud computing (Amazon Web Services (AWS))

  3. What we will/won't cover

  4. Grading Polices

Part I: Cloud Computing

Lecture 2 (01/10/2019) (Lecture notes, Lecture Scribe 1, Lecture Scribe 2)

  1. MapReduce and Spark

  2. Coded MapReduce

Lecture 3 (01/15/2019) (Lecture notes, Lecture Scribe 1, Lecture Sribe 2)

  1. Coded Distributed Computing (CDC) and its constraints

Lecture 4 (01/17/2019) (Lecture notes part 1, Lecture notes part 2, Lecture Scribe 1, Lecture Scribe 2)

  1. General Scheme for CDC

  2. Coded Distributed Computing (CDC) with a hypercube design

  3. Cascaded Coded Distributed Computing

Lecture 5 (01/21/2019) (Slides)

  1. Overview of error control codes

Lecture 6 (01/23/2019) (Lecture notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Straggler Mitigation using Codes

    1. MDS Codes

    2. Prouct Codes

    3. Polynomial Codes

Lecture 7 (01/28/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Straggler Mitigation using Codes

    1. Polynomial Codes

    2. MatDot Codes

Lecture 8 (01/31/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Straggler Mitigation using Codes

    1. Polydot Codes

    2. Entangled Polynomial Codes/Generalized Polydot Codes

  2. Overview of Probability and Random Processes

Lecture 9 (02/05/2019) (Slides Probablity and Random Processes Overview, Slides)

  1. Overview of Probability and Random Processes

  2. Using Efficient Redundancy to Reduce Latency in Cloud Systems

Lecture 10 (02/07/2019) (Slides)

  1. Using Efficient Redundancy to Reduce Latency in Cloud Systems

No Class on 02/12 due to ITA

Lecture 11 (02/14/2019) (Slides)

  1. Overview of Queueing Theory

Lecture 12 (02/19/2019) (Slides)

  1. Tasks Assignment Policies in Server Farms

Lecture 13 (02/21/2019) (Slides)

  1. Using Efficient Redundancy of Queued Tasks to Reduce Latency and Computing Cost in Cloud Systems

Lecture 14 (02/26/2019) (Slides)

  1. Overview of Lyapunov Optimizaiton Framework

Lecture 15 (02/28/2019) (Slides)

  1. Optimal Dynamic Cloud Network Control

Part II: Distributed Storage Theory

Lecture 16 (03/05/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Introduction to Distributed Storage Theory

  2. Network Coding Theory

Lecture 17 (03/07/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Information Flow Graph and Applications

Lecture 18 (03/19/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Storage-Bandwidth Tradeoff

    1. Proofs

    2. Minimum Bandwidth Regenerating (MBR) codes

    3. Minimum Storage Regerating (MSR) codes

Lecture 19 (03/21/2019) (Lecture Notes, guest lecture by Nicholas Woolsey , Lecture Scribe 1, Lecture Scribe 2)

  1. Private Information Retrieval

Lecture 20 (03/26/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2, Lecture Scribe 3)

  1. Exact Repair: Minimum Bandwidth Regenerating Codes

    1. Fractional Repetition Code

Lecture 21 (03/28/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Exact Repair: Minimum Bandwidth Regenerating Codes

    1. DRESS code

    2. Product-Matrix Code

Lecture 22 (04/02/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Exact Repair: Minimum Storage Regenerating Codes

    1. Product-Matrix Code

    2. Exact Repair MDS Codes using Interference Alignment

    3. Piggyback-RS Codes

  2. Locally Repairable Codes

Part III: Distributed Machine Learning

Lecture 23 (04/04/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Distributed Stochastic Gradient Descent with Mini-batches

    1. Parameter Server-Worker architecture

    2. Convergence Analysis

Lecture 24 (04/09/2019) (Lecture Notes, Lecture Scribe 1)

  1. Distributed Stochastic Gradient Descent with Mini-batches

    1. Gradient Diversity and applications

  2. Asynchronous Parallel Sparse Gradient Descent

    1. Problem Formulation

    2. Examples

    3. Hogwild

Lecture 25 (04/11/2019) (Lecture Notes, Lecture notes (Asyn. Mini-batch SGD (didn't cover in class)), Lecture Scribe 1, Lecture Scribe 2)

  1. Asynchronous Parallel Sparse Gradient Descent

    1. Hogwild and convergence analysis

  2. Asynchronous Mini-batch SGD (didn't cover in class)

Lecture 26 (04/16/2019) (Slides)

  1. Federated Learning

    1. System Design

    2. Detailed Implementations using Raspberry Pi's and Sochket Programming

Lecture 27 (04/18/2019) (Lecture Notes, Lecture Notes from Xiang Zhang)

  1. Federated Learning

    1. Convergence Analysis

Lecture 28 (04/23/2019) (Lecture Notes, Lecture Scribe 1, Lecture Scribe 2)

  1. Communication Bottleneck and Gradient Quantization

Downloads and Useful Links

  1. Syllabus

  2. Lecture Scribe Template

  3. Google doc sheet

  4. Paper List

  5. Teaching