Reliability and Performance Challenges at Cloud Scale
Date and Time
Friday, March 3, 2017 - 12:30pm to 1:30pm
Location
Computer Science Small Auditorium (Room 105)
Type
CS Department Colloquium Series
Speaker
Host
Prof. Kyle Jamieson
Cloud computing is quickly becoming the backbone for critical societal applications, such as self-driving cars, tele-medicine and Internet of Things. However, the unprecedented scale of cloud infrastructure, with millions of servers spread across hundreds of data centers, introduces new fundamental challenges that were not faced by past computer systems. In my talk, I will present two examples of systems, Copysets and Cliffhanger, which leverage an analytical approach to tackle novel challenges introduced by the scale of the cloud. In Copysets, I present a novel replication framework that reduces the probability of data loss by over 10,000 times for the common scenario of simultaneous server failures. In Cliffhanger, I present a key-value cache that dynamically adapts to changing cloud application workloads, which reduces the number of misses by 35% or more. Both of these systems provide an example for my research approach, of building practical systems that tackle novel unstudied problems introduced by the scale of the cloud.
Asaf Cidon is the Vice President, Content Security Services at Barracuda Networks. He currently also leads the cloud caching research project at Stanford. Asaf completed his PhD at Stanford under Mendel Rosenblum and Sachin Katti. His research on cloud and mobile systems was published in NSDI and Sigcomm, and received the Best Student Paper Award in Usenix ATC. Asaf was also the founder and CEO of Sookasa, a cloud storage security startup, which was acquired by Barracuda Networks. His research focuses on how to provide reliability and performance guarantees in large-scale cloud environments, and was adopted by several companies, including Facebook, NetApp, and Chartbeat.