One of the biggest challenges of building distributed systems is dealing with failure. In this talk we'll explore how distributed systems fail and then once we're good and scared, we'll cover a number of approaches and tools to help you deal with failure.