Failing With Grace*
One of the biggest challenges of building distributed systems is dealing with failure. In this talk we'll explore how distributed systems fail and then once we're good and scared, we'll cover a number of approaches and tools to help you deal with failure.
Thinking about jumping on board the microservices train? Need to scale your system past a few hosts?
Awesome! You’re about to embark on a great adventure.
As part of that adventure, you will face a great many challenges, not the least of which failure. Failed hosts, failed networks, inconsistent views on the world, these and more become something you have to deal with an plan for now.
In this talk, we’ll explore some of the most common forms of failure you will face and then from there, we’ll go over a number of approaches and tools that allow you to deal with failure in a graceful and understandable way.
distributed systems, microservices, NSQ, go, python, messaging, async, failure
Sean has spoken at a number of conferences and meetups including Ricon, PyGotham, DevopsDays, CodeMash and more.
You can view an earlier version of this talk at http://seanoc.co/1F02Vh9 and another distributed systems talk at http://seanoc.co/1pNfply.
Sean O’Connor is the Lead Application Developer at Bitly. Day to day he builds systems, reviews code, and works with the awesome crew at bitly to create powerful new tools to help people understand the connected world. Outside of Bitly, Sean spends most of his time biking around NYC, brewing cider, and keeping his dog out of trouble.
- Title: Failing With Grace
- Track: Cooking
- Room: B204
- Time: 3:45 – 4:30pm
One of the biggest challenges of building distributed systems is dealing with failure. In this talk we’ll explore how distributed systems fail and then once we’re good and scared, we’ll cover a number of approaches and tools to help you deal with failure.
- Speakers: Sean O'Connor