In our last podcast on chaos engineering with Tammy Butow , we learned about the basic concepts of chaos engineering, failure injection, and "game days." This time, Derrick Harris interviews Karun Chennuri (@karunchennuri) and Ramesh Krishnaram (@RKrishnaram) of T-Mobile about how they are applying this at T-Mobile, where they are running about 3,000 applications and nearly 40,000 containers on Pivotal Cloud Foundry-based platform. What they learned is that chaos engineering tools are not one-sized fits all.
What's different about chaos engineering in a PCF environment? For one thing, the BOSH layer that powers PCF already has a lot of resiliency built in, so the team set out to build a toolkit that understood BOSH. There are also a lot of microservices running on PCF, and we hear about why microservices require a special approach to chaos engineering. Finally, we hear about how teams are being onboarded to chaos engineering at T-Mobile and how they've started to run "game days."
Ramesh and Karun first presented about their work back at SpringOne Platform 2018, and delivered a deep dive, as well as a keynote, at Cloud Foundry Summit 2019 in Philadelphia. In October, their teammates Matthew Conover and Savani Tatake, will present the "part 2" of the T-Mobile journey with chaos engineering on PCF at SpringOne Platform 2019. Want to hear it? Register before August 20th for $200 off and use discount code S1P200_DDREWITZ for *another* $200 off.
Show Notes
- Visit http://pivotal.io/podcasts for show notes and other episodes.
- Download the episode and check us out on Soundcloud, subscribe to the feed directly, or on iTunes to have it automatically downloaded for you.
- Twitter: @derrickharris, @karunchennuri, @RKrishnaram
- Feedback: podcast@pivotal.io
About the Author
Follow on Twitter Visit Website More Content by Dormain Drewitz