Making Chaos Engineering Real for PCF at T-Mobile

August 8, 2019 Dormain Drewitz

In our last podcast on chaos engineering with Tammy Butow , we learned about the basic concepts of chaos engineering, failure injection, and "game days." This time, Derrick Harris interviews Karun Chennuri (@karunchennuri) and Ramesh Krishnaram (@RKrishnaram) of T-Mobile about how they are applying this at T-Mobile, where they are running about 3,000 applications and nearly 40,000 containers on Pivotal Cloud Foundry-based platform. What they learned is that chaos engineering tools are not one-sized fits all.

What's different about chaos engineering in a PCF environment? For one thing, the BOSH layer that powers PCF already has a lot of resiliency built in, so the team set out to build a toolkit that understood BOSH. There are also a lot of microservices running on PCF, and we hear about why microservices require a special approach to chaos engineering. Finally, we hear about how teams are being onboarded to chaos engineering at T-Mobile and how they've started to run "game days."

Ramesh and Karun first presented about their work back at SpringOne Platform 2018, and delivered a deep dive, as well as a keynote, at Cloud Foundry Summit 2019 in Philadelphia. In October, their teammates Matthew Conover and Savani Tatake, will present the "part 2" of the T-Mobile journey with chaos engineering on PCF at SpringOne Platform 2019. Want to hear it? Register before August 20th for $200 off and use discount code S1P200_DDREWITZ for *another* $200 off. 

Show Notes

About the Author

Dormain Drewitz

Dormain leads Product Marketing and Content Strategy for VMware Tanzu. Before VMware she was Senior Director of Pivotal Platform Ecosystem, including RabbitMQ, and Customer Marketing. Previously, she was Director of Product Marketing for Mobile and Pivotal Data Suite. Prior to Pivotal, she was Director of Platform Marketing at Riverbed Technology. Prior to Riverbed, she spent over 5 years as a technology investment analyst, closely following enterprise infrastructure software companies and industry trends. Dormain holds a B. A. in History from the University of California at Los Angeles.

Follow on Twitter Visit Website More Content by Dormain Drewitz
Hello, Spring Security 5.2 with Ria Stein
Hello, Spring Security 5.2 with Ria Stein

This week, Mark Heckler talks with Ria Stein about her talk at SpringOne Platform 2019, "Hello, Spring Secu...

Not Standing Still in Secure DevOps with Jai Schniepp of Liberty Mutual
Not Standing Still in Secure DevOps with Jai Schniepp of Liberty Mutual

In this episode, we learned how the Liberty Mutual Secure DevOps Platforms team keeps iterating to solve mo...