Humans of Reliability

Humans of Reliability
Podcast Description
Behind every reliable software system, there are people working hard to keep it online. Humans of Reliability is a series that spotlights the engineers, leaders, and innovators at the heart of incident management and system reliability. Through candid conversations, we explore the challenges, lessons, and personal journeys of those navigating complex technical landscapes to ensure the systems we rely on run smoothly. From unforgettable incident stories to favorite tools, workflows, and hobbies, Humans of Reliability uncovers the human side of technology—offering insights and inspiration for anyone passionate about building and maintaining resilient systems.https://rootly.com/humans-of-reliability
Podcast Insights
Content Themes
The show focuses on themes such as incident management, reliability engineering, and personal journeys within the tech industry, with episode examples including insights on SRE practices from Google’s Steve McGee and the impact of mentorship in tech leadership from Hannah Hammonds, highlighting key challenges and tools used in the field.

Behind every reliable software system, there are people working hard to keep it online.
Humans of Reliability is a series that spotlights the engineers, leaders, and innovators at the heart of incident management and system reliability. Through candid conversations, we explore the challenges, lessons, and personal journeys of those navigating complex technical landscapes to ensure the systems we rely on run smoothly.
From unforgettable incident stories to favorite tools, workflows, and hobbies, Humans of Reliability uncovers the human side of technology—offering insights and inspiration for anyone passionate about building and maintaining resilient systems.
https://rootly.com/humans-of-reliability
AI is transforming reliability work—from reactive firefighting to proactive engineering. In this episode, Ryan Lockard, VP of Platform Engineering and AI Enablement at CVS Health, joins Sylvain Kalache to break down how AI is showing up on the frontlines of healthcare infrastructure and operations.
From LLM copilots to cultural shifts in ownership, Ryan walks us through:
- How AI tools help troubleshoot legacy systems and assist during real-time incidents
- Why proactive reliability is finally within reach thanks to AI-enhanced tooling and workflows
- What MCP servers are, and how natural language interfaces are streamlining cloud operations
- How engineering culture and on-call models shift when teams truly own their reliability posture
- What AI means for the next generation of developers—and why prompt engineering matters
Whether you're managing incidents, building platforms, or scaling team culture, this conversation offers a pragmatic lens on how AI is changing the work of reliability from the inside out.

Disclaimer
This podcast’s information is provided for general reference and was obtained from publicly accessible sources. The Podcast Collaborative neither produces nor verifies the content, accuracy, or suitability of this podcast. Views and opinions belong solely to the podcast creators and guests.
For a complete disclaimer, please see our Full Disclaimer on the archive page. The Podcast Collaborative bears no responsibility for the podcast’s themes, language, or overall content. Listener discretion is advised. Read our Terms of Use and Privacy Policy for more details.