Humans of Reliability

We estimate there's a 100% chance this podcast is looking for guests- reach out and make your move!

Topic Category:

Technology

Where to Listen:

Language:

English

Copyright Notice:

Podcast Stats:

Number of Episodes: 4

Series Type: episodic

Content Type: AUDIO

Podcast Description

Behind every reliable software system, there are people working hard to keep it online. Humans of Reliability is a series that spotlights the engineers, leaders, and innovators at the heart of incident management and system reliability. Through candid conversations, we explore the challenges, lessons, and personal journeys of those navigating complex technical landscapes to ensure the systems we rely on run smoothly. From unforgettable incident stories to favorite tools, workflows, and hobbies, Humans of Reliability uncovers the human side of technology—offering insights and inspiration for anyone passionate about building and maintaining resilient systems.https://rootly.com/humans-of-reliability

Podcast Insights

Content Themes

The show focuses on themes such as incident management, reliability engineering, and personal journeys within the tech industry, with episode examples including insights on SRE practices from Google’s Steve McGee and the impact of mentorship in tech leadership from Hannah Hammonds, highlighting key challenges and tools used in the field.

Further Podcast insights, such as show format, guest types, ideal guests, and target audience are available to Podcast Collab Club Members!

Humans of Reliability

Behind every reliable software system, there are people working hard to keep it online.

Humans of Reliability is a series that spotlights the engineers, leaders, and innovators at the heart of incident management and system reliability. Through candid conversations, we explore the challenges, lessons, and personal journeys of those navigating complex technical landscapes to ensure the systems we rely on run smoothly.

From unforgettable incident stories to favorite tools, workflows, and hobbies, Humans of Reliability uncovers the human side of technology—offering insights and inspiration for anyone passionate about building and maintaining resilient systems.

https://rootly.com/humans-of-reliability

Only 50% of companies monitor their ML systems. Building observability for AI is not simple: it goes beyond 200 OK pings. In this episode, Sylvain Kalache sits down with Conor Brondsdon (Galileo) to unpack why observability, monitoring, and human feedback are the missing links to make large language model (LLM) reliable in production.

Conor dives into the shift from traditional test-driven development to evaluation-driven development, where metrics like context adherence, completeness, and action advancement replace binary pass-fail checks. He also shares how teams can blend human-in-the-loop feedback, automated guardrails, and small language models to keep AI accurate, compliant, and cost-efficient at scale.

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

Search Episodes

Episode play icon

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

Episode Description

Episode play icon

The End of “Good Code”? AI, Throughput, and Reliability with CircleCI CTO Rob Zuber

Episode Description

Episode play icon

Frontline Reliability: Protecting User Journeys with SLOs with Shery Brauner (Razor, ex-Zalando)

Episode Description

Episode play icon

Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

Episode Description

Episode play icon

Command Under Pressure: David Owczarek on Incident Leadership and Human-Centered Reliability

Episode Description

Episode play icon

AI at the Frontlines of Healthcare Reliability with Ryan Lockard (CVS Health)

Episode Description

Episode play icon

Trust Is the Product: Building Reliable Billing in the AI Era with Cosmo Wolfe (Metronome)

Episode Description

Episode play icon

The Golden Path to Nowhere: When Platforms Undermine Reliability with Chase Roberts (Northflank)

Episode Description

Episode play icon

AI can boost developer productivity, if used right, with Justin Reock, Deputy CTO at DX

Episode Description

Episode play icon

Why Reliability in the AI Era Starts with the Network with Marino Wijay

Episode Description

Search Results placeholder

Disclaimer
This podcast’s information is provided for general reference and was obtained from publicly accessible sources. The Podcast Collaborative neither produces nor verifies the content, accuracy, or suitability of this podcast. Views and opinions belong solely to the podcast creators and guests.

For a complete disclaimer, please see our Full Disclaimer on the archive page. The Podcast Collaborative bears no responsibility for the podcast’s themes, language, or overall content. Listener discretion is advised. Read our Terms of Use and Privacy Policy for more details.