Container Driven Architecture

In this episode, we continue our dive into the changing architecture of IT infrastructure and look at how containers and container platforms are changing. We also look at the fundamental nature of what people want to buy, accelerated by VMware Broadcom, making virtualization platforms much less attractive, and the shifting landscape here. This is work that is based on a presentation that I’ve been giving around the shift towards open shift virtualization and Kubernetes in general.

Transcript: otter.ai/u/BnYKzI0zzOLqqWi45v…?utm_source=copy_url

HA Troubleshooting [Tech Ops]

This episode of the TechOps series goes into high availability troubleshooting. Not just high availability, not just troubleshooting, but actually talking through what it takes to manage and maintain and fix HA systems. This is part of a longer discussion we’ve been having and so there’s some really interesting ideas in the middle of these discussions that I hope will shape your thinking as you build high availability systems, diagnostics and troubleshooting for people who are in high availability very complex environments.

Transcript: otter.ai/u/wM__4w1YIzZnhVdgLu…?utm_source=copy_url

References:
status.openai.com/incidents/ctrsv3lwd797\

Logging [TechOps Series]

We dive deep into logging, tracing, metrics, observability, with a specific filter for automation and systems and infrastructure.

There’s a real challenge here of how you capture information from a running system in a way that provides the right information at the right time. That fundamentally is the question that we are working to answer throughout this really fascinating discussion about logging.

Transcript: otter.ai/u/msNO2gn1b0FP2lK7rS…?utm_source=copy_url

Reading Logs and Events [TechOps]

This TechOps episode explores the challenges of processing events and logs in technical operations.

The discussion covers the importance of understanding the intent and purpose of building systems downstream from eventing and logging systems. Key topics include the trade-offs between real-time and delayed event processing, the principle of least privilege, and strategies for handling event buffering and dropping. The conversation also touches on security concerns related to event and log data.

The episode concludes with plans for future discussions on adding events and logging to scripts to make them more useful.

Advanced SSH [TechOps]

SSH and Secure Shell is one of those topics that people take for granted because it is a ubiquitous way to log in and access systems. True to form for the TechOps series, though, we break that down into much more detailed and granular components.

We talk about how to secure it and what best practices are. We also discuss how to use it for tunneling, or, more specifically, not use it for tunneling, and why all of this matters to your operations environment. Listen to what new things we’re doing that avoid having to have network access at all.

Transcript: otter.ai/u/XSRBfnifZOF0-nlNU5…?utm_source=copy_url