This post thumbnail

14 May 2026 09:00 AM

A practical introduction to why LLM serving breaks the usual web-app scaling playbook: requests become token streams, latency splits into TTFT and TPOT, replicas may span GPUs or nodes, memory becomes KV cache, and autoscaling needs workload-aware signals instead of CPU alone.

This post thumbnail

7 May 2026 07:49 AM

The best use of AI in DevOps isn't autonomous agents with production access. It's reducing cognitive toil: reading docs, summarizing release notes, comparing configs, and giving engineers enough context to make better decisions.

This post thumbnail

1 May 2026 08:00 AM

Platform engineering isn't rebranded DevOps. Here's what actually changes in your day job, your skills, and your salary when you make the shift, from someone living it.

This post thumbnail

26 April 2026 07:49 AM

In the previous part, we set up cert-manager on a Kubernetes cluster and issued SSL certif...

This post thumbnail

24 October 2025 07:49 AM

Learn how to enable Basic auth for Prometheus ingress with ALB in Kubernetes by adding an NGINX proxy sidecar in kube-prometheus-stack Helm chart. Full guide with YAML examples, annotations, and best practices.

This post thumbnail

24 November 2021 03:04 PM

Kubernetes is an open-source system for automating deployment, scaling and management of c...