Writing

Notes, thoughts, and technical musings — 14 posts.

2026

What Is an Agentic Runtime? My Mental Model (And Why I Care)

I have been building agents without a runtime. It works until something breaks and you have no idea why. Here is the runtime model I wish I had from the start.

3 Jul 20266 min read #agentic-runtime #ai-agent #architecture

Autonomous Remediation: Where I Think Agents Belong (And Where I Learned They Don't)

I built an autonomous remediation agent. It worked for two weeks, then it restarted a production database pod during a backup window. Here is what I learned about where agents actually belong.

23 Jun 20266 min read #autonomous-remediation #agentic-devops #kubernetes

Self-Hosted AI vs API: What I Actually Pay (And What I Actually Get)

I ran the numbers on my own setup. Self-hosted AI is not always cheaper, and the hidden costs are not where you think they are.

6 Jun 20266 min read #self-hosted-ai #cost-analysis #openai-api

How I Actually Think About Security for AI Automation (After a Near Miss)

I almost sent customer data to a public LLM API because my pipeline was not checking the data flow. Here is the security model I use now.

28 May 20267 min read #ai-automation-security #webhook-security #kubernetes-security

n8n vs Temporal: Why I Actually Run Both (And the Migration That Hurt)

I lost a day's worth of data because n8n does not replay state. That is why I now run Temporal for anything that matters.

22 Apr 20265 min read #n8n #temporal #workflow-orchestration

Agentic DevOps: Why I Think MCP Servers Are the Right Abstraction

I gave an AI agent kubectl access once. It deleted the wrong namespace. Here is why I now believe MCP servers are the only safe way to let agents touch infrastructure.

7 Apr 20266 min read #agentic-devops #ai-infrastructure #kubernetes

How I Actually Deploy Ollama on Kubernetes (And the GPU Headaches I Fixed)

I spent a weekend getting Ollama to actually see my GPU on Kubernetes. Here is what broke and what I learned.

26 Mar 20265 min read #ollama #kubernetes #self-hosted-ai

How I Actually Build Event-Driven AI Pipelines (And the Queue That Saved Me)

I built an AI pipeline without a queue. It worked until a marketing campaign sent 500 events in an hour. Here is what I learned about queues, durability, and why n8n alone is not enough.

17 Mar 20266 min read #ai-automation #event-driven-architecture #kubernetes

Temporal on Kubernetes: Why I Actually Reach for It (And the Time It Saved Me)

I had a workflow that processed AI-generated content. It failed mid-run and I lost half a day's work. Temporal would have prevented that. Here is why I now use it for anything that matters.

8 Mar 20266 min read #temporal #ai-workflows #kubernetes

Why I Actually Centralize My LLM Traffic Through LiteLLM

I had five API keys scattered across laptops, env files, and one unfortunate screenshot. LiteLLM fixed that. Here is what I actually run and what it actually costs me.

24 Feb 20265 min read #litellm #kubernetes #ai

How I Actually Deployed vLLM on Kubernetes (And the Reality of a GTX 1080)

I thought vLLM would be a simple upgrade from Ollama. I was wrong. Here is what actually happened when I tried to run it on my GTX 1080.

6 Feb 20265 min read #vllm #kubernetes #ai

AI Coding Tools: How I Actually Use OpenRouter, Kimi Code, and Continue.dev

I use multiple AI coding tools through OpenRouter and Kimi Code. Here is what each one actually does for me, what it cannot do, and why I keep using them.

25 Jan 20265 min read #openrouter #kimi-code #continue-dev

Ollama vs vLLM: Why I Actually Run Both (And the Migration That Wasn't Free)

I moved from Ollama to vLLM expecting a simple upgrade. I had to re-download models, learn new quantization formats, and debug NCCL errors. Here is what the migration actually cost me.

13 Jan 20266 min read #ollama #vllm #llm-inference

Building an AI SRE Agent with MCP: What I Actually Built

I built an AI SRE agent. It can read logs and list pods. It cannot fix anything yet. Here is why that is exactly the right scope.

4 Jan 20264 min read #mcp #ai-agent #sre