Blog

System architecture, performance, security, and the engineering decisions that matter at scale.

Forget the Output. Watch What Your Agent Reads Before It Writes.
LatestMay 7, 20268 min read

Forget the Output. Watch What Your Agent Reads Before It Writes.

234,760 tool calls revealed a 70% drop in read-to-edit ratio weeks before anyone noticed output quality declining. Behavioral fingerprints catch what output checks miss.

AI AgentsDeveloper ToolingObservabilityClaude Code
Read article
83 posts~9 hours of reading
AllAIEngineeringArchitectureAI AgentsDeveloper ToolsSecurityOpen SourceMCPaiLLMsai-agentssoftware-engineeringDeveloper ToolingInfrastructureSoftware EngineeringNext.jsSupply ChainMemory Systemsinfrastructuredeveloper-toolsCI/CDClaude CodeAutomationGoDevOpsClaudeNode.jsPerformanceCoding AgentsWeb Developmentarchitectureproductivityclaude-codedeveloper-experienceagentscode-reviewAI InfrastructureKnowledge GraphsCode ReviewProduction EngineeringTypeScriptPrivacyMachine LearningCultureAWSCareerCloudflareServerlessAnthropicGeminiStrategyOpenAIFuture of WorkReactsystem-designengineering-leadershipcareermetricsopen-sourceai-code-generationlicensingsoftware-lawdeveloper-productivityplanningautonomous-agentsbullmqouijamemory-systemsengramanthropicobservabilityindustrydeveloper-toolingverificationGraphRAGRAGProduction SystemsEngramGovernanceDistributed SystemsEncryptionCompilersTerminalCloud InfrastructureAzureEngineering LeadershipnpmRLHFCostRuby on RailsFramework ArchitectureDeveloper ProductivityOn-Device MLAI SecurityDevSecOpsApplication SecurityPersonalEidEngineering ManagementTechnical DebtDynamoDBPostgresGitHubAI SafetyEnterpriseAgent SkillsSEOObject StorageProductionGoogle CloudAPI SecurityWeb PerformanceHot TakeEdge ComputingLambdaCloudJavaScriptXSSRustBrowsersProductivityWorkflowDependency ManagementHardwaredevopscloudEthicsPolicyBusinessSaaSResearchChrome ExtensionsInfoSecCode QualityTechnologyQuantum ComputingFutureTailwind CSS
AI Made Microservices More Expensive. Nobody Wants to Admit It.

AI Made Microservices More Expensive. Nobody Wants to Admit It.

AI workloads punish the exact things microservices add: network hops, duplicated caches, serialization overhead. The monolith argument isn't about simplicity anymore. It's about money.

system-designarchitectureaiinfrastructure
May 5, 2026
5 min read
The Productivity Panic Is Coming for Senior Engineers Next

The Productivity Panic Is Coming for Senior Engineers Next

Three waves of productivity metrics. Story points, DORA, AI output. Each one optimized for a proxy until it broke. Each one made senior engineers less visible.

engineering-leadershipproductivitycareermetrics
May 2, 2026
6 min read
Someone Used Claude to Rewrite a Library and Strip Its License. The Legal Question Applies to All of Us.

Someone Used Claude to Rewrite a Library and Strip Its License. The Legal Question Applies to All of Us.

chardet has 130 million monthly downloads. A developer used Claude to rewrite it in five days and switched the license from LGPL to MIT. The original author came back after 15 years to object. I read the issue and realized I do this every day.

open-sourceai-code-generationlicensingsoftware-law
May 1, 2026
7 min read
I Spent 8 Hours Planning with an AI. The Code Took 75 Minutes.

I Spent 8 Hours Planning with an AI. The Code Took 75 Minutes.

Everyone says AI makes coding faster. Nobody publishes the actual hours. My planning-to-execution ratio was 6.4:1. That ratio is why the overnight build worked.

ai-agentsdeveloper-productivityplanningclaude-code
April 30, 2026
7 min read
The Flag Is Called --dangerously-skip-permissions. I Run It Every Night.

The Flag Is Called --dangerously-skip-permissions. I Run It Every Night.

Overnight autonomous agents are real, productive, and running on a flag with 'dangerously' in the name. Nobody has solved the trust problem. We've agreed to skip it.

ai-agentsclaude-codeautonomous-agentsdeveloper-experience
April 29, 2026
7 min read
Why My Agent Pipeline Still Runs on BullMQ

Why My Agent Pipeline Still Runs on BullMQ

Vercel Workflows ships crash recovery, step isolation, and durable state for agents. My pipeline uses BullMQ on a $7 VPS instead. Here's the trade-off.

ai-agentsbullmqinfrastructureouija
April 28, 2026
7 min read
Anthropic Shipped Agent Memory to Production While I Was Still Debugging Mine

Anthropic Shipped Agent Memory to Production While I Was Still Debugging Mine

Anthropic launched persistent memory for Claude Managed Agents. I've been building my own memory engine for months. Here's what their version solves, what it doesn't, and why the hard half of agent memory isn't storage.

ai-agentsmemory-systemsengramanthropic
April 27, 2026
8 min read
Context Engineering Replaced Prompt Engineering and Nobody Noticed

Context Engineering Replaced Prompt Engineering and Nobody Noticed

I've been doing context engineering for months without calling it that. A 547-line CLAUDE.md, subagent isolation, strategic compaction, six MCP servers. The term just caught up to the practice.

aisoftware-engineeringdeveloper-toolsagents
April 26, 2026
6 min read
AI Agents Don't Crash. They Succeed at the Wrong Thing.

AI Agents Don't Crash. They Succeed at the Wrong Thing.

78% of AI failures are invisible. But the research only covers wrong outputs. The failure mode nobody's monitoring for is correct output at the wrong time, in the wrong scope, for the wrong reason.

aisoftware-engineeringagentsobservability
April 25, 2026
8 min read
I Asked My AI to Fix One Line. It Rewrote the Entire Function.

I Asked My AI to Fix One Line. It Rewrote the Entire Function.

GPT-5.4 scores 0.395 on edit faithfulness while Claude Opus 4.6 scores 0.060. Over-editing isn't a bug. It's a training incentive. Models are rewarded for solving the task, not for touching the least code.

aisoftware-engineeringcode-reviewdeveloper-tools
April 24, 2026
8 min read
SpaceX Just Paid $60 Billion for an IDE. The Coding Tool War Is Over.

SpaceX Just Paid $60 Billion for an IDE. The Coding Tool War Is Over.

SpaceX secured the right to acquire Cursor for $60B. That's more than GitHub, Slack, and Figma combined. But the real story isn't the price tag. It's what happens when your coding tool runs Grok instead of Claude.

aideveloper-toolssoftware-engineeringindustry
April 23, 2026
8 min read
I Use AI to Write Code 10 Hours a Day. Vibe Coding Is Still a Terrible Idea.

I Use AI to Write Code 10 Hours a Day. Vibe Coding Is Still a Terrible Idea.

The METR study found experienced developers are 19% slower with AI tools while believing they're 20% faster. I've lived that 39-point perception gap. Here are three failures that proved it.

aideveloper-experiencesoftware-engineeringproductivity
April 22, 2026
7 min read
GitHub Is Using GPT to Review Claude's Work. That's Either Brilliant or the Most Expensive Code Review Ever.

GitHub Is Using GPT to Review Claude's Work. That's Either Brilliant or the Most Expensive Code Review Ever.

GitHub's Rubber Duck ships GPT-5.4 as a reviewer for Claude Sonnet's code. The cross-model pattern is real, backed by ICLR 2026 research. But 'second opinion' is the wrong frame. The hardest agent failures need structured verification, not another model guessing.

ai-agentscode-reviewdeveloper-toolingverification
April 21, 2026
8 min read
Your CI Pipeline Depends on a Model That Ships Breaking Changes Without a Changelog

Your CI Pipeline Depends on a Model That Ships Breaking Changes Without a Changelog

Opus 4.7 shipped Tuesday. It removed temperature, killed budget_tokens, changed the tokenizer by 35%, and shifted how agents spawn subprocesses. My pipeline didn't break. I also didn't test for it. Neither did you.

AI InfrastructureCI/CDLLMsDeveloper ToolingSupply ChainClaude Code
April 20, 2026
8 min read
GraphRAG Pilots Succeed. Production Deployments Fail Quietly.

GraphRAG Pilots Succeed. Production Deployments Fail Quietly.

Entity resolution errors compound exponentially. Graph decay runs 15-20% per quarter. Gradient Flow says they barely know of any production deployments offering real business value. The most hyped retrieval pattern of 2026 has a production problem nobody wants to own.

GraphRAGRAGKnowledge GraphsAI InfrastructureProduction Systems
April 19, 2026
8 min read
I'm Wiring Graph Memory Into Code Review. Here's What Vectors Miss.

I'm Wiring Graph Memory Into Code Review. Here's What Vectors Miss.

Your AI code reviewer gives the same feedback your team rejected three weeks ago. It can't know. I'm building the fix: two graphs, one structural and one cognitive, wired together through spreading activation.

AI AgentsCode ReviewEngramKnowledge GraphsDeveloper ToolingMCP
April 18, 2026
7 min read
Your Company Already Has AI Agents. You Just Don't Govern Them Yet.

Your Company Already Has AI Agents. You Just Don't Govern Them Yet.

The most dangerous AI agent in your org isn't the one leadership is planning to deploy. It's the one a developer shipped last quarter with operator-level permissions and no review process.

AI AgentsGovernanceSecurityProduction Engineering
April 17, 2026
7 min read
I Told My Agent Not to Do That. It Did It Anyway.

I Told My Agent Not to Do That. It Did It Anyway.

My CLAUDE.md said 'NEVER publish without internal links.' The agent published with zero. The fix wasn't better rules. It was structural enforcement: eval harnesses, separate verifiers, and hooks that don't ask permission.

AI AgentsClaude CodeDeveloper ToolingAutomation
April 16, 2026
6 min read
Multi-Agent Development Is a Distributed Systems Problem. I Learned This the Hard Way.

Multi-Agent Development Is a Distributed Systems Problem. I Learned This the Hard Way.

I built a multi-agent pipeline with BullMQ, hit every distributed systems failure in the book, and learned most tasks don't need multi-agent.

AI AgentsDistributed SystemsTypeScriptArchitecture
April 15, 2026
10 min read
I Decrypted My Own Notes to Feed an AI. Privacy and Memory Can't Coexist Yet.

I Decrypted My Own Notes to Feed an AI. Privacy and Memory Can't Coexist Yet.

I set up end-to-end encryption to protect my Obsidian notes from everyone. Then I wrote 814 lines of TypeScript to decrypt them on the server and pipe them into an AI memory engine. I am the threat model I was protecting against.

AIPrivacyMemory SystemsEncryption
April 14, 2026
9 min read
TypeScript's Compiler Is Being Rewritten in Go. I Have Mixed Feelings.

TypeScript's Compiler Is Being Rewritten in Go. I Have Mixed Feelings.

The 10x speed promise is real for VS Code-scale projects. For my 7-package monorepo it's more like 3x. And nobody's talking about what we lose: the plugin API that Angular, Vue, and hundreds of tools depend on has no replacement timeline.

TypeScriptDeveloper ToolingGoCompilers
April 12, 2026
8 min read
I Run Six MCP Servers Daily. Here's What Breaks.

I Run Six MCP Servers Daily. Here's What Breaks.

MCP won the standard war. But running six servers in production every day exposes failure modes no demo will show you: context bloat, silent auth failures, and tool selection that falls apart at scale.

MCPAI AgentsDeveloper ToolingInfrastructure
April 10, 2026
9 min read
The Quiet Death of the IDE: Why 46% of Developers Fell in Love with a Terminal

The Quiet Death of the IDE: Why 46% of Developers Fell in Love with a Terminal

The Pragmatic Engineer's 2026 survey says Claude Code is the most loved AI dev tool at 46%. Cursor sits at 19%. Copilot at 9%. I switched six months ago. The terminal won, and it wasn't even close.

Developer ToolsClaude CodeAI AgentsTerminal
April 9, 2026
7 min read
I Compared Three AI Memory Systems. They Can't Even Agree on What Memory Means.

I Compared Three AI Memory Systems. They Can't Even Agree on What Memory Means.

SimpleMem compresses conversations into atoms. MemPalace stores every word in a spatial hierarchy. Engram forgets on purpose. After a week with all three, I think they're solving different problems.

AIMemory SystemsOpen SourceAI Agents
April 8, 2026
7 min read
I Closed the AI Agent Loop. They Stopped Making the Same Mistakes.

I Closed the AI Agent Loop. They Stopped Making the Same Mistakes.

My AI agent applied seven patches to the same bug. Each one a fresh attempt. No accumulation. After connecting dispatch with memory, the same class of bug gets caught on the first try.

AIMemory SystemsOpen SourceAI AgentsAutomation
April 5, 2026
7 min read
I Built an AI Memory System That Forgets on Purpose. It Remembers Better Than Yours.

I Built an AI Memory System That Forgets on Purpose. It Remembers Better Than Yours.

My AI agent wrote a perfect architecture spec for dual-storage memory. Then it ignored the whole thing and built a flat table. Seven patches later, I threw it all away and built Engram from neuroscience papers instead.

AIMemory SystemsOpen SourceAI Agents
April 5, 2026
8 min read
AI Agents Can Write Code. Nobody Is Managing Them. I Built the Missing Layer.

AI Agents Can Write Code. Nobody Is Managing Them. I Built the Missing Layer.

Two of my AI agents opened conflicting PRs on the same repo. I caught it at 11 PM. That's when I realized the control plane was already sitting on my screen: the kanban board.

AIDevOpsOpen SourceAI AgentsAutomation
April 5, 2026
6 min read
oh-my-codex Hit 3K Stars in 24 Hours. I Run an Agent Orchestrator. Here's Why This Matters.

oh-my-codex Hit 3K Stars in 24 Hours. I Run an Agent Orchestrator. Here's Why This Matters.

The viral growth of oh-my-codex isn't about Codex. It's about who owns the orchestration layer. While Anthropic locks down, the community is building the real platform.

AIDeveloper ToolsOpen Source
April 4, 2026
7 min read
173 Agents, Zero Owners: Azure's Structural Rot

173 Agents, Zero Owners: Azure's Structural Rot

A former Azure Core engineer just published the most detailed infrastructure failure account since the Knight Capital postmortem. 173 unowned agents, 500K monthly crashes, and a trillion dollars gone.

Cloud InfrastructureAzureEngineering Leadership
April 3, 2026
7 min read
I Analyzed Claude Code 4 Times From the Outside. The Source Leak Proved Me Right and Wrong.

I Analyzed Claude Code 4 Times From the Outside. The Source Leak Proved Me Right and Wrong.

512,000 lines of TypeScript leaked from a missing .npmignore. The architecture I reverse-engineered was spot-on. The implementation quality made me rethink what 'good code' means at $2.5B ARR.

AIDeveloper ToolsEngineeringClaude
April 2, 2026
9 min read
I Audited My Lockfile After the Axios Compromise. You Should Too.

I Audited My Lockfile After the Axios Compromise. You Should Too.

Someone hijacked an Axios maintainer's npm account and published two versions with a RAT that deletes itself after install. 50 million weekly downloads. The dropper leaves no trace. Here's exactly what happened and what to check.

SecuritynpmSupply ChainNode.js
March 31, 2026
6 min read
My AI Agent Wrote the Spec, Then Ignored It. Here's the Gradient That Explains Why.

My AI Agent Wrote the Spec, Then Ignored It. Here's the Gradient That Explains Why.

I watched an AI coding agent write a perfect architecture doc, then ship 15 patches that violated every decision in it. When I confronted it, the explanation was more honest than anything I've read about RLHF.

AISoftware EngineeringAI AgentsRLHF
March 29, 2026
10 min read
The $7 VPS Running an AI Agent That Proves Infrastructure Minimalism Works

The $7 VPS Running an AI Agent That Proves Infrastructure Minimalism Works

George Larson put an AI agent on a $7/month VPS. It handles real traffic with a 678 KB binary and 1 MB of RAM. The 'AI needs scale' narrative just broke.

InfrastructureAICost
March 27, 2026
8 min read
Karpathy's 630 Lines Won't Replace Researchers. They'll Replace Research.

Karpathy's 630 Lines Won't Replace Researchers. They'll Replace Research.

AutoResearch got 56,000 stars for the wrong reason. Everyone focused on the AI agent. The real engineering is the four constraints that make unsupervised velocity safe.

AIMachine LearningArchitectureDeveloper Tools
March 26, 2026
6 min read
PROMPT to IPO: Why Rails Won the Agent Framework War Nobody Knew Was Happening

PROMPT to IPO: Why Rails Won the Agent Framework War Nobody Knew Was Happening

DHH rewrote the Rails homepage to target AI agents instead of developers. The data says he's onto something, even if Rails isn't the most token-efficient framework in benchmarks.

Ruby on RailsAI AgentsFramework ArchitectureDeveloper Productivity
March 25, 2026
9 min read
Your Phone's SSD Is the New VRAM

Your Phone's SSD Is the New VRAM

A 397B parameter model running on 12GB of RAM. The trick isn't new ML theory. It's demand paging, the same architecture pattern we've used since the 1960s.

AIPerformanceArchitectureOn-Device ML
March 24, 2026
8 min read
Cursor Just Open-Sourced the Agents That Review AI-Generated Code

Cursor Just Open-Sourced the Agents That Review AI-Generated Code

Cursor's PR velocity went up 5x. DryRun says 87% of AI PRs have vulnerabilities. The solution? More agents. Four autonomous security agents review every PR, scan for forgotten vulns, and auto-patch dependencies. Templates are public. The meta-layer is here.

AI SecurityDevSecOpsCoding AgentsApplication Security
March 23, 2026
9 min read
Next.js 16.2 Isn't a Framework Update. It's an Agent Platform.

Next.js 16.2 Isn't a Framework Update. It's an Agent Platform.

Next.js 16.2 shipped AGENTS.md by default, bundled docs in node_modules, browser logs piped to terminal, and a CLI that gives agents DevTools via shell commands. Vercel isn't improving DX. They're building for a new user: the coding agent.

Next.jsAI AgentsDeveloper ToolsWeb Development
March 22, 2026
8 min read
Tomorrow Is Eid. Here's What That Actually Means.

Tomorrow Is Eid. Here's What That Actually Means.

A software engineer's window into Eid al-Fitr. Not a lecture on Islam. Just what tonight and tomorrow actually look like from the inside.

CulturePersonalEid
March 20, 2026
6 min read
Ollama Just Became an OpenClaw Provider

Ollama Just Became an OpenClaw Provider

Ollama 0.18 shipped with native OpenClaw integration. Local models now get tool calling, multi-agent workflows, and permission boundaries. No API costs, no data leaving your network.

AIArchitectureDevOps
March 19, 2026
10 min read
Verification Debt: The Hidden Org Cost of AI-Generated Code

Verification Debt: The Hidden Org Cost of AI-Generated Code

Amazon.com went down for six hours because of AI-assisted code changes. A week later, they required senior engineer sign-offs. LinearB analyzed 8.1 million pull requests and found AI code waits 4.6x longer for review and ships 19% slower. The productivity gains were a mirage.

AICode ReviewEngineering ManagementTechnical Debt
March 17, 2026
12 min read
I Chose DynamoDB for a Social App. Here's Why I'd Undo It.

I Chose DynamoDB for a Social App. Here's Why I'd Undo It.

I picked DynamoDB through Amplify for a professional networking app with swipeable cards. The first two weeks were magic. Then we needed queries DynamoDB was never built to answer.

ArchitectureAWSDynamoDBPostgres
March 15, 2026
10 min read
The MCP vs CLI Debate Is Missing the Point

The MCP vs CLI Debate Is Missing the Point

Everyone's arguing whether AI agents should use MCP or CLI tools. The answer depends on a question nobody's asking: does the model already know how to use the tool, or did your team build it last Tuesday?

MCPAIArchitectureDeveloper Tools
March 14, 2026
7 min read
GitHub Built a Threat Model for Coding Agents. It's Missing a Layer.

GitHub Built a Threat Model for Coding Agents. It's Missing a Layer.

GitHub published the most sophisticated platform security for AI agents I've seen. Isolation, token quarantine, constrained outputs, audit trails. It doesn't stop the attacks that actually happened this month.

SecurityAICI/CDGitHub
March 13, 2026
8 min read
I Built a Prompt Injection Firewall for MCP Servers

I Built a Prompt Injection Firewall for MCP Servers

MCP servers have no input sanitization layer. Every JSON-RPC request flows straight from AI client to tool server, unfiltered. So I built one.

SecurityMCPAI SafetyOpen Source
March 12, 2026
8 min read
McKinsey's AI Got Hacked by an AI. The Vulnerability Was From 1998.

McKinsey's AI Got Hacked by an AI. The Vulnerability Was From 1998.

An autonomous AI agent breached McKinsey's internal AI platform in two hours. No credentials. No insider access. The entry point was SQL injection through JSON field names, a bug class older than most junior developers.

SecurityAIEnterprise
March 11, 2026
9 min read
A GitHub Issue Title Compromised 4,000 Developer Machines

A GitHub Issue Title Compromised 4,000 Developer Machines

Someone put a prompt injection payload in a GitHub issue title. An AI triage bot executed it, poisoned the build cache, stole npm credentials, and pushed a rogue package to 4,000 developers. The full chain took five steps.

SecurityAICI/CDSupply Chain
March 10, 2026
9 min read
Skills, MCP, and the Orchestration Gap Nobody's Fixing

Skills, MCP, and the Orchestration Gap Nobody's Fixing

Agent skills became an open standard. MCP connects everything. But the layer between them, the one that keeps agents from failing catastrophically in production, barely exists.

AI AgentsMCPAgent SkillsProduction Engineering
March 8, 2026
8 min read
I Had Zero Pages Indexed for Three Months. Here's the One-Line Fix.

I Had Zero Pages Indexed for Three Months. Here's the One-Line Fix.

A canonical URL mismatch between www and non-www kept my entire blog invisible to Google for three months. Six files, twelve line changes, and a sitemap resubmission fixed it. Here's how to check yours.

SEOWeb DevelopmentEngineeringNext.js
March 4, 2026
6 min read
Inside Claude Code's Context Machine

Inside Claude Code's Context Machine

Claude Code manages your context through three systems: microcompaction, auto-compaction, and structured rehydration. Here's how the machinery actually works, and why most developers burn tokens without realizing it.

AIDeveloper ToolsEngineeringClaude
March 3, 2026
8 min read
AI Made Writing Code Easier. It Made Engineering Harder.

AI Made Writing Code Easier. It Made Engineering Harder.

AI accelerates code production but expands scope, raises expectations, and shifts the bottleneck from implementation to judgment. Engineers are doing 2x the work and feeling 10x the burnout.

EngineeringAICareer
March 2, 2026
7 min read
MinIO Is Dead. Here's What Your Infrastructure Team Should Do Next.

MinIO Is Dead. Here's What Your Infrastructure Team Should Do Next.

60,000 GitHub stars. One billion Docker pulls. Officially archived. MinIO's five-year wind-down from Apache 2.0 to AGPL to dead is the most dramatic open-source infrastructure collapse in years. Here's the migration playbook.

InfrastructureOpen SourceObject StorageArchitecture
March 1, 2026
7 min read
Building Production-Ready MCP Servers

Building Production-Ready MCP Servers

MCP servers are everywhere. Production-ready ones aren't. Here's the architecture I use after running MCP in real workloads: error boundaries, state isolation, security hardening, and scaling patterns that actually hold up.

MCPArchitectureSecurityNode.jsProduction
March 1, 2026
12 min read
Your Google API Keys Just Became Gemini Credentials (And Nobody Told You)

Your Google API Keys Just Became Gemini Credentials (And Nobody Told You)

Google told developers API keys aren't secrets. Then Gemini changed the rules. Truffle Security found 2,863 live keys on public websites that now access private Gemini endpoints, including keys belonging to Google itself. The attack is a single curl command.

SecurityGoogle CloudAIAPI Security
February 28, 2026
7 min read
What Claude Code Actually Chooses (And Why Tool Vendors Should Pay Attention)

What Claude Code Actually Chooses (And Why Tool Vendors Should Pay Attention)

Amplifying.ai ran 2,430 prompts against Claude Code and found it builds custom solutions in 12 of 20 categories. The tools it picks are becoming the default stack for a growing share of new projects.

AIDeveloper ToolsArchitecture
February 27, 2026
6 min read
Cloudflare Just Replaced Next.js in a Week. Kind Of.

Cloudflare Just Replaced Next.js in a Week. Kind Of.

One engineer, $1,100 in tokens, and 94% API coverage. Vinext is either the future of framework development or the most impressive demo that will never matter. I think it's both.

Next.jsCloudflareWeb PerformanceAI
February 26, 2026
6 min read
MCP Server Benchmarks Are Asking the Wrong Question

MCP Server Benchmarks Are Asking the Wrong Question

3.9 million requests across Java, Go, Node.js, and Python. Go wins on memory, Java on latency. But after running MCP servers in production for months, I think the benchmark misses what actually matters.

MCPPerformanceArchitectureGoNode.js
February 26, 2026
8 min read
Software engineers aren't going extinct. The job just got harder.

Software engineers aren't going extinct. The job just got harder.

Boris Cherny says the software engineer title disappears in 2026. He's wrong about the title, right about the shift. Here's what 9 years of production engineering taught me about surviving it.

AIEngineeringCareerSoftware EngineeringHot Take
February 26, 2026
6 min read
Durable Objects: The Primitive AWS Doesn't Have

Durable Objects: The Primitive AWS Doesn't Have

Cloudflare's Durable Objects give you single-threaded, globally unique compute with embedded SQLite. AWS has no equivalent. Here's how they change backend architecture.

CloudflareServerlessArchitectureEdge Computing
February 25, 2026
8 min read
Lambda Durable Functions Are Not Step Functions Replacements

Lambda Durable Functions Are Not Step Functions Replacements

AWS Lambda Durable Functions look like Step Functions killers. They're not. Here's when each one wins, what the checkpoint-and-replay model actually costs, and the architectural patterns I'd use in production.

AWSServerlessLambdaArchitectureCloud
February 25, 2026
8 min read

Firefox 148's setHTML: The XSS Protection We Should Have Had Years Ago

Firefox just shipped setHTML in version 148, replacing the notorious innerHTML with something that actually sanitizes by default. Here's why this matters and what it means for your security posture.

SecurityWeb DevelopmentJavaScriptXSS
February 24, 2026
3 min read

Ladybird Just Picked Rust. Here's Why That Changes Everything.

When an independent browser engine switches from C++ to Rust mid-flight, it's not just a language choice. It's a bet on maintenance burden, contributor velocity, and long-term survival.

RustBrowsersArchitectureEngineeringOpen Source
February 23, 2026
4 min read

AI Can't Audit Your Binaries Yet

The best AI model finds 49% of backdoors in compiled binaries. With a 22% false positive rate. Here's what that means for your supply chain security strategy.

SecurityAIEngineeringSupply ChainArchitecture
February 22, 2026
6 min read

The planning-execution split: why your AI coding workflow is probably wrong

Most developers are using AI assistants inefficiently. Here's how separating planning from execution can 10x your productivity.

AIProductivityEngineeringWorkflow
February 22, 2026
4 min read

Local AI Just Got Serious

GGML.ai joined Hugging Face this week, creating a complete stack for running AI locally. The assumption that AI requires the cloud is already obsolete—we're just waiting for everyone to notice.

AIInfrastructureOpen Source
February 21, 2026
3 min read

Your Dependency Bot Is a Liability

Dependabot opened thousands of PRs for a vulnerability that affected nobody. The real fix isn't more automation - it's smarter automation.

EngineeringSecurityDevOpsDependency ManagementGo
February 21, 2026
4 min read

Custom Silicon is Coming for Your Inference Stack

A startup just hit 17K tokens/sec on a single chip by hard-wiring Llama into silicon. The GPU monoculture in AI inference has an expiration date.

AIInfrastructureArchitectureHardware
February 20, 2026
5 min read

The hidden cost of 'right' decisions: what 4 years of infrastructure teaches about trade-offs

Every infrastructure decision is a bet on the future. After watching teams make the same mistakes across multiple startups, here's what actually matters when choosing your stack.

infrastructurearchitecturedevopscloud
February 20, 2026
6 min read

Anthropic said no to the Pentagon. Now they're a 'supply chain risk.'

The Pentagon wants AI labs to allow 'all lawful use' of their models. Anthropic pushed back. Now the DoD is threatening to blacklist them. Here's why engineers should care.

AIEthicsPolicyAnthropicEngineering
February 19, 2026
5 min read

Gemini 3.1 Can Solve Puzzles. It Still Can't Use a Screwdriver.

Google's Gemini 3.1 Pro just dropped with a 77% on ARC-AGI-2 - up from 31%. The benchmarks are staggering. But the people actually building with it keep saying the same thing: it can't call tools.

AILLMsGeminiEngineeringDeveloper Tools
February 19, 2026
4 min read

Sonnet Is the New Opus: Why Mid-Tier Models Keep Eating the Premium Tier

Claude Sonnet 4.6 just dropped and developers with early access prefer it over Opus 4.5. This isn't an accident. It's a pattern that should change how you pick models.

AIEngineeringLLMsAnthropicDeveloper Tools
February 18, 2026
4 min read

Your coding agent is a slot machine. You're already pulling the lever.

There's a new name for something engineers have been feeling for a year: token anxiety. The compulsive urge to always be prompting, always shipping. This is what that actually is.

AIEngineeringCoding AgentsDeveloper ToolsCulture
February 17, 2026
6 min read

Claude Code Hid the File Names. The Dev Community Noticed.

Anthropic collapsed Claude Code's file output in v2.1.20. Devs pushed back immediately — and they were right. This isn't a UX preference. It's about catching AI mistakes before they cost you.

AIDeveloper ToolsEngineeringClaudeLLMs
February 16, 2026
4 min read

When Building Is Free, Who Gets to Charge?

Apple's App Store got 557,000 new submissions last year, up 24%. Building an app went from a $50K project to a weekend. When development costs disappear, subscription pricing follows. The businesses that survive know exactly why.

AIBusinessSoftware EngineeringSaaSStrategy
February 15, 2026
4 min read

GPT-5.2 Proved Physicists Wrong. Here's Why Engineers Should Care.

OpenAI's GPT-5.2 conjectured a new formula in theoretical physics that humans missed for decades. A concrete data point on where AI reasoning actually stands.

AIResearchMachine LearningEngineeringOpenAI
February 14, 2026
6 min read

Depth vs. Speed: What This Week's AI Drops Tell You About the Next Two Years

Google and OpenAI both shipped major AI releases this week — one betting on deeper reasoning, one on faster inference. These aren't just product launches. They're two different theories about where the real bottleneck is.

AIEngineeringLLMsDeveloper ToolsGeminiOpenAI
February 13, 2026
4 min read

You're Blaming the Model. The Harness Did It.

Everyone's arguing GPT-5 vs Opus while the real bottleneck in LLM coding agents is something nobody talks about: the edit tool format.

AIEngineeringLLMsCoding AgentsDeveloper Tools
February 12, 2026
5 min read

Your Browser History Is For Sale: 287 Chrome Extensions Are Watching You

287 Chrome extensions with 37.4 million installs are quietly exfiltrating browsing history to data brokers. Here's what was found, who's behind it, and what you can do about it.

SecurityPrivacyChrome ExtensionsInfoSec
February 11, 2026
3 min read

The Death of the Writer, The Rise of the Editor

Anthropic's AI writes nearly 100% of their code, but Microsoft research shows devs miss 40% more bugs reviewing AI code. The essential skill of 2026 is code cynicism.

AISoftware EngineeringFuture of WorkCode Quality
February 10, 2026
2 min read

The Quiet Shift: How AI is Rewriting Software Economics in 2026

The cost of testing an idea has dropped to zero. In 2026, we don't build MVPs to test tech feasibility anymore. We build them to test market feasibility.

AISoftware EngineeringStrategyFuture of Work
February 9, 2026
2 min read

Why Silicon Brains Are Starting to Look Like Ours

A look at the shift from brute-force AI to bio-inspired efficiency and quantum computing breakthroughs.

AITechnologyQuantum ComputingFuture
February 8, 2026
2 min read

Building Scalable Next.js Apps

Learn the key principles and patterns I've used to build Next.js applications that scale to millions of users, with insights from real-world production systems.

Next.jsArchitecturePerformanceReact
February 7, 2026
4 min read

The Modern Stack: Why I Bet on Next.js 16, React 19, and Tailwind 4

A technical deep dive into the stack used for this portfolio. Highlighting React Server Components, Tailwind's new engine, and performance benefits.

Next.jsReactTailwind CSS
February 7, 2026
2 min read

Get new posts in your inbox

Architecture, performance, security. No spam.