Blog

engineering-leadershipproductivitycareermetrics

The Productivity Panic Is Coming for Senior Engineers Next

Three waves of productivity metrics. Story points, DORA, AI output. Each one optimized for a proxy until it broke. Each one made senior engineers less visible.

May 2, 2026

open-sourceai-code-generationlicensingsoftware-law

Someone Used Claude to Rewrite a Library and Strip Its License. The Legal Question Applies to All of Us.

chardet has 130 million monthly downloads. A developer used Claude to rewrite it in five days and switched the license from LGPL to MIT. The original author came back after 15 years to object. I read the issue and realized I do this every day.

May 1, 2026

ai-agentsdeveloper-productivityplanningclaude-code

I Spent 8 Hours Planning with an AI. The Code Took 75 Minutes.

Everyone says AI makes coding faster. Nobody publishes the actual hours. My planning-to-execution ratio was 6.4:1. That ratio is why the overnight build worked.

April 30, 2026

ai-agentsclaude-codeautonomous-agentsdeveloper-experience

The Flag Is Called --dangerously-skip-permissions. I Run It Every Night.

Overnight autonomous agents are real, productive, and running on a flag with 'dangerously' in the name. Nobody has solved the trust problem. We've agreed to skip it.

April 29, 2026

ai-agentsbullmqinfrastructureouija

Why My Agent Pipeline Still Runs on BullMQ

Vercel Workflows ships crash recovery, step isolation, and durable state for agents. My pipeline uses BullMQ on a $7 VPS instead. Here's the trade-off.

April 28, 2026

ai-agentsmemory-systemsengramanthropic

Anthropic Shipped Agent Memory to Production While I Was Still Debugging Mine

Anthropic launched persistent memory for Claude Managed Agents. I've been building my own memory engine for months. Here's what their version solves, what it doesn't, and why the hard half of agent memory isn't storage.

April 27, 2026

aisoftware-engineeringdeveloper-toolsagents

Context Engineering Replaced Prompt Engineering and Nobody Noticed

I've been doing context engineering for months without calling it that. A 547-line CLAUDE.md, subagent isolation, strategic compaction, six MCP servers. The term just caught up to the practice.

April 26, 2026

aisoftware-engineeringagentsobservability

AI Agents Don't Crash. They Succeed at the Wrong Thing.

78% of AI failures are invisible. But the research only covers wrong outputs. The failure mode nobody's monitoring for is correct output at the wrong time, in the wrong scope, for the wrong reason.

April 25, 2026

aisoftware-engineeringcode-reviewdeveloper-tools

I Asked My AI to Fix One Line. It Rewrote the Entire Function.

GPT-5.4 scores 0.395 on edit faithfulness while Claude Opus 4.6 scores 0.060. Over-editing isn't a bug. It's a training incentive. Models are rewarded for solving the task, not for touching the least code.

April 24, 2026

aideveloper-toolssoftware-engineeringindustry

SpaceX Just Paid $60 Billion for an IDE. The Coding Tool War Is Over.

SpaceX secured the right to acquire Cursor for $60B. That's more than GitHub, Slack, and Figma combined. But the real story isn't the price tag. It's what happens when your coding tool runs Grok instead of Claude.

April 23, 2026

aideveloper-experiencesoftware-engineeringproductivity

I Use AI to Write Code 10 Hours a Day. Vibe Coding Is Still a Terrible Idea.

The METR study found experienced developers are 19% slower with AI tools while believing they're 20% faster. I've lived that 39-point perception gap. Here are three failures that proved it.

April 22, 2026

ai-agentscode-reviewdeveloper-toolingverification

GitHub Is Using GPT to Review Claude's Work. That's Either Brilliant or the Most Expensive Code Review Ever.

GitHub's Rubber Duck ships GPT-5.4 as a reviewer for Claude Sonnet's code. The cross-model pattern is real, backed by ICLR 2026 research. But 'second opinion' is the wrong frame. The hardest agent failures need structured verification, not another model guessing.

April 21, 2026

AI InfrastructureCI/CDLLMsDeveloper ToolingSupply ChainClaude Code

Your CI Pipeline Depends on a Model That Ships Breaking Changes Without a Changelog

Opus 4.7 shipped Tuesday. It removed temperature, killed budget_tokens, changed the tokenizer by 35%, and shifted how agents spawn subprocesses. My pipeline didn't break. I also didn't test for it. Neither did you.

April 20, 2026

GraphRAGRAGKnowledge GraphsAI InfrastructureProduction Systems

GraphRAG Pilots Succeed. Production Deployments Fail Quietly.

Entity resolution errors compound exponentially. Graph decay runs 15-20% per quarter. Gradient Flow says they barely know of any production deployments offering real business value. The most hyped retrieval pattern of 2026 has a production problem nobody wants to own.

April 19, 2026

AI AgentsCode ReviewEngramKnowledge GraphsDeveloper ToolingMCP

I'm Wiring Graph Memory Into Code Review. Here's What Vectors Miss.

Your AI code reviewer gives the same feedback your team rejected three weeks ago. It can't know. I'm building the fix: two graphs, one structural and one cognitive, wired together through spreading activation.

April 18, 2026

AI AgentsGovernanceSecurityProduction Engineering

Your Company Already Has AI Agents. You Just Don't Govern Them Yet.

The most dangerous AI agent in your org isn't the one leadership is planning to deploy. It's the one a developer shipped last quarter with operator-level permissions and no review process.

April 17, 2026

AI AgentsClaude CodeDeveloper ToolingAutomation

I Told My Agent Not to Do That. It Did It Anyway.

My CLAUDE.md said 'NEVER publish without internal links.' The agent published with zero. The fix wasn't better rules. It was structural enforcement: eval harnesses, separate verifiers, and hooks that don't ask permission.

April 16, 2026

AI AgentsDistributed SystemsTypeScriptArchitecture

Multi-Agent Development Is a Distributed Systems Problem. I Learned This the Hard Way.

I built a multi-agent pipeline with BullMQ, hit every distributed systems failure in the book, and learned most tasks don't need multi-agent.

April 15, 2026

AIPrivacyMemory SystemsEncryption

I Decrypted My Own Notes to Feed an AI. Privacy and Memory Can't Coexist Yet.

I set up end-to-end encryption to protect my Obsidian notes from everyone. Then I wrote 814 lines of TypeScript to decrypt them on the server and pipe them into an AI memory engine. I am the threat model I was protecting against.

April 14, 2026

TypeScriptDeveloper ToolingGoCompilers

TypeScript's Compiler Is Being Rewritten in Go. I Have Mixed Feelings.

The 10x speed promise is real for VS Code-scale projects. For my 7-package monorepo it's more like 3x. And nobody's talking about what we lose: the plugin API that Angular, Vue, and hundreds of tools depend on has no replacement timeline.

April 12, 2026

MCPAI AgentsDeveloper ToolingInfrastructure

I Run Six MCP Servers Daily. Here's What Breaks.

MCP won the standard war. But running six servers in production every day exposes failure modes no demo will show you: context bloat, silent auth failures, and tool selection that falls apart at scale.

April 10, 2026

Developer ToolsClaude CodeAI AgentsTerminal

The Quiet Death of the IDE: Why 46% of Developers Fell in Love with a Terminal

The Pragmatic Engineer's 2026 survey says Claude Code is the most loved AI dev tool at 46%. Cursor sits at 19%. Copilot at 9%. I switched six months ago. The terminal won, and it wasn't even close.

April 9, 2026

AIMemory SystemsOpen SourceAI Agents

I Compared Three AI Memory Systems. They Can't Even Agree on What Memory Means.

SimpleMem compresses conversations into atoms. MemPalace stores every word in a spatial hierarchy. Engram forgets on purpose. After a week with all three, I think they're solving different problems.

April 8, 2026

AIMemory SystemsOpen SourceAI AgentsAutomation

I Closed the AI Agent Loop. They Stopped Making the Same Mistakes.

My AI agent applied seven patches to the same bug. Each one a fresh attempt. No accumulation. After connecting dispatch with memory, the same class of bug gets caught on the first try.

April 5, 2026

AIMemory SystemsOpen SourceAI Agents

I Built an AI Memory System That Forgets on Purpose. It Remembers Better Than Yours.

My AI agent wrote a perfect architecture spec for dual-storage memory. Then it ignored the whole thing and built a flat table. Seven patches later, I threw it all away and built Engram from neuroscience papers instead.

April 5, 2026

AIDevOpsOpen SourceAI AgentsAutomation

AI Agents Can Write Code. Nobody Is Managing Them. I Built the Missing Layer.

Two of my AI agents opened conflicting PRs on the same repo. I caught it at 11 PM. That's when I realized the control plane was already sitting on my screen: the kanban board.

April 5, 2026

AIDeveloper ToolsOpen Source

oh-my-codex Hit 3K Stars in 24 Hours. I Run an Agent Orchestrator. Here's Why This Matters.

The viral growth of oh-my-codex isn't about Codex. It's about who owns the orchestration layer. While Anthropic locks down, the community is building the real platform.

April 4, 2026

Cloud InfrastructureAzureEngineering Leadership

173 Agents, Zero Owners: Azure's Structural Rot

A former Azure Core engineer just published the most detailed infrastructure failure account since the Knight Capital postmortem. 173 unowned agents, 500K monthly crashes, and a trillion dollars gone.

April 3, 2026

AIDeveloper ToolsEngineeringClaude

I Analyzed Claude Code 4 Times From the Outside. The Source Leak Proved Me Right and Wrong.

512,000 lines of TypeScript leaked from a missing .npmignore. The architecture I reverse-engineered was spot-on. The implementation quality made me rethink what 'good code' means at $2.5B ARR.

April 2, 2026

SecuritynpmSupply ChainNode.js

I Audited My Lockfile After the Axios Compromise. You Should Too.

Someone hijacked an Axios maintainer's npm account and published two versions with a RAT that deletes itself after install. 50 million weekly downloads. The dropper leaves no trace. Here's exactly what happened and what to check.

March 31, 2026

AISoftware EngineeringAI AgentsRLHF

My AI Agent Wrote the Spec, Then Ignored It. Here's the Gradient That Explains Why.

I watched an AI coding agent write a perfect architecture doc, then ship 15 patches that violated every decision in it. When I confronted it, the explanation was more honest than anything I've read about RLHF.

March 29, 2026

The $7 VPS Running an AI Agent That Proves Infrastructure Minimalism Works

George Larson put an AI agent on a $7/month VPS. It handles real traffic with a 678 KB binary and 1 MB of RAM. The 'AI needs scale' narrative just broke.

InfrastructureAICost

March 27, 2026

AIMachine LearningArchitectureDeveloper Tools

Karpathy's 630 Lines Won't Replace Researchers. They'll Replace Research.

AutoResearch got 56,000 stars for the wrong reason. Everyone focused on the AI agent. The real engineering is the four constraints that make unsupervised velocity safe.

March 26, 2026

Ruby on RailsAI AgentsFramework ArchitectureDeveloper Productivity

PROMPT to IPO: Why Rails Won the Agent Framework War Nobody Knew Was Happening

DHH rewrote the Rails homepage to target AI agents instead of developers. The data says he's onto something, even if Rails isn't the most token-efficient framework in benchmarks.

March 25, 2026

AIPerformanceArchitectureOn-Device ML

Your Phone's SSD Is the New VRAM

A 397B parameter model running on 12GB of RAM. The trick isn't new ML theory. It's demand paging, the same architecture pattern we've used since the 1960s.

March 24, 2026

AI SecurityDevSecOpsCoding AgentsApplication Security

Cursor Just Open-Sourced the Agents That Review AI-Generated Code

Cursor's PR velocity went up 5x. DryRun says 87% of AI PRs have vulnerabilities. The solution? More agents. Four autonomous security agents review every PR, scan for forgotten vulns, and auto-patch dependencies. Templates are public. The meta-layer is here.

March 23, 2026

Next.jsAI AgentsDeveloper ToolsWeb Development

Next.js 16.2 Isn't a Framework Update. It's an Agent Platform.

Next.js 16.2 shipped AGENTS.md by default, bundled docs in node_modules, browser logs piped to terminal, and a CLI that gives agents DevTools via shell commands. Vercel isn't improving DX. They're building for a new user: the coding agent.

March 22, 2026

Tomorrow Is Eid. Here's What That Actually Means.

A software engineer's window into Eid al-Fitr. Not a lecture on Islam. Just what tonight and tomorrow actually look like from the inside.

CulturePersonalEid

March 20, 2026

Ollama Just Became an OpenClaw Provider

Ollama 0.18 shipped with native OpenClaw integration. Local models now get tool calling, multi-agent workflows, and permission boundaries. No API costs, no data leaving your network.

AIArchitectureDevOps

March 19, 2026

AICode ReviewEngineering ManagementTechnical Debt

Verification Debt: The Hidden Org Cost of AI-Generated Code

Amazon.com went down for six hours because of AI-assisted code changes. A week later, they required senior engineer sign-offs. LinearB analyzed 8.1 million pull requests and found AI code waits 4.6x longer for review and ships 19% slower. The productivity gains were a mirage.

March 17, 2026

12 min read

I Chose DynamoDB for a Social App. Here's Why I'd Undo It.

I picked DynamoDB through Amplify for a professional networking app with swipeable cards. The first two weeks were magic. Then we needed queries DynamoDB was never built to answer.

ArchitectureAWSDynamoDBPostgres

March 15, 2026

MCPAIArchitectureDeveloper Tools

The MCP vs CLI Debate Is Missing the Point

Everyone's arguing whether AI agents should use MCP or CLI tools. The answer depends on a question nobody's asking: does the model already know how to use the tool, or did your team build it last Tuesday?

March 14, 2026

GitHub Built a Threat Model for Coding Agents. It's Missing a Layer.

GitHub published the most sophisticated platform security for AI agents I've seen. Isolation, token quarantine, constrained outputs, audit trails. It doesn't stop the attacks that actually happened this month.

SecurityAICI/CDGitHub

March 13, 2026

SecurityMCPAI SafetyOpen Source

I Built a Prompt Injection Firewall for MCP Servers

MCP servers have no input sanitization layer. Every JSON-RPC request flows straight from AI client to tool server, unfiltered. So I built one.

March 12, 2026

McKinsey's AI Got Hacked by an AI. The Vulnerability Was From 1998.

An autonomous AI agent breached McKinsey's internal AI platform in two hours. No credentials. No insider access. The entry point was SQL injection through JSON field names, a bug class older than most junior developers.

SecurityAIEnterprise

March 11, 2026

SecurityAICI/CDSupply Chain

A GitHub Issue Title Compromised 4,000 Developer Machines

Someone put a prompt injection payload in a GitHub issue title. An AI triage bot executed it, poisoned the build cache, stole npm credentials, and pushed a rogue package to 4,000 developers. The full chain took five steps.

March 10, 2026

AI AgentsMCPAgent SkillsProduction Engineering

Skills, MCP, and the Orchestration Gap Nobody's Fixing

Agent skills became an open standard. MCP connects everything. But the layer between them, the one that keeps agents from failing catastrophically in production, barely exists.

March 8, 2026

SEOWeb DevelopmentEngineeringNext.js

I Had Zero Pages Indexed for Three Months. Here's the One-Line Fix.

A canonical URL mismatch between www and non-www kept my entire blog invisible to Google for three months. Six files, twelve line changes, and a sitemap resubmission fixed it. Here's how to check yours.

March 4, 2026

AIDeveloper ToolsEngineeringClaude

Inside Claude Code's Context Machine

Claude Code manages your context through three systems: microcompaction, auto-compaction, and structured rehydration. Here's how the machinery actually works, and why most developers burn tokens without realizing it.

March 3, 2026

AI Made Writing Code Easier. It Made Engineering Harder.

AI accelerates code production but expands scope, raises expectations, and shifts the bottleneck from implementation to judgment. Engineers are doing 2x the work and feeling 10x the burnout.

EngineeringAICareer

March 2, 2026

InfrastructureOpen SourceObject StorageArchitecture

MinIO Is Dead. Here's What Your Infrastructure Team Should Do Next.

60,000 GitHub stars. One billion Docker pulls. Officially archived. MinIO's five-year wind-down from Apache 2.0 to AGPL to dead is the most dramatic open-source infrastructure collapse in years. Here's the migration playbook.

March 1, 2026

MCPArchitectureSecurityNode.jsProduction

Building Production-Ready MCP Servers

MCP servers are everywhere. Production-ready ones aren't. Here's the architecture I use after running MCP in real workloads: error boundaries, state isolation, security hardening, and scaling patterns that actually hold up.

March 1, 2026

12 min read

Your Google API Keys Just Became Gemini Credentials (And Nobody Told You)

Google told developers API keys aren't secrets. Then Gemini changed the rules. Truffle Security found 2,863 live keys on public websites that now access private Gemini endpoints, including keys belonging to Google itself. The attack is a single curl command.

SecurityGoogle CloudAIAPI Security

February 28, 2026

AIDeveloper ToolsArchitecture

What Claude Code Actually Chooses (And Why Tool Vendors Should Pay Attention)

Amplifying.ai ran 2,430 prompts against Claude Code and found it builds custom solutions in 12 of 20 categories. The tools it picks are becoming the default stack for a growing share of new projects.

February 27, 2026

Next.jsCloudflareWeb PerformanceAI

Cloudflare Just Replaced Next.js in a Week. Kind Of.

One engineer, $1,100 in tokens, and 94% API coverage. Vinext is either the future of framework development or the most impressive demo that will never matter. I think it's both.

February 26, 2026

MCPPerformanceArchitectureGoNode.js

MCP Server Benchmarks Are Asking the Wrong Question

3.9 million requests across Java, Go, Node.js, and Python. Go wins on memory, Java on latency. But after running MCP servers in production for months, I think the benchmark misses what actually matters.

February 26, 2026

AIEngineeringCareerSoftware EngineeringHot Take

Software engineers aren't going extinct. The job just got harder.

Boris Cherny says the software engineer title disappears in 2026. He's wrong about the title, right about the shift. Here's what 9 years of production engineering taught me about surviving it.

February 26, 2026

CloudflareServerlessArchitectureEdge Computing

Durable Objects: The Primitive AWS Doesn't Have

Cloudflare's Durable Objects give you single-threaded, globally unique compute with embedded SQLite. AWS has no equivalent. Here's how they change backend architecture.

February 25, 2026

AWSServerlessLambdaArchitectureCloud

Lambda Durable Functions Are Not Step Functions Replacements

AWS Lambda Durable Functions look like Step Functions killers. They're not. Here's when each one wins, what the checkpoint-and-replay model actually costs, and the architectural patterns I'd use in production.

February 25, 2026

SecurityWeb DevelopmentJavaScriptXSS

Firefox 148's setHTML: The XSS Protection We Should Have Had Years Ago

Firefox just shipped setHTML in version 148, replacing the notorious innerHTML with something that actually sanitizes by default. Here's why this matters and what it means for your security posture.

February 24, 2026

3 min read

Ladybird Just Picked Rust. Here's Why That Changes Everything.

When an independent browser engine switches from C++ to Rust mid-flight, it's not just a language choice. It's a bet on maintenance burden, contributor velocity, and long-term survival.

RustBrowsersArchitectureEngineeringOpen Source

February 23, 2026

SecurityAIEngineeringSupply ChainArchitecture

AI Can't Audit Your Binaries Yet

The best AI model finds 49% of backdoors in compiled binaries. With a 22% false positive rate. Here's what that means for your supply chain security strategy.

February 22, 2026

AIProductivityEngineeringWorkflow

The planning-execution split: why your AI coding workflow is probably wrong

Most developers are using AI assistants inefficiently. Here's how separating planning from execution can 10x your productivity.

February 22, 2026

AIInfrastructureOpen Source

Local AI Just Got Serious

GGML.ai joined Hugging Face this week, creating a complete stack for running AI locally. The assumption that AI requires the cloud is already obsolete—we're just waiting for everyone to notice.

February 21, 2026

3 min read

Your Dependency Bot Is a Liability

Dependabot opened thousands of PRs for a vulnerability that affected nobody. The real fix isn't more automation - it's smarter automation.

EngineeringSecurityDevOpsDependency ManagementGo

February 21, 2026

AIInfrastructureArchitectureHardware

Custom Silicon is Coming for Your Inference Stack

A startup just hit 17K tokens/sec on a single chip by hard-wiring Llama into silicon. The GPU monoculture in AI inference has an expiration date.

February 20, 2026

infrastructurearchitecturedevopscloud

The hidden cost of 'right' decisions: what 4 years of infrastructure teaches about trade-offs

Every infrastructure decision is a bet on the future. After watching teams make the same mistakes across multiple startups, here's what actually matters when choosing your stack.

February 20, 2026

AIEthicsPolicyAnthropicEngineering

Anthropic said no to the Pentagon. Now they're a 'supply chain risk.'

The Pentagon wants AI labs to allow 'all lawful use' of their models. Anthropic pushed back. Now the DoD is threatening to blacklist them. Here's why engineers should care.

February 19, 2026

AILLMsGeminiEngineeringDeveloper Tools

Gemini 3.1 Can Solve Puzzles. It Still Can't Use a Screwdriver.

Google's Gemini 3.1 Pro just dropped with a 77% on ARC-AGI-2 - up from 31%. The benchmarks are staggering. But the people actually building with it keep saying the same thing: it can't call tools.

February 19, 2026

AIEngineeringLLMsAnthropicDeveloper Tools

Sonnet Is the New Opus: Why Mid-Tier Models Keep Eating the Premium Tier

Claude Sonnet 4.6 just dropped and developers with early access prefer it over Opus 4.5. This isn't an accident. It's a pattern that should change how you pick models.

February 18, 2026

AIEngineeringCoding AgentsDeveloper ToolsCulture

Your coding agent is a slot machine. You're already pulling the lever.

There's a new name for something engineers have been feeling for a year: token anxiety. The compulsive urge to always be prompting, always shipping. This is what that actually is.

February 17, 2026

AIDeveloper ToolsEngineeringClaudeLLMs

Claude Code Hid the File Names. The Dev Community Noticed.

Anthropic collapsed Claude Code's file output in v2.1.20. Devs pushed back immediately — and they were right. This isn't a UX preference. It's about catching AI mistakes before they cost you.

February 16, 2026

AIBusinessSoftware EngineeringSaaSStrategy

When Building Is Free, Who Gets to Charge?

Apple's App Store got 557,000 new submissions last year, up 24%. Building an app went from a $50K project to a weekend. When development costs disappear, subscription pricing follows. The businesses that survive know exactly why.

February 15, 2026

AIResearchMachine LearningEngineeringOpenAI

GPT-5.2 Proved Physicists Wrong. Here's Why Engineers Should Care.

OpenAI's GPT-5.2 conjectured a new formula in theoretical physics that humans missed for decades. A concrete data point on where AI reasoning actually stands.

February 14, 2026

AIEngineeringLLMsDeveloper ToolsGeminiOpenAI

Depth vs. Speed: What This Week's AI Drops Tell You About the Next Two Years

Google and OpenAI both shipped major AI releases this week — one betting on deeper reasoning, one on faster inference. These aren't just product launches. They're two different theories about where the real bottleneck is.

February 13, 2026

AIEngineeringLLMsCoding AgentsDeveloper Tools

You're Blaming the Model. The Harness Did It.

Everyone's arguing GPT-5 vs Opus while the real bottleneck in LLM coding agents is something nobody talks about: the edit tool format.

February 12, 2026

SecurityPrivacyChrome ExtensionsInfoSec

Your Browser History Is For Sale: 287 Chrome Extensions Are Watching You

287 Chrome extensions with 37.4 million installs are quietly exfiltrating browsing history to data brokers. Here's what was found, who's behind it, and what you can do about it.

February 11, 2026

3 min read

The Death of the Writer, The Rise of the Editor

Anthropic's AI writes nearly 100% of their code, but Microsoft research shows devs miss 40% more bugs reviewing AI code. The essential skill of 2026 is code cynicism.

AISoftware EngineeringFuture of WorkCode Quality

February 10, 2026

AISoftware EngineeringStrategyFuture of Work

The Quiet Shift: How AI is Rewriting Software Economics in 2026

The cost of testing an idea has dropped to zero. In 2026, we don't build MVPs to test tech feasibility anymore. We build them to test market feasibility.

February 9, 2026

AITechnologyQuantum ComputingFuture

Why Silicon Brains Are Starting to Look Like Ours

A look at the shift from brute-force AI to bio-inspired efficiency and quantum computing breakthroughs.

February 8, 2026

Next.jsArchitecturePerformanceReact

Building Scalable Next.js Apps

Learn the key principles and patterns I've used to build Next.js applications that scale to millions of users, with insights from real-world production systems.

February 7, 2026

The Modern Stack: Why I Bet on Next.js 16, React 19, and Tailwind 4

A technical deep dive into the stack used for this portfolio. Highlighting React Server Components, Tailwind's new engine, and performance benefits.

Next.jsReactTailwind CSS

February 7, 2026