Words
Writing
Essays and posts on agents, retrieval, and research, gathered from Medium, the Sourcegraph blog, and here.
Essays
- I used two multi-agent pipelines for everything I built this week. Here's what happened.
Running two multi-agent workflows — one for research and planning, one for execution and review — across a week of software projects, and what actually held up.
- Applying creativity research to agentic workflows
Borrowing divergent-thinking techniques from creativity research to push agent workflows toward structurally diverse solutions instead of the first obvious one.
- Why your coding agent keeps failing in ways you can't predict
Coding agents fail unpredictably because they are a complex system layered on another complex system — the codebase — and prompt engineering alone can't tame the emergent failures.
- Why "Agent Advocate" exists
On the Agent Advocate role at Sourcegraph — advocating for the infrastructure that lets coding agents work reliably in production, not just better prompts.
- I couldn't find a good enough benchmark for large-scale software development, so I built one
Building CodeScaleBench — task design, verification, QA, and the infrastructure orchestration behind a benchmark for coding agents on large, multi-repo codebases.
- Rethinking coding agent benchmarks
Where today's coding-agent benchmarks fall short, and how to measure agent performance with stronger context retrieval at enterprise scale.
- How I use Amp (after 4 months and 6000 threads)
A working Amp practice after four months of heavy use — planning, context management, and verifying code across many projects.
- How I code with Amp on my phone
A secure iPhone-to-Mac setup with Blink Shell and Tailscale to run the Amp CLI and manage coding projects from a phone.
Engineering blog
- How we're using Sourcegraph and a Slack bot to detect vulnerabilities and react quickly
An automated Slack bot that triages security advisories, generates detection queries, and produces content artifacts to speed up vulnerability response.
- Why coding agents fail in large codebases (and what to do about it)
Five repeatable failure patterns for coding agents in large codebases — and why the fix is retrieval and code-search infrastructure, not just better models.
- What it actually takes to run code intelligence in-house
The real engineering requirements and costs of building an internal code-intelligence platform equivalent to a commercial offering.
- Detecting supply chain attacks at scale with Deep Search
Using Deep Search and Code Search to find repositories exposed to poisoned LiteLLM packages, separating version-pinned code from genuinely at-risk dependencies.
- CodeScaleBench: Testing coding agents on large codebases and multi-repo software engineering tasks
A benchmark for evaluating how coding agents perform on enterprise-scale, multi-repo software development tasks across the full SDLC.
- Episode III: Revenge of the React vulnerabilities
New high-severity React Server Components vulnerabilities surfacing after incomplete initial patches — denial of service and source-code exposure — with remediation guidance.
- Fixing the React2Shell vulnerability in large and complex enterprise codebases (part 2)
Automating remediation of CVE-2025-55182 across many repositories with Batch Changes, the MCP server, and Deep Search — applying patches, tracking progress, and validating fixes at scale.
Guides & ebooks
- Running coding agents in enterprise codebases
An operational guide to deploying AI coding agents in large enterprise environments — context engineering, code retrieval, review pipelines, and human oversight.
- The Sourcegraph guide to surviving Big Code
Why universal code search across every repo, branch, and code host — on a unified platform — is essential to stay productive as codebases grow.
- CodeScaleBench: Benchmarking AI coding agents on real-world, large-scale codebases
A report on how AI coding agents perform against enterprise-scale codebases, finding that code understanding and context-retrieval quality are the primary drivers of success.