Block’s BuilderBot Ships 1,500 Pull Requests a Week — 15% of All Production Code Is Now AI-Written

Block (Square, Cash App) just rolled out BuilderBot — an AI agent that autonomously writes code, opens pull requests, and monitors CI across Block’s entire codebase. It’s already responsible for 15% of all production code changes. This isn’t a demo. It’s Harness Theory in production at enterprise scale.

BuilderBot — Production Metrics

200K+

Operations daily

1,500

PRs merged weekly

15%

Of all production code changes

Months → Days

Feature delivery acceleration

How It Works

An engineer tags @builderbot in Slack with a task description. The agent then autonomously:

1

Researches requirements and plans implementation

2

Writes code across the entire codebase — an engineer on Cash App can modify Square services they’ve never touched

3

Creates branches, opens PRs, monitors CI

4

Iterates based on feedback — picks up tickets from Linear and Jira

The key differentiator: BuilderBot understands Block’s complete technical context — every service, API, and coding convention. It’s not a generic code assistant. It’s an agent with institutional knowledge of the company.

This is Harness Theory at scale. The human frames (Slack message with task description). The agent executes (research → code → PR → CI → iterate). 70/80 split — exactly what Anthropic’s 400K-session data showed. Block just proved it works at 200K operations per day.

The MCP + Goose Connection

BuilderBot is built on Goose — Block’s open-source AI agent framework, contributed to the Linux Foundation’s Agentic AI Foundation. Block also co-developed the Model Context Protocol (MCP) with Anthropic.

This matters because MCP is becoming the standard interface between agents and tools — and Block is one of its architects. BuilderBot isn’t just using the harness layer. It’s building the infrastructure that the harness layer runs on.

The Agent Week — Updated

Salesforce → Fin ($3.6B, customer agent)

OpenAI → Partner Network ($150M, 300K consultants)

SpaceX → Cursor ($60B, developer agent)

Microsoft → Copilot Cowork ($0.01/task, enterprise agent)

Pinterest → Ask Pinterest (shopping agent)

Databricks → Genie One (enterprise data agent)

Block → BuilderBot (code agent, 15% of production code)

Seven agent launches in five days. The harness layer isn’t emerging. It’s arrived.

Business Engineer

My Life in the Harness + The Dynamo Doctrine

The seven harness principles. The primitive inversion. The 70/80 split between framing and execution — now validated by Block’s 200K daily operations.

Read: Life in the Harness →

The Bottom Line

15% of all production code at a major fintech company is now written by an AI agent. 1,500 PRs merged per week. Tasks that took months take days. And it started with a Slack message. This is what the harness looks like when it’s not a framework — it’s a shipping machine. The 70/80 split is real. The human frames. The agent ships.

Source: Block

Scroll to Top

Discover more from FourWeekMBA

Subscribe now to keep reading and get access to the full archive.

Continue reading

FourWeekMBA