Block (Square, Cash App) just rolled out BuilderBot — an AI agent that autonomously writes code, opens pull requests, and monitors CI across Block’s entire codebase. It’s already responsible for 15% of all production code changes. This isn’t a demo. It’s Harness Theory in production at enterprise scale.
How It Works
An engineer tags @builderbot in Slack with a task description. The agent then autonomously:
Researches requirements and plans implementation
Writes code across the entire codebase — an engineer on Cash App can modify Square services they’ve never touched
Creates branches, opens PRs, monitors CI
Iterates based on feedback — picks up tickets from Linear and Jira
The key differentiator: BuilderBot understands Block’s complete technical context — every service, API, and coding convention. It’s not a generic code assistant. It’s an agent with institutional knowledge of the company.
This is Harness Theory at scale. The human frames (Slack message with task description). The agent executes (research → code → PR → CI → iterate). 70/80 split — exactly what Anthropic’s 400K-session data showed. Block just proved it works at 200K operations per day.
The MCP + Goose Connection
BuilderBot is built on Goose — Block’s open-source AI agent framework, contributed to the Linux Foundation’s Agentic AI Foundation. Block also co-developed the Model Context Protocol (MCP) with Anthropic.
This matters because MCP is becoming the standard interface between agents and tools — and Block is one of its architects. BuilderBot isn’t just using the harness layer. It’s building the infrastructure that the harness layer runs on.
The Agent Week — Updated
Salesforce → Fin ($3.6B, customer agent)
OpenAI → Partner Network ($150M, 300K consultants)
SpaceX → Cursor ($60B, developer agent)
Microsoft → Copilot Cowork ($0.01/task, enterprise agent)
Pinterest → Ask Pinterest (shopping agent)
Databricks → Genie One (enterprise data agent)
Block → BuilderBot (code agent, 15% of production code)
Seven agent launches in five days. The harness layer isn’t emerging. It’s arrived.
The Bottom Line
15% of all production code at a major fintech company is now written by an AI agent. 1,500 PRs merged per week. Tasks that took months take days. And it started with a Slack message. This is what the harness looks like when it’s not a framework — it’s a shipping machine. The 70/80 split is real. The human frames. The agent ships.
Source: Block





![This Week In AI Business [Week #3-2025] This Week In AI Business [Week #3-2025]](https://i0.wp.com/fourweekmba.com/wp-content/plugins/contextual-related-posts/default.png?resize=150%2C150&ssl=1)


