Long-Running Autonomy: How AI Agents Work for Hours and Days
Early agents handled one-shot tasks in minutes. By late 2025, agents were producing full feature sets for hours. This shift in autonomy duration changes the economics of entire categories of work.
Key Components
The Rakuten Case
Claude Code implemented a complex activation vector extraction method across a 12.5-million-line codebase in seven hours of autonomous work, achieving 99.9% numerical accuracy.
The Viability Expansion Effect
When agents work autonomously for extended periods, the viability threshold for projects drops dramatically:
Broader Translation
The principle: long-running autonomy doesn't just make existing work faster — it makes previously uneconomic work possible.
Real-World Examples
Anthropic
Key Insight
Claude Code implemented a complex activation vector extraction method across a 12.5-million-line codebase in seven hours of autonomous work, achieving 99.9% numerical accuracy. No human could maintain that level of precision across that codebase for that duration.
Exec Package + Claude OS Master Skill | Business Engineer Founding Plan
FourWeekMBA x Business Engineer | Updated 2026
Early agents handled one-shot tasks in minutes. By late 2025, agents were producing full feature sets for hours. This shift in autonomy duration changes the economics of entire categories of work.
The Rakuten Case
Claude Code implemented a complex activation vector extraction method across a 12.5-million-line codebase in seven hours of autonomous work, achieving 99.9% numerical accuracy. No human could maintain that level of precision across that codebase for that duration.
The Viability Expansion Effect
When agents work autonomously for extended periods, the viability threshold for projects drops dramatically:
Technical debt accumulated over years is systematically eliminated
Nice-to-have improvements that were always deprioritized become feasible
Exploratory projects that nobody had time for get executed
Anthropic found that ~27% of AI-assisted work consists of tasks that wouldn’t have been done otherwise — scaling — as explored in the emerging fifth paradigm of scaling — projects, building exploratory tools, fixing “papercuts” that improve quality of life.
Broader Translation
Marketing: Campaigns that required weeks of cross-team coordination become single-session efforts
Finance: Reconciliation processes that nobody had time to automate get addressed
Operations: Workflow optimizations that were always “too small to prioritize” get executed
The principle: long-running autonomy doesn’t just make existing work faster — it makes previously uneconomic work possible.
What is Long-Running Autonomy: How AI Agents Work for Hours and Days?
Early agents handled one-shot tasks in minutes. By late 2025, agents were producing full feature sets for hours. This shift in autonomy duration changes the economics of entire categories of work.
What is the rakuten case?
Claude Code implemented a complex activation vector extraction method across a 12.5-million-line codebase in seven hours of autonomous work, achieving 99.9% numerical accuracy. No human could maintain that level of precision across that codebase for that duration.
What is the viability expansion effect?
When agents work autonomously for extended periods, the viability threshold for projects drops dramatically:
What is Broader Translation?
The principle: long-running autonomy doesn't just make existing work faster — it makes previously uneconomic work possible.
Gennaro is the creator of FourWeekMBA, which reached about four million business people, comprising C-level executives, investors, analysts, product managers, and aspiring digital entrepreneurs in 2022 alone | He is also Director of Sales for a high-tech scaleup in the AI Industry | In 2012, Gennaro earned an International MBA with emphasis on Corporate Finance and Business Strategy.
Scroll to Top
Discover more from FourWeekMBA
Subscribe now to keep reading and get access to the full archive.