📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

The latest Google whitepaper emphasizes that AI models constitute only about 10% of system behavior. The majority of performance depends on harness design and context engineering, shifting the focus from model development to system configuration.

A new Google whitepaper titled The New SDLC With Vibe Coding states that the AI model accounts for only about 10% of the overall system behavior, emphasizing that harness design and context engineering are the primary drivers of effective AI systems. This shifts the industry focus away from developing larger models toward optimizing how models are integrated and controlled, which has significant implications for AI strategy and investment.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, highlights that in practice, most failures and inefficiencies in AI systems stem from configuration issues, missing tools, vague rules, and poor context management. Studies cited in the paper show that changing only the harness — including prompts, tools, and observability — can dramatically improve performance, even when using the same underlying model. For example, a team improved their coding agent’s ranking from outside the Top 30 to Top 5 by focusing solely on harness adjustments.

The authors argue that costs associated with AI are heavily influenced by token economy, with vibe coding (minimal structure, rapid prompts) appearing cheap initially but incurring high long-term costs due to inefficiency, maintenance, and security vulnerabilities. Conversely, disciplined engineering — involving schema design, testing, and structured context — offers a lower marginal cost per feature over time.

At a glance

reportWhen: published early 2026

The developmentA new Google whitepaper introduces a paradigm shift in AI development, highlighting that the model itself is only 10% of system performance, with the rest driven by harness and context engineering.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development and Investment Strategies

This shift means organizations should prioritize harness development, context management, and system configuration over solely focusing on acquiring or training larger models. It challenges the prevailing industry narrative that bigger models are the primary source of AI progress. Instead, it underscores that most of the value and reliability in AI applications comes from how systems are assembled and controlled, which can lead to more cost-effective and secure AI deployment.

MUCAR 892BT AI Bi-Directional OBD2 Scanner, Full System OBD2 Scanner Diagnostic Tool,35 Services,ECU Coding,FCA Autoauth,CANFD&DOIP,for Car Owners, DIYer, Technicians, Inspectors, Trainees and Others

Powerful Performance for busy days in the shop or your home garage: an 8-inch ultra-large display paired with…

As an affiliate, we earn on qualifying purchases.

Background on AI System Design and Industry Trends

Until early 2026, the industry largely equated AI progress with the development of larger, more powerful models. Companies invested heavily in training massive neural networks, expecting that size alone would yield better results. However, recent research and practical experiments, including those cited in the Google whitepaper, indicate that system configuration, prompt engineering, and context management are more critical to performance than raw model size. This realization is reshaping AI development priorities across the industry.

“The biggest shift in software engineering isn’t a new language or framework — it’s moving from writing code to expressing intent and trusting machines to implement it.”
— Addy Osmani

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

As an affiliate, we earn on qualifying purchases.

Unresolved Questions About Industry Adoption

While the whitepaper presents compelling evidence that harness and context engineering are crucial, it remains unclear how quickly organizations will adopt this paradigm shift at scale. Specific best practices for harness design are still emerging, and the long-term impact on AI research investments and model development strategies is yet to be fully understood.

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Practitioners and Leaders

Organizations should evaluate their current AI workflows, emphasizing system configuration, testing, and context management. Industry leaders are likely to invest more in developing robust harnesses and standards for context engineering. Further research and case studies are expected to clarify best practices and optimize cost-efficiency in AI deployment.

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

According to the whitepaper, most of the system’s performance depends on how the model is integrated, controlled, and guided through harness design and context management, rather than the model itself.

How does this shift affect AI development costs?

Focusing on harness and context engineering can reduce long-term costs by improving efficiency, security, and maintainability, despite higher upfront investment in system design.

Will this change industry-wide AI strategies?

Yes, many organizations are likely to reallocate resources from model training toward system configuration, testing, and infrastructure to maximize AI effectiveness and cost-efficiency.

What are the main challenges in adopting this new approach?

Developing expertise in harness design, establishing best practices for context engineering, and integrating these processes into existing workflows are key challenges that organizations will need to address.

Is bigger always better in AI models?

No, the whitepaper argues that size is less important than how the model is integrated and controlled within a system, with system design playing a more significant role in performance.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Simple Mondays Team

Share article

The model is only 10%

Implications for AI Development and Investment Strategies

MUCAR 892BT AI Bi-Directional OBD2 Scanner, Full System OBD2 Scanner Diagnostic Tool,35 Services,ECU Coding,FCA Autoauth,CANFD&DOIP,for Car Owners, DIYer, Technicians, Inspectors, Trainees and Others

Background on AI System Design and Industry Trends

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

Unresolved Questions About Industry Adoption

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory

Next Steps for AI Practitioners and Leaders

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Key Questions

Why is the model only 10% of the system’s behavior?

How does this shift affect AI development costs?

Will this change industry-wide AI strategies?

What are the main challenges in adopting this new approach?

Is bigger always better in AI models?

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Why AI Will Be Central To Your Tech In 2026

The NVIDIA Earnings Preview: What Q1 FY27 Will Reveal About the AI Cycle

The deployment. How the AI labs verticallyintegrated into the serviceslayer — the Palantir modelat scale.

Track Competitor Prices To Optimize Your TikTok Shop Listings

Siemens’ Factory Floor Revolution Powered By AI

Licensed Summerween Collections

How to Strengthen Trust Through Small Behaviors

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

Simple Mondays Team

Share article

The model is only 10%

Implications for AI Development and Investment Strategies

MUCAR 892BT AI Bi-Directional OBD2 Scanner, Full System OBD2 Scanner Diagnostic Tool,35 Services,ECU Coding,FCA Autoauth,CANFD&DOIP,for Car Owners, DIYer, Technicians, Inspectors, Trainees and Others

Background on AI System Design and Industry Trends

AI-Powered Web Design Mastery: Harness the Power of Framer AI to Build, Customize, and Launch Stunning Websites—A Step-by-Step Guide for Beginners and Professionals

Unresolved Questions About Industry Adoption

AI Context Engineering: Architecting Intelligence Through Prompt Structures, Tools, and Memory

Next Steps for AI Practitioners and Leaders

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Key Questions

Why is the model only 10% of the system’s behavior?

How does this shift affect AI development costs?

Will this change industry-wide AI strategies?

What are the main challenges in adopting this new approach?

Is bigger always better in AI models?

You May Also Like