David's Saturday AI Thoughts

The bill and the harness


Listen Later

David builds the case that flat-rate AI pricing is dying and that the buyer's question is no longer 'how much will this cost' but 'where does the spending compound'. He opens at a Las Vegas buffet that closed on 31st May, then moves to the supplier-side news: three of the four biggest AI vendors switched pricing in the last few weeks (Anthropic stripped bundled tokens out of Enterprise seats in mid-April, OpenAI took Codex pay-as-you-go a fortnight earlier, GitHub moves every Copilot plan to usage-based billing on 1st June, and an Anthropic manager admitted Pro and Max tiers have been outgrown). He brings in two friends' worried voice notes from the buyer side: a friend in Tokyo asking what happens when bills go up five or ten times, and a partner at a professional services firm naming the outsourcing trap. He explains the supplier maths (unit prices falling roughly tenfold a year, his $200-a-month Max plan delivering $500 a day of equivalent API use, unsustainable) and the buyer maths (Jevons Paradox: cheaper energy made coal use rise, not fall). The radiologist is the modern Jevons: Hinton's 2016 'stop training radiologists' was right about the models and wrong about the radiologists. Ten years on the US has six thousand more of them and pay is up roughly seventy per cent. Punchline: the bill rises either way, the question is whether the spending compounds in the model (a utility cost) or in the harness, the layer of instructions, context and workflows that wraps the model (an asset nobody else can buy). Intercom doubled engineering velocity in nine months on exactly that bet.

What happened this week
  • AI adoption stalls one layer below the executive sponsor, at the line manager: Gallup data (Q4 2025) finds AI use correlates more strongly with managerial endorsement than with tool access. In firms where the manager actively supports AI, 80% of staff use it weekly; where they don't, that drops to 44%. In the public sector: 65% versus 37%. Procurement and licences are the easy part. Adoption lives or dies one layer below the top.
  • The frontier-model leaderboard is now refreshing in weeks, not quarters: The Epoch Capabilities Index now shows GPT-5.5 Pro and Gemini 3.1 Pro above 155, up from GPT-4o's 128 in mid-2024. Seventeen frontier releases compressed into under two years with no visible plateau. Epoch researcher Greg Burnham: 'I don't know when Opus 8.2 will be shipped, but GPT-9.1 will be shipped that afternoon.' Build workflows and judgment around capabilities, not named models.
  • Six VC firms, one investment thesis: Linas Beliunas read the published 2026 investment theses of six of the biggest venture firms side by side and found the same handful of AI bets in all of them: AI-native enterprise software, AI agents for physical and industrial work, vertical AI software in legal, finance, healthcare and construction, multi-agent orchestration, and health AI. His sharper conclusion: the most valuable software companies of the next decade won't look like software companies. They'll look like law firms, factories and hedge funds run by teams of ten.
  • What to try
    • Pick one tool, get fluent, then refine your harness: A leader David spoke to had spent weeks running the same task through ChatGPT and Claude side by side, then asking each to review the other. Genuinely interesting and occasionally useful, but they couldn't decide which to use when. The advice from someone who has been at this longer: pick one, become very good with it, learn its quirks, then build the layer above it that reflects how you actually work. Most of the value is in fluency with one tool, not coverage across two.
    • Force yourself to change something on every AI output before you ship it: Came up at a senior training session this week, as the room debated when the 'check, edit, own' model breaks down. Increasingly the edit feels optional because the output looks good. Whatever the AI gives you, a paragraph, a slide, a summary, add something, remove something, or reorder something before it leaves your hands. The rule of 'always change something' is a forcing function against brain rot.
    • Skip the slides, build the page: In a senior strategy session this week, the most-praised artefact in the room was not a deck. It was a web page someone had built to walk teams through their thinking. Building it took a clear thought, a text file, and a small Claude skill they had set up once. Their harness paying off in front of the room.
    • Read the full edition with all links and sources

      ...more
      View all episodesView all episodes
      Download on the App Store

      David's Saturday AI ThoughtsBy David Boyle