Daily Tech Feed: From the Labs

The Bitter Lesson


Listen Later

Show Notes — The Bitter Lesson (DTFFTL-0027)
Why it matters.

Rich Sutton published a 1,200-word essay in 2019 and was largely dismissed. Then the past five years vindicated every word of it. Now Sutton is making a second claim: that the LLM paradigm — train in a lab, freeze weights, deploy — is structurally limited in the same way the knowledge-based approaches he criticized were limited. He and John Carmack are building the alternative at Keen Technologies, targeting a genuine AGI prototype by 2030. Whether they succeed or fail, the argument deserves serious examination. The track record says so.

Primary Sources
  • The Bitter Lesson — Rich Sutton (2019) — The original essay. 1,200 words. Read it.
  • Rich Sutton's homepage — University of Alberta — Papers, essays, and research archive
  • Reinforcement Learning: An Introduction — Sutton & Barto (2nd ed., 2018) — Free online; the canonical RL textbook
  • Sutton's 2025 talk: Toward Greater Generality and Autonomy in AI — Post-LLM critique, continual learning, the training/deployment divide
  • Turing Award 2024: Sutton & Barto citation — ACM announcement for RL foundations
  • Keen Technologies
    • Keen Technologies — company site — Carmack's AGI startup
    • Carmack leaves Meta announcement — Dec 2022 — Carmack's Facebook post explaining his departure
    • Carmack / Sutton partnership announcement — Sep 2023 — Carmack tweet on joining forces with Sutton
    • Keen Technologies $20M seed round coverage — TechCrunch on investors and goals
    • Carmack on AGI: Lex Fridman podcast #309 — Long-form on leaving Meta, AGI approach, and timelines
    • Historical Cases from the Essay
      • Deep Blue vs. Kasparov — IBM research archive — The 1997 match and the brute-force vs. knowledge debate
      • AlphaGo paper — Silver et al., Nature 2016 — "Mastering the game of Go with deep neural networks and tree search"
      • Deep learning vs. handcrafted features in speech — Hinton et al., 2012 — "Deep Neural Networks for Acoustic Modeling in Speech Recognition"
      • ImageNet and the end of handcrafted vision features — Krizhevsky et al., 2012 — AlexNet. The paper that broke the SIFT era.
      • Reinforcement Learning Foundations
        • Temporal Difference Learning — Sutton (1988) — TD learning paper; the foundation of the value function approach
        • Policy gradient methods — Sutton et al. (1999) — Foundations of modern deep RL
        • World models — Ha & Schmidhuber (2018) — Agents that model their environment and plan inside the model
        • LLM Critique Context
          • RLHF paper — Christiano et al. (2017) — "Deep Reinforcement Learning from Human Preferences" — the RLHF origin paper
          • Limitations of LLMs as reasoners — Marcus & Davis (2019) — Pre-GPT-4 critique; useful historical context
          • ...more
            View all episodesView all episodes
            Download on the App Store

            Daily Tech Feed: From the LabsBy Daily Tech Feed