June 23, 2026

GLM 5.2 Clearly Explained (and how to set it up)

Listen Later

22 minutes

In this episode I sit down with Amir to get tactical about running local AI models as part of a daily workflow. We center on GLM 5.2 from ZAI, how it stacks up against frontier models like Opus 4.8, and how a fusion approach lets you sequence a heavy thinking model with a lighter execution model for the best output at the lowest cost. Amir walks through setup in Cursor and Codex via OpenRouter, shares real token-cost math, and demos GLM 5.2 refining a live app. By the end you will know how to start today, where local models shine, and how model chaining keeps spend in check.

Timestamps

00:00 – Intro

02:09 – GLM 5.2 and Z AI

04:01 – Specs: 1M context and Terminal Bench 2.1

05:22 – Making sense of benchmark scores

06:42 – Setup in Cursor or Codex with OpenRouter

10:18 – Local model upside: buy a machine, run tasks

11:42 – Token cost: 44 cents versus $2.38

13:36 – Future-proofing with an upfront hardware bet & The Uber subsidy analogy

16:49 – Model chaining and the vision workaround

19:23 – Token maxing vs routing tasks to the right model

20:54 – Answering the "cost is irrelevant" crowd

21:59 – Closing thoughts

Key Points

GLM 5.2 ships with a 1M-token context window and scores 81 on Terminal Bench 2.1, landing about four points behind Opus 4.8.
A fusion approach (a term OpenRouter coined) sequences models: plan with Opus, execute with GLM 5.2, review with Composer 2.5 or Codex 5.5.
Running GLM 5.2 in the cloud through OpenRouter costs roughly 44 cents for a task that runs about $2.38 on Opus 4.8 — close to a 5X saving.
You can start today with credit-based access: load $20 in OpenRouter and route tasks to the right model.
For images, Amir uses Opus 4.8 to read screenshots and describe them, then hands the layout to GLM 5.2 to act on.
Teams are shifting from token-maxing to output-maxing, making model governance and chaining the smart play

The #1 tool to find startup ideas/trends - https://www.ideabrowser.com

LCA helps Fortune 500s and fast-growing startups build their future - from Warner Music to Fortnite to Dropbox. We turn 'what if' into reality with AI, apps, and next-gen products https://latecheckout.agency/

The Vibe Marketer - Resources for people into vibe marketing/marketing with AI: https://www.thevibemarketer.com/

FIND ME ON SOCIAL

X/Twitter: https://twitter.com/gregisenberg

Instagram: https://instagram.com/gregisenberg/

LinkedIn: https://www.linkedin.com/in/gisenberg/

FIND AMIR ON SOCIAL

Humblytics: https://humblytics.com/?via=community

X/Twitter: https://x.com/amirmxt

Youtube: https://www.youtube.com/@amirmxt

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

The Startup Ideas Podcast

By Greg Isenberg

4.7

204204 ratings

June 23, 2026

GLM 5.2 Clearly Explained (and how to set it up)

Listen Later

22 minutes

In this episode I sit down with Amir to get tactical about running local AI models as part of a daily workflow. We center on GLM 5.2 from ZAI, how it stacks up against frontier models like Opus 4.8, and how a fusion approach lets you sequence a heavy thinking model with a lighter execution model for the best output at the lowest cost. Amir walks through setup in Cursor and Codex via OpenRouter, shares real token-cost math, and demos GLM 5.2 refining a live app. By the end you will know how to start today, where local models shine, and how model chaining keeps spend in check.

Timestamps

00:00 – Intro

02:09 – GLM 5.2 and Z AI

04:01 – Specs: 1M context and Terminal Bench 2.1

05:22 – Making sense of benchmark scores

06:42 – Setup in Cursor or Codex with OpenRouter

10:18 – Local model upside: buy a machine, run tasks

11:42 – Token cost: 44 cents versus $2.38

13:36 – Future-proofing with an upfront hardware bet & The Uber subsidy analogy

16:49 – Model chaining and the vision workaround

19:23 – Token maxing vs routing tasks to the right model

20:54 – Answering the "cost is irrelevant" crowd

21:59 – Closing thoughts

Key Points

GLM 5.2 ships with a 1M-token context window and scores 81 on Terminal Bench 2.1, landing about four points behind Opus 4.8.
A fusion approach (a term OpenRouter coined) sequences models: plan with Opus, execute with GLM 5.2, review with Composer 2.5 or Codex 5.5.
Running GLM 5.2 in the cloud through OpenRouter costs roughly 44 cents for a task that runs about $2.38 on Opus 4.8 — close to a 5X saving.
You can start today with credit-based access: load $20 in OpenRouter and route tasks to the right model.
For images, Amir uses Opus 4.8 to read screenshots and describe them, then hands the layout to GLM 5.2 to act on.
Teams are shifting from token-maxing to output-maxing, making model governance and chaining the smart play

The #1 tool to find startup ideas/trends - https://www.ideabrowser.com

LCA helps Fortune 500s and fast-growing startups build their future - from Warner Music to Fortnite to Dropbox. We turn 'what if' into reality with AI, apps, and next-gen products https://latecheckout.agency/

The Vibe Marketer - Resources for people into vibe marketing/marketing with AI: https://www.thevibemarketer.com/

FIND ME ON SOCIAL

X/Twitter: https://twitter.com/gregisenberg

Instagram: https://instagram.com/gregisenberg/

LinkedIn: https://www.linkedin.com/in/gisenberg/

FIND AMIR ON SOCIAL

Humblytics: https://humblytics.com/?via=community

X/Twitter: https://x.com/amirmxt

Youtube: https://www.youtube.com/@amirmxt

...more

More shows like The Startup Ideas Podcast

The Smart Passive Income Online Business and Blogging Podcast by Pat Flynn

The Smart Passive Income Online Business and Blogging Podcast

3,449 Listeners

This Week in Startups by Jason Calacanis

This Week in Startups

1,290 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

537 Listeners

The Side Hustle Show by Nick Loper of Side Hustle Nation | YAP Media

The Side Hustle Show

1,581 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

Marketing School - Digital Marketing and Online Marketing Tips by Eric Siu and Neil Patel

Marketing School - Digital Marketing and Online Marketing Tips

1,258 Listeners

Founders by David Senra

Founders

2,171 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

228 Listeners

The Game with Alex Hormozi by Alex Hormozi

The Game with Alex Hormozi

4,469 Listeners

My First Million by Hubspot Media

My First Million

2,654 Listeners

Marketing Against The Grain by Hubspot Media

Marketing Against The Grain

361 Listeners

Moneywise by Hampton

Moneywise

656 Listeners

The Koerner Office - Business Ideas and Deep Dives with Chris Koerner by Chris Koerner

The Koerner Office - Business Ideas and Deep Dives with Chris Koerner

269 Listeners

The Next Wave - AI and The Future of Technology by Mindstream (Hubspot Media)

The Next Wave - AI and The Future of Technology

54 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners