COEY Cast

GLM 4.7 Flash: Open Weights, Local Dreams, Real Constraints


Listen Later

Zhipu’s GLM 4.7 Flash drops with open weights and big claims about local and edge friendly workflows. This episode breaks down what Mixture of Experts and MLA attention actually mean for VRAM, context windows, and running agents at home without becoming a full time inference engineer. Hear how agentic coding fits into real human in the loop workflows, where SWE Bench hype meets production reality, and why review beats raw generation in AI audio and video. Plus a quick dive on Google’s AI Mode Direct Offers, Universal Commerce Protocol, and why product feeds and schema are now creative assets for performance marketers.
...more
View all episodesView all episodes
Download on the App Store

COEY CastBy COEY