Big Models, Bigger Benchmarks: Qwen3-Next’s Leap Forward. vLLM Changes the Inference Game. Scaling the Edge: Qwen3-Next-80B on Blackwell, Windows, WSL2, Docker. Leaner Chat Clients, DIY Coding Agents, and Opencode Hacks. Tool Use: GPT-OSS-20B’s “Simulated” Shortfall Sparks Debate