
Sign up to save your podcasts
Or


I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error.
What you’ll learn:
—
Blog design: https://www.chatprd.ai/blog
—
Brought to you by:
Lovable—Build apps by simply chatting with AI
—
Where to find Claire Vo:
ChatPRD: https://www.chatprd.ai/
Website: https://clairevo.com/
LinkedIn: https://www.linkedin.com/in/clairevo/
X: https://x.com/clairevo
—
In this episode, we cover:
(00:00) Introduction to the AI design challenge
(01:25) The question: Which model is the better designer?
(03:08) The prompt used for all three models
(04:10) Gemini 3 Pro’s approach and results
(06:00) Opus 4.5’s approach and results
(10:54) Codex 5.1’s approach and disappointing results
(14:51) Comparing the three designs side by side
(16:03) Analyzing the change logs and SEO improvements from each model
(22:43) Final verdict
(23:00) Conclusion and next steps
—
Tools referenced:
• Gemini 3 Pro: https://deepmind.google/models/gemini/pro/
• Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5
• OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex
• Cursor: https://cursor.com/
—
Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [email protected].
By Claire Vo4.8
143143 ratings
I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error.
What you’ll learn:
—
Blog design: https://www.chatprd.ai/blog
—
Brought to you by:
Lovable—Build apps by simply chatting with AI
—
Where to find Claire Vo:
ChatPRD: https://www.chatprd.ai/
Website: https://clairevo.com/
LinkedIn: https://www.linkedin.com/in/clairevo/
X: https://x.com/clairevo
—
In this episode, we cover:
(00:00) Introduction to the AI design challenge
(01:25) The question: Which model is the better designer?
(03:08) The prompt used for all three models
(04:10) Gemini 3 Pro’s approach and results
(06:00) Opus 4.5’s approach and results
(10:54) Codex 5.1’s approach and disappointing results
(14:51) Comparing the three designs side by side
(16:03) Analyzing the change logs and SEO improvements from each model
(22:43) Final verdict
(23:00) Conclusion and next steps
—
Tools referenced:
• Gemini 3 Pro: https://deepmind.google/models/gemini/pro/
• Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5
• OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex
• Cursor: https://cursor.com/
—
Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [email protected].

538 Listeners

1,085 Listeners

226 Listeners

209 Listeners

146 Listeners

207 Listeners

136 Listeners

209 Listeners

595 Listeners

36 Listeners

60 Listeners

35 Listeners

38 Listeners

64 Listeners

52 Listeners