(A)bsolutely (I)ncredible Podcast

'Insane': OpenAI Introduces GPT-4o Native Image Generation, Already Wowing Users


Listen Later

We’re coming up on the one year anniversary since OpenAI released its first omni or multimodal model, GPT-4o back in May 2024, but that old standby still has some tricks up its sleeve. Case-in-point, today OpenAI finally turned on the native multimodal image generation capabilities of GPT-4o for users of its hit chatbot ChatGPT on the Plus, Pro, Team, and Free usage tiers, though the company said it would also soon be made available for Enterprise, Edu, and through its application programming interface (API).

Unlike the previous generative AI image model available in ChatGPT — OpenAI’s DALL-E 3, a classic diffusion transformer model that was trained to reconstruct images from text prompts by removing noise from pixels — this new image generator is part of the same model that spits out text and code, as OpenAI trained the entire model to understand all these forms of media at once.

OpenAI president Greg Brockman had long ago previewed this native capability of GPT-4o back in May 2024, but for reasons that still remain unknown publicly, the company held onto it until now — following the public release of what many AI power users saw as a similar feature from Google AI Studio with its Gemini 2 Flash Experimental model.

This has resulted in a much higher quality image generator that produces far more lifelike images and accurate text baked in, and it’s already impressing users — one of whom calls the quality insane. By the same token (pun intended), OpenAI still hasn’t said precisely what data GPT-4o’s image generation capabilities were trained on — and given the history of the company and other model providers, it likely includes many artworks scraped from the web, some of which are presumably copyrighted, which is likely to anger the artists behind them.

More on OpenAI’s GTP 4o native image generation on VentureBeat

Agentic AI for the Enterprise | Inbound and Outbound AI Voice Agents

I’ve partnered with DBC Technologies and I am now consulting with companies who are interested in automating inbound and outbound messaging with AI Voice Agents.

Web AI Chatbots, Inbound / Outbound AI Voice Agents, AI Marketing & Consulting

If you are interested in AI Voice Agents for your business or organization message me.

Thats all for today, but AI is moving fast - subscribe and follow for more Neural News.



Get full access to Neural News Network at remunerationlabs.substack.com/subscribe
...more
View all episodesView all episodes
Download on the App Store

(A)bsolutely (I)ncredible PodcastBy Bruce Burke