Two Voice Devs

Episode 221 - AI Holiday Gift Guide: Amazon, Meta, Google, OpenAI, and MORE!


Listen Later

It's the holiday season, and the AI world has been showering us with gifts! Join Mark and Allen on Two Voice Devs as they unwrap a mountain of new announcements and releases from Amazon, Meta, Google, and OpenAI. From groundbreaking new models to developer-friendly tools, this episode is packed with insights on the latest advancements in AI. We'll explore the features and potential of each new "present" and discuss what it means for you, the developer.


[00:00:00] Intro and Holiday Greetings: Mark and Allen kick off the show, reflecting on the recent flurry of AI releases.

[00:00:15] The AI Gift Giving Season: A lighthearted introduction on the sheer volume of new AI tools being released.

[00:01:41] Amazon Nova Models: Amazon's surprising release of multiple new models, including Micro, Lite, and Pro, with a peek at Canvas (image generation) and Reel (video generation).

[00:04:42] Meta's Llama 3.3: The focus on multilingual capabilities and open-source nature of Llama 3.3.

[00:05:38] OpenAI vs. Google Announcement Showdown: The back-and-forth between Google and OpenAI with a focus on developer-related announcements.

[00:06:40] Google's Imagen 3 & Veo: Google's new advancements in image and video generation available on Vertex AI, including image editing via prompting.

[00:07:28] OpenAI's Sora Release: OpenAI makes their impressive video generation model available, but notably, not yet via API.

[00:08:34] OpenAI's Canvas for Code: Explore how you can interact with code as a chatbot on a virtual canvas.

[00:09:21] Microsoft's Expanded Copilot Free Tier: A note about Microsoft expanding access to their code tool.

[00:09:38] Google's Jules: The AI Bug Detective: An introduction to Google's automated bug-fixing system which proposes fixes in a version control branch.

[00:11:09] OpenAI's O1 Model: The official release of the O1 model with function calling, structured output, and image input capabilities.

[00:11:42] Gemini 2.0 API: Google's improved Gemini API, now in public preview, offering better performance with optimized tools.

[00:14:01] OpenAI's Real-Time API & WebRTC: Details about real time APIs, including WebRTC support for simplified browser-to-server connections.

[00:16:15] Google's Gemini 2.0 Live API: Real-time streaming API using WebSockets for multimodal input and output, with demos available on AI Studio.

[00:17:01] Google's New SDKs: A deep dive into the unified libraries for AI Studio and Vertex AI, simplifying things for developers.

[00:18:10] OpenAI's new Java and Go Libraries: OpenAI ups their game by adding libraries to match Google's supported development platforms.

[00:19:49] Google's PaliGemma 2 and Android XR: Vision-enabled open model, and a new Android platform for headsets and smart glasses.

[00:22:04] Wrapping Up: Mark and Allen discuss which tools they're most excited about for the break and what's in store for the future.


Let us know in the comments what you're most excited about, or if you noticed anything we missed. We’ll discuss it on future episodes.


#AI #ArtificialIntelligence #MachineLearning #GenerativeAI #LLM #LargeLanguageModels #AmazonNova #Llama3 #Gemini2 #OpenAI #GoogleAI #VertexAI #AIStudio #ChatGPT #GPT #O1 #Reasoning #ImageGeneration #VideoGeneration #DeveloperTools #Coding #Programming #WebRTC #AndroidXR #TechNews #TwoVoiceDevs

...more
View all episodesView all episodes
Download on the App Store

Two Voice DevsBy Mark and Allen

  • 1
  • 1
  • 1
  • 1
  • 1

1

1 ratings


More shows like Two Voice Devs

View all
Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

354 Listeners

The Daily AI Show by The Daily AI Show Crew - Brian, Beth, Jyunmi, Andy, Karl, and Eran

The Daily AI Show

3 Listeners