
Sign up to save your podcasts
Or
It's the holiday season, and the AI world has been showering us with gifts! Join Mark and Allen on Two Voice Devs as they unwrap a mountain of new announcements and releases from Amazon, Meta, Google, and OpenAI. From groundbreaking new models to developer-friendly tools, this episode is packed with insights on the latest advancements in AI. We'll explore the features and potential of each new "present" and discuss what it means for you, the developer.
[00:00:00] Intro and Holiday Greetings: Mark and Allen kick off the show, reflecting on the recent flurry of AI releases.
[00:00:15] The AI Gift Giving Season: A lighthearted introduction on the sheer volume of new AI tools being released.
[00:01:41] Amazon Nova Models: Amazon's surprising release of multiple new models, including Micro, Lite, and Pro, with a peek at Canvas (image generation) and Reel (video generation).
[00:04:42] Meta's Llama 3.3: The focus on multilingual capabilities and open-source nature of Llama 3.3.
[00:05:38] OpenAI vs. Google Announcement Showdown: The back-and-forth between Google and OpenAI with a focus on developer-related announcements.
[00:06:40] Google's Imagen 3 & Veo: Google's new advancements in image and video generation available on Vertex AI, including image editing via prompting.
[00:07:28] OpenAI's Sora Release: OpenAI makes their impressive video generation model available, but notably, not yet via API.
[00:08:34] OpenAI's Canvas for Code: Explore how you can interact with code as a chatbot on a virtual canvas.
[00:09:21] Microsoft's Expanded Copilot Free Tier: A note about Microsoft expanding access to their code tool.
[00:09:38] Google's Jules: The AI Bug Detective: An introduction to Google's automated bug-fixing system which proposes fixes in a version control branch.
[00:11:09] OpenAI's O1 Model: The official release of the O1 model with function calling, structured output, and image input capabilities.
[00:11:42] Gemini 2.0 API: Google's improved Gemini API, now in public preview, offering better performance with optimized tools.
[00:14:01] OpenAI's Real-Time API & WebRTC: Details about real time APIs, including WebRTC support for simplified browser-to-server connections.
[00:16:15] Google's Gemini 2.0 Live API: Real-time streaming API using WebSockets for multimodal input and output, with demos available on AI Studio.
[00:17:01] Google's New SDKs: A deep dive into the unified libraries for AI Studio and Vertex AI, simplifying things for developers.
[00:18:10] OpenAI's new Java and Go Libraries: OpenAI ups their game by adding libraries to match Google's supported development platforms.
[00:19:49] Google's PaliGemma 2 and Android XR: Vision-enabled open model, and a new Android platform for headsets and smart glasses.
[00:22:04] Wrapping Up: Mark and Allen discuss which tools they're most excited about for the break and what's in store for the future.
Let us know in the comments what you're most excited about, or if you noticed anything we missed. We’ll discuss it on future episodes.
#AI #ArtificialIntelligence #MachineLearning #GenerativeAI #LLM #LargeLanguageModels #AmazonNova #Llama3 #Gemini2 #OpenAI #GoogleAI #VertexAI #AIStudio #ChatGPT #GPT #O1 #Reasoning #ImageGeneration #VideoGeneration #DeveloperTools #Coding #Programming #WebRTC #AndroidXR #TechNews #TwoVoiceDevs
1
11 ratings
It's the holiday season, and the AI world has been showering us with gifts! Join Mark and Allen on Two Voice Devs as they unwrap a mountain of new announcements and releases from Amazon, Meta, Google, and OpenAI. From groundbreaking new models to developer-friendly tools, this episode is packed with insights on the latest advancements in AI. We'll explore the features and potential of each new "present" and discuss what it means for you, the developer.
[00:00:00] Intro and Holiday Greetings: Mark and Allen kick off the show, reflecting on the recent flurry of AI releases.
[00:00:15] The AI Gift Giving Season: A lighthearted introduction on the sheer volume of new AI tools being released.
[00:01:41] Amazon Nova Models: Amazon's surprising release of multiple new models, including Micro, Lite, and Pro, with a peek at Canvas (image generation) and Reel (video generation).
[00:04:42] Meta's Llama 3.3: The focus on multilingual capabilities and open-source nature of Llama 3.3.
[00:05:38] OpenAI vs. Google Announcement Showdown: The back-and-forth between Google and OpenAI with a focus on developer-related announcements.
[00:06:40] Google's Imagen 3 & Veo: Google's new advancements in image and video generation available on Vertex AI, including image editing via prompting.
[00:07:28] OpenAI's Sora Release: OpenAI makes their impressive video generation model available, but notably, not yet via API.
[00:08:34] OpenAI's Canvas for Code: Explore how you can interact with code as a chatbot on a virtual canvas.
[00:09:21] Microsoft's Expanded Copilot Free Tier: A note about Microsoft expanding access to their code tool.
[00:09:38] Google's Jules: The AI Bug Detective: An introduction to Google's automated bug-fixing system which proposes fixes in a version control branch.
[00:11:09] OpenAI's O1 Model: The official release of the O1 model with function calling, structured output, and image input capabilities.
[00:11:42] Gemini 2.0 API: Google's improved Gemini API, now in public preview, offering better performance with optimized tools.
[00:14:01] OpenAI's Real-Time API & WebRTC: Details about real time APIs, including WebRTC support for simplified browser-to-server connections.
[00:16:15] Google's Gemini 2.0 Live API: Real-time streaming API using WebSockets for multimodal input and output, with demos available on AI Studio.
[00:17:01] Google's New SDKs: A deep dive into the unified libraries for AI Studio and Vertex AI, simplifying things for developers.
[00:18:10] OpenAI's new Java and Go Libraries: OpenAI ups their game by adding libraries to match Google's supported development platforms.
[00:19:49] Google's PaliGemma 2 and Android XR: Vision-enabled open model, and a new Android platform for headsets and smart glasses.
[00:22:04] Wrapping Up: Mark and Allen discuss which tools they're most excited about for the break and what's in store for the future.
Let us know in the comments what you're most excited about, or if you noticed anything we missed. We’ll discuss it on future episodes.
#AI #ArtificialIntelligence #MachineLearning #GenerativeAI #LLM #LargeLanguageModels #AmazonNova #Llama3 #Gemini2 #OpenAI #GoogleAI #VertexAI #AIStudio #ChatGPT #GPT #O1 #Reasoning #ImageGeneration #VideoGeneration #DeveloperTools #Coding #Programming #WebRTC #AndroidXR #TechNews #TwoVoiceDevs
354 Listeners
3 Listeners