Tech News Now

Gemini 2.0: The Agentic Era


Listen Later

Google DeepMind introduces Gemini 2.0, a new AI model designed for the agentic era.

Gemini 2.0 boasts impressive capabilities, including native tool use, image and speech generation, and enhanced performance in various benchmarks.

This episode will explore Gemini 2.0's key features, such as:

Agent Capabilities:

  • Taking action and following instructions under user supervision
  • Tool use, including Google Search, code execution, and more
  • Real-time streaming, responding to live audio and video input
  • Multimodal understanding
  • Hands-on Applications:

    • Spatial understanding within images
      • Video understanding, including outlining key moments and summarization
      • Function calling with the Maps API6○Multimodal Live API for developers
      • Starter apps like Boilerplate, GenExplainer, and GenWeather
      • Performance Improvements:

        • Enhanced capabilities across a range of benchmarks, including MMLU-Pro, Natural2Code, Bird-SQL, LiveCodeBench, FACTS Grounding, MATH, HiddenMath, GPQA, MRCR, MMMU, Vibe-Eval, CoVoST2, and EgoSchema.
        • Responsible Development:

          • Prioritizing safety and security in the development of these new technologies
          • Join us as we delve into the potential of Gemini 2.0 to revolutionize human-agent interaction and unlock a new era of possibilities.

            ...more
            View all episodesView all episodes
            Download on the App Store

            Tech News NowBy Andre Sampaio