Two Voice Devs

Episode 205 - Gemini + LangGraph Agents + Google Sheets = Vodo Drive


Listen Later

Join us as we explore Vodo Drive, an innovative project that leverages Google's Gemini AI to revolutionize how we interact with spreadsheets. Creator Allen Firstenberg takes us behind the scenes, revealing the architecture, challenges, and breakthroughs of building an agentic system that understands and manipulates data like never before.


Discover how Vodo Drive:

* Empowers natural language interaction: Say goodbye to rigid formulas and hello to conversational commands.

* Integrates image recognition: Effortlessly input data by simply taking pictures.

* Provides real-time feedback: Experience transparent processing with live updates on your requests.

* Prioritizes security and user control: Maintain data privacy and manage permissions seamlessly.


More Info:

* Vodo Drive: https://vodo-drive.com/

* Gemini API on Vertex AI: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models

* LangChain: https://www.langchain.com/langchain

* LangGraph: https://www.langchain.com/langgraph

* Google Sheets: https://workspace.google.com/products/sheets/

* Firebase: https://firebase.google.com/


Timestamps:

* (0:00:00) Introduction and Project Overview: Discover the inspiration and goals behind Vodo Drive's participation in the Gemini API competition.

* (0:03:30) Reimagining Spreadsheet Control: Explore the evolution of Vodo Drive from voice-controlled spreadsheets to an AI-powered agentic system.

* (0:07:45) The Power of Visual Input: Learn how Vodo Drive seamlessly integrates image recognition to extract and input data from pictures.

* (0:11:55) Contextual Awareness and Conversational Flow: Delve into the importance of contextual awareness and how Vodo Drive maintains the flow of information.

* (0:14:30) Optimizing Tasks with the Right Tools: Understand the strategic use

of spreadsheets as the computational backbone for Vodo Drive's data processing.

* (0:15:30) System Design and Architecture Breakdown: Get a detailed look at the core components of Vodo Drive, including Firebase Cloud Functions, Firestore, and Authentication.

* (0:22:55) Addressing Security Concerns: Explore the safety measures implemented to protect user data and prevent unauthorized actions.

* (0:26:35) Real-Time Updates and User Experience: Discover how Vodo Drive leverages Firestore to provide real-time feedback and enhance user experience.

* (0:32:30) Behind the Scenes: The AI's Internal Dialogue: Uncover the hidden conversations happening between the agent and the LLM during data processing.

* (0:38:05) Firebase Authentication and Authorization: Learn how Vodo Drive ensures secure access to user spreadsheets and leverages Google's authorization system.

* (0:40:45) Firebase Cloud Storage and Media Handling: Explore the role of cloud storage in managing user-uploaded photos and audio files.

* (0:43:35) Gemini's Role in Image Processing and Agentic Logic: Discover how Gemini powers both image recognition and the decision-making process of the agentic system.


Don't miss this insightful discussion on the future of AI-powered data management and how Vodo Drive is paving the way for a more intuitive and efficient user experience.


#GeminiAPI #LLM #AgenticSystems #VoiceControl #Spreadsheets #Firebase #WebDevelopment #AndroidDevelopment #AI #Innovation

...more
View all episodesView all episodes
Download on the App Store

Two Voice DevsBy Mark and Allen

  • 1
  • 1
  • 1
  • 1
  • 1

1

1 ratings


More shows like Two Voice Devs

View all
Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

350 Listeners

The Daily AI Show by The Daily AI Show Crew - Brian, Beth, Jyunmi, Andy, Karl, and Eran

The Daily AI Show

3 Listeners