Weaviate Podcast

ChatArena with Yuxiang Wu - Weaviate Podcast #47!


Listen Later

Hey everyone, thank you so much for watching the Weaviate podcast! I am so excited about this episode! ChatArena is a software framework for multi-agent chat games. There are quite a few interesting applications of this, firstly we can use this kind of system to evaluate the intelligence of an LLM based on how intelligent it sounds in conversation with another LLM! Another interesting idea is to have the LLM impersonate people such as Lex Fridman or Sam Altman and simulate conversations between these people -- retrieving from their digital content to facilitate the impersonation. I thought there was so many interesting ideas in this podcast, please let us know what you think!

Links:
ChatArena on GitHub (please give it a star!) - https://github.com/chatarena/chatarena
Twitter thread from Yuxiang describing the launch of ChatArena - https://twitter.com/YuxiangJWu/status/1643633046208249856
Chapters
0:00 Welcome Yuxiang!
0:38 What is ChatArena?
2:38 Impersonating People with LLMs
4:58 Weaviate and ChatArena
8:14 Generative Feedback Loops
11:10 Chat Games
16:30 Scientific Peer Review Discussions
20:05 Code Repos and Multi-Agent LLMs
23:05 Scaling Multi-Agent LLMs
25:16 Role Evolution in Startups
26:00 Evolution of Multi-Agent RL Research
29:22 AlphaGo and MCTS Text Generation
36:55 Hallucination in Role Maintenance
41:15 Evaluating LLMs with ChatArena
45:40 ChatGPT Marketplace and Tool Use
50:30 Upcoming work from Yuxiang and ChatArena!

...more
View all episodesView all episodes
Download on the App Store

Weaviate PodcastBy Weaviate

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Weaviate Podcast

View all
Fareed Zakaria GPS by CNN

Fareed Zakaria GPS

3,420 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,063 Listeners

Acquired by Ben Gilbert and David Rosenthal

Acquired

4,159 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

293 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

223 Listeners

DataFramed by DataCamp

DataFramed

268 Listeners

Practical AI by Practical AI LLC

Practical AI

192 Listeners

Last Week in AI by Skynet Today

Last Week in AI

296 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,304 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

434 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

129 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

89 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

464 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

33 Listeners