November 25, 2024

Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks

48 minutes

In this episode, we dive into the multimodal AI agents, starting with the recent release of runner H and diving into groundbreaking research, including:

04:15 VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks by Jing Yu Koh et. al

19:18 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations by Gaurav Verma et. al.

32:32 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast by Xiangming Gu et. al.

...more

View all episodes

By The Agents of Tomorrow Show

November 25, 2024

Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks

48 minutes

In this episode, we dive into the multimodal AI agents, starting with the recent release of runner H and diving into groundbreaking research, including:

04:15 VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks by Jing Yu Koh et. al

19:18 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations by Gaurav Verma et. al.

32:32 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast by Xiangming Gu et. al.

...more

Share Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks

Sign up to save your podcasts

Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks

Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks