Learning GenAI via SOTA Papers

By Yun Wu

This podcast is focusing on sharing the papers on GenAI related topic, especially the SOTA (State of the Art) papers that are the foundations of GenAI work. It shows how these researches paved the way... more

· Technology

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Learning GenAI via SOTA Papers:

How many episodes does Learning GenAI via SOTA Papers have?

The podcast currently has 217 episodes available.

Learning GenAI via SOTA Papers episodes:

May 20, 2026 EP197: Teaching AI Agents to Plan Like Humans
Title: From Coarse to Fine: Self-Adaptive Hierarchical Planning for LLM Agents
Source: http://arxiv.org/abs/2604.23194v1

Summary:
This paper introduces AdaPlan-H, a novel agentic reasoning framework that enables LLM agents to dynamically adjust planning granularity based on task complexity. It provides a foundational primitive for long-horizon task execution by mimicking human progressive refinement to optimize the balance between planning detail and execution efficiency.
...more
21min
May 20, 2026 EP196: Forcing AI to Prove Its Logic
Title: ArgRE: Formal Argumentation for Conflict Resolution in Multi-Agent Requirements Negotiation
Source: http://arxiv.org/abs/2604.23124v1

Summary:
ArgRE establishes a foundational framework for multi-agent coordination by embedding formal Dung-style abstract argumentation into the agentic reasoning loop for conflict resolution. It moves multi-agent negotiation from heuristic synthesis to a principled, auditable mechanism, which is critical for the reliability and transparency of complex Agentic AI systems.
...more
22min
May 19, 2026 EP195: How tool attention ends the tools tax
Title: Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows
Source: http://arxiv.org/abs/2604.21816v1

Summary:
This paper introduces a middleware-layer mechanism that reduces tool-related token overhead by 95% through dynamic gating and lazy schema loading. It provides a critical efficiency breakthrough for scaling agentic systems to support hundreds of tools without exhausting context limits or sacrificing reasoning quality.
...more
23min
May 19, 2026 EP194: AI coding through mental simulation
Title: DryRUN: On the Role of Public Tests in LLM-Driven Code Generation
Source: http://arxiv.org/abs/2604.21598v1

Summary:
DryRUN introduces a novel agentic reasoning loop where LLMs autonomously generate inputs and simulate execution traces to self-correct code without relying on human-provided test cases. This removes a primary bottleneck for autonomous software engineering and mitigates the 'overconfidence gap' seen in traditional code generation frameworks.
...more
21min
May 18, 2026 EP193: AI image generators master physical reality
Title: Image Generators are Generalist Vision Learners
Source: http://arxiv.org/abs/2604.20329v1

Summary:
This paper demonstrates that image generation pretraining serves as a unified foundation for both visual creation and zero-shot understanding, rivaling domain-specific specialists across diverse 2D and 3D tasks. It proposes a paradigm shift where generative models act as generalist vision learners, establishing image generation as a universal interface for computer vision similar to text in LLMs.
...more
23min
May 18, 2026 EP192: Fixing AI memory with knowledge graphs
Title: Automatic Ontology Construction Using LLMs as an External Layer of Memory, Verification, and Planning for Hybrid Intelligent Systems
Source: http://arxiv.org/abs/2604.20795v1

Summary:
The paper introduces a hybrid architecture that integrates LLMs with an external ontological memory layer to provide semantically grounded reasoning and long-term persistence. This framework enables the automated construction of structured knowledge graphs for agentic systems, addressing critical limitations in structural understanding and verifiable decision-making for complex planning.
...more
23min
May 17, 2026 EP191: Why AI Agents Blame Each Other
Title: Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment
Source: http://arxiv.org/abs/2604.19548v1

Summary:
This paper introduces ReTAS, a dialectical reasoning framework that mitigates Actor-Observer Asymmetry in multi-agent systems by synthesizing conflicting actor and observer perspectives into objective consensus. By enforcing perspective-invariant reasoning through dialectical alignment, it provides a foundational method for improving the reliability and fault resolution of agentic workflows.
...more
24min
May 17, 2026 EP190: [OLLM] Replacing AI dice rolls with ten lanes
Title: OLLM: Options-based Large Language Models
Source: http://arxiv.org/abs/2604.19087v1

Summary:
OLLM introduces a novel architectural modification that replaces standard next-token prediction with a set of learned options indexed by a discrete latent variable, significantly enhancing diversity and controllability. This shift to latent-space policy selection provides a foundational primitive for more efficient, robust reasoning and alignment in generative models.
...more
23min
May 16, 2026 EP189: How Sessa architecture fixes AI amnesia
Title: Sessa: Selective State Space Attention
Source: http://arxiv.org/abs/2604.18580v2

Summary:
This paper introduces a novel architectural primitive that integrates attention into a recurrent feedback path, achieving power-law memory tails for superior long-context information preservation. It represents a significant breakthrough by combining the strengths of Transformers and State-Space Models to enable flexible selective retrieval that does not decay with sequence distance.
...more
22min
May 16, 2026 EP188: [Agent-World] AI Building Its Own Training Worlds
Title: Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
Source: http://arxiv.org/abs/2604.18292v1

Summary:
Agent-World provides a foundational framework for scaling general agent intelligence by autonomously synthesizing thousands of real-world environment themes and tasks for continuous self-evolution. The co-evolution of agent policies and environments enables models to identify and bridge capability gaps, allowing smaller models to consistently outperform larger proprietary baselines.
...more
21min

FAQs about Learning GenAI via SOTA Papers:

How many episodes does Learning GenAI via SOTA Papers have?

The podcast currently has 217 episodes available.