March 17, 2026

CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

26 minutes

This episode examines CacheSlide from USENIX FAST26, a system that enables LLMs to reuse cached key-value pairs across shifting prompt positions in agentic workflows. The paper introduces chunked contextual position encoding and priority-based eviction to solve the position mismatch problem that prevents KV cache reuse when prompt segments shift in multi-turn agent conversations.

Interactive Visualization: CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

...more

View all episodes

By mcgrof

March 17, 2026

CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

26 minutes

Interactive Visualization: CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

...more

Share CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

Sign up to save your podcasts

CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs

CacheSlide: Position-Aware KV Cache Reuse for Agent LLMs