May 31, 2026

LT2: Linear-Time Looped Transformers

42 minutes

Replaces quadratic softmax attention in looped architectures with linear/sparse mechanisms for iterative memory refinement, achieving parity with standard looped transformers at much lower cost.

...more

View all episodes

By Shaoqing Tan

May 31, 2026

LT2: Linear-Time Looped Transformers

42 minutes

Replaces quadratic softmax attention in looped architectures with linear/sparse mechanisms for iterative memory refinement, achieving parity with standard looped transformers at much lower cost.

...more

Share LT2: Linear-Time Looped Transformers

Sign up to save your podcasts

LT2: Linear-Time Looped Transformers

LT2: Linear-Time Looped Transformers