Misreading Chat

#131: FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness


Listen Later

CUDA で書かれた PyTorch 用カーネルに森田が玉砕しました。
...more
View all episodesView all episodes
Download on the App Store

Misreading ChatBy Hajime Morrita, Jun Mukai

  • 5
  • 5
  • 5
  • 5
  • 5

5

6 ratings


More shows like Misreading Chat

View all
Rebuild by Tatsuhiko Miyagawa

Rebuild

51 Listeners

耳で学ぶAI、ロボシンク by 矢野 哲平

耳で学ぶAI、ロボシンク

1 Listeners