airhacks.fm podcast with adam bien

TornadoVM: The Need for GPU Speed


Listen Later

An airhacks.fm conversation with Michalis Papadimitriou (@mikepapadim) about:
starting with Java 8,
first computer experiences with Pentium 2,
doom 2 and Microsoft Paint,
university introduction to Object-oriented programming using Objects First and bluej IDE,
Monte Carlo simulations for financial portfolio optimization in Java,
porting Java applications to OpenCL for GPU acceleration achieving 20x speedup,
working at Huawei on GPU hardware,
writing unit tests as introduction to TornadoVM,
working on FPGA integration and Graal compiler optimizations,
experience at OctoAI startup doing AI compiler optimizations for TensorFlow and PyTorch models,
understanding model formats evolution from ONNX to GGUF,
standardization of LLM inference through Llama models,
implementing GPU-accelerated Llama 3 inference in pure Java using TornadoVM,
achieving 3-6x speedup over CPU implementations,
supporting multiple models including Mistral and working on qwen 3 and deepseek,
differences between models mainly in normalization layers,
GGUF becoming quasi-standard for LLM model distribution,
TornadoVM's Consume and Persist API for optimizing GPU data transfers,
challenges with OpenCL deprecation on macOS and plans for Metal backend,
importance of developer experience and avoiding python dependencies for Java projects,
runtime and compiler optimizations for GPU inference,
kernel fusion techniques,
upcoming integration with langchain4j,
potential of Java ecosystem with Graal VM and Project Panama FFM for high-performance inference,
advantages of Java's multi-threading capabilities for inference workloads

Michalis Papadimitriou on twitter: @mikepapadim

...more
View all episodesView all episodes
Download on the App Store

airhacks.fm podcast with adam bienBy Adam Bien

  • 5
  • 5
  • 5
  • 5
  • 5

5

7 ratings


More shows like airhacks.fm podcast with adam bien

View all
The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

289 Listeners

WSJ Tech News Briefing by The Wall Street Journal

WSJ Tech News Briefing

1,647 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,096 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

Bits und so by Undsoversum GmbH

Bits und so

25 Listeners

c’t uplink - der IT-Podcast aus Nerdistan by c’t Magazin

c’t uplink - der IT-Podcast aus Nerdistan

6 Listeners

heiseshow by heise online

heiseshow

2 Listeners

REWORK by 37signals

REWORK

210 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

204 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

141 Listeners

LANZ & PRECHT by ZDF, Markus Lanz & Richard David Precht

LANZ & PRECHT

328 Listeners

Der KI-Podcast by ARD

Der KI-Podcast

12 Listeners

Foojay.io, the Friends Of OpenJDK! by Foojay.io

Foojay.io, the Friends Of OpenJDK!

0 Listeners

The Economics Show by Financial Times

The Economics Show

147 Listeners