airhacks.fm podcast with adam bien

Revolutionizing AI with Java: From LLMs to Vector APIs


Listen Later

An airhacks.fm conversation with Alfonso Peterssen (@TheMukel) about:
Alfonso previously appeared on "#294 LLama2.java: LLM integration with A 100% Pure Java file",
discussion of llama2.java and llama3.java projects for running LLMs in Java,
performance comparison between Java and C implementations,
use of Vector API in Java for matrix multiplication,
challenges and potential improvements in Vector API implementation,
integration of various LLM models like Mistral, phi, qwen or gemma,
differences in model sizes and capabilities,
tokenization and chat format challenges across different models,
potential for Java Community Process (JCP) standardization of gguf parsing,
quantization techniques and their impact on performance,
plans for integrating with langchain4j,
advantages of pure Java implementations for AI models,
potential for GraalVM and native image optimizations,
discussion on the future of specialized AI models for specific tasks,
challenges in training models with language capabilities but limited world knowledge,
importance of SIMD instructions and vector operations for performance optimization,
potential improvements in Java's handling of different float formats like float16 and bfloat16,
discussion on the role of smaller,
specialized AI models in enterprise applications and development tools

Alfonso Peterssen on twitter: @TheMukel

...more
View all episodesView all episodes
Download on the App Store

airhacks.fm podcast with adam bienBy Adam Bien

  • 5
  • 5
  • 5
  • 5
  • 5

5

7 ratings


More shows like airhacks.fm podcast with adam bien

View all
The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

WSJ Tech News Briefing by The Wall Street Journal

WSJ Tech News Briefing

1,651 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,096 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

580 Listeners

Bits und so by Undsoversum GmbH

Bits und so

25 Listeners

c’t uplink - der IT-Podcast aus Nerdistan by c’t Magazin

c’t uplink - der IT-Podcast aus Nerdistan

6 Listeners

heiseshow by heise online

heiseshow

2 Listeners

REWORK by 37signals

REWORK

210 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

204 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

141 Listeners

LANZ & PRECHT by ZDF, Markus Lanz & Richard David Precht

LANZ & PRECHT

333 Listeners

Der KI-Podcast by ARD

Der KI-Podcast

12 Listeners

Foojay.io, the Friends Of OpenJDK! by Foojay.io

Foojay.io, the Friends Of OpenJDK!

0 Listeners

The Economics Show by Financial Times

The Economics Show

149 Listeners