The Phront Room - Practical AI

By Nathan Rigoni

AI for everyone – data‑driven leaders, teachers, engineers, program managers and researchers break down the latest AI breakthroughs and show how they’re applied in real‑world projects. From AI in aero... more

· Technology

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about The Phront Room - Practical AI:

How many episodes does The Phront Room - Practical AI have?

The podcast currently has 16 episodes available.

The Phront Room - Practical AI episodes:

January 09, 2026NLP - Tokenization
Tokenization: The Building Blocks of Natural Language Processing Hosted by Nathan Rigoni (no guest)
In this first half of the NLP mini‑series, Nathan breaks down how computers turn raw text into numbers that machines can manipulate. He explains the evolution from naïve “split‑by‑space” word indexing to modern sub‑word tokenization, shows why tokens are both the engine and the bottleneck of today’s large language models, and highlights the numeric and linguistic challenges that still limit AI performance. How can we redesign tokenization so models can understand numbers and rare words without exploding in size?
What you will learn
The basic “word‑to‑integer” tokenization method and why it fails at web‑scale vocabularies.
Sub‑word tokenization (syllable‑like prefixes, suffixes, and character‑level tokens) and its typical vocabulary size (~100 k).
How token limits (≈ 2 million tokens for state‑of‑the‑art models) affect context length and memory usage.
The impact of token granularity on numeric handling (e.g., different token splits of “100 000”) and on counting characters (the classic “R in strawberry” problem).
Why the bits‑per‑parameter metric (≈ 3–4 bits) is tied to preserving every token across the model’s forward pass.
Resources mentioned
Original word‑level tokenization (space‑split) – basic concept.
Sub‑word tokenization methods such as Byte‑Pair Encoding (BPE) and WordPiece (used by GPT, BERT, etc.).
Papers on bits‑per‑parameter efficiency in large language models (to be covered in future “paper review” episodes).
Example numeric tokenization challenges (e.g., different token splits for 100 000).
Why this episode matters
Tokenization is the foundation of every downstream NLP task—from document classification to chatbots. Understanding its limits explains why models hallucinate, struggle with math, or miscount characters, and points to research directions (better token schemes, dynamic chunking, or byte‑level models) that could unlock longer contexts and more accurate reasoning. For anyone building or fine‑tuning language models, mastering tokenization is the first step toward more reliable AI.
Subscribe for more AI deep‑dives, visit www.phronesis‑analytics.com, or email nathan.rigoni@phronesis‑analytics.com.
Keywords: tokenization, sub‑word tokenization, BPE, WordPiece, NLP basics, large language model limits, token length, numeric tokenization, bits‑per‑parameter, contextual AI.
...more
15min
January 04, 2026 Hidden States and Latent Spaces
In this episode we break down the concepts of hidden states and latent spaces in neural networks. Pulling back the curtain to reveal AI's Wizard.
Company Website: www.phronesis-analytics.com
Email: [email protected]
...more
15min
January 04, 2026 Supervised Learning and Reinforcement Learning
In this episode we break down the concept and application of supervised learning and reinforcement learning. These two concepts are at the heart of AI training in todays technology.
Company Website: www.phronesis-analytics.com
Email: [email protected]
...more
13min
January 04, 2026 AI ML and Overfitting
In this episode we explain the basic concept of AI and ML along with the first major hurdle that AI ML faced, Overfitting. This series is intended to break down the basic concepts of AI as a reference for more complicated topics that are discussed in current AI.

Company Website: www.phronesis-analytics.com
Email: [email protected]
...more
13min
December 30, 2025 Paper Review: Emergence of Hidden Capabilities - Exploring Learning Dynamics in Concept Space
In this episode we review one of my favorite apapers and talk about how it bridges the space between artificial and just plain intelligence. We explore what we can learn not only about AI but how we can expand our own understanding of how we learn and how we can classify "what is intelligent". Below is a link to the paper as well as a link to our company website and our email.
Paper: https://arxiv.org/html/2406.19370v1
Company Website: www.phronesis-analytics.com
Email: [email protected]
...more
23min
December 08, 2025 AI in Manufacturing
In this episode we interview Nathaniel Slaten a Huntsville local who works as a project manager in the manufacturing industry. He tells us his perspective and insights into how AI is changing his industry.
...more
1h 11min

FAQs about The Phront Room - Practical AI:

How many episodes does The Phront Room - Practical AI have?

The podcast currently has 16 episodes available.