Super Data Science: ML & AI Podcast with Jon Krohn

626: Subword Tokenization with Byte-Pair Encoding

11.11.2022 - By Jon KrohnPlay

Download our free app to listen on your phone

Download on the App StoreGet it on Google Play

Word tokenization, character tokenization and subword tokenization go head-to-head this week as Jon Krohn delivers a mini-bootcamp on the NLP-related process.

Additional materials: www.superdatascience.com/626

Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

More episodes from Super Data Science: ML & AI Podcast with Jon Krohn