626: Subword Tokenization with Byte-Pair Encoding - podcast episode cover

626: Subword Tokenization with Byte-Pair Encoding

Nov 11, 20227 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Word tokenization, character tokenization and subword tokenization go head-to-head this week as Jon Krohn delivers a mini-bootcamp on the NLP-related process. Additional materials: www.superdatascience.com/626 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
626: Subword Tokenization with Byte-Pair Encoding | Super Data Science: ML & AI Podcast with Jon Krohn - Listen or read transcript on Metacast