My Weird Prompts

AI's Data Kitchen: From Hoovering to Fine-Tuning


Listen Later

Everyone talks about the magic of AI, but the real war is over data. This episode pulls back the curtain on the messy, multi-billion-dollar process of finding, cleaning, and filtering the information that trains large language models. We explore why the era of simply "hoovering" the internet is over, how deduplication and quality filtering work, and why the "well of high-quality data" might be running dry.
...more
View all episodesView all episodes
Download on the App Store

My Weird PromptsBy Daniel Rosehill