Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
Welcome to the confusion matrix where we have lively and candid discussions about data, data science and AI in day to day life, business and beyond.... more
FAQs about The Confusion Matrix:How many episodes does The Confusion Matrix have?The podcast currently has 33 episodes available.
February 23, 2026Will AI steal your job?Alex and Pete examine whether AI will replace jobs within 18 months, discussing automation history, current limitations, and Microsoft’s predictions regarding workplace displacement....more1h 1minPlay
February 09, 2026Agent coding an LLM chat client – The aftermathAlex and Pete discuss AI coding agents’ limitations after Alex’s LLM client build revealed extensive bugs and missing features. They explore testing challenges, agent reliability metrics, and corporate liability concerns preventing enterprise adoption. This episode will make a lot more sense if you listen to the previous episode first....more1h 8minPlay
January 26, 2026Agent coding an LLM chat client step-by-stepAlex and Pete discuss Alex’s rebuild of his perma-regenerating LLM chat client. This time he did it properly! No messing, just hardcore, disciplined, systematic, agent-coding goodness. Join them as he walks Pete through the process in excruciating, step-by-step detail. The Vanishing Gradients episode Alex mentions....more1h 16minPlay
December 03, 2025How the norms use LLMsAlex and Pete examine OpenAI’s comprehensive report on ChatGPT usage patterns, analysing classification methodologies and user behaviour across consumer and workplace contexts. They discuss the shift from advice-seeking to task execution, the dominance of writing and information-seeking functions, and implications for future AI adoption and market opportunities....more1h 1minPlay
November 17, 2025Evals and Aliens – How model testing is not a binary affairPete and Alex examine AI model evaluation methodologies, comparing traditional machine learning metrics with the qualitative assessment challenges of large language models. They discuss the collaborative requirements between technical and business teams to establish evaluation criteria for generative AI systems, highlighting the subjective nature of testing conversational outputs versus binary classification tasks. With the help […]...more1h 6minPlay
November 03, 2025I suppose a hack’s out of the question? – Adventures in LLM Cyber-securityPete and Alex dig into cybersecurity risks with AI agents and generative AI systems. They cover two main problems: people coding dodgy applications without security knowledge, and hackers directly exploiting AI agents that have access to tools and data. Despite the scary possibilities, they reckon most vulnerabilities are manageable with decent security practices. Practical AI […]...more1h 5minPlay
October 27, 2025GenAI, the state of it! Returns!Pete and Alex recap their recent AI discussions, covering why language model “hallucinations” are actually normal behaviour, how most AI proof-of-concepts fail due to poor ideas rather than technical issues, the impact on jobs and graduate recruitment, and the rise of AI coding agents that are reshaping software development. Practical AI – Dealing with increasingly […]...more1h 9minPlay
October 13, 2025No Surprises – Analysis of The GenAI Divide MIT ReportAlex and Pete discuss MIT’s study revealing 95% of GenAI projects fail despite massive enterprise investment. They explore why companies struggle to scale beyond pilots, the “shadow AI economy” of employees secretly using personal AI tools, and practical strategies for successful implementation including adaptive systems and treating AI procurement like business process outsourcing....more1h 9minPlay
October 03, 2025Terminal Velocity – LLMs and The Inexorable March to Text First UIsAlex explains his slow but unstoppable gravitation to text based interfaces and an “everything via the terminal” mentality. After all, who needs graphical operating systems? To do this, Pete and Alex take a ramble through the history of nerd-first user interfaces, discuss why keyboard layouts are stupid, how this relates to window based OSes and […]...more1h 18minPlay
September 22, 2025Peak LLM = Peak Swiss CheesePete and Alex ponder whether we are at the point where LLMs are as good as they are going to get, and what the implications of this are. This requires a dip into the murky depths of what businesses exist to do and how the randomness that the LLMs generate is antithetical to how businesses […]...more1h 2minPlay
FAQs about The Confusion Matrix:How many episodes does The Confusion Matrix have?The podcast currently has 33 episodes available.