
Sign up to save your podcasts
Or


Deep dive into Meta's new Llama 4 AI models (Scout & Maverick) and upcoming Behemoth. Plus, OpenAI's PaperBench reveals current AI limitations in replicating research papers, scoring just 21% success.
Sources:
[1] https://stadt-bremerhaven.de/meta-llama-4-neue-ki-modelle-vorgestellt/
[2] https://medium.com/@cognidownunder/paperbench-openais-new-benchmark-reshapes-how-we-evaluate-ai-research-capabilities-b6220e5a070e
By Matthias LauDeep dive into Meta's new Llama 4 AI models (Scout & Maverick) and upcoming Behemoth. Plus, OpenAI's PaperBench reveals current AI limitations in replicating research papers, scoring just 21% success.
Sources:
[1] https://stadt-bremerhaven.de/meta-llama-4-neue-ki-modelle-vorgestellt/
[2] https://medium.com/@cognidownunder/paperbench-openais-new-benchmark-reshapes-how-we-evaluate-ai-research-capabilities-b6220e5a070e