
Sign up to save your podcasts
Or


Deep dive into continual learning challenges, Microsoft's Code Researcher breakthrough, Apple's new foundation models, and debates on reward hacking. Plus, insights on PyTorch's DCP update and Ivanka Trump's AI endorsement.
Sources:
[1] https://thezvi.substack.com/p/dwarkesh-patel-on-continual-learning
[2] https://pytorch.org/blog/huggingface-safetensors-support-in-pytorch-distributed-checkpointing/
[3] https://huggingface.co/blog/screensuite
[4] https://metr.org/blog/2025-06-05-recent-reward-hacking/
[5] https://www.microsoft.com/en-us/research/publication/code-researcher-deep-research-agent-for-large-systems-code-and-commit-history/
[6] https://www.greaterwrong.com/posts/pCMmLiBcHbKohQgwA/i-replicated-the-anthropic-alignment-faking-experiment-on
[7] https://machinelearning.apple.com/research/apple-foundation-models-2025-updates
By Matthias LauDeep dive into continual learning challenges, Microsoft's Code Researcher breakthrough, Apple's new foundation models, and debates on reward hacking. Plus, insights on PyTorch's DCP update and Ivanka Trump's AI endorsement.
Sources:
[1] https://thezvi.substack.com/p/dwarkesh-patel-on-continual-learning
[2] https://pytorch.org/blog/huggingface-safetensors-support-in-pytorch-distributed-checkpointing/
[3] https://huggingface.co/blog/screensuite
[4] https://metr.org/blog/2025-06-05-recent-reward-hacking/
[5] https://www.microsoft.com/en-us/research/publication/code-researcher-deep-research-agent-for-large-systems-code-and-commit-history/
[6] https://www.greaterwrong.com/posts/pCMmLiBcHbKohQgwA/i-replicated-the-anthropic-alignment-faking-experiment-on
[7] https://machinelearning.apple.com/research/apple-foundation-models-2025-updates