
Sign up to save your podcasts
Or
With the last episode in 2024, I dare to release an solo episode, summarizing my christmas research on the topics of
- Small Language models
- Agentic Systems
- Advanced Reasoning / Test time compute paradigm
I hope you find it interesting and useful!
All the best for 2025!
## AAIP Community
Join our discord server and ask guest directly or discuss related topics with the community.
https://discord.gg/5Pj446VKNU
## TOC
00:00:05 Intro
00:01:52 Part 1 - Small Language Models
00:20:16 Part 2 - Agentic Systems
00:36:16 Part 3 - Advanced Reasoning
00:58:08 Outro
## References
- Testing Qwen2.5 - https://huggingface.co/spaces/Qwen/Qwen2.5
- Qwen2.5 Technical report - https://arxiv.org/pdf/2412.15115
- Agents: https://www.superannotate.com/blog/llm-agents
- Scaling Test-time compute: https://arxiv.org/html/2408.03314v1
- Test time compute: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute
- O3 achieving 88% on ARC-AGI https://arcprize.org/blog/oai-o3-pub-breakthrough
- https://arxiv.org/html/2409.01374v1 - Human performance on ARC-AGI 76%
With the last episode in 2024, I dare to release an solo episode, summarizing my christmas research on the topics of
- Small Language models
- Agentic Systems
- Advanced Reasoning / Test time compute paradigm
I hope you find it interesting and useful!
All the best for 2025!
## AAIP Community
Join our discord server and ask guest directly or discuss related topics with the community.
https://discord.gg/5Pj446VKNU
## TOC
00:00:05 Intro
00:01:52 Part 1 - Small Language Models
00:20:16 Part 2 - Agentic Systems
00:36:16 Part 3 - Advanced Reasoning
00:58:08 Outro
## References
- Testing Qwen2.5 - https://huggingface.co/spaces/Qwen/Qwen2.5
- Qwen2.5 Technical report - https://arxiv.org/pdf/2412.15115
- Agents: https://www.superannotate.com/blog/llm-agents
- Scaling Test-time compute: https://arxiv.org/html/2408.03314v1
- Test time compute: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute
- O3 achieving 88% on ARC-AGI https://arcprize.org/blog/oai-o3-pub-breakthrough
- https://arxiv.org/html/2409.01374v1 - Human performance on ARC-AGI 76%
17 Listeners