
Sign up to save your podcasts
Or
The NovaSky team at UC Berkeley presents Sky-T1-32B-Preview, an open-source reasoning model achieving performance comparable to o1-preview at a significantly lower cost. This was accomplished by training a 32B parameter model using readily available open-source resources and a carefully curated dataset, including data from math and coding benchmarks. The researchers openly share their data, code, and model weights to foster collaboration and further development within the open-source community. Their findings highlight the importance of model size and data diversity in achieving strong reasoning capabilities, paving the way for more accessible and affordable advanced AI models.
Send us a text
Support the show
Podcast:
https://kabir.buzzsprout.com
YouTube:
https://www.youtube.com/@kabirtechdives
Please subscribe and share.
4.7
3333 ratings
The NovaSky team at UC Berkeley presents Sky-T1-32B-Preview, an open-source reasoning model achieving performance comparable to o1-preview at a significantly lower cost. This was accomplished by training a 32B parameter model using readily available open-source resources and a carefully curated dataset, including data from math and coding benchmarks. The researchers openly share their data, code, and model weights to foster collaboration and further development within the open-source community. Their findings highlight the importance of model size and data diversity in achieving strong reasoning capabilities, paving the way for more accessible and affordable advanced AI models.
Send us a text
Support the show
Podcast:
https://kabir.buzzsprout.com
YouTube:
https://www.youtube.com/@kabirtechdives
Please subscribe and share.
5,426 Listeners