AI Radio FM - Technology Channel: GShard and Giant Models
A deep dive into GShard, a module for scaling giant neural networks, focusing on its application to multilingual machine translation and its impact on training efficiency and model quality.
AI Radio FM - Technology Channel: GShard and Giant Models
A deep dive into GShard, a module for scaling giant neural networks, focusing on its application to multilingual machine translation and its impact on training efficiency and model quality.