Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
FAQs about AI Podcast:How many episodes does AI Podcast have?The podcast currently has 399 episodes available.
January 06, 2025Flow Matching for Generative ModelingA podcast discussing the new paradigm for generative modeling using Continuous Normalizing Flows (CNFs) called Flow Matching (FM). FM offers a simulation-free approach for training CNFs by regressing vector fields of fixed conditional probability paths, which enables training CNFs at unprecedented scale and allows for the use of different probability paths....more7minPlay
January 05, 2025Swin Transformer: A New Vision TransformerA podcast discussing the Swin Transformer, a hierarchical vision transformer using shifted windows for computer vision tasks....more8minPlay
January 05, 2025ConvNeXt: A Modern ConvNet for the 2020sA podcast discussing the architecture and performance of ConvNeXt, a modern ConvNet model that challenges the dominance of Vision Transformers....more7minPlay
January 05, 2025AI Vision Podcast: Masked Autoencoders for Scalable Vision LearningA deep dive into Masked Autoencoders (MAE) and their impact on computer vision, discussing their architecture, training efficiency, and performance on ImageNet and downstream tasks....more6minPlay
January 04, 2025AI Radio FM - Technology Channel, Your Personal Generative AI PodcastA podcast discussing the auxiliary-loss-free load balancing strategy for mixture-of-experts models....more6minPlay
January 04, 2025混合专家模型(MoE)技术综述本播客深入探讨了混合专家模型(MoE)的最新进展、算法设计、系统实现以及实际应用。从稀疏和密集MoE的背景知识开始,我们提出了一个创新的MoE分类法,并探讨了选通函数、专家网络、训练方案和系统设计方面的复杂性,从而全面了解MoE。...more6minPlay
January 04, 2025零气泡流水线并行本期播客深入探讨了零气泡流水线并行技术,这是一种旨在提高大规模分布式训练效率的创新方法。我们分析了传统流水线并行方法中的气泡问题,并介绍了如何通过精细化调度和优化器同步绕过技术来实现零气泡。此外,我们还讨论了自动调度算法、内存优化策略以及实验结果,旨在为听众提供一个全面而深入的技术解析。...more7minPlay
January 04, 2025GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingA podcast discussion about GShard, a module for scaling neural networks using conditional computation and automatic sharding, focusing on its application to multilingual machine translation....more7minPlay
January 04, 2025AI Radio FM - Technology Channel: GShard and Giant ModelsA deep dive into GShard, a module for scaling giant neural networks, focusing on its application to multilingual machine translation and its impact on training efficiency and model quality....more9minPlay
January 04, 2025混合张量专家数据并行方法优化混合专家训练深入探讨 DeepSpeed-TED,一种新颖的三维混合并行框架,用于训练具有大型基础模型的混合专家模型。我们讨论了内存优化、通信优化以及与现有方法的性能比较。...more6minPlay
FAQs about AI Podcast:How many episodes does AI Podcast have?The podcast currently has 399 episodes available.