
Sign up to save your podcasts
Or


PaLM: Scaling Language Modeling with Pathways introduces the Pathways Language Model (PaLM), a 540-billion parameter, densely activated Transformer model. The researchers trained PaLM across 6144 TPU v4 chips using a new, highly efficient machine learning system called Pathways.
Key highlights from the paper include:
By Yun WuPaLM: Scaling Language Modeling with Pathways introduces the Pathways Language Model (PaLM), a 540-billion parameter, densely activated Transformer model. The researchers trained PaLM across 6144 TPU v4 chips using a new, highly efficient machine learning system called Pathways.
Key highlights from the paper include: