
Sign up to save your podcasts
Or


Bloomberg Intelligence Head of Technology Research Mandeep Singh is joined by Nicole Hu, a Silicon Valley technology veteran and GLG expert, to explore the implications of Google’s TurboQuant paper and the evolving economics of AI infrastructure. As hyperscalers look to improve the efficiency of AI workloads, advances in quantization are redefining the tradeoffs between memory and compute, with far-reaching implications for cost, latency, and datacenter architecture. They examine how new approaches to model optimization and inference could reshape hardware requirements, deployment strategies, and the next wave of AI investment.
By Bloomberg4.6
1414 ratings
Bloomberg Intelligence Head of Technology Research Mandeep Singh is joined by Nicole Hu, a Silicon Valley technology veteran and GLG expert, to explore the implications of Google’s TurboQuant paper and the evolving economics of AI infrastructure. As hyperscalers look to improve the efficiency of AI workloads, advances in quantization are redefining the tradeoffs between memory and compute, with far-reaching implications for cost, latency, and datacenter architecture. They examine how new approaches to model optimization and inference could reshape hardware requirements, deployment strategies, and the next wave of AI investment.

1,290 Listeners

543 Listeners

402 Listeners

2,185 Listeners

422 Listeners

1,096 Listeners

2,379 Listeners

970 Listeners

342 Listeners

196 Listeners

63 Listeners

30 Listeners

561 Listeners

507 Listeners

349 Listeners

5 Listeners

58 Listeners

233 Listeners

233 Listeners

66 Listeners

145 Listeners

76 Listeners

86 Listeners

401 Listeners

18 Listeners

12 Listeners

7 Listeners

2 Listeners

115 Listeners