cloud2030

Back After a Break


Listen Later

In this episode, we discuss the rising cost of using AI and how usage-based pricing, model changes, and capacity limits are affecting daily work as AI moves from experimentation into operational use. We also talk about multi-model workflows, hybrid infrastructure, and examples of using hosted models alongside open models locally for tasks such as writing and named entity resolution. We get into the need for enterprises to run their own AI infrastructure, including questions around GPU pooling, routing, reservation, data sovereignty, and service levels.
Transcript: https://otter.ai/u/pihJkUzDWWcqBnWyM24CIyxX4Qs?utm_source=copy_url
...more
View all episodesView all episodes
Download on the App Store

cloud2030By the2030.cloud Podcast

  • 4.5
  • 4.5
  • 4.5
  • 4.5
  • 4.5

4.5

4 ratings