Machine Learning Archives - Software Engineering Daily

Data Science at Spotify with Boxun Zhang


Listen Later

“I normally try to sit together or very close to a product team or engineering team. And by doing so, I get very close to the source of all kinds of challenging problems.”

Spotify is a streaming music service that uses data science and machine learning to implement product features such as recommendation systems and music categorization, but also to answer internal questions.

Boxun Zhang is a data scientist at Spotify where he focuses on understanding user behavior within the product.

Questions
  • What is the overlap between distributed systems and data science?
  • How has Spotify’s big data architecture evolved over time?
  • As a data scientist do you need to understand this big data architecture well?
  • What were the benefits for starting to use Kafka?
  • What kinds of data science problems do you tackle at Spotify?
  • Could you describe what a random forest is?
  • Why are there so many streaming systems, and what do you use at Spotify?
  • How will data science change moving towards the future?
  • Links
    • The Evolution of Big Data at Spotify
    • Luigi
    • Project Jupyter
    • XGBoost
    • Automatic Statistician
    • Skytree
    • Sponsors

      Hired.com is the job marketplace for software engineers. Go to hired.com/softwareengineeringdaily to get a $600 bonus upon landing a job through Hired.

      Digital Ocean is the simplest cloud hosting provider. Use promo code SEDAILY for $10 in free credit.

      The post Data Science at Spotify with Boxun Zhang appeared first on Software Engineering Daily.

      ...more
      View all episodesView all episodes
      Download on the App Store

      Machine Learning Archives - Software Engineering DailyBy Machine Learning Archives - Software Engineering Daily

      • 4.4
      • 4.4
      • 4.4
      • 4.4
      • 4.4

      4.4

      69 ratings


      More shows like Machine Learning Archives - Software Engineering Daily

      View all
      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      284 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      480 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      590 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      621 Listeners

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

      441 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      297 Listeners

      Python Bytes by Michael Kennedy and Brian Okken

      Python Bytes

      215 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      322 Listeners

      Machine Learning Guide by OCDevel

      Machine Learning Guide

      763 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      192 Listeners

      Google DeepMind: The Podcast by Hannah Fry

      Google DeepMind: The Podcast

      198 Listeners

      Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

      Machine Learning Street Talk (MLST)

      87 Listeners

      AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

      AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

      141 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      201 Listeners

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

      The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

      462 Listeners