Machine Learning Archives - Software Engineering Daily

Data Science at Spotify with Boxun Zhang


Listen Later

“I normally try to sit together or very close to a product team or engineering team. And by doing so, I get very close to the source of all kinds of challenging problems.”

Spotify is a streaming music service that uses data science and machine learning to implement product features such as recommendation systems and music categorization, but also to answer internal questions.

Boxun Zhang is a data scientist at Spotify where he focuses on understanding user behavior within the product.

Questions
  • What is the overlap between distributed systems and data science?
  • How has Spotify’s big data architecture evolved over time?
  • As a data scientist do you need to understand this big data architecture well?
  • What were the benefits for starting to use Kafka?
  • What kinds of data science problems do you tackle at Spotify?
  • Could you describe what a random forest is?
  • Why are there so many streaming systems, and what do you use at Spotify?
  • How will data science change moving towards the future?
  • Links
    • The Evolution of Big Data at Spotify
    • Luigi
    • Project Jupyter
    • XGBoost
    • Automatic Statistician
    • Skytree
    • Sponsors

      Hired.com is the job marketplace for software engineers. Go to hired.com/softwareengineeringdaily to get a $600 bonus upon landing a job through Hired.

      Digital Ocean is the simplest cloud hosting provider. Use promo code SEDAILY for $10 in free credit.

      The post Data Science at Spotify with Boxun Zhang appeared first on Software Engineering Daily.

      ...more
      View all episodesView all episodes
      Download on the App Store

      Machine Learning Archives - Software Engineering DailyBy Machine Learning Archives - Software Engineering Daily

      • 4.4
      • 4.4
      • 4.4
      • 4.4
      • 4.4

      4.4

      69 ratings


      More shows like Machine Learning Archives - Software Engineering Daily

      View all
      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      289 Listeners

      6 Minute English by BBC Radio

      6 Minute English

      1,756 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      479 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      625 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      585 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      302 Listeners

      Python Bytes by Michael Kennedy and Brian Okken

      Python Bytes

      214 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      334 Listeners

      Machine Learning Guide by OCDevel

      Machine Learning Guide

      773 Listeners

      Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

      Syntax - Tasty Web Development Treats

      988 Listeners

      DataFramed by DataCamp

      DataFramed

      269 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      211 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      203 Listeners

      Google DeepMind: The Podcast by Hannah Fry

      Google DeepMind: The Podcast

      201 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      227 Listeners