The Real Python Podcast

Preparing Data Science Projects for Production


Listen Later

How do you prepare your Python data science projects for production? What are the essential tools and techniques to make your code reproducible, organized, and testable? This week on the show, Khuyen Tran from CodeCut discusses her new book, “Production Ready Data Science.”

Khuyen shares how she got into blogging and what motivated her to write a book. She shares tips on how to create repeatable workflows. We delve into modern Python tools that will help you bring your projects to production.

Topics:

  • 00:00:00 – Introduction
  • 00:01:27 – Recent article about top six visualization libraries
  • 00:02:19 – How long have you been blogging?
  • 00:03:55 – What do you cover in your book?
  • 00:07:07 – Potential issues with notebooks
  • 00:11:40 – Structuring data science projects
  • 00:15:12 – Reproducibility and sharing notebooks
  • 00:20:33 – Using Polars
  • 00:26:03 – Advantages of marimo notebooks
  • 00:34:21 – Video Course Spotlight
  • 00:35:44 – Shipping a project in data science
  • 00:42:10 – Advice on testing
  • 00:49:50 – Creating importable parameter values
  • 00:53:55 – Seeing the commit diff of a notebook
  • 00:55:12 – What are you excited about in the world of Python?
  • 00:56:04 – What do you want to learn next?
  • 00:56:52 – What’s the best way to follow your work online?
  • 00:58:28 – Thanks and goodbye
  • Show Links:

    • Production Ready Data Science by Khuyen Tran - CodeCut
    • CodeCut
    • Top 6 Python Libraries for Visualization: Which One to Use? - CodeCut
    • Ruff
    • uv
    • Cookiecutter
    • marimo - a next-generation Python notebook
    • Episode #230: marimo: Reactive Notebooks and Deployable Web Apps in Python
    • Polars — DataFrames for the new era
    • Episode #260: Harnessing the Power of Python Polars
    • Narwhals
    • Episode #224: Narwhals: Expanding DataFrame Compatibility Between Libraries
    • pytest documentation
    • nbdime: Tools for diffing and merging of Jupyter notebooks.
    • LangChain
    • Build Production-Ready LLM Agents with LangChain 1.0 Middleware - CodeCut
    • Build an LLM RAG Chatbot With LangChain
    • Khuyen Tran - LinkedIn
    • Khuyen Tran (@KhuyenTran16) - X
    • Level up your Python skills with our expert-led courses:

      • Working With Python Polars
      • Getting Started With marimo Notebooks
      • Python Project Management With uv
      • Support the podcast & join our community of Pythonistas

        ...more
        View all episodesView all episodes
        Download on the App Store

        The Real Python PodcastBy Real Python

        • 4.7
        • 4.7
        • 4.7
        • 4.7
        • 4.7

        4.7

        139 ratings


        More shows like The Real Python Podcast

        View all
        Software Engineering Radio by se-radio@computer.org

        Software Engineering Radio

        271 Listeners

        The Changelog: Software Development, Open Source by Changelog Media

        The Changelog: Software Development, Open Source

        288 Listeners

        Software Engineering Daily by Software Engineering Daily

        Software Engineering Daily

        624 Listeners

        Talk Python To Me by Michael Kennedy

        Talk Python To Me

        584 Listeners

        Soft Skills Engineering by Jamison Dance and Dave Smith

        Soft Skills Engineering

        288 Listeners

        Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

        Super Data Science: ML & AI Podcast with Jon Krohn

        303 Listeners

        Python Bytes by Michael Kennedy and Brian Okken

        Python Bytes

        214 Listeners

        Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

        Syntax - Tasty Web Development Treats

        986 Listeners

        REWORK by 37signals

        REWORK

        208 Listeners

        CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

        CoRecursive: Coding Stories

        190 Listeners

        DataFramed by DataCamp

        DataFramed

        268 Listeners

        Practical AI by Practical AI LLC

        Practical AI

        209 Listeners

        AWS Podcast by Amazon Web Services

        AWS Podcast

        203 Listeners

        The Stack Overflow Podcast by The Stack Overflow Podcast

        The Stack Overflow Podcast

        63 Listeners

        The Pragmatic Engineer by Gergely Orosz

        The Pragmatic Engineer

        64 Listeners