The Real Python Podcast

Automate Tasks With Python & Building a Small Search Engine


Listen Later

What are the typical computer tasks you do manually every week? Could you automate those tasks with a Python script? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder’s Weekly articles and projects.

We discuss a recent Hacker News thread about frequently used automation scripts. We share the kinds of tasks we’ve automated with Python in our work and personal lives.

Christopher shares a tutorial about building a micro-search engine from scratch using Python. The post takes you through coding the components of a crawler, index, and ranker. The finished engine is designed to search the posts of the blogs you follow.

We also share several other articles and projects from the Python community, including a news roundup, how a Polars query works under the hood, using Python for data analysis, understanding open-source licensing, summarizing the significant changes between Python versions, a robust TUI hex editor, and a lightweight dataframe library with a universal interface for data wrangling.

This week’s episode is brought to you by Intel.

Course Spotlight: Building Command Line Interfaces With argparse

In this step-by-step Python video course, you’ll learn how to take your command line Python scripts to the next level by adding a convenient command line interface that you can write with argparse.

Topics:

  • 00:00:00 – Introduction
  • 00:02:23 – uv: Python Packaging in Rust
  • 00:02:43 – Rye Grows With uv
  • 00:03:20 – Python 3.13.0 Alpha 4 Is Now Available
  • 00:03:45 – A Bird’s Eye View of Polars
  • 00:07:28 – Polars: Why We Have Rewritten the String Data Type
  • 00:09:33 – A Search Engine in 80 Lines of Python
  • 00:13:14 – Using Python for Data Analysis
  • 00:18:22 – Sponsor: Intel
  • 00:18:53 – Understanding Open Source Licensing
  • 00:21:54 – Summary of Major Changes Between Python Versions
  • 00:23:19 – What Python automation scripts do you reuse frequently at work?
  • 00:34:21 – Video Course Spotlight
  • 00:35:52 – hexabyte: A modern, modular, and robust TUI hex editor
  • 00:39:56 – ibis: The Flexibility of Python With the Scale of Modern SQL
  • 00:43:31 – Thanks and goodbye
  • News:

    • uv: Python Packaging in Rust – uv is an extremely fast Python package installer and resolver, designed as a drop-in alternative to pip and pip-tools. This post introduces you to uv and shows some of its performance numbers.
    • Rye Grows With uv - Armin Ronacher’s Thoughts and Writings
    • Python 3.13.0 Alpha 4 Is Now Available
    • Show Links:

      • A Bird’s Eye View of Polars – This post on the Polars blog introduces you to how Polars works, showing the steps from queries, plans, optimizations, and then the final execution.
      • Polars: Why We Have Rewritten the String Data Type – A large refactor on the string data type is underway in Polars. This deep dive explains why and what is changing.
      • A Search Engine in 80 Lines of Python – In this post Alex explains how he built a micro-search engine from scratch using Python. The resulting search engine is used to search in the posts of the blogs he follows.
      • Using Python for Data Analysis – In this tutorial, you’ll learn the importance of having a structured data analysis workflow, and you’ll get the opportunity to practice using Python for data analysis while following a common workflow process.
      • Understanding Open Source Licensing – This article discusses the importance of open-source licensing in software development and its implications for stakeholders.
      • Summary of Major Changes Between Python Versions – This article is a quick reference covering the major changes introduced with each new version of Python. Can’t remember when the walrus operator was introduced? This is the place to look that up.
      • Discussion:

        • What Python automation scripts do you reuse frequently at work? - Hacker News
        • Projects

          • hexabyte: A modern, modular, and robust TUI hex editor
          • ibis: The Flexibility of Python With the Scale of Modern SQL
          • Additional Links:

            • Episode #193: Wes McKinney on Improving the Data Stack & Composable Systems – The Real Python Podcast
            • Open Source Licensing: Software Freedom and Intellectual Property Law
            • What’s in which Python - Ned Batchelder
            • Automate the Boring Stuff with Python - Al Sweigart
            • Working With Files in Python – Real Python
            • Build Command-Line Interfaces With Python’s argparse – Real Python
            • Level up your Python skills with our expert-led courses:

              • Defining Python Functions With Optional Arguments
              • Practical Recipes for Working With Files in Python
              • Building Command Line Interfaces With argparse
              • Support the podcast & join our community of Pythonistas

                ...more
                View all episodesView all episodes
                Download on the App Store

                The Real Python PodcastBy Real Python

                • 4.7
                • 4.7
                • 4.7
                • 4.7
                • 4.7

                4.7

                136 ratings


                More shows like The Real Python Podcast

                View all
                Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

                Software Engineering Radio - the podcast for professional software developers

                272 Listeners

                The Changelog: Software Development, Open Source by Changelog Media

                The Changelog: Software Development, Open Source

                283 Listeners

                Data Skeptic by Kyle Polich

                Data Skeptic

                481 Listeners

                Talk Python To Me by Michael Kennedy

                Talk Python To Me

                592 Listeners

                Software Engineering Daily by Software Engineering Daily

                Software Engineering Daily

                624 Listeners

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                443 Listeners

                Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                Super Data Science: ML & AI Podcast with Jon Krohn

                296 Listeners

                Python Bytes by Michael Kennedy and Brian Okken

                Python Bytes

                213 Listeners

                Data Engineering Podcast by Tobias Macey

                Data Engineering Podcast

                142 Listeners

                Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                Syntax - Tasty Web Development Treats

                982 Listeners

                CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

                CoRecursive: Coding Stories

                189 Listeners

                DataFramed by DataCamp

                DataFramed

                266 Listeners

                Practical AI by Practical AI LLC

                Practical AI

                189 Listeners

                The Stack Overflow Podcast by The Stack Overflow Podcast

                The Stack Overflow Podcast

                64 Listeners

                Latent Space: The AI Engineer Podcast by swyx + Alessio

                Latent Space: The AI Engineer Podcast

                77 Listeners