DataTalks.Club

Building an Open-Source NLP Tool - Johannes Hötter


Listen Later

We talked about:

  • Johannes’s background
  • Johannes’s Open Source Spotlight demos – Refinery and Bricks
  • The difficulties of working with natural language processing (NLP)
  • Incorporating ChatGPT into a process as a heuristic
  • What is Bricks?
  • The process of starting a startup – Kern
  • Making the decision to go with open source
  • Pros and cons of launching as open source
  • Kern’s business model
  • Working with enterprises
  • Johannes as a salesperson
  • The team at Kern
  • Johannes’s role at Kern
  • How Johannes and Henrik separate responsibilities at Kern
  • Working with very niche use cases
  • The short story of how Kern got its funding
  • Johannes’s resource recommendation

  • Links:

    • Refinery's GitHub repo: https://github.com/code-kern-ai/refinery
    • Bricks' Github repo: https://github.com/code-kern-ai/bricks
    • Bricks Open Source Spotlight demo: https://www.youtube.com/watch?v=r3rXzoLQy2U
    • Refinery Open Source Spotlight demo: https://www.youtube.com/watch?v=LlMhN2f7YDg
    • Discord: https://discord.com/invite/qf4rGCEphW
    • Ker's Website: https://www.kern.ai

    • Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp

      Join DataTalks.Club: https://datatalks.club/slack.html

      Our events: https://datatalks.club/events.html

      ...more
      View all episodesView all episodes
      Download on the App Store

      DataTalks.ClubBy DataTalks.Club

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      7 ratings


      More shows like DataTalks.Club

      View all
      Radiolab by WNYC Studios

      Radiolab

      44,007 Listeners

      Hidden Brain by Hidden Brain, Shankar Vedantam

      Hidden Brain

      43,735 Listeners

      The Knowledge Project by Shane Parrish

      The Knowledge Project

      2,702 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      302 Listeners

      Data Engineering Podcast by Tobias Macey

      Data Engineering Podcast

      144 Listeners

      The Real Python Podcast by Real Python

      The Real Python Podcast

      140 Listeners

      Huberman Lab by Scicomm Media

      Huberman Lab

      29,328 Listeners

      The Ezra Klein Show by New York Times Opinion

      The Ezra Klein Show

      16,223 Listeners

      ReThinking by TED

      ReThinking

      632 Listeners

      Data Career Podcast: Helping You Land a Data Analyst Job FAST by Avery Smith - Data Career Coach

      Data Career Podcast: Helping You Land a Data Analyst Job FAST

      162 Listeners

      The Analytics Engineering Podcast by dbt Labs, Inc.

      The Analytics Engineering Podcast

      29 Listeners

      The Tucker Carlson Show by Tucker Carlson Network

      The Tucker Carlson Show

      17,005 Listeners