Data Engineering Weekly

DEW #123: Generative AI at BuzzFeed, Building OnCall Culture & Dimensional Modeling at WhatNot


Listen Later

Welcome to another episode of Data Engineering Weekly Radio. Ananth and Aswin discussed a blog from BuzzFeed that shares lessons learned from building products powered by generative AI. The blog highlights how generative AI can be integrated into a company's work culture and workflow to enhance creativity rather than replace jobs. BuzzFeed provided their employees with intuitive access to APIs and integrated the technology into Slack for better collaboration.

Some of the lessons learned from BuzzFeed's experience include:

  1. Getting the technology into the hands of creative employees to amplify their creativity.

  2. Effective prompts are a result of close collaboration between writers and engineers.

  3. Moderation is essential and requires building guardrails into the prompts.

  4. Demystifying the technical concepts behind the technology can lead to better applications and tools.

  5. Educating users about the limitations and benefits of generative AI.

  6. The economics of using generative AI can be challenging, especially for hands-on business models.

  7. The conversation also touched upon the non-deterministic nature of generative AI systems, the importance of prompt engineering, and the potential challenges in integrating generative AI into data engineering workflows. As technology progresses, it is expected that the economics of generative AI will become more favorable for businesses.

    https://tech.buzzfeed.com/lessons-learned-building-products-powered-by-generative-ai-7f6c23bff376

    Moving on, We discuss the importance of on-call culture in data engineering teams. We emphasize the significance of data pipelines and their impact on businesses. With a focus on communication, ownership, and documentation, we highlight how data engineers should prioritize and address issues in data systems.

    We also discuss the importance of on-call rotation, runbooks, and tools like PagerDuty and Airflow to streamline alerts and responses. Additionally, we mention the value of having an on-call handoff process, where one engineer summarizes their experiences and alerts during their on-call period, allowing for improvements and a better understanding of common issues.

    Overall, this conversation stresses the need for a learning culture within data engineering teams, focusing on building robust systems, improving team culture, and increasing productivity.

    https://towardsdatascience.com/how-to-build-an-on-call-culture-in-a-data-engineering-team-7856fac0c99

    Finally, Ananth and Aswin discuss an article about adopting dimensional data modeling in hyper-growth companies. We appreciate the learning culture and emphasize balancing speed, maturity, scale, and stability.

    We highlight how dimensional modeling was initially essential due to limited computing and expensive storage. However, as storage became cheaper and computing more accessible, dimensional modeling was often overlooked, leading to data junkyards. In the current landscape, it's important to maintain business-aware domain-driven data marts and acknowledge that dimensional modeling still has a role.

    The conversation also touches upon the challenges of tracking slowly changing dimensions and the responsibility of data architects, engineers, and analytical engineers in identifying and implementing such dimensions. We discuss the need for a fine balance between design thinking and experimentation and stress the importance of finding the right mix of correctness and agility for each company.

    https://medium.com/whatnot-engineering/same-data-sturdier-frame-layering-in-dimensional-data-modeling-at-whatnot-5e6a548ee713


    ...more
    View all episodesView all episodes
    Download on the App Store

    Data Engineering WeeklyBy Ananth Packkildurai

    • 2.7
    • 2.7
    • 2.7
    • 2.7
    • 2.7

    2.7

    3 ratings


    More shows like Data Engineering Weekly

    View all
    Software Engineering Radio - the podcast for professional software developers by team@se-radio.net (SE-Radio Team)

    Software Engineering Radio - the podcast for professional software developers

    271 Listeners

    The Changelog: Software Development, Open Source by Changelog Media

    The Changelog: Software Development, Open Source

    290 Listeners

    Software Engineering Daily by Software Engineering Daily

    Software Engineering Daily

    623 Listeners

    Talk Python To Me by Michael Kennedy

    Talk Python To Me

    585 Listeners

    Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

    Super Data Science: ML & AI Podcast with Jon Krohn

    301 Listeners

    Data Engineering Podcast by Tobias Macey

    Data Engineering Podcast

    146 Listeners

    Y Combinator Startup Podcast by Y Combinator

    Y Combinator Startup Podcast

    226 Listeners

    DataFramed by DataCamp

    DataFramed

    269 Listeners

    Tech Brew Ride Home by Morning Brew

    Tech Brew Ride Home

    960 Listeners

    Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

    Kubernetes Podcast from Google

    181 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    207 Listeners

    The Real Python Podcast by Real Python

    The Real Python Podcast

    141 Listeners

    Big Technology Podcast by Alex Kantrowitz

    Big Technology Podcast

    500 Listeners

    The Data Engineering Show by The Firebolt Data Bros

    The Data Engineering Show

    8 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    93 Listeners