Screaming in the Cloud

Episode 15: Nagios was the Original Call of Duty


Listen Later

Let’s chat about the Cloud and everything in between. The people in this world are pretty comfortable with not running physical servers on their own, but trusting someone else to run them. Yet, people suffer from the psychological barrier of thinking they need to build, design, and run their own monitoring system. Fortunately, more companies are turning to Datadog.

Today, we’re talking to Ilan Rabinovitch, Datadog’s vice president of product and community. He spends his days diving into container monitoring metrics, collaborating with Datadog’s open source community, and evangelizing observability best practices. Previously, Ilan led infrastructure and reliability engineering teams at various organizations, including Ooyala and Edmunds.com. He’s active in the open source and DevOps communities, where he is a co-organizer of events, such as SCALE and Texas Linux Fest.

Some of the highlights of the show include:

  • Datadog is well-known, especially because it is a frequent sponsor
  • More organizations know their core competency is not monitoring or managing servers
  • Monitoring/metrics is a big data problem; Datadog takes monitoring off your plate
  • Alternate ways, other than using Nagios, to monitor instances and regenerate configurations
  • Datadog is first to identify patterns when there is a widespread underlying infrastructure issue
  • Trends of moving from on-premise to Cloud; serverless is on the horizon
  • How trends affect evolution of Datadog; adjusting tools to monitor customers’ environments
  • Datadog’s scope is enormous; the company tries to present relevant information as the scale of what it’s watching continues to grow
  • Datadog’s pricing is straightforward and simple to understand; how much Cloud providers charge to use Datadog is less clear
  • Single Pane of Glass: Too much data to gather in small areas (dashboards)  
  • Why didn’t monitoring catch this? Alerts need to be actionable and relevant
  • How to use Datadog’s workflow for setting alerts and work metrics
  • Datadog’s first Dash user conference will be held in July in New York; addresses how to solve real business problems, how to scale/speed up your organization
  • Links:

    • Ilan Rabinovitch on Twitter
    • Datadog
    • Docker Adoption Survey Results  
    • Rubric for Setting Alerts/Work Metrics
    • Dash Conference
    • re:Invent
    • Nagios
    • .
      ...more
      View all episodesView all episodes
      Download on the App Store

      Screaming in the CloudBy Corey Quinn

      • 4.7
      • 4.7
      • 4.7
      • 4.7
      • 4.7

      4.7

      92 ratings


      More shows like Screaming in the Cloud

      View all
      Software Engineering Radio by se-radio@computer.org

      Software Engineering Radio

      271 Listeners

      Hanselminutes with Scott Hanselman by Scott Hanselman

      Hanselminutes with Scott Hanselman

      383 Listeners

      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      289 Listeners

      The a16z Show by Andreessen Horowitz

      The a16z Show

      1,092 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      622 Listeners

      The Cloudcast by Massive Studios

      The Cloudcast

      151 Listeners

      Thoughtworks Technology Podcast by Thoughtworks

      Thoughtworks Technology Podcast

      43 Listeners

      Y Combinator Startup Podcast by Y Combinator

      Y Combinator Startup Podcast

      225 Listeners

      Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

      Syntax - Tasty Web Development Treats

      987 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      202 Listeners

      AWS Morning Brief by Corey Quinn

      AWS Morning Brief

      79 Listeners

      The Stack Overflow Podcast by The Stack Overflow Podcast

      The Stack Overflow Podcast

      63 Listeners

      Dwarkesh Podcast by Dwarkesh Patel

      Dwarkesh Podcast

      517 Listeners

      Oxide and Friends by Oxide Computer Company

      Oxide and Friends

      62 Listeners

      The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

      The AI Daily Brief: Artificial Intelligence News and Analysis

      616 Listeners