The Azure Podcast

Episode 504 - Azure Reliability SRE


Listen Later

Sadaf Khan joins Evan and Russell to explain and talk about Service Reliability Engineering in the Azure engineering group.

 

Media file: https://azpodcast.blob.core.windows.net/episodes/Episode504.mp3

YouTube: https://www.youtube.com/watch?v=QNGdTnb1W90&t=1684s

 

  • Public Preview: Customer managed planned failover for Azure Storage
  • Public Preview: Instance Mix on Virtual Machine Scale Sets
  • Generally Available: Workspaces in Azure API Management
  • Generally Available: Azure NetApp Files storage with cool access for all service levels
  • Generally Available: Larger Enterprise tier cache instances for Azure Cache for Redis
  • Generally Available: Azure Red Hat OpenShift Now Supports Clusters Up to 250 Nodes
  • Key Topics:

    • Azure Reliability SRE: Evan introduced the episode's focus on Azure reliability SRE and mentioned a special guest, Sadaf, who would provide insights on the topic. 0:19
    • Azure Storage Public Preview Feature: Russell discussed a new public preview feature for Azure storage that allows customers to manage planned failovers, enhancing the service's reliability. 1:10
    • Virtual Machine Scale Set Update: Russell highlighted an update to virtual machine scale sets that allows mixing different instances, improving flexibility and scalability. 1:38
    • Azure API Management Workspace: Russell introduced a new feature in Azure API management that enables teams to have more autonomy in managing and publishing APIs. 2:08
    • NetApp Files Storage Update: Russell mentioned the general availability of cool access for NetApp files storage, allowing for more cost-effective data storage based on access patterns. 2:40
    • Redis Cache Update: Russell discussed a new tier for Redis Cache that supports larger enterprises with increased memory and compute capabilities. 3:02
    • Azure Red Hat Openshift Update: Russell shared an update on Azure Red Hat Openshift, which now supports up to 250 nodes, significantly increasing scalability. 3:29
    • SRE Role and Impact: Sadaf explained the role of SRE in improving service reliability and quality, detailing their engagement model with various Azure services. 4:52
    • SRE Engagement and Resistance: Sadaf shared insights on the initial resistance faced from service teams during SRE engagements and how trust is built over time to allow for more impactful changes. 7:49
    • SRE's Approach to Service Improvement: Sadaf outlined the SRE team's structured approach to service improvement, focusing on fundamentals, service health, operational efficiency, and scalability. 10:51
    • AI Initiatives in SRE: Sadaf discussed the SRE team's initiatives in leveraging AI to analyze incident data and generate insights, aiming to reduce the cognitive load on engineers. 30:27
    • ...more
      View all episodesView all episodes
      Download on the App Store

      The Azure PodcastBy Cynthia Kreng, Kendall Roden, Cale Teeter, Evan Basalik, Russell Young and Sujit D'Mello

      • 4.6
      • 4.6
      • 4.6
      • 4.6
      • 4.6

      4.6

      44 ratings


      More shows like The Azure Podcast

      View all
      Hanselminutes with Scott Hanselman by Scott Hanselman

      Hanselminutes with Scott Hanselman

      377 Listeners

      Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

      Software Engineering Radio - the podcast for professional software developers

      272 Listeners

      .NET Rocks! by Carl Franklin and Richard Campbell

      .NET Rocks!

      244 Listeners

      The Cloudcast by Massive Studios

      The Cloudcast

      153 Listeners

      a16z Podcast by Andreessen Horowitz

      a16z Podcast

      1,030 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      592 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      625 Listeners

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

      The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

      444 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      202 Listeners

      Python Bytes by Michael Kennedy and Brian Okken

      Python Bytes

      213 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      323 Listeners

      Kubernetes Podcast from Google by Abdel Sghiouar, Kaslin Fields

      Kubernetes Podcast from Google

      181 Listeners

      Azure & DevOps Podcast by Jeffrey Palermo

      Azure & DevOps Podcast

      20 Listeners

      Ctrl+Alt+Azure by Tobias Zimmergren, Jussi Roine

      Ctrl+Alt+Azure

      12 Listeners

      Last Week in AI by Skynet Today

      Last Week in AI

      288 Listeners