Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
FAQs about Podcast Archives - Software Engineering Daily:How many episodes does Podcast Archives - Software Engineering Daily have?The podcast currently has 1,298 episodes available.
May 08, 2020Dropbox Engineering with Andrew FongDropbox is a consumer storage product with petabytes of data. Dropbox was originally started on the cloud, backed by S3. Once there was a high enough volume of data, Dropbox created its own data centers, designing hardware for the express purpose of storing user files. Over the last 13 years, Dropbox’s infrastructure has developed hardware,...more55minPlay
May 07, 2020Pravega: Storage for Streams with Flavio Junqueira“Data stream” is a word that can be used in multiple ways. A stream can refer to data in motion or data at rest. When a stream is data in motion, an endpoint is receiving new pieces of data on a continual basis. Each new data point is sent over the wire and captured by...more53minPlay
May 06, 2020Advanced Redis with Alvin RichardsRedis is an in-memory object storage system that is commonly used as a cache for web applications. This core primitive of in-memory object storage has created a larger ecosystem encompassing a broad set of tools. Redis is also used for creating objects such as queues, streams, and probabilistic data structures. Machine learning systems also need...more54minPlay
May 05, 2020Multicloud MySQL with Jiten Vaidya and Anthony YehFor many applications, a transactional MySQL database is the source of truth. To make a MySQL database scale, some developers deploy their database using Vitess, a sharding system built on top of Kubernetes. Jiten Vaidya and Anthony Yeh work at PlanetScale, a company that focuses on building and supporting MySQL databases sharded with Vitess. Their...more53minPlay
May 04, 2020Isolation with Courtland Allen and Anurag GoelWe are all living in social isolation due to the quarantine from COVID-19. Isolation is changing our habits and our moods, ravaging the economy, and changing how we work. One positive change is that more people have been reconnecting with their friends and family over frequent calls and video chats. Isolation is not a normal...more57minPlay
May 01, 2020Data Lakehouse with Michael ArmbrustA data warehouse is a system for performing fast queries on large amounts of data. A data lake is a system for storing high volumes of data in a format that is slow to access. A typical workflow for a data engineer is to pull data sets from this slow data lake storage into the...more1hPlay
April 30, 2020JAMStack Content Management with Scott Gallant, Jordan Patterson, and Nolan PhillipsA content management system (CMS) defines how the content on a website is arranged and presented. The most widely used CMS is WordPress, the open source tool that is written in PHP. A large percentage of the web consists of WordPress sites, and WordPress has a huge ecosystem of plugins and templates. Despite the success...more56minPlay
April 29, 2020Prefect Dataflow Scheduler with Jeremiah LowinA data workflow scheduler is a tool used for connecting multiple systems together in order to build pipelines for processing data. A data pipeline might include a Hadoop task for ETL, a Spark task for stream processing, and a TensorFlow task to train a machine learning model. The workflow scheduler manages the tasks in that...more1h 5minPlay
April 28, 2020CockroachDB with Peter MattisA relational database often holds critical operational data for a company, including user names and financial information. Since this data is so important, a relational database must be architected to avoid data loss. Relational databases need to be a distributed system in order to provide the fault tolerance necessary for production use cases. If a...more57minPlay
April 27, 2020Dask: Scalable Python with Matthew RocklinPython is the most widely used language for data science, and there are several libraries that are commonly used by Python data scientists including Numpy, Pandas, and scikit-learn. These libraries improve the user experience of a Python data scientist by giving them access to high level APIs. Data science is often performed over huge datasets,...more1h 2minPlay
FAQs about Podcast Archives - Software Engineering Daily:How many episodes does Podcast Archives - Software Engineering Daily have?The podcast currently has 1,298 episodes available.