Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
FAQs about Podcast Archives - Software Engineering Daily:How many episodes does Podcast Archives - Software Engineering Daily have?The podcast currently has 1,332 episodes available.
May 27, 2021Data Exploration with a New Python Library with Doris LeeData exploration uses visual exploration to understand what is in a dataset and the characteristics of the data. Data scientists explore data to understand things like customer behavior and resource utilization. Some common programming languages used for data exploration are Python, R, and Matlab. Doris Jung-Lin Lee is currently a Graduate Research Assistant at the...more48minPlay
May 27, 2021Data Management Systems and Artificial Intelligence with Arun KumarArun Kumar is an Assistant Professor in the Department of Computer Science and Engineering and the Halicioglu Data Science Institute at the University of California, San Diego. His primary research interests are in data management and systems for machine learning/artificial intelligence-based data analytics. Systems and ideas based on his research have been released as part...more1h 4minPlay
May 25, 2021Firebolt: Data Warehouses with Eldad FarkashCloud data warehouses are databases hosted in cloud environments. They provide typical benefits of the cloud like flexible data access, scalability, and performance. The company Firebolt provides a cloud data warehouse built for modern data environments. It decouples storage and compute to operate on top of existing data lakes like S3. It computes orders of...more58minPlay
May 24, 2021Portainer: Container Management with Neil CresswellRunning applications in containerized environments involves regularly organizing, adding and replacing containers. This complex job may involve managing clusters of containers in different geographic locations with different configuration requirements. Platforms like Kubernetes are great for managing this complexity, but include steep learning curves to efficiently get anything off the ground. The company Portainer provides a...more51minPlay
May 20, 2021Preset: Visualizing Big Data with Srini KadamatiApache Superset is an open-source, fast, lightweight and modern data exploration and visualization platform. It can connect to any SQL based data source through SQLAlchemy at petabyte scale. Its architecture is highly scalable and it ships with a wide array of visualizations. The company Preset provides a powerful, easy to use data exploration and visualization...more54minPlay
May 20, 2021BaseTen: Creating Machine Learning APIs with Tuhin Srivastava and Amir HaghighatApplication Programming Interfaces (APIs) are interfaces that enable multiple software applications to send and retrieve data from one another. They are commonly used for retrieving, saving, editing, or deleting data from databases, transmitting data between apps, and embedding third-party services into apps. The company BaseTen helps companies build and deploy machine learning APIs and applications....more53minPlay
May 18, 2021Skynet Labs: Decentralized Internet with Matthew SeveyThe company Skynet Labs provides an open protocol for hosting data and web applications on the decentralized web. Skynet allows for decentralized, censorship-resistant, highly redundant storage and applications that are available around the globe. Developers don’t pay for their application’s storage, can launch apps with access to a user’s data right away, are free from...more56minPlay
May 17, 2021ClickHouse: Data Warehousing with Robert HodgesColumnar databases store and retrieve columns of data rather than rows of data. Each block of data in a columnar database stores up to 3 times as many records as row-based storage. This means you can read data with a third of the power needed in row-based data, among other advantages. The company Altinity is...more45minPlay
May 14, 2021Data Mechanics: Data Engineering with Jean-Yves StephanApache Spark is a popular open source analytics engine for large-scale data processing. Applications can be written in Java, Scala, Python, R, and SQL. These applications have flexible options to run on like Kubernetes or in the cloud. The company Data Mechanics is a cloud-native Spark platform for data engineers. It runs continuously optimized Apache...more47minPlay
May 13, 2021Apache Hudi: Large Scale Data Systems with Vinoth ChandarApache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC...more52minPlay
FAQs about Podcast Archives - Software Engineering Daily:How many episodes does Podcast Archives - Software Engineering Daily have?The podcast currently has 1,332 episodes available.