AWS re:Invent 2014

(BDT309) Delivering Results with Amazon Redshift, One Petabyte at a Time | AWS re:Invent 2014


Listen Later

The Amazon Enterprise Data Warehouse team, responsible for data warehousing across all of Amazon's divisions, spent 2014 working with Amazon Redshift on its largest datasets, including web log traffic. The key goals in this project were to provide a viable, enterprise-grade solution that enabled full scans of 2 trillion rows in under an hour at load. Key to success were automation of routine DW tasks that become complicated at scale: backfilling erroneous data, re-calculating statistics, re-sorting daily additions, and so forth. In this session, we discuss the scale and performance of a 100-node 1PB Amazon Redshift cluster, as well as describing some of the technical aspects and best practices of running 100-node clusters in an enterprise environment.
...more
View all episodesView all episodes
Download on the App Store

AWS re:Invent 2014By Amazon Web Services