Epicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies

Allen Day: Google’s Mission to Provide Open Datasets for Public Blockchains


Listen Later

Public blockchains produce enormous amounts of data. In theory, anyone can access the raw contents of transaction and blocks. In practice, however, querying blockchains can prove to be a daunting task.

The difficulty lies in the fact that blockchains are particular types of distributed databases and thus carry several limitations. Most, if not all, blockchains lack the most basic SQL querying capabilities supported by nearly every off-the-shelf database system.

Take Bitcoin as an example. Its API lacks even the most basic calls which would allow a user to query any address and receive the balance. In order to achieve this, block explorers and alike have developed sophisticated middleware infrastructure that parses the blockchain, normalizes the data, and stores it in a database, where it can be queried. In the best of cases, companies offer API calls for only a limited set of operations. Google hopes to change this by freeing blockchain datasets.

We’re joined by Allen Day, Science Advocate at Google’s Singapore office. Earlier this year, he and his team released the Bitcoin blockchain as a public dataset in Big Query, Google big data IaaS offering. In August, they added Ethereum to their list of freely available public datasets, which includes US census data, cannabis genomes, and the entirety of Reddit and Github. Anyone wishing to query the data can do so in SQL on the Big Query website or via an API. For instance, a relatively simple query would return the daily mean transaction fees since the Genesis Block in just a few seconds.

Coupled with Google’s AI and Machine Learning infrastructure and other open data sets, one can only imagine the potentially groundbreaking insights we could gain from this data.

Topics covered in this episode:

  • Allen’s background as a geneticist
  • The similarities between blockchains and evolution process in lifeforms
  • Google’s cloud platform and its various components
  • Big Query and its publicly available datasets
  • The Bitcoin and Ethereum datasets in Big Query
  • Why this data is useful to the public and for what it may be used
  • The particular challenges in implementing Ethereum as opposed to Bitcoin
  • Insights we may gain by crossing blockchain dataset with other data
  • How machine learning and AI could help us better understand specific transaction patterns
  • Episode links:

    • Bitcoin in BigQuery: blockchain analytics on public data
    • Bitcoin Blockchain Public Dataset
    • Ethereum in BigQuery: a Public Dataset for smart contract analytics
    • Ethereum in BigQuery: how we built this dataset
    • Ethereum Blockchain Public Dataset
    • Change Agent by Daniel Suarez
    • Real-time Ethereum Notifications for Everyone for Free
    • ethjs-abi library, compiled for use in Google BigQuery
    • Kaggle: Your Home for Data Science
    • The Strange Inevitability of Evolution - Issue 20: Creativity - Nautilus
    • Reddit_
    • Google Cloud
    • Thank you to our sponsors for their support:

      • The open, decentralized trading protocol for ERC20 tokens using the Dutch auction mechanism. More at epicenter.tv/dutchx.
      • Deploy enterprise-ready consortium blockchain networks that scale in just a few clicks. More at aka.ms/epicenter.
      • This episode is hosted by Sébastien Couture. Show notes and listening options: epicenter.tv/254

        ...more
        View all episodesView all episodes
        Download on the App Store

        Epicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed TechnologiesBy Epicenter Media Ltd.

        • 4.7
        • 4.7
        • 4.7
        • 4.7
        • 4.7

        4.7

        186 ratings


        More shows like Epicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies

        View all
        We Study Billionaires - The Investor’s Podcast Network by The Investor's Podcast Network

        We Study Billionaires - The Investor’s Podcast Network

        3,358 Listeners

        Macro Voices by Hedge Fund Manager Erik Townsend

        Macro Voices

        3,072 Listeners

        The a16z Show by Andreessen Horowitz

        The a16z Show

        1,104 Listeners

        Unchained by Laura Shin

        Unchained

        1,205 Listeners

        Hidden Forces by Demetri Kofinas

        Hidden Forces

        1,462 Listeners

        Real Vision: Finance & Investing by Real Vision Podcast Network

        Real Vision: Finance & Investing

        905 Listeners

        CRYPTO 101 by Bryce Paul & Brendan Viehman

        CRYPTO 101

        39 Listeners

        The Breakdown by Blockworks

        The Breakdown

        738 Listeners

        The Pomp Podcast by Anthony Pompliano

        The Pomp Podcast

        1,839 Listeners

        Thinking Crypto News & Interviews by Tony Edward

        Thinking Crypto News & Interviews

        251 Listeners

        Bankless by Bankless

        Bankless

        1,047 Listeners

        The Wolf Of All Streets by Scott Melker

        The Wolf Of All Streets

        243 Listeners

        Coin Stories with Natalie Brunell by Natalie Brunell

        Coin Stories with Natalie Brunell

        444 Listeners

        Raoul Pal: The Journey Man by Real Vision Podcast Network

        Raoul Pal: The Journey Man

        128 Listeners

        Forward Guidance by Blockworks

        Forward Guidance

        277 Listeners