Show Notes
- (1:45) Willem discussed his undergraduate degree in Mechatronic Engineering at Stellenbosch University in the early 2010s.
- (2:34) Willem recalled his entrepreneurial journey founding and selling a networking startup that provides internet access to private residents on campus.
- (5:37) Willem worked for two years as a Software Engineer focusing on data systems at Systems Anywhere in Capetown after college.
- (6:49) Willem talked about his move to Bangkok working as a Senior Software Engineer at INDEFF, a company in industrial control systems.
- (9:52) Willem went over his decision to join Gojek, a leading Indonesian on-demand multi-service platform and digital payment technology group.
- (12:16) Willem mentioned the engineering challenges associated with building complex data systems for super-apps.
- (14:50) Willem dissected Gojek’s ML platform, including these four solutions for various stages of the ML life cycle: Clockwork, Merlin, Feast, and Turing.
- (19:24) Willem recapped the lessons from designing the ML platform to meet Gojek’s scaling requirements — as delivered at Cloud Next 2018.
- (23:09) Willem briefly went through the key design components to incorporate Kubeflow pipelines into Gojek’s existing ML platform — as delivered at KubeCon 2019.
- (26:21) Willem explained the inception of Feast, an open-source feature store that bridges the gap between data and models.
- (32:20) Willem talked about prioritizing the product roadmap and engaging the community for an open-source project.
- (35:07) Willem recapped the key lessons learned and envisioned Feast's future to be a lightweight modular feature store.
- (37:29) Willem explained the differences between commercial and open-source feature stores (given Tecton’s recent backing of Feast).
- (41:36) Willem reflected on his experience living and working in Southeast Asia.
- (44:33) Closing segment.
Willem’s Contact Info
Mentioned Content
Feast
- Feast Project website: feast.dev
- Feast Slack community: #Feast
- Feast Documentation: docs.feast.dev
- Feast GitHub repository: feast-dev/feast
- Feast on StackOverflow: stackoverflow.com/questions/tagged/feast
- Feast Wiki: wiki.lfaidata.foundation/display/FEAST/Feast+Home
- Feast Twitter: @feast_dev
Article
- An Introduction to Gojek’s Machine Learning Platform (2019)
- Introducing Feast: An Open-Source Feature Store For Machine Learning (2019)
- A State of Feast (2020)
- Why Tecton is Backing The Feast Open-Source Feature Store (2020)
Talks
- Lessons Learned Scaling Machine Learning at GoJek on Google Cloud (Cloud Next 2018)
- Accelerating Machine Learning App Development with Kubeflow Pipelines (Cloud Next 2019)
- Moving People and Products with Machine Learning on Kubeflow (KubeCon 2019)
People
- David Aronchick (Open-Source ML Strategy at Azure, Ex-PM for Kubernetes at Google, Co-Founder of Kubeflow, Advisor to Tecton)
- Jeremy Lewi (Principal Engineer at Primer.ai, Co-Founder of Kubeflow)
- Felipe Hoffa (Developer Advocate for BigQuery, Data Cloud Advocate for Snowflake)
Book
- Cal Newport’s “Deep Work”
Willem will be a speaker at Tecton’s apply() virtual conference (April 21-22, 2021) for data and ML teams to discuss the practical data engineering challenges faced when building ML for the real world. Participants will share best practice development patterns, tools of choice, and emerging architectures they use to successfully build and manage production ML applications. Everything is on the table from managing labeling pipelines, to transforming features in real-time, and serving at scale. Register for free now: https://www.applyconf.com/!
This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit datacast.substack.com/subscribe