
Sign up to save your podcasts
Or
When you are working on a data pipeline for ML ... you are never dealing with a single table. It always demands different tables for different reasons that all have to be mashed together in order to have something that you can learn from. But if that is the case, why do we spend so much time talking about ML pipelines that only work on a single table? Madelon Hulsebos has a Phd on the topic and so we figured that we might ask her.
As mentioned in the podcast, here is the link to Madelon's homepage. https://www.madelonhulsebos.com/
Some links to interesting articles from Madelon, as well as her homepage, can be found below. https://www.madelonhulsebos.com/assets/dataset_search_survey.pdf
https://dl.acm.org/doi/pdf/10.1145/3654975
https://dl.acm.org/doi/pdf/10.1145/3588710
5
22 ratings
When you are working on a data pipeline for ML ... you are never dealing with a single table. It always demands different tables for different reasons that all have to be mashed together in order to have something that you can learn from. But if that is the case, why do we spend so much time talking about ML pipelines that only work on a single table? Madelon Hulsebos has a Phd on the topic and so we figured that we might ask her.
As mentioned in the podcast, here is the link to Madelon's homepage. https://www.madelonhulsebos.com/
Some links to interesting articles from Madelon, as well as her homepage, can be found below. https://www.madelonhulsebos.com/assets/dataset_search_survey.pdf
https://dl.acm.org/doi/pdf/10.1145/3654975
https://dl.acm.org/doi/pdf/10.1145/3588710
470 Listeners
586 Listeners
439 Listeners
214 Listeners
988 Listeners
268 Listeners
189 Listeners
62 Listeners
137 Listeners
67 Listeners
58 Listeners
27 Listeners
21 Listeners
2 Listeners
54 Listeners