
Sign up to save your podcasts
Or


This is the third in a 4-part series where Anders Larson and Shea Parkes discuss predictive analytics with high cardinality features. In this episode they focus on feature engineering via the hashing trick. The hashing trick is most applicable for extremely high cardinality, and at first glance can seem almost ridiculous. In a lot of ways, it is the same as bucketing values at random. But there are times that it is more valuable to include randomly engineered buckets than to exclude the original high cardinality feature entirely.
By Society of Actuaries (SOA)4.6
3131 ratings
This is the third in a 4-part series where Anders Larson and Shea Parkes discuss predictive analytics with high cardinality features. In this episode they focus on feature engineering via the hashing trick. The hashing trick is most applicable for extremely high cardinality, and at first glance can seem almost ridiculous. In a lot of ways, it is the same as bucketing values at random. But there are times that it is more valuable to include randomly engineered buckets than to exclude the original high cardinality feature entirely.

78,278 Listeners

32,081 Listeners

30,665 Listeners

25,888 Listeners

4,359 Listeners

1,384 Listeners

1,630 Listeners

112,433 Listeners

56,382 Listeners

9,517 Listeners

15 Listeners

11 Listeners

2 Listeners

2,109 Listeners

1,655 Listeners