Welcome to my blog!

Merging categories with small frequencies

It can happen that some categorical features have lot of different values with small frequencies. This can often be a problem during modeling phase.

Typical solution is to merge all categories with small (e.g. 5% , 1% etc… ) frequency of appearance in data set into one category e.g. Other

Below is the pandas example on the sweet data set 🙂



Loading

Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

Leave a Reply

Your email address will not be published. Required fields are marked *