In data pre-processing, Data Binning is a technique to convert continuous values of a feature to categorical ones. For example, sometimes, the values of age feature in datasets are replaced with one of intervals such as:
[10,20),
[20,30),
[30,40].
When is the best time to use Data Binning? Does it (always) lead to a better result in a predication system or it may work as a trial and error?