In data pre-processing, Data Binning is a technique to convert continuous values of a feature to categorical ones. For example, sometimes, the values of age feature in datasets are replaced with one of intervals such as:
[10,25),
[25,40),
[40,55].
When is the best time to use Data Binning? Does it (always) lead to a better result in a predication system or it may work as a trial and error?