Tweedie deviance loss for tree based models

#### Describe the workflow you want to enable
If the target `y` is (approximately) Poisson, Gamma or else Tweedie distributed, it would be beneficial for tree based regressors to support [Tweedie deviance loss functions](https://scikit-learn.org/stable/modules/model_evaluation.html#mean-poisson-gamma-and-tweedie-deviances) as splitting criterion. This partially addresses #5975.

#### Describe your proposed solution
Ideally, one first implements
- [ ] differentiable loss functions #15123

and then adds the different loss criteria to the tree based models:
- [x]  `DecisionTreeRegressor` (poisson only) https://github.com/scikit-learn/scikit-learn/pull/17386
- [x] `RandomForestRegressor` (poisson only) https://github.com/scikit-learn/scikit-learn/pull/19836
- [ ] `GradientBoostingRegressor`
- [x] `HistGradientBoostingRegressor` (poisson and gamma but no other tweedie cases) https://github.com/scikit-learn/scikit-learn/pull/16692

#### Open for Discussion
For Poisson and Tweedie deviance with `1<=power<2`, ther target `y` may be zero while the prediction `y_pred` must be strictly larger than zero. A tree might find a split where one node has  `y=0` for all samples in that node, resulting naively in `y_pred = mean(y) = 0` for that node. I see 3 different solutions to that:
1. Use a log-link function, i.e. predict `y_pred = np.exp(tree)`
    See #16692 for HistGradientBoostingRegressor. This may be no option for DecisionTreeRegressor.
2. Use a splitting rule that forbids splits where one node has `sum(y)=0`.
    One might also introduce some option like `min_y_weight`, such that splits with  `sum(sample_weight*y) < min_y_weight` are forbidden.
3. Use some form of parent child average `y_pred = a * mean(y) + (1-a) * y_pred_parent` and forbid further splits, see [1].
(Bayes/credibility theory motivates to set `a = sum(sample_weight*y)/(gamma+sum(sample_weight*y))` for some hyperparameter `gamma`.)

There is also a dirty solution that allows `y_pred=0` but sets the value `min(eps, y_pred)` in the loss function for some tiny value of `eps`.

#### References
[1] [R rpart library](https://cran.r-project.org/web/packages/rpart/vignettes/longintro.pdf), chapter 8 Poisson regression

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Tweedie deviance loss for tree based models #16668

Describe the workflow you want to enable

Describe your proposed solution

Open for Discussion

References

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Tweedie deviance loss for tree based models #16668

Description

Describe the workflow you want to enable

Describe your proposed solution

Open for Discussion

References

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions