Switching to float for histogram building #10913

Shiki-H · 2024-10-20T03:01:03Z

Hi, I'd like to get some clarifications about the decision to stick to double for GHistRow that is used in histogram building. @RAMitchell mentioned in this comment that switching to float will result in significant accuracy degradation. I'm just wondering how is performance measured in this case. More generally, if say we have ways to improve the runtime of model building that would lead to changes in model metrics, from the development team's perspective, what level of variation is acceptable?

The text was updated successfully, but these errors were encountered:

trivialfis · 2024-10-20T09:55:02Z

Hi, XGBoost GPU used to have this option. I proposed and merged the removal of this option (a long time ago). The model wasn't converging when f32 was used for accumulation when the number of samples was large. This can be obvious for typical regression since one would see metrics like rmse increasing for the training dataset. But it's not apparent if the metric is in a log scale like logistic regression. In that case, the model can still be garbage even if the metric somewhat improves.

what level of variation is acceptable

As a result, it's not about variation. It's about converging or not, and whether a user can quickly tell he or she is running into trouble.

I don't have the experiment's results now. I think I was using the HIGGS dataset as a demonstration. The performance gain is negligible for small datasets, whereas for large datasets, it might not converge. The parameter's practical usefulness was small.

trivialfis · 2024-10-21T09:33:52Z

Feel free to close if there are no further questions.

Shiki-H · 2024-10-21T19:06:58Z

@trivialfis thanks, that was very helpful

Shiki-H closed this as completed Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switching to float for histogram building #10913

Switching to float for histogram building #10913

Shiki-H commented Oct 20, 2024

trivialfis commented Oct 20, 2024 •

edited

Loading

trivialfis commented Oct 21, 2024

Shiki-H commented Oct 21, 2024

Switching to float for histogram building #10913

Switching to float for histogram building #10913

Comments

Shiki-H commented Oct 20, 2024

trivialfis commented Oct 20, 2024 • edited Loading

trivialfis commented Oct 21, 2024

Shiki-H commented Oct 21, 2024

trivialfis commented Oct 20, 2024 •

edited

Loading