Comparing Transformer-based and gradient boosted decision tree (GBDT) Models on Tabular Data: A Rossmann Case Study
dc.contributor.author | Middel, Coenraad | |
dc.contributor.author | Davel, Marelie H. | |
dc.date.accessioned | 2025-05-06T07:43:29Z | |
dc.date.available | 2025-05-06T07:43:29Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Heterogeneous tabular data is a common and important data format. This empirical study investigates how the performance of deep transformer models compares against benchmark gradient boosting decision tree (GBDT) methods, the more typical modelling approach. All models are optimised using a Bayesian hyperparameter optimisation protocol, which provides a stronger comparison than the random grid search hyperparameter optimisation utilized in earlier work. Since feature skewness is typically handled differently for GBDT and transformer-based models, we investigate the effect of a pre-processing step that normalises feature distribution on the model comparison process. Our analysis is based on the Rossmann Store Sales dataset, a widely recognized benchmark for regression tasks. | en_US |
dc.identifier.citation | Middel, C. & Davel M. Comparing Transformer-based and gradient boosted decision tree (GBDT) Models on Tabular Data: A Rossmann Case Study | en_US |
dc.identifier.uri | http://hdl.handle.net/10394/42882 | |
dc.language.iso | en | en_US |
dc.subject | Tabular data | en_US |
dc.subject | Transformer architectures | en_US |
dc.subject | Gradient Boosting Decision Trees | en_US |
dc.subject | Hyperparameter tuning | en_US |
dc.subject | Rossmann Store Sales | en_US |
dc.title | Comparing Transformer-based and gradient boosted decision tree (GBDT) Models on Tabular Data: A Rossmann Case Study | en_US |
dc.type | Article | en_US |
Files
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.61 KB
- Format:
- Item-specific license agreed upon to submission
- Description: