V1
float64
0
816
V2
float64
-2
12.3k
V3
float64
0
32.7k
V4
int64
0
130
V5
float64
0
373k
V6
float64
0
14.8k
V7
float64
0
6.61k
V8
int64
0
53
V9
float64
0
12.8k
V10
float64
0
7
V11
int64
0
136
V13
float64
0
332
V19
int64
0
9
V22
int64
0
2
V30
int64
0
7
V33
int64
0
3
V35
int64
0
9
V36
int64
0
12
V40
float64
0
3.78k
V41
int64
0
3
V42
float64
0
515
V43
float64
0
12.1k
V45
int64
0
8
V47
int64
0
10
V50
float64
0
4.55k
V51
int64
0
136
V52
int64
0
7
V59
float64
0
3.27k
V63
int64
0
13
V72
int64
0
117
V75
int64
0
136
class
int64
0
1
1
2
5
4
1,092
4
3
9
36
1
2
4
7
2
6
2
9
4
1
1
5
50
1
0
6
2
0
0
6
3
2
0
14
0
1
3
247
3
14
14
13
3
3
3
7
2
1
3
1
8
1
0
42
1
1
10
2
0
1
14
7
30
2
0
1
3
12
8
1,080
38
7
38
136
1
2
28
7
2
4
3
9
7
3
3
21
2
1
8
2
1
1
0
8
1
1
0
1
5
9
16
2
5
4
18
88
1
3
5
3
2
1
0
0
6
1
3
23
38
6
5
4
12
0
9
6
2
0
0
1
0
2
9
54
13
1
4
9
1
1
9
4
2
2
3
9
7
32
2
6
21
1
5
0
0
1
5
7
1
2
0
1
0
129
0
2
0
6
2
3
1
2
0
7
2
1
2
9
6
2
2
14
144
1
8
135
2
2
36
7
1
0
0
2
8
1
8
1,039
8
3
10
76
1
2
8
7
2
0
2
9
6
59
1
5
917
0
8
8
3
0
0
7
3
0
0
0
1
3
3
2,416
41
4
17
38
0
1
3
3
2
1
2
9
1
0
3
0
450
1
3
46
0
0
24
6
8
3
0
3
1
8
15
14
3
9
16
154
1
4
3
7
2
3
2
8
8
2
2
1
8
1
3
0
15
1
5
1
6
0
0
2
46
40
1
8
1
2
1
1
1
1
1
7
2
0
0
9
7
18
2
32
6
6
3
16
1
2
3
7
2
1
0
7
41
30
0
17
0
51
0
164
2
15
0
3
2
6
0
9
6
470
2
5
16
6
3
3
9
0
619
6
0
1
0
5
3
6
6
1
5
29
33
144
2
9
5
7
2
1
3
9
6
3
2
3
13
1
7
12
2
0
5
6
0
0
0
3
61
9
15
4
8
12
14
15
1
3
8
4
2
1
3
9
6
1
0
38
20
0
8
3
1
0
20
6
1
1
0
2
123
25
10
4
21
2
9
10
1
1
2
3
2
1
2
9
6
26
3
4
12
7
5
2
1
1
1
6
1
1
0
8
9
4
5
83
5
16
10
56
3
4
5
3
2
1
3
9
7
3
3
0
120
1
0
1
2
0
12
6
0
1
0
4
0
18
12
1,533
20
7
16
16
0
1
12
9
2
1
0
9
6
12
1
2
71
4
3
0
1
0
1
7
34
17
0
0
73
4
5
2,427
33
11
16
344
0
8
5
4
2
1
2
9
6
42
0
28
146
1
10
35
9
1
44
8
1
3
0
24
45
26
25
4
15
47
46
46
2
5
15
4
2
1
1
9
8
0
0
29
6
1
8
0
1
1
8
6
2
1
0
142
193
10
8
2
0
762
16
215
4
51
0
3
2
3
3
8
9
1
3
0
98
1
5
1
1
0
7
6
2
3
0
11
138
1
1
277
1
12
14
35
2
3
1
7
2
3
2
9
8
1
3
3
317
4
3
28
1
0
15
8
10
1
0
1
0
47
6
17
7
1
5
6
1
1
6
7
2
0
2
9
6
48
1
30
24
1
8
67
5
0
26
7
0
1
0
0
2
1
9
14,139
881
2
8
1,974
0
1
9
7
2
1
3
9
8
11
2
2
109
1
0
1
1
2
46
6
1
1
0
3
159
1
1
77
1
4
12
12
1
2
1
6
2
1
1
9
6
8
3
0
48
1
10
7
0
1
2
7
4
2
0
12
0
102
10
0
0
20
32
47
2
4
0
9
2
1
2
9
6
23
3
2
29
1
5
0
1
0
4
6
2
1
0
1
0
2
3
646
7
2
9
110
1
2
3
7
2
6
3
9
6
5
0
28
1
6
10
90
5
0
41
6
0
1
0
3
8
1
8
12
4
3
2
8
1
1
4
7
2
1
3
9
6
4
0
8
26
0
5
7
1
0
41
9
5
5
0
0
9
7
7
2,619
107
1
9
35
0
1
7
7
2
7
3
9
8
24
3
11
273
5
3
2
2
0
20
6
2
2
0
12
3
10
5
116
11
47
15
128
3
8
6
4
2
6
0
9
7
121
0
0
12
7
3
17
1
0
306
6
1
6
0
11
7
1
1
220
1
61
17
105
4
22
1
9
2
1
3
9
6
0
3
2
111
2
3
84
0
0
1
6
1
1
0
5
10
1
7
133
9
8
38
56
2
3
9
7
2
1
2
5
7
116
3
6
0
2
5
3
0
0
9
8
1
6
0
0
70
18
6
6,071
42
3
13
119
0
3
6
9
2
1
1
9
6
2
1
1
1
1
8
19
2
1
1
7
0
0
0
6
5
9
8
940
8
89
18
106
1
13
8
5
2
1
3
9
6
180
0
19
139
1
8
13
1
1
3
8
3
2
0
9
0
14
15
339
15
9
15
15
1
1
15
6
2
1
0
9
6
82
0
0
0
1
8
2
0
0
1
8
0
0
0
2
1,194
3
0
10
0
2
0
0
1
1
0
7
2
1
3
8
7
48
3
2
12
0
5
10
1
1
1
6
0
3
0
2
23
9
7
1,255
7
23
14
88
1
10
7
7
2
4
1
9
8
5
0
0
4,531
2
5
4
1
1
122
4
3
1
0
0
1
11
3
10,039
101
1
3
17
0
1
3
3
2
7
1
9
4
1
2
3
1
1
5
40
1
0
23
6
4
0
0
0
3
2
10
3,941
43
37
5
62
0
3
10
7
2
4
1
9
6
2
3
48
27
6
8
0
4
1
32
6
4
2
0
0
6
1
9
1,558
185
7
29
188
0
1
9
7
2
1
2
8
6
0
2
2
1,664
4
3
5
9
1
1
7
1
0
0
1
125
20
1
8
1
1
1
1
1
1
1
4
2
7
0
8
7
3
0
45
34
1
3
53
2
1
170
7
1
5
0
14
1
1
2
2
2
14
0
2
1
1
2
7
2
1
1
9
7
21
0
2
375
1
8
1
0
0
1
6
3
1
0
2
14
2
1
1
1
2
2
1
1
1
1
7
2
0
3
9
6
1
0
0
96
1
5
77
1
0
117
1
1
1
0
4
39
3
3
169
21
6
21
21
1
2
3
4
2
0
2
9
6
0
3
2
30
1
10
20
13
0
50
6
15
13
0
4
0
7
3
5
3
248
2
10
2
10
3
4
2
1
1
9
6
0
3
19
133
1
8
4
0
0
437
7
2
1
0
34
1
4
4
862
36
34
12
24
1
1
4
7
2
1
0
9
6
7
3
1
60
1
8
0
3
0
0
6
5
0
0
1
161
1
1
8
1
1
1
1
1
1
1
4
2
7
3
8
7
6
1
32
52
1
10
4
0
1
1
4
1
1
0
2
5
1
1
1
1
27
1
2
1
3
1
3
2
4
3
9
8
1
0
20
131
1
3
3
1
0
47
7
0
2
0
4
163
835
24
2
0
5
25
28
1
2
0
7
2
1
2
1
1
3
2
27
65
1
3
0
1
0
1
6
1
2
0
5
21
1
1
761
1
66
1
1
1
9
1
4
2
1
3
1
7
15
2
21
134
1
3
1
0
1
0
2
4
2
0
0
4
5
12
11
18
5
16
259
0
4
5
4
2
1
3
1
7
1
1
27
49
8
8
2
0
1
7
6
1
0
0
3
407
3
1
7
0
11
3
9
1
2
0
7
2
1
2
8
8
1
3
37
2
8
7
1
0
1
6
8
16
6
0
3
-1
5
6
1,026
125
86
49
479
1
17
8
7
2
1
3
9
9
11
3
3
2
5
8
1
1
1
1
6
0
1
0
3
66
6
5
133
6
14
14
47
1
5
5
4
2
1
2
8
1
1
1
9
8
1
8
11
6
0
1
1
0
0
0
14
0
1
0
1
0
14
0
0
2
2
0
9
2
1
2
9
7
3
2
2
64
1
8
14
2
1
3
6
1
5
0
2
0
70
33
73
39
2
47
415
1
1
39
4
2
1
1
9
8
5
3
2
287
8
3
143
1
0
53
1
1
1
0
10
13
2
15
1
2
10
12
15
1
1
2
7
2
1
0
8
6
49
3
0
3
1
8
1
2
1
2
6
6
2
0
7
2
1
10
0
0
132
1
1,996
1
15
0
3
2
6
3
9
6
70
3
10
3
5
3
38
0
1
173
6
1
0
0
4
0
30
8
4
2
32
13
117
1
6
2
3
2
1
3
9
8
68
3
0
11
1
3
15
1
1
4
2
1
1
0
0
0
16
22
2,714
87
313
31
910
0
20
22
7
2
3
3
9
7
0
3
22
11
1
5
81
2
1
28
8
4
30
0
9
67
38
18
2
8
10
15
56
1
2
8
7
2
1
1
9
1
1
0
2
85
1
3
0
0
2
2
6
5
1
0
13
158
6
11
5
11
13
16
21
2
2
11
4
2
1
3
9
6
45
1
14
28
1
10
1
8
0
0
6
8
1
0
14
1
7
7
57
7
222
25
1,862
2
25
7
7
2
6
1
9
8
17
1
3
9
0
8
7
4
1
24
6
2
0
0
3
268
1
1
46
8
4
1
43
2
2
1
4
2
0
3
9
6
4
1
14
5
8
8
1
5
1
2
8
1
3
0
3
0
13
17
69
17
3
14
17
1
1
17
7
2
0
0
9
6
0
0
1
925
1
5
11
1
1
1
6
0
1
0
40
41
3
1
1
1
95
1
15
4
10
1
3
2
1
1
9
8
1
2
28
34
1
3
1
5
0
1
6
0
0
0
1
-2
74
18
738
84
1
31
30
1
1
19
6
2
7
3
9
6
5
1
17
120
8
8
9
8
0
65
7
0
15
0
77
1
2
2
2
2
109
4
18
1
2
2
7
2
1
3
1
6
2
3
35
0
0
7
12
5
0
14
8
2
2
0
1
19
6
7
2
2
1
18
18
1
1
2
7
2
4
1
8
6
6
3
47
37
8
5
1
0
0
0
6
4
2
0
2
-1
3
7
7
4
2
8
7
1
1
4
4
2
3
3
8
7
0
3
0
3
1
8
1
34
0
0
8
2
3
0
2
769
2
4
21
7
2
4
4
1
1
4
3
2
0
3
8
6
0
3
8
10
1
3
14
6
1
13
8
0
9
0
8
-1
2
5
1
2
8
5
5
2
2
2
7
2
1
3
9
8
2
3
3
400
0
8
9
1
1
0
6
0
2
0
1
32
36
3
1,110
114
1
36
42
1
1
3
7
2
1
1
9
7
0
2
21
27
1
8
7
6
2
123
1
0
8
0
0
175
3
6
2,777
124
2
47
72
0
1
6
3
2
0
3
8
6
0
3
48
17
1
3
1
2
1
1
8
1
1
0
3
427
1
1
7
0
11
5
48
2
5
0
0
2
1
3
9
6
1
2
0
3
0
8
13
2
0
12
8
2
0
0
3
3
18
6
570
18
3
25
36
1
1
6
0
2
3
1
9
6
7
1
28
1,022
7
5
1
3
1
2
6
1
1
0
2
1
24
8
286
11
2
8
8
1
1
8
4
2
0
2
9
6
12
1
1
22
2
8
24
0
0
3
4
12
3
0
4
887
6
5
56
4
5
8
7
1
2
4
3
2
0
3
1
7
62
0
0
12
1
5
12
2
0
7
6
1
1
0
6
0
6
24
18
5
7
8
49
2
3
5
7
2
1
1
9
4
29
0
19
217
7
8
0
1
1
7
6
2
5
0
0
38
6
16
2,728
68
12
7
71
0
3
16
4
2
1
1
9
4
4
0
6
89
0
5
2
4
2
2
8
1
4
0
0
11
3
0
1,961
118
14
4
201
0
10
3
0
2
4
3
9
6
0
3
5
6
1
8
0
0
0
2
7
1
1
0
99
135
13
0
0
36
147
47
124
0
4
0
3
2
4
1
8
1
4
1
12
11
0
8
1
1
0
2
7
40
2
0
0
14
1
1
1,203
14
112
1
83
0
29
1
5
2
1
2
8
6
2
0
43
4
7
8
42
1
0
7
6
1
0
0
1
26
55
8
447
68
12
33
469
1
2
34
7
2
6
1
9
6
4
3
48
0
8
8
6
0
1
0
7
0
2
0
5
2
1
5
745
59
5
37
43
1
1
5
7
2
1
3
9
6
4
3
12
15
0
5
4
0
1
3
7
3
5
0
13
669
3
3
8
0
1,022
19
1,048
1
47
0
7
2
6
2
8
8
6
3
16
4
1
5
125
1
2
3
6
1
17
0
1
50
1
1
293
1
1
1
1
1
1
1
9
2
7
3
9
6
1
2
7
15
1
10
5
1
1
0
7
80
1
0
1
0
7
5
112
5
1
7
9
1
1
5
9
2
1
0
9
6
21
3
5
27
6
5
0
0
0
8
6
0
3
0
9
7
10
9
1
0
35
39
484
2
15
0
4
2
1
3
8
7
0
1
7
114
1
10
19
1
3
1
6
9
1
0
0
326
3
5
3,163
38
5
5
162
0
2
5
7
2
6
3
9
6
0
3
45
24
1
5
90
0
1
0
7
1
0
0
15
0
22
19
5
0
15
21
22
3
3
0
3
2
1
0
9
4
2
3
6
0
3
3
8
2
1
18
7
0
1
0
0
117
4
4
2,023
17
21
15
144
0
9
4
7
2
6
3
1
8
9
1
0
59
1
8
3
1
1
20
1
3
1
0
0
53
1
1
4,711
198
4
12
161
0
3
1
4
2
1
3
9
6
2
2
33
31
1
10
8
8
1
0
7
11
9
0
0
0
3
1
4,403
14
19
2
7
0
1
1
0
2
1
1
9
7
0
0
2
203
0
3
1
0
0
8
6
1
16
0
21
93
46
2
1
0
21
5
2
2
2
0
7
2
3
1
8
6
32
2
33
4
5
8
6
65
1
2
7
1
2
0
1
83
1
1
1
0
1
4
70
1
1
0
9
2
7
1
9
6
6
3
7
55
4
8
3
0
1
1
6
0
15
0
30
0
76
8
27
0
192
26
48
1
17
0
7
2
1
2
8
6
2
3
36
7
5
8
14
0
1
3
7
1
0
0
3
55
16
8
1,195
37
32
48
728
0
10
8
4
2
3
3
1
6
9
3
5
2
8
8
23
6
0
3
4
1
0
0
0
623
3
2
783
21
123
25
344
0
37
2
5
2
3
2
9
6
2
1
8
1
6
3
63
1
0
100
6
1
6
0
1
1
2
4
233
34
10
34
465
1
6
4
4
2
6
3
9
6
2
1
28
17
1
8
273
1
0
71
7
1
0
0
2
21
6
7
90
9
2
7
10
1
1
9
3
2
1
3
9
6
22
3
12
1
1
5
49
1
1
58
4
0
4
0
2
548
2
2
17
2
5
5
54
1
2
2
3
2
1
3
9
6
0
3
2
24
0
3
7
16
0
1
7
3
6
0

Tabular Benchmark

Dataset Description

This dataset is a curation of various datasets from openML and is curated to benchmark performance of various machine learning algorithms.

Dataset Summary

Benchmark made of curation of various tabular data learning tasks, including:

  • Regression from Numerical and Categorical Features
  • Regression from Numerical Features
  • Classification from Numerical and Categorical Features
  • Classification from Numerical Features

Supported Tasks and Leaderboards

  • tabular-regression
  • tabular-classification

Dataset Structure

Data Splits

This dataset consists of four splits (folders) based on tasks and datasets included in tasks.

  • reg_num: Task identifier for regression on numerical features.
  • reg_cat: Task identifier for regression on numerical and categorical features.
  • clf_num: Task identifier for classification on numerical features.
  • clf_cat: Task identifier for classification on categorical features.

Depending on the dataset you want to load, you can load the dataset by passing task_name/dataset_name to data_files argument of load_dataset like below:

from datasets import load_dataset
dataset = load_dataset("inria-soda/tabular-benchmark", data_files="reg_cat/house_sales.csv")

Dataset Creation

Curation Rationale

This dataset is curated to benchmark performance of tree based models against neural networks. The process of picking the datasets for curation is mentioned in the paper as below:

  • Heterogeneous columns. Columns should correspond to features of different nature. This excludes images or signal datasets where each column corresponds to the same signal on different sensors.
  • Not high dimensional. We only keep datasets with a d/n ratio below 1/10.
  • Undocumented datasets We remove datasets where too little information is available. We did keep datasets with hidden column names if it was clear that the features were heterogeneous.
  • I.I.D. data. We remove stream-like datasets or time series.
  • Real-world data. We remove artificial datasets but keep some simulated datasets. The difference is subtle, but we try to keep simulated datasets if learning these datasets are of practical importance (like the Higgs dataset), and not just a toy example to test specific model capabilities.
  • Not too small. We remove datasets with too few features (< 4) and too few samples (< 3 000). For benchmarks on numerical features only, we remove categorical features before checking if enough features and samples are remaining.
  • Not too easy. We remove datasets which are too easy. Specifically, we remove a dataset if a simple model (max of a single tree and a regression, logistic or OLS) reaches a score whose relative difference with the score of both a default Resnet (from Gorishniy et al. [2021]) and a default HistGradientBoosting model (from scikit learn) is below 5%. Other benchmarks use different metrics to remove too easy datasets, like removing datasets perfectly separated by a single decision classifier [Bischl et al., 2021], but this ignores varying Bayes rate across datasets. As tree ensembles are superior to simple trees and logistic regresison [Fernández-Delgado et al., 2014], a close score for the simple and powerful models suggests that we are already close to the best achievable score.
  • Not deterministic. We remove datasets where the target is a deterministic function of the data. This mostly means removing datasets on games like poker and chess. Indeed, we believe that these datasets are very different from most real-world tabular datasets, and should be studied separately

Source Data

Numerical Classification

dataset_name n_samples n_features original_link new_link
electricity 38474.0 7.0 https://www.openml.org/d/151 https://www.openml.org/d/44120
covertype 566602.0 10.0 https://www.openml.org/d/293 https://www.openml.org/d/44121
pol 10082.0 26.0 https://www.openml.org/d/722 https://www.openml.org/d/44122
house_16H 13488.0 16.0 https://www.openml.org/d/821 https://www.openml.org/d/44123
MagicTelescope 13376.0 10.0 https://www.openml.org/d/1120 https://www.openml.org/d/44125
bank-marketing 10578.0 7.0 https://www.openml.org/d/1461 https://www.openml.org/d/44126
Bioresponse 3434.0 419.0 https://www.openml.org/d/4134 https://www.openml.org/d/45019
MiniBooNE 72998.0 50.0 https://www.openml.org/d/41150 https://www.openml.org/d/44128
default-of-credit-card-clients 13272.0 20.0 https://www.openml.org/d/42477 https://www.openml.org/d/45020
Higgs 940160.0 24.0 https://www.openml.org/d/42769 https://www.openml.org/d/44129
eye_movements 7608.0 20.0 https://www.openml.org/d/1044 https://www.openml.org/d/44130
Diabetes130US 71090.0 7.0 https://www.openml.org/d/4541 https://www.openml.org/d/45022
jannis 57580.0 54.0 https://www.openml.org/d/41168 https://www.openml.org/d/45021
heloc 10000.0 22.0 "https://www.kaggle.com/datasets/averkiyoliabev/home-equity-line-of-creditheloc?select=heloc_dataset_v1+%281%29.csv" https://www.openml.org/d/45026
credit 16714.0 10.0 "https://www.kaggle.com/c/GiveMeSomeCredit/data?select=cs-training.csv" https://www.openml.org/d/44089
california 20634.0 8.0 "https://www.dcc.fc.up.pt/ltorgo/Regression/cal_housing.html" https://www.openml.org/d/45028

Categorical Classification

Numerical Regression

dataset_name n_samples n_features original_link new_link
cpu_act 8192.0 21.0 https://www.openml.org/d/197 https://www.openml.org/d/44132
pol 15000.0 26.0 https://www.openml.org/d/201 https://www.openml.org/d/44133
elevators 16599.0 16.0 https://www.openml.org/d/216 https://www.openml.org/d/44134
wine_quality 6497.0 11.0 https://www.openml.org/d/287 https://www.openml.org/d/44136
Ailerons 13750.0 33.0 https://www.openml.org/d/296 https://www.openml.org/d/44137
yprop_4_1 8885.0 42.0 https://www.openml.org/d/416 https://www.openml.org/d/45032
houses 20640.0 8.0 https://www.openml.org/d/537 https://www.openml.org/d/44138
house_16H 22784.0 16.0 https://www.openml.org/d/574 https://www.openml.org/d/44139
delays_zurich_transport 5465575.0 9.0 https://www.openml.org/d/40753 https://www.openml.org/d/45034
diamonds 53940.0 6.0 https://www.openml.org/d/42225 https://www.openml.org/d/44140
Brazilian_houses 10692.0 8.0 https://www.openml.org/d/42688 https://www.openml.org/d/44141
Bike_Sharing_Demand 17379.0 6.0 https://www.openml.org/d/42712 https://www.openml.org/d/44142
nyc-taxi-green-dec-2016 581835.0 9.0 https://www.openml.org/d/42729 https://www.openml.org/d/44143
house_sales 21613.0 15.0 https://www.openml.org/d/42731 https://www.openml.org/d/44144
sulfur 10081.0 6.0 https://www.openml.org/d/23515 https://www.openml.org/d/44145
medical_charges 163065.0 5.0 https://www.openml.org/d/42720 https://www.openml.org/d/44146
MiamiHousing2016 13932.0 14.0 https://www.openml.org/d/43093 https://www.openml.org/d/44147
superconduct 21263.0 79.0 https://www.openml.org/d/43174 https://www.openml.org/d/44148

Categorical Regression

dataset_name n_samples n_features original_link new_link
topo_2_1 8885.0 255.0 https://www.openml.org/d/422 https://www.openml.org/d/45041
analcatdata_supreme 4052.0 7.0 https://www.openml.org/d/504 https://www.openml.org/d/44055
visualizing_soil 8641.0 4.0 https://www.openml.org/d/688 https://www.openml.org/d/44056
delays_zurich_transport 5465575.0 12.0 https://www.openml.org/d/40753 https://www.openml.org/d/45045
diamonds 53940.0 9.0 https://www.openml.org/d/42225 https://www.openml.org/d/44059
Allstate_Claims_Severity 188318.0 124.0 https://www.openml.org/d/42571 https://www.openml.org/d/45046
Mercedes_Benz_Greener_Manufacturing 4209.0 359.0 https://www.openml.org/d/42570 https://www.openml.org/d/44061
Brazilian_houses 10692.0 11.0 https://www.openml.org/d/42688 https://www.openml.org/d/44062
Bike_Sharing_Demand 17379.0 11.0 https://www.openml.org/d/42712 https://www.openml.org/d/44063
Airlines_DepDelay_1M 1000000.0 5.0 https://www.openml.org/d/42721 https://www.openml.org/d/45047
nyc-taxi-green-dec-2016 581835.0 16.0 https://www.openml.org/d/42729 https://www.openml.org/d/44065
abalone 4177.0 8.0 https://www.openml.org/d/42726 https://www.openml.org/d/45042
house_sales 21613.0 17.0 https://www.openml.org/d/42731 https://www.openml.org/d/44066
seattlecrime6 52031.0 4.0 https://www.openml.org/d/42496 https://www.openml.org/d/45043
medical_charges 163065.0 5.0 https://www.openml.org/d/42720 https://www.openml.org/d/45048
particulate-matter-ukair-2017 394299.0 6.0 https://www.openml.org/d/42207 https://www.openml.org/d/44068
SGEMM_GPU_kernel_performance 241600.0 9.0 https://www.openml.org/d/43144 https://www.openml.org/d/44069

Dataset Curators

Léo Grinsztajn, Edouard Oyallon, Gaël Varoquaux.

Licensing Information

[More Information Needed]

Citation Information

Léo Grinsztajn, Edouard Oyallon, Gaël Varoquaux. Why do tree-based models still outperform deep learning on typical tabular data?. NeurIPS 2022 Datasets and Benchmarks Track, Nov 2022, New Orleans, United States. ffhal-03723551v2f

Downloads last month
2,048
Edit dataset card
Evaluate models HF Leaderboard

Spaces using inria-soda/tabular-benchmark 2