text
string
"9.000000000000000000e+00 7.000000000000000000e+00 1.431758905067273829e-17 1.280206171201988319e+00 5.061034370019017505e-19"
"3.000000000000000000e+00 3.000000000000000000e+00 1.799513747624317425e-18 5.031704544386760958e+00 7.071075943056076557e-19"
"1.000000000000000000e+00 6.000000000000000000e+00 2.851774047927096091e-18 4.986234560064437105e+00 1.106648087681759389e-19"
"4.000000000000000000e+00 9.000000000000000000e+00 6.061187884331541288e-18 5.010228440980251108e-01 3.107360632328579100e-17"
"6.000000000000000000e+00 8.000000000000000000e+00 3.156748407622208517e-17 3.239482781398461242e+00 7.728381964948306949e-21"
"8.000000000000000000e+00 4.000000000000000000e+00 1.144180535969386525e-18 2.406926622061432042e+00 3.429664846543973254e-18"
"6.000000000000000000e+00 1.000000000000000000e+00 3.501921011621698747e-18 2.859546781161941720e+00 1.016309089973520505e-20"
"4.000000000000000000e+00 5.000000000000000000e+00 5.319155967284645039e-17 9.539658211429957735e-02 9.099686596465644722e-17"
"3.000000000000000000e+00 1.000000000000000000e+00 1.293787265428018181e-17 1.087489792773425723e+00 2.496905631520444419e-21"
"7.000000000000000000e+00 6.000000000000000000e+00 6.756936643226736941e-17 3.109327636906834336e+00 1.285965620353533423e-21"
"3.000000000000000000e+00 5.000000000000000000e+00 1.956391767250055723e-18 2.936693040159271906e+00 1.997137374945423817e-19"
"7.000000000000000000e+00 5.000000000000000000e+00 3.767209185017485848e-18 5.516656028418967850e+00 1.468404622489697379e-17"
"3.000000000000000000e+00 2.000000000000000000e+00 2.651069509578025206e-17 6.008381932034194683e+00 4.841344051769563797e-19"
"8.000000000000000000e+00 9.000000000000000000e+00 9.207846472849559348e-18 3.093423264351731206e-01 3.611169300735629551e-16"
"5.000000000000000000e+00 7.000000000000000000e+00 2.487174959286345972e-18 1.378124156238764719e+00 4.030962402165536957e-18"
"8.000000000000000000e+00 6.000000000000000000e+00 4.201540780739203087e-17 4.688458157491520062e+00 1.656496062972934863e-20"
"3.000000000000000000e+00 6.000000000000000000e+00 7.312116475036373910e-17 1.095954807064002834e+00 2.736777781586602965e-21"
"6.000000000000000000e+00 7.000000000000000000e+00 4.458974005336971723e-18 3.287300893189536222e+00 2.982959913782312218e-19"
"4.000000000000000000e+00 2.000000000000000000e+00 6.812802687642499830e-17 1.834133768444316193e+00 1.155159230929102429e-22"
"4.000000000000000000e+00 9.000000000000000000e+00 7.068689079012983850e-18 3.077581088830496192e+00 8.646125901910363045e-20"
"7.000000000000000000e+00 4.000000000000000000e+00 4.884198013049426451e-18 1.906289125794887074e+00 2.475088319919910362e-19"
"1.000000000000000000e+00 8.000000000000000000e+00 5.154640792767285127e-18 1.910138813786768752e+00 1.804154932027149004e-20"
"7.000000000000000000e+00 3.000000000000000000e+00 3.822431186733833197e-17 4.237310984866286212e+00 1.890869036369245955e-21"
"6.000000000000000000e+00 3.000000000000000000e+00 2.710568708406320278e-18 5.145687988855002004e+00 1.743547344092833975e-18"
"3.000000000000000000e+00 3.000000000000000000e+00 5.314749255494939434e-17 1.480321800871041926e+00 4.611437211102066043e-22"
"3.000000000000000000e+00 4.000000000000000000e+00 4.514643025647030942e-17 5.550028272463169543e+00 1.424081160131288457e-20"
"2.000000000000000000e+00 9.000000000000000000e+00 7.407072108748866991e-17 2.663629482661041603e+00 2.204666012138434556e-22"
"5.000000000000000000e+00 9.000000000000000000e+00 4.087332275922785802e-18 4.065824536291943403e+00 6.281187016320465482e-19"
"2.000000000000000000e+00 1.000000000000000000e+00 4.177123792651497660e-18 4.008390182190091799e+00 1.124189695873582179e-21"
"8.000000000000000000e+00 2.000000000000000000e+00 1.115406179368725909e-18 5.244548126955621115e+00 1.128342100168454327e-17"
"3.000000000000000000e+00 5.000000000000000000e+00 1.548938981293920587e-17 3.645676055134719373e+00 3.547232969893860486e-21"
"3.000000000000000000e+00 7.000000000000000000e+00 9.411726954167716347e-17 4.650107042331787177e+00 5.871075559936433894e-22"
"2.000000000000000000e+00 7.000000000000000000e+00 6.965046687305010595e-17 3.314083799506222783e-01 1.815664533289403499e-19"
"3.000000000000000000e+00 2.000000000000000000e+00 3.722263503962774895e-18 1.647144836295190151e+00 2.984741206596804825e-20"
"6.000000000000000000e+00 8.000000000000000000e+00 2.106574667015577495e-17 7.018579295391728055e-03 1.138832542977202707e-10"
"6.000000000000000000e+00 5.000000000000000000e+00 6.042705609532116719e-17 5.616712236912577261e+00 7.162171819237218559e-20"
"9.000000000000000000e+00 9.000000000000000000e+00 4.244517268759135592e-17 4.699044890413168751e+00 4.719155466956035610e-20"
"3.000000000000000000e+00 6.000000000000000000e+00 2.561803609644232373e-17 6.981280490883462475e-01 1.200216861185763045e-19"
"1.000000000000000000e+00 6.000000000000000000e+00 6.420781556898753231e-18 9.330865500878768315e-02 6.140357027033428999e-16"
"1.000000000000000000e+00 3.000000000000000000e+00 1.953688516880101196e-18 6.061131691872442495e+00 5.204695669891972143e-17"
"7.000000000000000000e+00 2.000000000000000000e+00 8.620863033986324260e-17 1.356582803671346982e+00 5.659850797603739546e-22"
"7.000000000000000000e+00 3.000000000000000000e+00 3.994886513089014973e-18 4.371442939104355774e-01 4.158233268205992155e-17"
"7.000000000000000000e+00 7.000000000000000000e+00 7.420447020111031240e-17 8.156861484532083040e-01 5.861288958190755274e-20"
"6.000000000000000000e+00 1.000000000000000000e+00 2.448166867954044225e-18 5.367268619692191933e+00 5.229655256316721150e-19"
"4.000000000000000000e+00 8.000000000000000000e+00 7.560291652187439385e-17 2.524539890663932695e+00 7.231908921477752126e-22"
"9.000000000000000000e+00 4.000000000000000000e+00 5.828017531173082863e-18 5.127817314153143791e+00 1.427335351603707674e-18"
"6.000000000000000000e+00 7.000000000000000000e+00 2.444925783031345883e-17 2.932679436386334437e+00 1.003381853073512687e-20"
"1.000000000000000000e+00 1.000000000000000000e+00 7.839172941538747769e-18 4.396509198439997768e+00 1.260516558712738636e-22"
"9.000000000000000000e+00 6.000000000000000000e+00 1.015101953146738867e-17 1.739553209842998216e+00 2.760443511176637648e-19"
"5.000000000000000000e+00 4.000000000000000000e+00 5.022532391069483900e-17 1.205527553543750185e+00 5.106550481763106766e-21"
"8.000000000000000000e+00 9.000000000000000000e+00 4.825898335360076640e-18 2.125532334206710949e+00 1.270734668886657672e-18"
"5.000000000000000000e+00 9.000000000000000000e+00 2.174217071457344907e-17 4.466450058833493664e+00 3.686506648379383265e-20"
"7.000000000000000000e+00 2.000000000000000000e+00 3.442616117362001606e-17 9.407637684023267832e-01 1.303830768039655631e-20"
"8.000000000000000000e+00 6.000000000000000000e+00 1.685906871244143496e-17 4.346874610395551564e+00 5.853895878899960278e-20"
"7.000000000000000000e+00 7.000000000000000000e+00 1.638503574705224186e-17 5.080813351120856858e+00 2.906698357424034905e-19"
"8.000000000000000000e+00 4.000000000000000000e+00 2.367629010490722176e-17 1.914709784141975235e-01 7.278462425102487775e-17"
"2.000000000000000000e+00 7.000000000000000000e+00 6.026790667482072890e-18 3.975747261749804196e+00 2.569079037703694079e-20"
"8.000000000000000000e+00 8.000000000000000000e+00 5.577077659926133750e-17 1.222044446752177160e+00 4.043862270487136645e-20"
"7.000000000000000000e+00 5.000000000000000000e+00 7.570433182753319990e-18 3.890246828074699792e+00 9.474572179884738214e-20"
"3.000000000000000000e+00 6.000000000000000000e+00 1.010680171003443255e-17 1.138736663162285501e+00 1.249235016248240067e-19"
"8.000000000000000000e+00 5.000000000000000000e+00 1.966100236560476580e-18 2.893572874074413215e-01 3.186838119324474617e-15"
"9.000000000000000000e+00 5.000000000000000000e+00 3.949961308182787950e-18 2.209782823421562625e+00 6.776882910886078768e-19"
"8.000000000000000000e+00 2.000000000000000000e+00 8.897915606515159557e-17 6.247862642328153804e+00 1.105768607934164858e-15"
"4.000000000000000000e+00 7.000000000000000000e+00 7.050956047545617797e-18 4.453844910424123782e+00 1.330856207956575936e-19"
"7.000000000000000000e+00 9.000000000000000000e+00 2.433625429890016408e-17 4.073766813728716407e+00 3.500446330270757285e-20"
"4.000000000000000000e+00 2.000000000000000000e+00 2.414794316573334737e-18 7.068584049548216619e-01 2.544086901594874881e-18"
"1.000000000000000000e+00 1.000000000000000000e+00 9.452214804216906346e-17 1.806021677018353033e+00 9.795515729680039591e-25"
"4.000000000000000000e+00 2.000000000000000000e+00 9.339492495173154943e-17 2.096920313892032262e+00 4.326640371481772477e-23"
"3.000000000000000000e+00 4.000000000000000000e+00 4.683454704431967404e-18 5.932928113235194978e+00 2.369708638092500055e-17"
"3.000000000000000000e+00 8.000000000000000000e+00 1.622718793298063228e-18 2.069331860894302100e+00 1.332199617002374374e-18"
"2.000000000000000000e+00 9.000000000000000000e+00 9.047585073525408499e-18 8.331321337505753766e-01 4.912286087857879308e-19"
"4.000000000000000000e+00 6.000000000000000000e+00 8.021929318520891582e-18 3.597490167979736420e+00 3.306724509346455230e-20"
"5.000000000000000000e+00 6.000000000000000000e+00 9.147728948222436981e-17 1.170991666786665109e+00 3.836515012817435785e-21"
"2.000000000000000000e+00 4.000000000000000000e+00 4.195201169829914703e-18 5.116240808947232210e-01 2.951094609556839993e-18"
"6.000000000000000000e+00 3.000000000000000000e+00 1.160123729958857406e-18 2.259836262218765768e+00 1.197132623304392431e-18"
"4.000000000000000000e+00 6.000000000000000000e+00 1.463368618846331192e-18 3.625979048329505350e+00 1.007336879945383008e-18"
"3.000000000000000000e+00 2.000000000000000000e+00 4.750673963853375409e-17 3.641185913223601300e+00 6.019607660918929223e-23"
"6.000000000000000000e+00 5.000000000000000000e+00 1.301233504107680349e-17 2.153111726529040482e+00 2.944122605345800466e-20"
"3.000000000000000000e+00 7.000000000000000000e+00 7.414834520011253647e-17 1.844118225218603691e+00 6.618193509203561135e-22"
"9.000000000000000000e+00 6.000000000000000000e+00 4.077019727637265459e-18 2.430058349274822227e+00 7.558652574211988318e-19"
"1.000000000000000000e+00 1.000000000000000000e+00 4.691254133288448370e-18 2.291265149985873162e+00 2.194865391115089752e-22"
"1.000000000000000000e+00 7.000000000000000000e+00 3.177520667089765644e-18 3.099613866876956281e-01 2.843630158965470003e-17"
"7.000000000000000000e+00 1.000000000000000000e+00 6.410980902474251846e-18 3.191292866791270555e+00 3.970890860212438374e-21"
"5.000000000000000000e+00 6.000000000000000000e+00 2.506859471739889933e-18 5.270016984643103974e+00 8.596624021623657320e-18"
"5.000000000000000000e+00 1.000000000000000000e+00 3.464192437788335363e-17 9.592679857283589184e-01 1.528357171087389487e-21"
"8.000000000000000000e+00 7.000000000000000000e+00 6.461211101025578879e-18 7.302962230428601265e-01 1.536927211769523226e-17"
"7.000000000000000000e+00 2.000000000000000000e+00 3.820801219859763639e-17 2.441869992913731302e+00 5.734715950045898954e-22"
"7.000000000000000000e+00 6.000000000000000000e+00 9.769773612460225468e-17 7.597447078316938995e-01 3.252116233025387603e-20"
"8.000000000000000000e+00 9.000000000000000000e+00 2.882321401718943258e-18 6.037717906208601715e-01 2.656319012474321514e-16"
"7.000000000000000000e+00 4.000000000000000000e+00 2.441443924787365756e-17 6.121000031125895191e+00 1.016261828122074275e-16"
"1.000000000000000000e+00 4.000000000000000000e+00 2.981151075239109150e-18 3.896487101347803517e+00 8.019662627290448775e-21"
"8.000000000000000000e+00 4.000000000000000000e+00 1.524859401674680570e-17 1.700636485135086140e+00 4.593577043218005099e-20"
"5.000000000000000000e+00 1.000000000000000000e+00 3.528318342653024379e-18 1.500735854141997727e+00 3.089626235345693668e-20"
"8.000000000000000000e+00 7.000000000000000000e+00 2.093261404084852879e-18 2.967421664739426479e+00 2.417292176945029239e-18"
"9.000000000000000000e+00 2.000000000000000000e+00 3.943650641121725841e-18 1.695591406260611889e+00 2.192379339914917950e-19"
"6.000000000000000000e+00 2.000000000000000000e+00 1.868062263089878497e-17 1.175072379511287401e+00 1.454008359429034264e-20"
"1.000000000000000000e+00 4.000000000000000000e+00 6.114909498272393549e-18 3.988017579489349651e-01 9.246199213969785547e-19"
"5.000000000000000000e+00 9.000000000000000000e+00 7.880174392538101106e-17 3.032667098438696751e+00 1.091278131070368198e-21"
"7.000000000000000000e+00 7.000000000000000000e+00 1.371242523137519271e-17 3.272157037947383884e+00 4.284226003955573781e-20"
"9.000000000000000000e+00 2.000000000000000000e+00 2.508546405163304181e-17 2.508954611431225779e+00 2.099439549906360551e-21"

Dataset Card for SRSD-Feynman (Hard set)

Dataset Summary

Our SRSD (Feynman) datasets are designed to discuss the performance of Symbolic Regression for Scientific Discovery. We carefully reviewed the properties of each formula and its variables in the Feynman Symbolic Regression Database to design reasonably realistic sampling range of values so that our SRSD datasets can be used for evaluating the potential of SRSD such as whether or not an SR method con (re)discover physical laws from such datasets.

This is the Hard set of our SRSD-Feynman datasets, which consists of the following 50 different physics formulas:

Click here to open a PDF file

More details of these datasets are provided in the paper and its supplementary material.

Supported Tasks and Leaderboards

Symbolic Regression

Dataset Structure

Data Instances

Tabular data + Ground-truth equation per equation

Tabular data: (num_samples, num_variables+1), where the last (rightmost) column indicate output of the target function for given variables. Note that the number of variables (num_variables) varies from equation to equation.

Ground-truth equation: pickled symbolic representation (equation with symbols in sympy) of the target function.

Data Fields

For each dataset, we have

  1. train split (txt file, whitespace as a delimiter)
  2. val split (txt file, whitespace as a delimiter)
  3. test split (txt file, whitespace as a delimiter)
  4. true equation (pickle file for sympy object)

Data Splits

  • train: 8,000 samples per equation
  • val: 1,000 samples per equation
  • test: 1,000 samples per equation

Dataset Creation

Curation Rationale

We chose target equations based on the Feynman Symbolic Regression Database.

Annotations

Annotation process

We significantly revised the sampling range for each variable from the annotations in the Feynman Symbolic Regression Database. First, we checked the properties of each variable and treat physical constants (e.g., light speed, gravitational constant) as constants. Next, variable ranges were defined to correspond to each typical physics experiment to confirm the physical phenomenon for each equation. In cases where a specific experiment is difficult to be assumed, ranges were set within which the corresponding physical phenomenon can be seen. Generally, the ranges are set to be sampled on log scales within their orders as 10^2 in order to take both large and small changes in value as the order changes. Variables such as angles, for which a linear distribution is expected are set to be sampled uniformly. In addition, variables that take a specific sign were set to be sampled within that range.

Who are the annotators?

The main annotators are

  • Naoya Chiba (@nchiba)
  • Ryo Igarashi (@rigarash)

Personal and Sensitive Information

N/A

Considerations for Using the Data

Social Impact of Dataset

We annotated this dataset, assuming typical physical experiments. The dataset will engage research on symbolic regression for scientific discovery (SRSD) and help researchers discuss the potential of symbolic regression methods towards data-driven scientific discovery.

Discussion of Biases

Our choices of target equations are based on the Feynman Symbolic Regression Database, which are focused on a field of Physics.

Other Known Limitations

Some variables used in our datasets indicate some numbers (counts), which should be treated as integer. Due to the capacity of 32-bit integer, however, we treated some of such variables as float e.g., number of molecules (10^{23} - 10^{25})

Additional Information

Dataset Curators

The main curators are

  • Naoya Chiba (@nchiba)
  • Ryo Igarashi (@rigarash)

Licensing Information

MIT License

Citation Information

[Preprint]

@article{matsubara2022rethinking,
  title={Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery},
  author={Matsubara, Yoshitomo and Chiba, Naoya and Igarashi, Ryo and Ushiku, Yoshitaka},
  journal={arXiv preprint arXiv:2206.10540},
  year={2022}
}

Contributions

Authors:

  • Yoshitomo Matsubara (@yoshitomo-matsubara)
  • Naoya Chiba (@nchiba)
  • Ryo Igarashi (@rigarash)
  • Yoshitaka Ushiku (@yushiku)
Downloads last month
2
Edit dataset card
Evaluate models HF Leaderboard