Figure out how / when to run / update these. - [ ] Consider the [RUBI benchmark set of problems](https://rulebasedintegration.org/testResults.html) - The visualization could definitely be improved - [ ] Consider the [Sympy setup](https://github.com/sympy/sympy_benchmarks), [visualized here](https://www.asmeurer.com/sympy_benchmarks/#regressions?sort=3&dir=desc) - [ ] Also the [PyPy benchmark](https://speed.pypy.org/)