I agree Criterion is excellent, but its output seems too focused on individual benchmarking. I wrote some more code to provide higher-level overview, how different cases (different crates with comparable functionality in my case) compare with each other. Feel free to pick what suits from you from my benchmarking project, or ideas. Sorry for some Python. I think a good Rust based package that provides overview, and also tracks history, checks for regression, is yet to come.