https://github.com/hanabi1224/Programming-Language-Benchmark...
Here's the measurement code, it appears to be significantly more complicated than a simple fork/exec/wait loop but that could just be all the C# getting in the way: https://github.com/hanabi1224/Programming-Language-Benchmark... Note that we are definitely measuring the C# async runtime to some degree. Nevertheless you are probably right that the bulk of this 1.8ms is in the executable under test, and it truly is just bloat. Running `hyperfine ./empty-main-function` from rustc on my Mac gives 0.8ms.
parallel:
for i in 0..
(https://github.com/hanabi1224/Programming-Language-Benchmark...)It's almost four times as slow as the otherwise identical single-threaded version.
https://github.com/hanabi1224/Programming-Language-Benchmark...
https://github.com/hanabi1224/Programming-Language-Benchmark...