Why is the nix-compiled Python slower?

We are bumping into this issue as well, but I see up to 40% slowdown on a GPU-driven load (basically Amdahl’s law in effect, the less we spend in single-threaded Python, the better). We can override Python to use optimalizations, but this is quite inconvenient – we’d need to have a binary cache to avoid build times. Also a handful of derivations fail to build, probably because our builds have more idle CPU cores available, so we hit more concurrency issues in Python tests (hope to provide some PRs later).

1 Like