What commits belong to which channel

NobbZ · May 16, 2023, 7:51am

I want to optimise bisection workflows. Perhaps with some script even.

Currently, for any bisection over nixpkgs, you usually quickly run into a commit that has a lot of dependencies involved, not built by hydra, because you get a staging-merge or end up on staging itself, due to the way how the bisection went.

So my idea was, to do a 2 pass bisection, to avoid the amount of necessary rebuilds.

In the first pass only consider commits that have been built by hydra and considered to be good enough to be a new channel push. git bisect skip all the commits that are not.

In the second pass consider all the commits between the last known “good” and “bad” commits.

To be able to implement this though, I’d need such a list of commits that hydra created a new channel push from, is there an easy way to get such a list of commits? I think it would still be sufficient for my purposes, if this is only available for the last year of the unstable channels and the last 2 “full” release channels.

Does someone have an idea?

My current approach is to nix eval some outPaths of well known and expensive derivations, which in itself costs some time, extract the hashes, query https://cache.nixos.org/$hash.narinfo for availability and skip if any of them is missing.

This list currently for me contains gcc, llvm, rust in various versions, as well as systemd. I include the very same checks in every bisection I do from the start, ignoring whether an individual from this list would make sense, after python regularly surprised me with 3 LLVM rebuilds regularly because of some cryptography dependency.

I hope to cut a lot of the intermediate evals when I know in advance whether or not a commit has been built by hydra or not.

jtojnar · May 16, 2023, 9:34am

nix-bisect allows you to skip commits that would trigger more rebuilds than the max-rebuilds flag.

And there is also hydrasect, which sounds closer to what you want.

NobbZ · May 16, 2023, 9:44am

Nice feature! Not quite exactly what I want though. This might skip some stuff where the “builds” are basically no-ops but many of them, while the dreaded hour long builds of LLVM or a GCC bootstrap cycle can still happen.

I agree this is unlikely, but possible in general.

Yes, nice tool, I think I will at least take some inspiration from it.

Feels a bit weird, that there is no nix expression in the repo for the build…

Thank you!

wamserma · May 16, 2023, 11:54am

NobbZ:

I want to optimise bisection workflows. Perhaps with some script even.

Currently, for any bisection over nixpkgs, you usually quickly run into a commit that has a lot of dependencies involved, not built by hydra, because you get a staging-merge or end up on staging itself, due to the way how the bisection went.

So my idea was, to do a 2 pass bisection, to avoid the amount of necessary rebuilds.

In the first pass only consider commits that have been built by hydra and considered to be good enough to be a new channel push. git bisect skip all the commits that are not.

In the second pass consider all the commits between the last known “good” and “bad” commits.

To be able to implement this though, I’d need such a list of commits that hydra created a new channel push from, is there an easy way to get such a list of commits? I think it would still be sufficient for my purposes, if this is only available for the last year of the unstable channels and the last 2 “full” release channels.

Does someone have an idea?

github.com

wamserma/flake-programs-sqlite/blob/31aaad2e7851caa2a5bbcff0420a71746e4f8030/sources.json

{
  "2dea0f4c2d6e4603f54b2c56c22367e77869490c": {
    "name": "nixos-23.05pre433819.2dea0f4c2d6",
    "url": "/nixos/unstable/nixos-23.05pre433819.2dea0f4c2d6/nixexprs.tar.xz",
    "nixexprs_hash": "2e5607a1b93b401fc12706b806380541b48c56c7ac555a6648ce7bf0cd0e4ce3",
    "programs_sqlite_hash": "53a713335bae981b1bb15b5ac3df9b645f4676d2a99b8141731f1e27e772a194"
  },
  "850c27142ac01525b751e2c9d97138fe5d8a0ad0": {
    "name": "nixos-22.05.4481.850c27142ac",
    "url": "/nixos/22.05/nixos-22.05.4481.850c27142ac/nixexprs.tar.xz",
    "nixexprs_hash": "27a16434e959ef08de567faa8946c7fa0603e51bd26292d59f19b8961eeb65bf",
    "programs_sqlite_hash": "4837882315934c7f6f8f691fd40ddc04c523cdd4222a0f596398b0d0a14e0b52"
  },
  "e8ec26f41fd94805d8fbf2552d8e7a449612c08e": {
    "name": "nixos-22.11.758.e8ec26f41fd",
    "url": "/nixos/22.11/nixos-22.11.758.e8ec26f41fd/nixexprs.tar.xz",
    "nixexprs_hash": "7a11987e00ef163293176faaad4746965432e820eb531907c92ff5358a340575",
    "programs_sqlite_hash": "afc2bb2d22d3534b1ce075c1953aeb5a4fe7a6a85715479b3786027259fd83da"
  },
  "64e0bf055f9d25928c31fb12924e59ff8ce71e60": {

This file has been truncated. show original

Atemu · May 16, 2023, 5:17pm

I usually just do a first-parent bisect on master. A commit being on master usually means that it’s built. There’s only very rarely >1 rebuild of some package in the couple hour window between hydra evals.

That way the pain only begins when the first pass lands you on a staging-next merge.

NobbZ · May 16, 2023, 5:25pm

I tend to hit them after the second to third bisection step, unless I do semi-intelligent skipping.

vcunat · May 17, 2023, 6:46pm

I didn’t know there was git bisect start --first-parent. @NobbZ: I assume you did not use that one.

NobbZ · May 17, 2023, 7:49pm

I am aware, still hits staging merges quite often

But thanks!

Atemu · May 18, 2023, 12:49pm

Well, of course. If the real root cause went to staging, you will obviously have to bisect staging. No way around that.

What the simple first-parent strategy does is entirely avoid commits of the staging branch if the cause went to master which is why I think it’s super useful.
What it also does is tell you which staging cycle the cause is part of; allowing a narrower frame for the costly staging bisect; the second phase.

Atemu · May 18, 2023, 12:53pm

Re-read your comment and there’s an uncertainty: What exactly do you mean by “hit”? Land on it as the cause or a staging commit being a bisection step?

With first-parent, you should never land on a staging commit as a bisection step, only staging-next merges (as in “merge staging-next into master”) as the root cause and that would still be cached.

NobbZ · May 18, 2023, 2:08pm

I am talking about staging next, and didn’t yet had good experiences them being cached.

Or at least not what I needed from it.

vcunat · May 19, 2023, 4:47am

Yes, most of the combinations aren’t built (on staging and staging-next). That’s the main reason why they were created in the first place.