New Merge Policy for an always-green hydra

jonringer · September 18, 2020, 7:07pm

That would be incredibly useful.

However, people will need to be disciplined about their git history, otherwise there will be false positives.

Ericson2314 · September 18, 2020, 7:39pm

Oh I meant every commit ought to pass. We can use Git’s --first-parent (or a slightly fancier variations that understand special branches like staging-next) to skip the noise from individual PRs.

kevincox · September 21, 2020, 11:57pm

Hi everyone, I’m sorry for the delay but after the discussion settled I have finally found time to write up a requirements document. The idea here is to gather all of the concerns raised here into an easy-to-read list so that further RFCs can reference them. I have read through the comments here a couple of times but please leave a comment on the document if I missed something or you have thought of something else.

If you are interested in participating send me an email at kevincox@kevincox.ca (Warning, I will be sending a message to everyone who responds so the email you send from will be shared. Do we have a better place for organizing these things?) The next step will be attempting to break down the requirements into a set of smaller projects that can be run through the Nix RFC process and it would be helpful if we could share the load of writing those plans.

timokau · September 22, 2020, 8:08am

One more difficulty, assuming something like the “make change, then ping maintainers to fix resulting breakage” approach: Many packages have no formal maintainer. We need to decide who is responsible. One approach would be to go down the graph to the nearest maintained dependency(s).

If you want further input, there is a lot of interesting discussion (including this point) in PLEASE fix ALL packages which were broken by the Qt-wrapping-changes. Its a long thread though.

kevincox · September 22, 2020, 11:56am

This is a good point to bring up. I think we should probably go the other direction though. I think that if a package is broken we should try contacting the maintainers of the dependents next. Because their packages will be transitively broken if the issue isn’t resolved. Also pushing packages upstream is a recipe for having the maintainers of “core” libraries and tools slowly getting more and more packages pushed upon them and them getting overloaded. I don’t want the maintainers of our most critical derivations to become a dumping ground for unmaintained packages.

This does raise the question about leaf packages. We can consider doing the following things:

Don’t let them block and other changes, they quickly get marked as broken.
Notify anyone who has changed the file.
- We might want some systematic way to filter out tree-wide changes.
Notify the maintainers of dependencies or other related packages and ask if they are interested in maintaining the package.

But at the end of the day if the maintainer of a leaf package leaves I think we just need to let it wither (marked as broken) and we should probably periodically sweep broken packages from the repo.

This also doesn’t specify how we determine if a maintainer has abandoned a package. It would be nice to have some sort of automation around this as well. However if we see they have been inactive for an extended period of time (1 month?) we can remove them from packages. Worst case they are on an extended offline period and can revert the PR to claim their packages back.

I think PLEASE fix ALL packages which were broken by the Qt-wrapping-changes - #20 by timokau is the right call. I think that we need to define some policy around this (submitting an RFC is high on my dependency list for the merge queue). I think marking things is broken makes sense and would want to define some sort of standard workflow around this. For example take the following option as a base example for changes that break more dependents than you can, or wish to fix yourself.

The breaking PR is reviewed, but not merged. (The merge queue bot wouldn’t allow it anyways)
- The PR branch is now “the staging branch” for this breaking change.
Once it is approved maintainers of any broken dependents are notified.
- They are expected to respond to the breakage. They should create a fix and merge it to master, if it is backwards compatible, or to the staging branch.
After at least 1 week the any unfixed packages are marked as broken.
At this point the staging branch should be “green” and is merged.

To me this sounds like acceptable overhead for the “parent maintainer” assuming that we can mostly (or completely) automate sending notifications and marking packages broken.

timokau · September 23, 2020, 8:47am

Yes, that’s what I had in mind (though not what I said ). Its still not entirely trivial, since you would have to decide which “reverse maintainers” are responsible (which might be a lot). This is not a blocker though, we would just need to decide on something.

I think this is the way to go. If somebody cares about the package, they can then adopt it after they notice it has been marked as broken and doesn’t have a maintainer.

We need more defined guidelines for package inclusion might interest you.

timokau · September 23, 2020, 8:51am

I’m glad we are in agreement and it would be great to have somebody actually work on this. I would recommend to decide on a minimal, useful subset of the process first and ideally also develop some tooling for it. Then we could trial it on a volunteer basis and set it in stone afterwards. I imagine the success probability of such an incremental approach is a bit higher than an all-out RFC. Your call of course, if you want to work on this.

kevincox · September 24, 2020, 11:55pm

It’s me again.

I have broken down a list of policies that we need to agree upon and changes that need to be made in order for us to get the the state where a merge queue system would be feasible. Right now I basically made a list of documents and put in rough summaries for what I think needs to be done/resolved by each document. I hope to eventually complete the write ups and submit them though the Nix RFC process.

If you are interested in helping you can take up any of the following:

Review started documents: While the biggest review will come from the nix RFC process early feedback would be very helpful. Simply leave comments or send me an email. If you log into Dropbox you should be able to subscribe to changes in the Overview document. This will allow you to know when new documents are ready for review.
Ownership of an RFC: If there is a topic that you are passionate about I would be happy if someone else could take ownership. There is a lot of work required to move us towards a merge queue and I don’t have loads of time so all help is appreciated.
Suggest missing RFCs: Do you think that the proposed list of RFCs is insufficient to resolve the requirements? Let me know so that we can figure out how to address it.

kevincox · September 25, 2020, 12:14am

This is a very interesting post, I wish I was aware of it when it was happening. There are some points that I wish I could have made but they are a bit off topic for this thread. However on topic is that they say that it would be useful to identify idle maintainers. I strongly agree, partially because robots enforcing policy often cause less offense than humans doing the same but mostly because it can be hard for any one person to notice a maintainer is idle. I think that we should have a policy like “a package marked as broken or insecure for 90 days will be deleted”. This is something that I want to address with nixpkgs Merge Queue - Unmaintained Package Removal.

7c6f434c · September 25, 2020, 7:19am

Just in case, I support permanently marking ghostscript as insecure and oppose its deletion.

(It is clearly unlikely to be safe for malicious files in forseeable future, and it is also quite a useful tool for processing files of known-benign origin)

timokau · September 25, 2020, 8:04am

Removing the source code for packages is a whole different can of worms that has also been discussed on discourse a couple of times already. Unfortunately I can’t think of a specific thread right now. Personally I don’t see much gain in it, but some loss (better discoverability of the work, easier to fix up a broken package than creating a new one). In any case, I would tackle one policy change at a time and discuss that separately later

timokau · September 25, 2020, 8:11am

I don’t think you need RFCs for tools. There isn’t really anything to decide/agree/disagree there. If we agree on the policy, then somebody just has to implement the tools to make it feasible. The tools could even be developed without any RFC, as long as their use is optional.

As I said, I’d postpone the “removal” bit for now, since that seems unrelated to the general goal. The “arch specific” and “flaky” policies seem to be necessary (if I’m not missing something?) to come to any conclusion on the general “breaking change policy”, so those should probably be merged.

kevincox · September 25, 2020, 12:17pm

timokau Great contributor
September 25 |

| - |

I don’t think you need RFCs for tools. There isn’t really anything to decide/agree/disagree there. If we agree on the policy, then somebody just has to implement the tools to make it feasible. The tools could even be developed without any RFC, as long as their use is optional.

The thing about these tools is that they will have a lot of policy encoded in them. Like how to notify maintainers and similar. This is why I think it makes sense to discuss beforehand. Maybe they won’t actually be put to the nixos/rfcs repo but I figured they would require some discussion. I also think the most value will come if the tools are not optional (run automatically) so that is another reason we would want some sort of RFC.

As I said, I’d postpone the “removal” bit for now, since that seems unrelated to the general goal.

Fair enough, I don’t think it is actually blocking (once marked broken they aren’t a problem for the merge queue) so we can postpone that discussion to another time.

The “arch specific” and “flaky” policies seem to be necessary (if I’m not missing something?) to come to any conclusion on the general “breaking change policy”, so those should probably be merged.

Do you be merged into one document? The reason that I put them separate is because I think the arch-specific solution will require something technical while the “flaky” will likely be a more social problem. If we be able to solve them with the same solution all the better and we can merge them.

timokau · September 28, 2020, 9:17am

I think the policy (how to notify etc.) belongs into the policy RFC. People will be hesitant to accept an under-specified RFC without those policy details anyway. I agree that those tools would work best if mandatory, but they could still be trialled and proven in an optional manner. If you’re willing to do the work, its your call how to go about it of course

If I understand it correctly, the breaking change policy requires an always-green master as a pre-requisite. If that is true, its not really possible to proceed without covering the flaky/arch-specific problems at the same time.

anka-213 · October 14, 2020, 7:55pm

Regarding the problem of bisecting which commit caused a failure, wouldn’t something like this

cd nixpkgs
git checkout $knownFailing
newHash=$(nix-instantiate . -A $package)
git bisect start $knownFailing $knownWorking
git bisect run sh -c "nix-instantiate . -A $package | grep -v $newHash"

that simply checks which commit modified the package (or its dependencies) at all, cover 99% of all cases? How often are there many changes to a package in a single batch?

On the other hand, if that weren’t the case, I guess the batching wouldn’t gain much?

(Sorry for the slight necro)

kevincox · October 14, 2020, 8:20pm

Yeah. I think something like that is what we would want to do. We could
probably even just check what changes affect the failed derivations.
Then we can treat the commits where they changed as “bisection” points.

I was hoping that we could use Bors but it doesn’t appear to support the
logic to do this. We may need to add the feature or write our own
tooling: Optimized bisection - Support - Bors Forum

kevincox · March 12, 2021, 10:39pm

Sorry for the delay everyone but life got in the way. I would love to pick this back up by submitting the first document to the nix RFC process. In order to do that I need a co-author to help with the RFC. I would appreciate a volunteer.

ryantm · March 13, 2021, 1:32am

You do not need a co-author to submit an RFC.

blaggacao · April 12, 2021, 5:57pm

@nrdxp If you can find time, I’d consider you’ve got the expertise and the vibe.

I very much care, but, due to my dialectic communicative nature, I guess I could be an obstacle rather than an aid.

You need somebody that scores highest on agreeableness.