Out of curiosity, I’ve just taken the top 5000 packages of “Statistics by source packages (max)” sorted by “inst” from https://popcon.debian.org/ ; and grepped the nixpkgs source code for these package names. I then eliminated packages that contained
node, as we probably have them and the name would never match. This is quite obviously a bad evaluation (eg. just in the first page of the csv
slang2 is not found as “in nixpkgs” because nixpkgs calls it
slang), but whatever, it may still be interesting to have a look at.
It left me with 4518 packages, of which 1748 are not in nixpkgs. Obviously there are still a lot of false positives (eg. ripgrep is rust-ripgrep for debian…), and I’m not totally sure what this list precisely is about, but I feel like it gives an overall idea: we probably have around 500-1500 packages missing to have the debian-popcon top 5000.
Note that this comparison is quite dumb though, as popcon is voluntary, my testing is really really simple, I also count debian-specific packages like debconf, etc. But I think it does show that we still have a way to go to have all useful packages.
(FWIW, I’ve opened Introduce some kind of popularity contest/popcon? · Issue #159 · repology/repology-webapp · GitHub to see whether repology might be interested in introducing something like that directly on there, as it has all the normalization logic already they probably could do much better than I could by blindly grepping through nixpkgs)