How to make Nixpkgs more eco-friendly / use less resources

gravndal · August 16, 2022, 5:17am

How does it compare to Nix-casync, a more efficient way to store and substitute Nix store paths?

Solene · August 16, 2022, 7:25am

there is a work in progress https://obsidian.systems/blog/nix-ipfs-milestone-1

rnhmjoj · August 16, 2022, 7:57am

Won’t the binary change every time the store path of one of its dependency changes? The binary rpath must contain those paths.

uep · August 16, 2022, 8:05am

The Solaris IPS packaging system did away with transferring package archives entirely, and transfers individual files, with a content-addressing scheme. This was in large part because even when a package update changes something, many files are unaffected and can be reused between versions.

While you might lose a little efficiency from single-archive-compression (many doc files together, etc) it seems that this efficiency comes instead from avoiding repeated downloads.

rnhmjoj · August 16, 2022, 8:09am

Because of reproducibility, it’s hard to skip rebuilds, guix has a “graft” system for packages update with really minor changes to avoid recompiling the whole dependency graph, but I suppose it kills reproducibility?

There already is, in the typical Nix fashion it’s called replaceRuntimeDependencies. It works by going through every file in the system and replacing the store path of the original package with a replacement.

It won’t work for any package update, though: it assumes the replacement store path to have the same length as the original. So, you could do this for security patches and minor bug fixes.

rnhmjoj · August 16, 2022, 8:22am

If the goal is to increase NixOS sustainability, I’m not sure using IPFS will be an improvement: the IPFS node software is a big resource hog. I admit it’s been a few years since the last time I tried it, but the node in idle would use a few cpu percents and several Mb/s of bandwidth with just a handful of pinned files.

IPFS is all nice (at least in theory) and it gives file level granularity, but we could probably go away with just distributing nars using plain old torrents much more efficiently.

Solene · August 16, 2022, 9:11am

IPS was incredibly slow, I’m curious to know if this is due to updating files one by one.

Solene · August 16, 2022, 9:12am

IPFS draws CPU when it’s actively contributing to the P2P network routing, this is not mandatory and not really useful if you use it locally to access IPFS content. It got a better wrt resources too.

uep · August 16, 2022, 9:15am

I’m not sure exactly, and honestly my experience wasn’t too bad with it, but I’ll posit some combination of:

insufficient download parallelism and/or streaming with earlier HTTP
iops amplification and latency cascades with small files and duplicated metadata updates
- likely running on early zfs, which had some pretty strong transaction commit latency
- likely running on spinning media with short concurrency queues
- conservative sync writes in the package manager
- the need to keep two copies of the file (or hardlink in some cases?); one in the store and one in the system
different expectations; it may have seemed slow compared to (say) apt (on ext4), but it was already faster and more convenient that the previous Solaris pkg system so there was plenty of room to start with conservative implementation with room for further optimisations

manveru · August 16, 2022, 11:29am

I wrote Spongix after evaluating nix-casync. Compared to it, Spongix offers:

garbage collection based on LRU/max-cache-size and integrity checking
metrics
proxying and caching multiple upstream caches behind itself
uploading chunks to S3 compatible stores
signing narinfos if no signature is present (for automated build farms on top of Cicero)

It probably has a few more features that i forgot about, but we’ve been running it in production at IOG for a few months now and it performs much better than our previous setup with Hydra->S3.

Ericson2314 · August 16, 2022, 11:40am

CA derivations are strictly better, but It is really hard to get shit merged in this community — thus Hydra still doesn’t have support for them upstream, and we cannot being testing thing in Nixpkgs.

I remain incredibly frustrated that we have this thing 90% done, and there is no will to unblock willing maintainers ready to do work getting the feature out in the real world.

NobbZ · August 16, 2022, 12:11pm

Part of the problem is that the roadmap targets 3.0 for flakes and CA for 4.0.

We are not 3.0 yet, and flakes are far from being ready, despite what Eelco says…

And last time I tried CA, despite having set up the CA enabled cache, it bootstrapped some compilers, which ultimately caused a world-rebuild. Also cachix does not yet support CA.

And what annoys me with CA: Its design still relies on a single authority per cache to transform IA to CA as known by that particulat cache.

What we need instead is a distributed “trust” network that can point from IA not only to the CA, but also to a “store”/“mirror”/“cache”/IPFS or whatever to download it.

Ericson2314 · August 16, 2022, 12:23pm

3.0 is already a big giant mess that we cannot review, we need to minimize the scope. Flakes is a huge ball of unaudited complexity we are in no way ready to stabilize in one go.

Ericson2314 · August 16, 2022, 12:32pm

I am not saying Flakes should be 4.0 and CA 3.0, but we should focus on layering so we can stabelize e.g. part of the new CLI (stuff like nix show-derivation) without worrying about Flakes.

Solene · August 16, 2022, 4:03pm

This is an awesome tool!

Could this be modified to be used by end users (people not building projects) as a way to improve nix efficiency?

manveru · August 16, 2022, 4:49pm

I run it on my personal machines to save a lot of bandwidth alongside regular automatic Nix GC runs. So that’s totally possible, yeah.

Solene · August 16, 2022, 5:27pm

I currently use a nginx reverse proxy as a substituter caching packages on my LAN, spongix can be a great replacement.

I fail to see how running spongix on a single machine can help? You still have to download packages. Is it because it allow you to retrieve packages that has been GC? If so, there is a design issue here, you have a local package cache used to get packages that got GCed from the store.

arianvp · August 18, 2022, 12:50pm

I think phrasing this from an “eco-friendly” standpoint is rather pointless. 130TB is nothing (fits on 2 hard drives).
I think phrasing this in terms of build turnaround time is more effective and is a better motivation to address these issues.

nixinator · August 18, 2022, 1:09pm

Hopefully Content Address packages, can lead to the holy grail of build farms…

distributed trustless builds…

I know Adam’s and co have been working on trustnix, a distributed build system… imagine your builds being done on machines directly connected to renewable resources, or where ever the sun happens to be in the sky on earth :-).

If this can be somehow linked to a way for builders (miners) to get rewarded for building derivations for others then centralized building can a thing of the past.

Hydra goes from 1000 cpu’s to many many thousands, use ipfs or hypercore to distribute said build…

a bit star trek but if works, it’s probably change the course of software building and distribution for ever…i mean, you can’t expect do a nixos rebuild switch on mars, and fetch it from earth over a rather low bandwidth and ‘slightly’ high latency tcp connection can you. ???

Nix = $ would never be a truer word then.

marius851000 · August 18, 2022, 3:25pm

Trustless distributed build would probably a long time in the future. From my understanding, Trustix is more about checking the build from independant builder. The problem is that being able to demonstrate that two builder are independant is hard, and you need to trust someone or something to demonstrate they are independant (but it would certainly still be less trust that trusting a single hydra provider).

However, safely distributing file distribution is totally possible with IPFS (or certainly others) (which mean Inter Planetary File System btw). Possibly including a signed “input → CA (and IPFS) hash” (with IPNS possibly being said signature). (but no one will probably have a latency more than a few seconds before a long time, althought this may help with load balancing and performance).

Also, the centralized architecture as used today probably doesn’t prevent using certains host under certains condition (if we ignore the fact that this might not even be a good idea to begin with due to fabrication cost, but I’m not knowledgeable with this). Internet is probably fast enought that transfering data to the other side of the world is totally possible and easily achievable. (even thought the builder still need to be trusted sadly).