Cockpit dashboard/management interface package does not work properly

ShockWave-1 · December 14, 2024, 12:25pm

Im new to nix and I got here through frustration with other distributions creating a server environment. So I might not be entirely right by my judgement of the situation but it seems the cockpit nix package doesn’t work properly.

It starts up however some of the subprocesses/helper programs that cockpit uses dont seem to be pathed properly so things like installing extra cockpit modules(the derivations of said modules do work) and things like privilege escalation dont work.

When I try:

find /nix/store -name '*cockpit*'

All the necessary processes seem to be there but cockpit still looks in the old locations for some things. An example like cockpit-askpass is running

find /nix/store -name cockpit-askpass

produces:

./m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/libexec/cockpit-askpass`

but I still get:

after which cockpit promptly logs out of the user.

This is all the configuration related to cockpit:

  environment.systemPackages = with pkgs; [
   [...]
    cockpit
    outputs.cockpit-modules.virtual-machines
  ];
  [...]
  systemd.services.cockpit = {
    environment = {
      PATH = lib.mkDefault "${pkgs.cockpit}/libexec:${pkgs.coreutils}/bin";
    };
  };
  services.cockpit = {
    enable = true;
    port = 9090;
    settings = {
      WebService = {
        AllowUnencrypted = true;
      };
    };
  };

I have tried tying the cockpit paths overriding systemd but Im not sure if the paths or if its setting in general. Since this doesnt work. Im also unsure how to print out variables I have used to check if its working. Using lib.mkForce instead of lib.mkDefault provides the same bad result as in the screenshot so hierarchy isnt the issue I think.

I could also map them to /usr/libexec/cockpit-* but defeats the point of nixos. So thats not a solution I tried yet.

Any ideas?

Thanks in advance.

TLATER · December 14, 2024, 1:10pm

What are you trying to do when you get that message? Cockpit is pretty complex, and the services have all kinds of limitations in their envs for security purposes, so it’s entirely possible that you’re just doing something that the cockpit nix module isn’t intended to do (and not configured to give particularly nice error messages for).

IMHO cockpit generally doesn’t mesh well with NixOS, a lot of what it does is very imperative and just won’t work properly around these parts - but then, it’s been like half a decade since I last properly looked at it.

This stuff is pointless, by the way:

lib.mkDefault will make this be overridden by settings without a priority override, so this will just be ignored because the upstream module sets PATH without priorities AFAICT. Using mkForce sounds like it would have… interesting results, but probably not what you intend. If you want to add stuff, just write packages to the path attribute (without any priority overrides).

That said, the module already has coreutils set in the path, and stuff in libexec is explicitly never intended to be in $PATH, so it’d be very odd for the upstream build scripts not to do that correctly (and the module maintainers somehow not to notice).

I think it’s far more likely that what you’re doing runs up against systemd hardening and is just prohibited by the NixOS module than that the package is broken.

ShockWave-1 · December 14, 2024, 2:15pm

Thanks for the reply.

I dont believe Im doing anything strange although maybe I should try and run a cleaner environment to test. What I did with cockpit is how most ppl have configured it. For them it seems to work and but not in my situation which led me to think it was the package. Im also developing the config in a VM though so maybe its related to that. Although I couldnt think of a reason why. Thats my best guess.

I tried modifying the polkit permissions as well but I might have done something wrong with that. I was thinking cockpit might have been working with its own users/permissions/rules but I didnt pursue this line of thinking.

That might explain why it didnt have any effect regardless of hierarchy. I just did lib.mkDefault bc NixOS would complain otherwise that it was already set.

Any suggestions then?
I would like to have a dashboard and I looked for alternatives for the reason you stated but im using rootless podman for my containers which drastically limits my options. I the first you see if you look up podman server dashboard/manager is the home-dashboard project but that seemed like a lot of work since you had configure a lot of the panels/modules/etc yourself.

TLATER · December 15, 2024, 4:00am

I believe you, I’d just like to know what specifically triggers that message (simple login? viewing logs?..) so I can guess which specific service to look at and think about the implications on cgroup permissions to give you a better debug experience

Personally I use a grafana dashboard, using victoriametrics to gather the specific stats I need. Indeed not very plug-and-play. This post might be helpful: Explaining modern server monitoring stacks for self-hosting with NixOS

There’s also icinga and nagios for more integrated solutions. Basically, any of the alternatives that focus less on doing administrative tasks (since those should only ever be done with nixos-rebuild, which nothing reasonably supports) and more on monitoring.

That said, clearly someone’s maintaining cockpit, and we should figure out why it’s not working for you.

If you heavily rely on containers cockpit probably also works better for you, though I’d also softly suggest thinking about using NixOS modules instead of containers, if this is an option for your infrastructure.

TLATER · December 15, 2024, 4:15am

Note that nixos-rebuild build-vm exists, it might help creating such an env.

ShockWave-1 · December 15, 2024, 6:14pm

##journalctl
`journalctl -eu cockpit` this is just from today after a restart: 
dec 15 18:09:29 nixos systemd[1]: Starting Cockpit Web Service...
dec 15 18:09:29 nixos cockpit-certificate-ensure[11073]: /nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/libexec/.cockpit-certificate-helper->
dec 15 18:09:29 nixos cockpit-certificate-ensure[11074]: .....+...+..+...+.+.....+......+....+++++++++++++++++++++++++++++++++++++++*.+...+.+........>
dec 15 18:09:29 nixos cockpit-certificate-ensure[11074]: ..+........+....+...+........+....+...+.....+.........+....+......+...+...+........+....+..+>
dec 15 18:09:29 nixos cockpit-certificate-ensure[11074]: -----
dec 15 18:09:29 nixos systemd[1]: Started Cockpit Web Service.
dec 15 18:10:59 nixos systemd[1]: cockpit.service: Deactivated successfully.
dec 15 18:10:59 nixos systemd[1]: cockpit.service: Consumed 176ms CPU time, 3.8M memory peak, 1M read from disk, 8K written to disk.

Nothing really strange here sscg isnt found but it uses openssh as fallback as shown by the ssh-pattern image.

##cockpit-bridge

I can see errors from cockpit by trying to run cockpit manually using cockpit-bridge in the terminal.

Traceback (most recent call last):
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 130, in consume_one_frame
    length = int(data[:newline])
             ^^^^^^^^^^^^^^^^^^^
ValueError: invalid literal for int() with base 10: b''

Traceback (most recent call last):
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/bin/.cockpit-bridge-wrapped", line 8, in <module>
    sys.exit(main())
             ^^^^^^
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/bridge.py", line 315, in main
    run_async(run(args), debug=args.debug)
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/_vendor/systemd_ctypes/event.py", line 135, in run_async
    asyncio.run(main, debug=debug)
  File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/runners.py", line 194, in run
    return runner.run(main)
           ^^^^^^^^^^^^^^^^
  File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/runners.py", line 118, in run
    return self._loop.run_until_complete(task)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/base_events.py", line 687, in run_until_complete
    return future.result()
           ^^^^^^^^^^^^^^^
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/bridge.py", line 166, in run
    await router.communicate()
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/router.py", line 258, in communicate
    await self._communication_done
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 192, in data_received
    result = self.consume_one_frame(self.buffer)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 132, in consume_one_frame
    raise CockpitProtocolError("frame size is not an integer") from exc
cockpit.protocol.CockpitProtocolError: frame size is not an integer

It might be nothing if I look at it from glance but it might be related to the systemd configuration not find a specific path since before these errors show up there is a warning also systemd is mentioned near the top of the trace:

cockpit.packages-WARNING: Could not detect libexecdir

This is related to executing programs by other programs which explains why no plugins and the escalation privilege porgram(cockpit-askpass) dont seem to work.
libexec also aligns with the error in the screenshot since this is where Rhel places these programs to be executed I think. This is what infer from cockpit-tls

You probably know about libexec already I am just going through my thought process.

I dont have time to go even deeper so I hope this helps. Sorry I cant do more debugging. I also am not sure where they are putting the logs bc I dont see them in /var/*

All this happens on startup.

Ill try and get more info when I have time.

Ill check them out later this week.

whats the exact difference then between a module and a container bc from my understanding right now a module is a separated out section of the configuration. In my mind that would be the same as running it on the os-level. I do have my container configs tied to systemd though, each separated into their own modules.

Thanks for the response though

ShockWave-1 · December 15, 2024, 6:16pm

I didnt know nixos could just do that nice feature. Ill try that later this week.

TLATER · December 16, 2024, 1:47am

Yep, that’s what I mean. NixOS modules generally set up services with cgroups anyway, so you don’t gain much from using containers, but you lose all the integration the module maintainer has done (and a lot of nix’ guarantees because most of your services are basically data now).

That said, this is a moot point if you have to use containers for other reasons.

That’s indeed interesting, I don’t see any explicit setup for that in the systemd services, and that binary is explicitly put in $PATH, so it should probably be expected to work.

This might be one for the maintainer, @lucasew ?

lucasew · December 16, 2024, 2:05am

Maybe it was introduced by the latest bump. I ran the NixOS tests and they passed. I am basically only trusting the test. Builds and passthru.tests run successfully? Merge!

I don’t have the time to cherry-pick cockpit from master into my systems.

EDIT 1: to monkey patch more stuff into the path the systemd.service of the services has the path option. That populates PATH properly.

ShockWave-1 · December 16, 2024, 8:46am

When I have time Ill try and run it off of the nix-unstable version. Im currently trying this on the 24.11 release. Ill also try the 24.11 release in a vm as well to see if its something weird in my environment.

For time Im already working on moving what I have working in my config from my vm to the server.
All I was going to use it for was for monitoring and the shell, so I can try it on a relatively clean bare metal environment as well.

TLATER · December 16, 2024, 11:01am

Might be good to try with 24.05 in a VM, too, in case it’s a regression.

ShockWave-1 · December 20, 2024, 8:05pm

Just tried it on 24.05 and it works as expected, its a regression.

I also tried it on my server running 24.11 and the same issues so its definitely something introduced in the stable release.

I guess Im gonna make a bug report.

1 of two Im gonna make soon. Sops-nix isn’t working properly either but ill first make another forum post before I do.

lucasew · December 23, 2024, 9:58pm

BTW when doing that bug report please show the steps to reproduce. I am guiding the bumps solely by the result of the NixOS test and I am using stable in all my machines.

That way we can reproduce the bug condition in the test and the next bumps will not have this issue.

ShockWave-1 · December 27, 2024, 2:28pm

Sorry for just now replying but is this in response to my bug report?

Are the steps not clear enough or is are you talking just in general and you havent come to it?

The report is right here:

github.com/NixOS/nixpkgs

Regression: in cockpit supporting applications cannot be used in cockpit because libexecdir cannot be found.

opened 09:53AM - 21 Dec 24 UTC

ShockWave-1

0.kind: bug

## Description Cockpit relies on multiple satellite applications to function pr…operly however in 24.11 those no longer work because cockpit cannot find `libexecdir` to use those applications for its purposes. Purposes such as privilege escalation and adding cockpit-Applications to itself. This all used to work in 24.05. ## Steps To Reproduce Steps to reproduce the behavior: ### WebGUI 1. Add to configuration: ``` environment.sessionPackages = pkgs.cockpit; services.cockpit = { enable = true; port = 9090; }; ``` 3. open in browser `127.0.0.1:9090` or `123.456.789.x:9090` on another machine and login. 4. Eith click on `Limited access` or `Turn on Administrative privileges` and watch it either restart or give an error that `libexec/cockpit-askpass` is missing. Also inspect the side menu there should be more there. Things like Networking, Storage, Applications and Software Updates are missing. Although mostly not relevant to nixos they should be there and its a symptom of the problem. ### Terminal 1. with cockpit installed run `cockpit-bridge` ``` cockpit.packages-WARNING: Could not detect libexecdir 1097 { "capabilities": { "explicit-superuser": true }, "command": "init", "os-release": { "ANSI_COLOR": "1;34", "BUG_REPORT_URL": "https://github.com/NixOS/nixpkgs/issues", "BUILD_ID": "24.11.20241210.a0f3e10", "CPE_NAME": "cpe:/o:nixos:nixos:24.11", "DEFAULT_HOSTNAME": "nixos", "DOCUMENTATION_URL": "https://nixos.org/learn.html", "HOME_URL": "https://nixos.org/", "ID": "nixos", "ID_LIKE": "", "IMAGE_ID": "", "IMAGE_VERSION": "", "LOGO": "nix-snowflake", "NAME": "NixOS", "PRETTY_NAME": "NixOS 24.11 (Vicuna)", "SUPPORT_END": "2025-06-30", "SUPPORT_URL": "https://nixos.org/community.html", "VARIANT": "", "VARIANT_ID": "", "VENDOR_NAME": "NixOS", "VENDOR_URL": "https://nixos.org/", "VERSION": "24.11 (Vicuna)", "VERSION_CODENAME": "vicuna", "VERSION_ID": "24.11" }, "version": 1, "packages": { "shell": null, "sosreport": null, "static": null, "metrics": null, "base1": null, "playground": null, "selinux": null, "system": null, "users": null } } ``` note: `cockpit.packages-WARNING: Could not detect libexecdir` 3. press enter: ``` Traceback (most recent call last): File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 130, in consume_one_frame length = int(data[:newline]) ^^^^^^^^^^^^^^^^^^^ ValueError: invalid literal for int() with base 10: b'' The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/bin/.cockpit-bridge-wrapped", line 8, in <module> sys.exit(main()) ^^^^^^ File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/bridge.py", line 315, in main run_async(run(args), debug=args.debug) File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/_vendor/systemd_ctypes/event.py", line 135, in run_async asyncio.run(main, debug=debug) File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/runners.py", line 194, in run return runner.run(main) ^^^^^^^^^^^^^^^^ File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/runners.py", line 118, in run return self._loop.run_until_complete(task) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/nix/store/zv1kaq7f1q20x62kbjv6pfjygw5jmwl6-python3-3.12.7/lib/python3.12/asyncio/base_events.py", line 687, in run_until_complete return future.result() ^^^^^^^^^^^^^^^ File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/bridge.py", line 166, in run await router.communicate() File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/router.py", line 258, in communicate await self._communication_done File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 192, in data_received result = self.consume_one_frame(self.buffer) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/nix/store/m2l2bnrdfj14aymr82ia1jn45rxvyg6f-cockpit-328/lib/python3.12/site-packages/cockpit/protocol.py", line 132, in consume_one_frame raise CockpitProtocolError("frame size is not an integer") from exc cockpit.protocol.CockpitProtocolError: frame size is not an integer ``` ## Expected behavior GUI: a password prompt at the top of the screen and Storage, Applications, etc in the side menu. Terminal: when running cockpit `cockpit-bridge: no option specified` ## Screenshots ![image](https://github.com/user-attachments/assets/eeb815b4-16ca-4dcd-be47-ae5f07be9cb5) ## Additional context extra context here: https://discourse.nixos.org/t/cockpit-dashboard-management-interface-package-does-not-work-properly/57403/8 ## Metadata ``` - system: `"x86_64-linux"` - host os: `Linux 6.6.63, NixOS, 24.11 (Vicuna), 24.11.20241210.a0f3e10` - multi-user?: `yes` - sandbox: `yes` - version: `nix-env (Nix) 2.24.10` - channels(root): `"home-manager-24.11.tar.gz, nixos-24.11"` - nixpkgs: `/nix/store/va0p2i72cm2ljwm084a0g6ji41s5qnyz-source` ``` @lucas-deangelis --- Note for maintainers: Please tag this issue in your PR. --- [reaction]: https://github.blog/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/ [issues you find important]: https://github.com/NixOS/nixpkgs/issues?q=is%3Aissue+is%3Aopen+sort%3Areactions-%2B1-desc

not that im egging you on or anything just a bit confusing considering by that point I already made the report.

TLATER · December 27, 2024, 3:47pm

Looks like a great bug report to me, I think they hadn’t seen it yet, and were afraid that they might have to ask you for the steps since you neglected to tell me what they were several times in a row. It’s why I was suspecting an XY problem

Guess explicitly asking for “reproduction steps” in the bug report template makes it clearer what is meant, though, sorry about the miscommunication!

lucasew · December 29, 2024, 4:12pm

The fix: cockpit: 330 -> 331 by lucasew · Pull Request #368886 · NixOS/nixpkgs · GitHub