System with root zfs new derivatiosn wont boot

RichieCahill · November 24, 2025, 1:01am

some context first
im using flakes my repo GitHub - RichieCahill/dotfiles

im experensing this problem on my server jeeves
specs
MOBO: ASRockRack EPYCD8
CPU: AMD EPYC 7551
GPU: NVIDIA Quadro P2000
Memory: 256GB

Im running a zfs root filesystem on all my machines
all other machines curently use systemd boot
Jeeves is using GRUB with a mirrored boot

Im getting this error

I tried rolling back the kernel to 6.12.52 and pilling Linux-firmware to

I have also tried updating to the 6.17 and zfs 2.4

my boot partishion has 3+ gb of free storage and all my vdevs have 100GB+ free space

MagicRB · November 24, 2025, 10:33am

Im not sure id blame ZFS for this. I dont see anything in the logs youve provided to indicate that ZFS is involved in the path that leads to the mmu fault. Do you run this system with ECC? If not, can you run a memtest? Have tried booting a live nixos image?

ElvishJerricco · November 24, 2025, 12:25pm

mmu fault? I must be missing something, because I don’t see anything about the mmu in those screenshots. What I see is Attempted to kill init!, which happens when PID 1 tries to exit. It’s very odd considering NixOS’s stage-1-init.sh should only ever exit if you have boot.panic_on_fail in your kernel params (and even then you should see the error message of the command that failed).

You might try boot.initrd.systemd.enable = true; since that switches to an initrd that IMO is much more robust to failures and makes it easier to diagnose them. You can set boot.initrd.systemd.emergencyAccess to a hashed password to allow entering a password to get an emergency shell when initrd fails that way, or you can add rd.systemd.debug_shell to the kernel params to get a debug shell on tty9. Then you can start looking at what failed with commands like systemctl status --failed and journalctl.

withakay · November 24, 2025, 3:55pm

I won’t pretend to understand memory management, but I think they are looking at the [ 328.565168] ? handle_mm_fault+0x1bd/0x2c0 line. Not sure if this actually implicates MMU/hardware issue vs. some other software issue with paging or somesuch.

Skimming the first hit on google for this (Page Tables — The Linux Kernel documentation) suggests multiple routes call this code path? Some snippets that other people who also don’t understand memory management mind nevertheless find interesting and/or worth further learning at some point.

…There are several reasons why the MMU can’t find certain translations…
…When these conditions happen, the MMU triggers page faults…
…Additionally, page faults may be also caused by code bugs or by maliciously crafted addresses that the CPU is instructed to access…
… Whatever the routes, all architectures end up to the invocation of handle_mm_fault() which, in turn…

RichieCahill · November 25, 2025, 2:05am

I do have ECC memory.

not yet because mounting decrypting and mounting is a.

MagicRB · November 25, 2025, 3:12am

mmu fault? I must be missing something, because I don’t see anything about the mmu in those screenshots. What I see is Attempted to kill init!, which happens when PID 1 tries to exit. It’s very odd considering NixOS’s stage-1-init.sh should only ever exit if you have boot.panic_on_fail in your kernel params (and even then you should see the error message of the command that failed).

Yeah what people pointed out above me is what i meant. (I was AFK, had a fun weekend, not sarcasm). My first thought was that our init went to do some syscall, cant tell which one, then in the kernel it faulted which:

Taints the kernel
Kills init

As such it looks as if init just exited, but its because the userspace thread was killed in the kernel by a mmu fault.

Not a 100% positive on this one but thats how im reading this.

MagicRB · November 25, 2025, 3:15am

ECC should have caught any HW faults then. Hm, could theoretically be ZFS is screwing up something in kernel space which then causes a MMU fault, but witg KASLR I find it highly unlikely that it would always manifest in the same way. is the errro you get always the same? Can you reliably reproduce it? Have you by any chance disabled KASLR?

ElvishJerricco · November 25, 2025, 5:25am

I am confident the MMU has nothing to do with it. If I just run a NixOS test on x86_64 that exits stage-1-init.sh I get the same thing:

testers.nixosTest {
  name = "foo";
  nodes.machine.boot.initrd.postDeviceCommands = "exit 1";
  testScript = ''
    machine.wait_for_unit("multi-user.target")
  '';
}

machine # [    1.357736] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000100
machine # [    1.359169] CPU: 0 UID: 0 PID: 1 Comm: init Not tainted 6.12.58 #1-NixOS
machine # [    1.360434] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.17.0-0-gb52ca86e094d-prebuilt.qemu.org 04/01/2014
machine # [    1.362544] Call Trace:
machine # [    1.363103]  <TASK>
machine # [    1.363612]  dump_stack_lvl+0x5d/0x80
machine # [    1.364396]  panic+0x118/0x2db
machine # [    1.365063]  do_exit.cold+0x15/0x15
machine # [    1.365800]  ? count_memcg_events.constprop.0+0x1a/0x30
machine # [    1.366835]  ? srso_alias_return_thunk+0x5/0xfbef5
machine # [    1.367803]  ? handle_mm_fault+0x1bd/0x2c0
machine # [    1.368650]  do_group_exit+0x30/0x80
machine # [    1.369407]  __x64_sys_exit_group+0x18/0x20
machine # [    1.370256]  x64_sys_call+0x14b4/0x14c0
machine # [    1.371049]  do_syscall_64+0xb7/0x200
machine # [    1.371818]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
machine # [    1.372814] RIP: 0033:0x7f76665be77d
machine # [    1.373574] Code: 45 31 c0 45 31 d2 45 31 db c3 0f 1f 00 f3 0f 1e fa 48 8b 35 85 f6 10 00 ba e7 00 00 00 eb 07 66 0f 1f 44 00 00 f4 89 d0 0f 05 <48> 3d 00 f0 ff ff >
machine # [    1.376942] RSP: 002b:00007ffd4b841f88 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
machine # [    1.378411] RAX: ffffffffffffffda RBX: 0000557908bc32a0 RCX: 00007f76665be77d
machine # [    1.379740] RDX: 00000000000000e7 RSI: ffffffffffffff88 RDI: 0000000000000001
machine # [    1.381086] RBP: 0000000000000004 R08: 00007ffd4b841f90 R09: 0000000000000004
machine # [    1.382424] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000004
machine # [    1.383769] R13: 0000000000000000 R14: 0000000000000000 R15: 00005578dda8c148
machine # [    1.385117]  </TASK>
machine # [    1.385709] Kernel Offset: 0xf400000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)

The problem is that PID 1 is exiting.

Either trying systemd initrd or adding boot.trace to the kernel params seems to be the best next debugging step IMO.

MagicRB · November 25, 2025, 11:42am

Yeah, youre right. My mistake, (in my defence this weekend was very fun but very exhausting).

RichieCahill · November 27, 2025, 2:36am

ITs fixed but i have more questions than answers.

when chroot i was getting a pemition denied for all command i tried including shell builtins

after rebooting i saw this
[root@nixos:~]# zfs list -o name,exec
NAME EXEC
root_pool on
root_pool/home on
root_pool/nix off
root_pool/root on
root_pool/var on

some how my root nix dataset was set to exec=off

I dont know how that happen
I dont know how my old deviation was working
Unfortunately my derivation garbage collector deleted that revision

Would there be a way to check if the nix store was exicutible and give a more intelligent error

ahuston-0 · December 9, 2025, 10:07pm

Another datapoint. I apparently also had exec=off and also ended up with a kernel panic. Other than that, we have different setups nowadays usually. Thankfully I know Richie and was able to call them quickly to figure it out since they dont have , but I don’t remember setting exec=off. Something interesting is I tried to run zpool history to find out what the original setting was and it doesnt appear to have the create statement for the dataset, but it does for the pool. not sure if zed was having an issue or if this is expected after some time.

RichieCahill · December 12, 2025, 8:07pm

Corection on my part mine was set to exec=off at creation time

2024-10-07.22:59:28 zfs create -o compression=zstd-19 -o recordsize=1M -o exec=off root_pool/nix
2024-11-29.13:45:20 zfs set sync=disabled root_pool/nix
2025-01-15.13:08:07 zfs set reservation=10G root_pool/nix
2025-03-23.19:35:25 zfs set compression=zstd-9 root_pool/nix
2025-11-23.13:18:27 zfs set -u mountpoint=/nix root_pool/nix
2025-11-23.13:18:50 zfs set -u mountpoint=/nix root_pool/nix
2025-11-23.14:17:59 zfs set -u mountpoint=legacy root_pool/nix
2025-11-26.19:14:03 zfs set exec=on root_pool/nix
2025-11-26.19:14:17 zfs set exec=on root_pool/nix
2025-11-26.19:14:23 zfs set exec=on root_pool/nix