I have a remote server with a dead disk. Unfortunately, I will not have physical access for some time. The broken disk cause boot time to be very long with errors like this,
The disk is connected to an SAS2308 controller and the disk symlink (/dev/sge) changes every time. In the screenshot it is /dev/sde, but next boot it shows up as /dev/sda. I want to disable this disk in the system similar to this question.
There is no back plane in the system.
Dead disk
# lsscsi -v
[0:0:3:0] disk ATA WDC WD10EFRX-68J 1A01 /dev/sda
dir: /sys/bus/scsi/devices/0:0:3:0 [/sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0/0000:02:09.0/0000:04:00.0/host0/port-0:3/end_device-0:3/target0:0:3/0:0:3:0]
HBA details
# lspci -nn -v -s 04:00.0
04:00.0 Serial Attached SCSI controller [0107]: Broadcom / LSI SAS2308 PCI-Express Fusion-MPT SAS-2 [1000:0087] (rev 05)
Subsystem: Hewlett Packard Enterprise H220i [1590:0041]
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at c000 [size=256]
Memory at dd540000 (64-bit, non-prefetchable) [size=64K]
Memory at dd500000 (64-bit, non-prefetchable) [size=256K]
Expansion ROM at dd400000 [disabled] [size=1M]
Capabilities: [50] Power Management version 3
Capabilities: [68] Express Endpoint, IntMsgNum 0
Capabilities: [d0] Vital Product Data
Capabilities: [a8] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [c0] MSI-X: Enable+ Count=16 Masked-
Capabilities: [100] Advanced Error Reporting
Capabilities: [1e0] Secondary PCI Express
Capabilities: [1c0] Power Budgeting <?>
Capabilities: [190] Dynamic Power Allocation <?>
Capabilities: [148] Alternative Routing-ID Interpretation (ARI)
Kernel driver in use: mpt3sas
Kernel modules: mpt3sas
No, i have not referenced the disk in my config and is not part of any zfs pool. The system on bootup tried to probe all the present drives and ultimately times out with failure message.
I am looking the kernel parameter to disable this disk at boot similar to libata like this but for scsi.
Hi,
I understand your question, but is there an kind of IPMI available to enter the boot software of the HBA and disable this harddisk. In this way the kernel won´t notice it.
I have educated myself on the subject… there is a software called multipath
You can blacklist specific devices, below is a link to the opensuse documentation on how to blacklist it there
And there is an option in NixOS to configure this
So as I’m still green on this topic, I’m not able to solve your problem
Just a far shot. Is it possible to set the scan id to No of the hard disk during the boot. If this is possible, the host system will not see the device. I saw this in Wayback Machine.
I dont get such details as mentioned on Page 132 of the pdf (7-12 of the document). Perhaps the HBA mode means that such details are not exposed in the BIOS.