Long story short, I was having a great lunch with parents when I came back to my pc and saw all my opened applications randomly crashing. None of my programs opened and I shut down the pc, at that moment I already knew something has happened to my Western Digital nvme ssd. Thankfully I had some partial backup but my few weeks old infrastructure configuration has been lost due to this catastrophic event.
I am quite the heavy user and I abused it for a long time but I have never really reached the TBW (58TB/300TB). My nixos system was LUKS encrypted, and after a quick restart my boot partition seems to work but not my /root which is encrypted.
I have ran a few tests and experiments but I still have no Idea if the ssd is really dead and no data can be recovered anymore.
Smartctl’s output:
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
- available spare has fallen below threshold
- media has been placed in read only mode
SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x09
Temperature: 51 Celsius
Available Spare: 0%
Available Spare Threshold: 10%
Percentage Used: 13%
Data Units Read: 70.837.653 [36,2 TB]
Data Units Written: 43.515.977 [22,2 TB]
Host Read Commands: 1.026.740.170
Host Write Commands: 651.610.322
Controller Busy Time: 8.543
Power Cycles: 2.236
Power On Hours: 3.713
Unsafe Shutdowns: 221
Media and Data Integrity Errors: 0
Error Information Log Entries: 1
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
I tried rebuilding the ext4 filesystem, and I always got a bad superblock
error.
sudo fsck.ext4 -v -f -b 32768 -B 4096 /dev/mapper/crypto
What’s the best I could do to recover any data? I don’t really want to spend $500 for a recovery service.