Troubleshooting

Monitor kernel logs

During the first day or so of setting this machine up, it is recommended to actively monitor the kernel logs, checking for errors.

[bash]: Run this on your workstation:
## Just leave this running in a separate terminal as you keep working..
sudo dmesg -w

Disable power saving on NVME storage

How to disable power saving on the NVME drive

Out of two identical kits, I encountered one error in the kernel log on one machine but not on the other, so you may or may not run into this error:

(stdout)
[  359.477209] nvme nvme0: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0x11
[  359.477218] nvme nvme0: Does your device have a faulty power saving mode enabled?
[  359.477220] nvme nvme0: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off" and report a bug
[  359.545210] nvme 0000:01:00.0: enabling device (0000 -> 0002)
[  359.549032] nvme nvme0: Shutdown timeout set to 10 seconds
[  359.722783] nvme nvme0: 4/0/0 default/read/poll queues

This error seems to indicate there is a faulty power saving feature in hardware or in the NVME firmware or kernel code. It may be fixed by following the advice to turn off the power saving feature of the NVME:

Run this on the Raspberry Pi
sudo nano /boot/firmware/cmdline.txt

This file should contain a single long line of text. You should find the very end of the line, and add the following to the end of it:

nvme_core.default_ps_max_latency_us=0 pcie_aspm=off

The whole line should now look like:

console=serial0,115200 console=tty1 root=PARTUUID=xxxxxxxx-02 rootfstype=ext4 fsck.repair=yes rootwait cfg80211.ieee80211_regdom=US   nvme_core.default_ps_max_latency_us=0 pcie_aspm=off

Press Ctrl+S to save the file. Press Ctrl+X to quit nano.

Reboot the pi:

Run this on the Raspberry Pi
sudo reboot

Check to see if the error goes away, but if not, you probably have a hardware issue.