Fix: DGX Spark Kernal Panic - OS Reinstall via System Recovery
The Issue : Kernel Panic: VFS Unable to Mount Root FS on
Unknown-Block(0,0)
This error is one of the more alarming things you can
encounter on a Linux-based system. When the DGX Spark throws a kernel panic
with the message VFS: Unable to mount root fs on unknown-block(0,0), it means
the kernel has booted successfully but cannot find or mount the root filesystem
from the internal SSD. The system essentially has no idea where to look for its
operating system.
Common causes on the DGX Spark include:
Filesystem
or partition table corruption on the internal SSD (often from an unclean
shutdown or power loss during a write operation)
A
botched OS or firmware update that left the bootloader or root partition
in an inconsistent state
Fatal
misconfiguration (e.g., incorrect GRUB or initramfs entries pointing to
the wrong block device)
SSD
sectors that have gone bad, making critical filesystem metadata unreadable
Once you're in this state, the system is effectively
unbootable. You cannot recover from within the OS itself — a full reinstall via
NVIDIA's recovery media is the path forward.
Temporary Fix: Full OS Reinstall via Recovery USB
Note: This applies to the
DGX Spark Founders
Edition only. If you have an OEM variant, contact your manufacturer for
recovery procedures.
What You'll Need
A
USB flash drive, 16GB or larger
A wired
USB keyboard (Bluetooth keyboards may not work in UEFI)
A
display connected to the DGX Spark
A
secondary computer to download and prepare the recovery media
Step 1: Download the Recovery Media
On another machine, download the recovery archive from
NVIDIA:
Extract the .tar.gz archive once downloaded.
| |
Run CreateUSBKey.cmd as Administrator | Select USB |
Step 2 — Create the Bootable Recovery USB
Insert your USB drive into the secondary machine and run the
appropriate script from the extracted folder. This will wipe the USB drive,
so back up anything on it first.
Step 3: Prepare the DGX Spark for Recovery
Disconnect
any external storage from the DGX Spark
Plug
the recovery USB into a USB port on the DGX Spark
Power
on the device and immediately hold Esc or Del to enter UEFI
settings
Step 4: Restore UEFI Defaults
Navigate
to the Save & Exit page (Right Arrow key)
Select Restore Defaults → confirm with Yes
Select Save Changes and Reset
As
it reboots, hold Esc/Del again to re-enter UEFI
Step 5: Enable Secure Boot
Navigate
to the Security tab
Confirm Secure Boot is set to Enabled
Select Restore Factory Keys
Go
to Save and Exit → Save Changes and Reset
Hold Esc/Del again on reboot to enter UEFI a third time
Step 6: Boot from the Recovery USB
Go
to Save & Exit → scroll down to Boot Override
Select
your USB drive and press Enter
The
system will reboot into the recovery environment
Step 7: Run the Recovery
Walk through the on-screen prompts:
Welcome
screen — Press Enter to continue
Warning
screen — Select [START RECOVERY] (this will completely wipe
the internal SSD)
Progress
screen — Monitor the reflash; this takes several minutes
Completion
screen — Review the output, then follow the prompt to continue
Final
screen — Confirm factory reset and press Enter to restart
The DGX Spark will reboot into a clean factory state.
Key Things to Know
The
kernel panic in this case was a symptom of filesystem-level corruption —
the reinstall is the right call once you've ruled out a hardware fault
with the SSD
UEFI
defaults and Secure Boot must be properly configured before booting
the USB or the recovery environment may not load correctly — don't skip
Steps 4 and 5
Use
a wired USB keyboard throughout; wireless/Bluetooth keyboards
frequently fail to register input during UEFI and early boot
Related Articles
How to do a remote power cycle on NVIDIA QM9700 Switch?
1. Purpose To perform a remote reboot of NVIDIA QM9700 switch using the NVIDIA's Web GUI. If the remote reboot does not resolve any issues occurred, a physical power-cycle should be carried out onsite as per OEM recommendations. 2. Scope This MOP ...
How to Collect Logs from NVIDIA UFM (UFM System Dump)
Purpose This article explains how to collect diagnostic logs from NVIDIA Unified Fabric Manager (UFM) using the web-based GUI. The UFM system dump is typically required by NVIDIA Support for troubleshooting fabric health, host visibility, alerts, and ...
How to send AutoSupport Dispatch on a NetApp Device via SANtricity System Manager
Purpose This purpose of this article is to provide detailed instructions on how to manually trigger and send an AutoSupport dispatch from a NetApp E-Series or EF-Series storage System using SANtricity System Manager. AutoSupport is a NetApp feature ...
How to Collect Logs from NVIDIA Cumulus Linux Switch
Purpose This article describes how to collect diagnostic logs from a switch running NVIDIA Cumulus Linux. These logs are typically required by NVIDIA Networking Support for troubleshooting switch-level issues such as port flaps, routing problems, ...
Collect Logs from NVIDIA QM9700 InfiniBand Switch (Sysdump) - Web GUI
Purpose This article describes the procedure to collect diagnostic logs (sysdump) from an NVIDIA QM9700 InfiniBand switch. The sysdump file is typically requested by NVIDIA Networking Support for troubleshooting fabric, port, firmware, or stability ...