General
Run a GPU stress test for RTX 6000 Ada and L40S on Ubuntu Server
You can stress‑test RTX 6000 Ada and L40S GPUs on Ubuntu Server using GPU Burn, a CUDA‑based stress tool commonly used for datacenter validation. The process is the same for both GPUs because they use NVIDIA’s CUDA stack. Core Steps to Run GPU Burn ...
How to use official ThinkParQ script to collect detailed BeeGFS Logs
1. Purpose This document describes how to collect a full BeeGFS diagnostic bundle using the official ThinkParQ script. Applicable for environments running: BeeGFS This procedure is typically requested by: BeeGFS / ThinkParQ Support NetApp (when ...
How to collect diagnostic logs using the NetApp Log Collection Script
1. Purpose This document describes the procedure to collect diagnostic logs using the NetApp Log Collection Script in environments running: BeeGFS NetApp E-Series backend storage HA cluster using Pacemaker and Corosync This script is typically ...
Enabling Desktop Notifications for Emails Moved to Subfolders in Outlook Desktop
1. Purpose To configure Microsoft Outlook (Desktop Client) to trigger desktop notifications for emails that are automatically moved to a subfolder (e.g., Inbox > alerts) using a mail rule. 2. Problem Statement By default, Microsoft Outlook Desktop ...
Configure Date & Time on ASUS HGX Servers via ASMB11-iKVM (BMC)
Purpose This article explains how to configure and synchronize the Date & Time on ASUS HGX servers using the ASMB11‑iKVM (BMC) interface. This ensures that all HGX servers synchronize their time with the NTP servers configured on Head Node 1 and Head ...
NVMe Devices Not Detected During Early Boot with Existing BIOS
Overview This KB document addresses an issue where some NVMe devices are not detected during system startup, causing the operating system to fail to recognize all installed NVMe drives. The issue is related to limitations in PCIe device enumeration ...
BeeGFS Metadata Check Before Large File Ingest
1. Purpose To validate metadata and inode capacity before ingesting a very large number of small files into a BeeGFS filesystem. Step 1 - Check Metadata Inodes Run beegfs-df on any BeeGFS client This checks if the nodes have enough Inode capacity for ...
How to install Ansible
1. Purpose This article provides a focused, crystal-clear procedure strictly for installing Ansible on the control node and validating the installation. It excludes inventories, playbooks, and advanced configuration, making it suitable for baseline ...
How to Collect Logs from NVIDIA Cumulus Linux Switch
Purpose This article describes how to collect diagnostic logs from a switch running NVIDIA Cumulus Linux. These logs are typically required by NVIDIA Networking Support for troubleshooting switch-level issues such as port flaps, routing problems, ...
How to Collect Logs from NVIDIA UFM (UFM System Dump)
Purpose This article explains how to collect diagnostic logs from NVIDIA Unified Fabric Manager (UFM) using the web-based GUI. The UFM system dump is typically required by NVIDIA Support for troubleshooting fabric health, host visibility, alerts, and ...
Collect Logs from NVIDIA QM9700 InfiniBand Switch (Sysdump) - Web GUI
Purpose This article describes the procedure to collect diagnostic logs (sysdump) from an NVIDIA QM9700 InfiniBand switch. The sysdump file is typically requested by NVIDIA Networking Support for troubleshooting fabric, port, firmware, or stability ...
How to Collect NVIDIA Bug Report
Purpose This article provides step-by-step instructions to collect an NVIDIA bug report from servers equipped with NVIDIA GPUs. The NVIDIA bug report is commonly required by NVIDIA Support for troubleshooting GPU driver, CUDA, NVLink, PCIe, and ...
SOS Report collection from NetApp OSS Servers
Purpose This article details the process of generating and collecting SOS Reports from NetApp OSS Servers. These reports are often required by the NetApp Support Team for detailed analysis and troubleshooting. Scope Applicable to: NetApp OSS Servers ...
Installing Cumulus VX on Proxmox VE
This document describes how to deploy NVIDIA Cumulus VX (Cumulus Linux 5.x) on Proxmox VE using the QCOW2 disk image provided by NVIDIA. Cumulus VX allows you to simulate a Cumulus Linux switch using KVM. 1. Requirements Item Details Hypervisor ...
How to update Mellanox ConnectX-7 NICs Firmware on OSS Servers
1. Purpose This article describes the procedure to upgrade the Mellanox ConnectX-7 network adapter firmware on the affected OSS servers to version 28.45.1200 in order to ensure compatibility, stability, and optimal performance. 2. Scope This ...
How to send AutoSupport Dispatch on a NetApp Device via SANtricity System Manager
Purpose This purpose of this article is to provide detailed instructions on how to manually trigger and send an AutoSupport dispatch from a NetApp E-Series or EF-Series storage System using SANtricity System Manager. AutoSupport is a NetApp feature ...
How to trigger a Support Bundle on NetApp Appliance
Purpose This article provides the steps to collect and trigger a full support bundle on a NetApp Storage Appliance. Support Bundles are used to collect diagnostics data for troubleshooting performance, connectivity, or I/O issues. Steps Login to the ...
Enabling Microburst Monitoring on Cisco Nexus Switches
Summary This article explains how to enable and verify microburst detection on Cisco Nexus 9000 series switches. Microbursts are short spikes of traffic that can momentarily exceed interface buffer capacity, leading to output discards even when ...