In NVIDIA UFM (Unified Fabric Manager), OpenSM — the InfiniBand Subnet Manager — is managed internally by UFM rather than as a standalone systemd service. This allows UFM to control the IB fabric, collect telemetry, and maintain HA configurations without requiring manual OpenSM service management.
OpenSM is launched directly from the UFM installation path:
The configuration file used by UFM is located at:
UFM monitors OpenSM, ensuring it runs on the designated primary node and manages failover in HA configurations.
Telemetry and diagnostic processes (like ibdiagnet
) are also invoked by UFM to monitor port health, link utilization, and performance counters.
Note: On UFM-managed systems, OpenSM does not have a systemd service. Attempting
systemctl status opensm
will return "Unit opensm.service could not be found."
All OpenSM-related logs and dumps are stored under the UFM log directory:
Relevant files include:
opensm.log
— main OpenSM runtime log
opensm-statistics.dump
— fabric statistics dump
opensm-smdb.dump
— Subnet Manager database snapshot
opensm-activity.dump
— recorded activity
opensm-virtualization.dump
— virtualization-specific info
Other diagnostic logs and telemetry outputs may also contain OpenSM-related data.
From your local computer, run:
Replace:
<UFM_NODE_IP>
with the UFM node’s IP (e.g., T10PHGXUFMST02
)
/local/path/to/save/
with the local directory where you want to store the logs
Optional: For faster transfer, compress the folder first:
OpenSM in UFM is managed by UFM; no standalone systemd service exists.
Logs and dumps are centralized under /opt/ufm/files/log/
.
Copying logs to your home directory and using scp
allows easy transfer to your workstation for analysis or archival.