Fix: UFM Service Not Starting on UFM Appliance UFM-MUA9502H

Fix: UFM Service Not Starting on UFM Appliance UFM-MUA9502H

NotesNote: This appliance UFM-MUA9502H is EOL as of 2022.

Pre-requisites

Ensure the following conditions are met before attempting to start the UFM service:

  1. Ensure the license is attached. 

  2. Verify bond0 is configured with eth0 and eth1.

    1. Command: show interface bond0
  3. Set the management interface on eth0 or eth1.

    1. Commands:

      config terminal
      ufm mgmt-interface eth0
  4. Ensure eth0/eth1 is up.

    1. Command:

      show interfaces

Starting UFM Service

Once the prerequisites are met, attempt to start the UFM service:

  1. Command:

    ufm start

Troubleshooting: SM Already Running

If the service does not start and you see a message indicating that the Subnet Manager (SM) is already running:

  1. Stop SM on the connected InfiniBand switch.

  2. Run the following command to safely stop the SM running on UFM:

    ufm safe-stop
  3. Retry starting UFM:

    ufm start
  4. Check the status:

    show ufm status

Issue: UFM Shows as Stopped but Other Services are Running

If you see the message "ufm: stopped but other services are running", follow the steps below to update the UFM appliance firmware.

Fix: Update UFM Appliance Firmware to 4.11.1.1

Firmware Version: image-ufm_appliance-x86_64-UFMAPL_4.11.1.1_UFM_6.12.1.1-20230209-111043.img


Downloading the New Firmware

  1. Go to the NVIDIA Networking Portal (NPN).

  2. Click the "Get Help" tab > Click "Enterprise Support" button.

  3. You will be redirected to the Enterprise Support Portal.

  4. Click the "Downloads" tab.

  5. Select "Software" > "UFM Appliance".

  6. You will be redirected to the Licensing Portal.

  7. Click the "Software Downloads" tab.

  8. Search for "UFM Appliance".

  9. Select the version mentioned above and download the firmware.

Updating the UFM Appliance


  1. Log in to the Management Portal of UFM.

  2. Upload the downloaded .img file.

  3. Initiate the update process and reboot the appliance.


Once the update is complete, the UFM service issue should be resolved. Verify that the service is running by executing:

show ufm status


Conclusion: Following the above steps should resolve the issue of the UFM service not starting on the UFM Appliance UFM-MUA9502H. If the problem persists, consider reaching out to enterprise support for further assistance.

    • Related Articles

    • Managing OpenSM within NVIDIA UFM and getting OpenSM Logs from UFM

      Overview In NVIDIA UFM (Unified Fabric Manager), OpenSM — the InfiniBand Subnet Manager — is managed internally by UFM rather than as a standalone systemd service. This allows UFM to control the IB fabric, collect telemetry, and maintain HA ...
    • How to Change Default IB Interface in NVIDIA UFM (e.g., from ib0 to ib1)

      By default, NVIDIA UFM binds to the ib0 InfiniBand interface during startup. If ib0 is down or unavailable, UFM will fail to start. In such cases, you may need to change the default interface to ib1 (or any other available IB interface). This article ...
    • Digital Media Retention Service (HDD/SSD)

      MBUZZ offers a Digital Media Retention Service (DMR) to ensure customers can retain their original HDD/SSD during a hardware replacement process. This service provides enhanced data security and compliance, especially for government entities, data ...
    • How to fix the camera issue in Laptop / Camera not working

      Root cause: Solution 1 Follow the below steps: Go to Settings -> Privacy -> Camera -> Enable - Allow apps to enable camera -> and enable access to required listed apps. check again if the camera is working or not else continue to the below steps. Go ...
    • Can we recover data from damaged Hard drive

      Yes, it is often possible to recover data from a damaged hard drive, though the success depends on the extent and type of damage. Here are the main scenarios and approaches for recovery: Logical Damage This occurs when the file system is corrupted ...