How to update Mellanox ConnectX-7 NICs Firmware

How to update Mellanox ConnectX-7 NICs Firmware

1. Purpose

This article describes the procedure to upgrade the Mellanox ConnectX-7 network adapter firmware on the affected OSS servers to version 28.45.1200 in order to ensure compatibility, stability, and optimal performance.

2. Scope

This procedure applies to all OSS servers equipped with Mellanox ConnectX-7 network interface cards that currently run an earlier firmware version.

3. Prerequisites

  1. Firmware package: Obtain the firmware image file (e.g., fw-ConnectX7-rel-28_45_1200-MCX755106AS-xxx.bin) from NVIDIA/Mellanox official site or internal repository.

  2. Backup current firmware and configuration.

  3. Maintenance window approved.

  4. Ensure server console or iDRAC/iLO access is available.

  5. Network impact: Firmware update requires NIC reset; plan downtime.

  6. Root/sudo privileges.

4. Procedure

Step 1: Verify Current Firmware and Device

  1. To identify the Mellanox device name run: 
    1. mst status
    2. (e.g., mst_device:/dev/mst/mt4125_pciconf0).
  2. Check current firmware version:
    1. mlxconfig -d /dev/mst/mt4125_pciconf0 query | grep -i "fw_version"

Step 2: Back Up Existing Firmware
  1. mstflint -d /dev/mst/mt4125_pciconf0 query > /root/mellanox_fw_backup.txt

Step 3: Load the Mellanox Firmware Update Tool

  1. Make sure the firmware file is accessible, e.g.:
    1. ls /root/fw-ConnectX7-rel-28_45_1200-MCX755106AS-xxx.bin

Step 4: Flash the New Firmware
  1. mstflint -d /dev/mst/mt4125_pciconf0 -i /root/fw-ConnectX7-rel-28_45_1200-MCX755106AS-xxx.bin burn

  1. Expected output should indicate "Firmware was successfully flashed".

Step 5: Reboot the Server
  1. reboot

Step 6: Verify Firmware Version After Reboot

  1. After the system comes back online:
  2. mst status
  3. mlxconfig -d /dev/mst/mt4125_pciconf0 query | grep -i "fw version"
  4. Confirm the version matches 28.45.1200.

Step 7: Post-Upgrade Validation

  1. Confirm NIC interfaces are up:

    1. ip link show

  2. Validate network connectivity:

    1. ping <gateway or peer server>

  3. Check dmesg/syslog for NIC-related errors:

    1. dmesg | grep -i mlx

5. Rollback Plan

If issues arise:

  1. Restore the backup firmware (if available):

    1. mstflint -d /dev/mst/mt4125_pciconf0 -i <old_fw_image.bin> burn

  2. Reboot the server.

  3. Verify previous firmware version is restored.