Warning: Not all InfiniBand HCAs have a PSID

If you read the documents posted by Mellanox about their new 2..0.5 build 4453 InfiniBand drivers you may have noticed the advice to update your firmware. If so, you will need to discover your PSID. This should be pretty straight forward, just install the drivers and then using the HPC Management console Run a Command feature run vstat on the node you wish to update. If you are lucky, you’ll see something like this:

NODE-08 -> Finished

-------------------------------------------------------------------------------------------------

        hca_idx=0

        uplink={BUS=PCI_E, SPEED=2.5 Gbps, WIDTH=x8, CAPS=2.5*x8}

        vendor_id=0x08f1

        vendor_part_id=0x6278

        hw_ver=0xa0

        fw_ver=4.08.0200

        PSID=VLT0040010001

        node_guid=0008:f104:0399:2054

        num_phys_ports=2

               port=1

               port_state=PORT_ACTIVE (4)

               link_speed=5.0 Gbps (2)

               link_width=4x (2)

               rate=20 Gbps

               port_phys_state=LINK_UP (5)

               active_speed=5.0 Gbps (2)

               sm_lid=0x0001

               port_lid=0x0009

               port_lmc=0x0

               max_mtu=2048 (4)

               port=2

               port_state=PORT_DOWN (1)

               link_speed=NA

               link_width=NA

               rate=NA

               port_phys_state=POLLING (2)

               active_speed=2.5 Gbps (1)

               sm_lid=0x0000

               port_lid=0x0000

               port_lmc=0x0

               max_mtu=2048 (4)

 

If, like me, you are unlucky, you will not have a PSID line in the output. Like this:

NODE-07 -> Finished

--------------------------------------------------------------------------------------

       hca_idx=0

       uplink={BUS=PCI_E, SPEED=2.5 Gbps, WIDTH=x8, CAPS=2.5*x8}

       vendor_id=0x066a

       vendor_part_id=0x6274

       hw_ver=0xa0

       fw_ver=0x100020000

       node_guid=0006:6a00:9800:f356

       num_phys_ports=1

             port=1

             port_state=PORT_ACTIVE (4)

             link_speed=5.0 Gbps (2)

             link_width=4x (2)

             rate=20 Gbps

             port_phys_state=LINK_UP (5)

             active_speed=5.0 Gbps (2)

             sm_lid=0x0001

             port_lid=0x000a

             port_lmc=0x0

             max_mtu=2048 (4)

If this happens to you, don’t waste time trying to find the PSID of your HCA. As soon as I find a way to pick the right firmware upgrade for my own HCAs without PSIDs, I’ll post how to do it. Until then, you and I must run on whatever firmware we already have.

 

Sorry,

 

  Frankie