HP iLO stops responding to ping from other subnets in a couple of minutes

SYMPTOMS:

iLO network interface (be it shared or dedicated) is set up with static IP address and a gateway. One can access and ping it from the same LAN subnet with no issues; however when pinged from a different subnet through the router pings stop after random amount of time, but generally within a few minutes. Resetting the iLO resolves the issue for another couple of minutes. This applies to iLO 3, iLO 4, iLO 5 and probably iLO 6 as well.

This problem has been described on the Internet number of times [1] [2] [3] with no clear  resolution provided.

CAUSE: 

 iLO management interface honors ICMP Type 9 - Mobile IP Router Advertisements (RA) for IPv4 even when static gateway IP address has been set in the configuration. When such RA is received by iLO, the gateway being used is replaced with one specified in RA; this, however, is not being reflected in the Web interface. 

iLO then proceeds with sending ARP request for this new gateway address and forwards all non-local traffic through it.

At this moment communication with any IP addresses not on the same LAN subnet ceases until the iLO is reset.

In the example Wireshark capture below, the Cisco SG350 switch with address 192.168.3.4 sends rogue Mobile IP RA that is being picked up by iLO3 with address 192.168.3.6. The switch itself has a defunct default gateway 192.168.3.252 configured that it tries to unsuccessfully resolve.

As this capture has been performed on the router, no further communication is observed from the iLO, as it communicates with the switch directly.

 

RESOLUTION: 

 Find the device sending ICMP Type 9 RAs and either disable this functionality, or make sure that the information provided in RAs is correct and the advertising router does have ability to route the traffic appropriately - e.g. there's a correct default gateway set on the advertising router itself.

If your switch supports multicast filtering, it might be useful to filter all-hosts multicast group 224.0.0.1 or 01:00:5e:00:00:01 on interfaces facing iLOs.

Comments

Popular posts from this blog

HP DL380 G7 won't power up after a power cord removal/power loss

ESXi 7.0/6.7 and MegaRaid on Alder Lake Asrock Z690M-ITX/ax

Accessing MegaRAID BIOS (WebBIOS, Ctrl-H) on consumer motherboards