beautypg.com

Dell POWEREDGE M1000E User Manual

Page 28

background image

14

Fabric OS Troubleshooting and Diagnostics Guide

53-1001769-01

Switch boot issues

2

reboot

haFailover

fastBoot

firmwareDownload

The RRD feature is activated and halts rebooting when an unexpected reboot reason is shown
continuously in the reboot history within a certain period of time. The period of time is switch
dependent. The following are considered unexpected reboots:

Reset
A reset reboot may be caused by one of the following:

-

Power-cycle of the switch or CP.

-

Linux reboot command.

-

Hardware watchdog timeout.

-

Heartbeat loss related reboot.

Software Fault:Kernel Panic

-

If the system upon detecting an internal fatal error from which it cannot safely recover,
generally it will output an error message to the console, dump a stack trace for debugging
and then performs an automatic reboot.

-

After a kernel panic, the system may not have enough time to write the reboot reason
causing the reboot reason to be empty. This is treated as an Unknown/reset case.

Software fault

-

Software Fault:Software Watchdog

-

Software Fault:ASSERT.

Software recovery failure
This is an HA bootup related issue and happens when switch is unable to recover to a stable
state. HASM log contains more detail and specific information on this type of failure, such as
one of the following:

-

Failover recovery failed: This occurs when failover recovery failed and has to reboot the CP.

-

Failover when standby CP unready: Occurs when the active CP has to failover, but the
standby CP is not ready to takeover mastership.

-

Failover when LS trans incomplete: Takes place when a logical switch transaction is
incomplete.

Software bootup failure
This is an HA bootup related issue and happens when a switch is unable to load the firmware
to a usable state. HASM log contains more detail and specific information on this type of
failure, such as one of the following:

-

System bring up timed out: The CP failed to come up within the time allotted.

-

LS configuration timed out and failed: Logical switch configuration failed and timed out.

After RRD is activated, admin level permission is required to login enter the supportShow or
supportSave command to collect a limited amount of data to resolve the issue.

ATTENTION

The limited supportSave used with the RRD feature does not support USB.