6 bmc fw health sensor, 1 bmc fw health sensor – next steps, Bmc fw health sensor – Kontron S4600 SEL Troubleshooting User Manual
Page 116: Next steps

Miscellaneous Events
System Event Log Troubleshooting Guide for EPSD
Platforms Based on Intel
®
Xeon
®
Processor E5 4600/2600/2400/1600/1400 Product Families
106
Intel order number G90620-002
Revision 1.1
11.6 BMC FW Health Sensor
The BMC tracks the health of each of its IPMI sensors and reports
failures by providing a “BMC FW Health” sensor of the IPMI 2.0
sensor type Management Subsystem Health with support for the Sensor Failure offset. Only assertions will be logged into the SEL
for the Sensor Failure offset. The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read.
These are not standard sensor events (that is, threshold crossings or discrete assertions). These are BMC Hardware Access Layer
(HAL) errors such as I
2
C NAKs or internal errors while attempting to read a register. If a successful sensor read is completed, the
counter resets to zero.
Table 83: BMC FW Health Sensor Typical Characteristics
Byte
Field
Description
11
Sensor Type
28h = Management Subsystem Health
12
Sensor Number
10h
13
Event Direction and
Event Type
[7] Event direction
0b = Assertion Event
1b = Deassertion Event
[6:0] Event Type = 6Fh (Sensor Specific)
14
Event Data 1
[7:6]
– 11b = Sensor-specific event extension code in Event Data 2
[5:4]
– 00b = Unspecified Event Data 3
[3:0]
– Event Trigger Offset = 4h = Sensor failure
15
Event Data 2
Sensor number of the failed sensor
16
Event Data 3
Not used
11.6.1
BMC FW Health Sensor – Next Steps
1. Check the SEL for any other events around the time of the failure.
2. Take note of all IPMI activity that was occurring around the time of the failure. Capture a System BMC Debug Log as soon as you
can after experiencing this failure. This log can be captured from the Integrated BMC Web Console or by using the Intel
®
Syscfg
utility (syscfg /sbmcdl private filename.zip). Send the log file to your system manufacturer or Intel representative for failure
analysis.
3. If the failure continues around a specific sensor, replace the board with that sensor.