Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc.
is strictly forbidden.
Trademarks used in this text: Dell, the DELL logo and Dell OpenManage are trademarks of Dell Inc.;
VMware is registered trademark or trademark of VMware, Inc. in the United States and/or other
jurisdictions; Microsoft, Windows and Windows Server are either trademarks or registered trademarks
of Microsoft Corporation in the United States and/or other countries; Red Hat and Red Hat Enterprise
Linux are registered trademark of Red Hat, Inc.; SUSE is a registered trademark of Novell, Inc. in the
United States and other countries.
Other trademarks and trade names may be used in this document to refer to either the entities claiming
the marks and names or their products. Dell Inc. disclaims any proprietary interest in trademarks and
trade names other than its own.
Dell™ OpenManage™ Server Administrator produces event messages
stored primarily in the operating system or Server Administrator event logs.
This document describes the event messages created by Server Administrator
version 6.0.3 and displayed in the Server Administrator Alert log.
Server Administrator creates events in response to sensor status changes and
other monitored parameters. The Server Administrator event monitor uses
these status change events to add descriptive messages to the operating
system event log or the Server Administrator Alert log.
Each event message that Server Administrator adds to the Alert log consists
of a unique identifier called the event ID for a specific event source category
and a descriptive message. The event message includes the severity, cause of
the event, and other relevant information, such as the event location and the
monitored item’s previous state.
Tables provided in this guide list all Server Administrator event IDs in numeric
order. Each entry includes the event ID’s corresponding description, severity level,
and cause. Message text in angle brackets (for example,
event-specific information provided by the Server Administrator.
What’s New in this Release
The following changes have been made for this release:
•Support for the VMware® ESXi version 3.5 Update 4 hypervisor.
•Support for the Server Administrator Web Server.
•Supports Serial Attached SCSI (SAS) controllers only for this release.
<State>
) describes the
Introduction7
•No SNMP trap support for this release.
•No support for LRA numbers for this release.
•Added two new alerts 1013 and 2382 in the “Miscellaneous Messages” and
“Alert Descriptions and Corrective Actions” sections respectively.
•Added the POST Code Errors table in the “BIOS Generated System
Events” section.
•Support for Solid State Drives (SSD). Added new SSD alert 2370 in the
“Storage Management Message Reference” section.
Messages Not Described in This Guide
This guide describes only event messages created by Server Administrator and
displayed in the Server Administrator Alert log. For information on other
messages produced by your system, consult one of the following sources:
•Your system’s
•Operating system documentation
•Application program documentation
Installation and Troubleshooting Guide
Understanding Event Messages
This section describes the various types of event messages generated by
the Server Administrator. When an event occurs on your system, the Server
Administrator sends information about one of the following event types to
the systems management console:
8Introduction
Table 1-1. Understanding Event Messages
IconAlert SeverityComponent Status
OK /Normal /
Informational
War n in g /
Non-critical
Critical /
Failure / Error
An event that describes the successful operation of a unit.
The alert is provided for informational purposes and does
not indicate an error condition. For example, the alert may
indicate the normal start or stop of an operation, such as
power supply or a
An event that is not necessarily significant, but may indicate a
possible future problem.
alert may indicate that a component (such as a temperature
probe in an enclosure) has crossed a warning threshold.
A significant event that indicates actual or imminent loss of
data or loss of function.
threshold or a hardware failure such as
sensor reading returning to normal.
For example, a Warning/Non-critical
For ex am p l e ,
crossing a failure
an array disk.
Server Administrator generates events based on status changes in the
following sensors:
•
Temperature Sensor
— Helps protect critical components by alerting the
systems management console when temperatures become too high inside
a chassis; also monitors a variety of locations in the chassis and in any
attached systems.
•
Fan Sensor
— Monitors fans in various locations in the chassis and in any
attached systems.
•
Voltage Sensor
— Monitors voltages across critical components in various
chassis locations and in any attached systems.
•
Current Sensor
— Monitors the current (or amperage) output from the
power supply (or supplies) in the chassis and in any attached systems.
•
Chassis Intrusion Sensor
— Monitors intrusion into the chassis and any
attached systems.
Introduction9
•
Redundancy Unit Sensor
such as fans, AC power cords, or power supplies) within the chassis;
also monitors the chassis and any attached systems. For example,
redundancy allows a second or
safe temperature when another fan has failed. Redundancy is normal when
the intended number of critical components are operating. Redundancy is
degraded when a component fails, but others are still operating.
Redundancy is lost when there is one less critical redundancy device than
required.
•
Power Supply Sensor
attached systems.
•
Memory Prefailure Sensor
number of Error Correction Code (ECC) memory corrections.
Fan Enclosure Sensor
•
their removal from and insertion into the system, and by measuring how
long a fan enclosure is absent from the chassis. This sensor monitors the
chassis and any attached systems.
•
AC Power Cord Sensor
AC power cord.
•
Hardware Log Sensor
•
Processor Sensor
•
Pluggable Device Sensor
or configuration errors for some pluggable devices, such as memory cards.
•
Battery Sensor
the system.
— Monitors the processor status in the system.
— Monitors the status of one or more batteries in
— Monitors redundant units (critical units
n
th fan to keep the chassis components at a
— Monitors power supplies in the chassis and in any
— Monitors memory modules by counting the
— Monitors protective fan enclosures by detecting
— Monitors the presence of AC power for an
— Monitors the size of a hardware log.
— Monitors the addition, removal,
Sample Event Message Text
The following example shows the format of the event messages logged by
Server Administrator.
EventID: 1000
Source: Server Administrator
Category: Instrumentation Service
Type: Information
10Introduction
Date and Time: Mon Oct 21 10:38:00 2002
Computer:
<computer name>
Description:
Server Administrator starting
Data: Bytes in Hex
Viewing Alerts and Event Messages
NOTE: The Red Hat® Enterprise Linux®, SUSE® Linux Enterprise Server and
Microsoft
the VMware
An event log is used to record information about important events.
Server Administrator generates alerts that are added to the operating system
event log and to the Server Administrator Alert log. To view these alerts in
Server Administrator:
1
Select the
2
Select the
3
Select the
You can also view the event log using your operating system’s event viewer.
Each operating system’s event viewer accesses the applicable operating
system event log.
The location of the event log file depends on the operating system you are using.
•In the Microsoft® Windows® 2000 Advanced Server and Windows
Server
log and optionally to a Unicode text file,
Notepad), that is located in the
The default
•In the Red Hat Enterprise Linux, SUSE Linux Enterprise Server and
VMware ESXi operating systems, messages are logged to the system log
file. The default name of the system log file is
view the messages file using a text editor such as vi or emacs.
®
Windows® content mentioned in the following section do not apply to
®
ESXi version 3.5 Update 4 release.
System
Logs
Alert
®
2003 operating systems, messages are logged to the system event
object in the tree view.
tab.
subtab.
install_path
is
C:\Program Files\Dell\SysMgt
dcsys32.log
install_path
(viewable using
\omsa\log
directory.
.
/var/log/messages
. You can
Introduction11
Logging Messages to a Unicode File
Logging messages to a Unicode text file is optional. By default, the feature is
disabled. To enable this feature, modify the Event Manager section of the
dcemdy32.ini file as follows:
•In Windows, locate the file at <install_path>\dataeng\ini and set
UnitextLog.enabled=True
Files\Dell\SysMgt. Restart the DSM SA Event Manager service.
•In Red Hat Enterprise Linux and SUSE Linux Enterprise Server, locate the file at
<install_path>/dataeng/ini and set
default install_path is /opt/dell/srvadmin. Issue the "/etc/init.d/dataeng restart"
command to restart the Server Administrator event manager service.
. The default <install_path> is C:\Program
UnitextLog.enabled=True
. The
The following subsections explain how to open the Windows 2000 Advanced
Server, Windows Server 2003, Red Hat Enterprise Linux, SUSE Linux
Enterprise Server and VMware ESXi event viewers.
Viewing Events in Windows 2000 Advanced Server and Windows
Server 2003
1
Click the
2
Double-click
3
In the
The
4
To view the details of an event, double-click one of the event items.
Start
Administrative Tools
Event Viewer
System Log
button, point to
window, click the
Settings
, and click
Control Panel
, and then double-click
Tree
tab and then click
Event Viewer
window displays a list of recently logged events.
.
.
System Log
.
NOTE: You can find the event log file dcsys32.log, at <install_path>\omsa\log
directory. The default <install_path> is C:\Program Files\Dell\SysMgt.
Viewing Events in Red Hat Enterprise Linux and SUSE Linux
Enterprise Server
1
Log in as
2
Use a text editor such as vi or emacs to view the file named
messages
12Introduction
root
.
.
/var/log/
The following example shows the Red Hat Enterprise Linux and SUSE Linux
Enterprise Server message log, /var/log/messages. The text in boldface type
indicates the message text.
...
Feb 6 14:20:51 server01 Server Administrator:
Instrumentation Service EventID: 1000
Server Administrator starting
Feb 6 14:20:51 server01 Server Administrator:
Instrumentation Service EventID: 1001
Server Administrator startup complete
Feb 6 14:21:21 server01 Server Administrator:
Instrumentation Service EventID: 1254 Chassis
intrusion detected Sensor location: Main chassis
intrusion Chassis location: Main System Chassis
Previous state was: OK (Normal) Chassis intrusion
state: Open
Feb 6 14:21:51 server01 Server Administrator:
Instrumentation Service EventID: 1252 Chassis
intrusion returned to normal Sensor location: Main
chassis intrusion Chassis location: Main System
Chassis Previous state was: Critical (Failed) Chassis
intrusion state: Closed
Viewing Events in VMware ESXi
1
Log in to the VMware ESXi system with VMware Infrastructure (VI)
Client.
2
Click
3
4
Administration
Select
System Logs
Select
Server Log [/var/log/messages
on the navigation bar.
.
] entry on the drop-down list.
Introduction13
Viewing the Event Information
The event log for each operating system contains some or all of the following
information:
•
Date
— The date the event occurred.
•
Time
— The local time the event occurred.
Ty p e
•
— A classification of the event severity: Information, Warning,
or Error.
•
User
— The name of the user on whose behalf the event occurred.
Computer
•
•
Source
•
Category
Event ID
•
•
Description
— The name of the system where the event occurred.
— The software that logged the event.
— The classification of the event by the event source.
— The number identifying the particular event type.
— A description of the event. The format and contents of
the event description vary, depending on the event type.
Understanding the Event Description
Table 1-2 lists in alphabetical order each line item that may appear in the
event description.
Table 1-2. Event Description Reference
Description Line ItemExplanation
Action performed was:
<Action>
Action requested was:
<Action>
Additional Details:
<Additional details for
the event>
Specifies the action that was performed,
for example:
Action performed was: Power cycle
Specifies the action that was requested,
for example:
Action requested was: Reboot, shutdown OS first
Specifies additional details available for the hot
plug event, for example:
Specifies the status of the previous redundancy
message, for example:
Previous redundancy state was: Lost
Specifies the previous state of the sensor, for
example:
Previous state was: OK (Normal)
Specifies the status of the processor sensor,
for example:
Processor sensor status:
Configuration error
Specifies the location of the redundant power
supply or cooling unit in the chassis, for example:
Redundancy unit: Fan Enclosure
Specifies the location of the sensor in the specified
chassis, for example:
Sensor location: CPU1
Specifies the temperature in degrees Celsius,
for example:
Temperature sensor value
(in degrees Celsius): 30
Specifies the voltage sensor value in volts,
for example:
Voltage sensor value (in Volts):
1.693
Introduction17
18Introduction
Event Message Reference
The following tables lists in numerical order each event ID and its
corresponding description, along with its severity and cause.
NOTE: For corrective actions, see the appropriate documentation.
Miscellaneous Messages
Miscellaneous messages in Table 2-1 indicate that certain alert systems are up
and working.
Table 2-1. Miscellaneous Messages
Event IDDescriptionSeverityCause
0000Log was clearedInformationUser cleared the log from
Server Administrator.
0001Log backup createdInformationThe log was full, copied to
backup, and cleared.
1000Server Administrator
starting
1001Server Administrator
startup complete
1002A system BIOS update
has been scheduled for
the next reboot
1003A previously scheduled
system BIOS update has
been canceled
InformationServer Administrator is
beginning to initialize.
InformationServer Administrator
completed its initialization.
InformationThe user has chosen to
update the flash basic input/
output system (BIOS).
InformationThe user decides to cancel
the flash BIOS update, or an
error occurs during the flash.
Event Message Reference19
Table 2-1. Miscellaneous Messages (continued)
Event IDDescriptionSeverityCause
1004Thermal shutdown
protection has been
initiated
1005SMBIOS data is absentErrorThe system does not
1006Automatic System
Recovery (ASR) action
was performed
Action performed was:
<Action>
Date and time of
action:
time>
1007User initiated host
system control action
Action requested was:
<Action>
<Date and
ErrorThis message is generated
when a system is configured
for thermal shutdown due
to an error event. If a
temperature sensor reading
exceeds the error threshold
for which the system is
configured, the operating
system shuts down and
the system powers off.
This event may also be
initiated on certain systems
when a fan enclosure is
removed from the system for
an extended period of time.
contain the required systems
management BIOS
version 2.2 or higher, or the
BIOS is corrupted.
ErrorThis message is generated
when an automatic system
recovery action is performed
due to a hung operating
system. The action
performed and the time of
action are provided.
InformationUser requested a host system
control action to reboot,
power off, or power cycle the
system. Alternatively the
user had indicated protective
measures to be initiated in
the event of a thermal
shutdown.
20Event Message Reference
Table 2-1. Miscellaneous Messages (continued)
Event IDDescriptionSeverityCause
1008Systems Management
Data Manager Started
1009Systems Management
Data Manager Stopped
1011RCI table is corruptErrorThis message is generated
1012IPMI Status
Interface: <
the IPMI
interface being used
additional
<
information if
available and
applicable
1013System Peak Power
detected new peak
value
Peak value (in Watts):
>
<Reading>
InformationSystems Management
Data Manager services
were started.
InformationSystems Management
Data Manager services
were stopped.
when the BIOS Remote
Configuration Interface
(RCI) table is corrupted
or cannot be read by the
systems management
software.
InformationThis message is generated
to indicate the Intelligent
>,
InformationThe system peak power
Platform Management
Interface (IPMI)) status of
the system.
Additional information,
when available, includes
Baseboard Management
Controller (BMC) not
present, BMC not
responding, System Event
Log (SEL) not present, and
SEL Data Record (SDR) not
present.
sensor detected a new peak
value in power consumption.
The new peak value in Watts
is provided.
Event Message Reference21
Temperature Sensor Messages
Temperature sensors listed in Table 2-2 help protect critical components
by alerting the systems management console when temperatures become
too high inside a chassis. The temperature sensor messages use additional
variables: sensor location, chassis location, previous state, and temperature
sensor value or state.
Table 2-2. Temperature Sensor Messages
Event IDDescriptionSeverityCause
1050 Temperature sensor has failed
Sensor location: <Location in
chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not discrete:
Temperature sensor value
(in degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
1051 Temperature sensor value
unknown
Sensor location: <Location in
chassis>
Chassis location: <Name of
chassis>
If sensor type is not discrete:
Temperature sensor value (in
degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
<Reading>
<Reading>
ErrorA temperature
sensor on the
backplane board,
system board, or
the carrier in the
specified system
failed. The sensor
location, chassis
location, previous
state, and
temperature sensor
value are provided.
Information A temperature
sensor on the
backplane board,
system board, or
drive carrier in the
specified system
could not obtain a
reading. The sensor
location, chassis
location, previous
state, and
a nominal
temperature sensor
value are provided.
22Event Message Reference
Table 2-2. Temperature Sensor Messages (continued)
Event IDDescriptionSeverityCause
1052 Temperature sensor returned
to a normal value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of
chassis>
Previous state was:
If sensor type is not discrete:
Temperature sensor value (in
degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
<Reading>
<State>
1053 Temperature sensor detected
a warning value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of
chassis>
Previous state was:
If sensor type is not discrete:
Temperature sensor value (in
degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
<Reading>
<State>
Information A temperature
sensor on the
backplane board,
system board, or
drive carrier in the
specified system
returned to a valid
range after crossing
a failure threshold.
The sensor
location, chassis
location, previous
state, and
temperature sensor
value are provided.
WarningA temperature
sensor on the
backplane board,
system board,
CPU, or drive
carrier in the
specified system
exceeded its
warning threshold.
The sensor
location, chassis
location, previous
state, and
temperature sensor
value are provided.
Event Message Reference23
Table 2-2. Temperature Sensor Messages (continued)
Event IDDescriptionSeverityCause
1054 Temperature sensor detected
a failure value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of
chassis>
Previous state was:
If sensor type is not discrete:
Temperature sensor value (in
degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
<Reading>
<State>
1055 Temperature sensor detected
a non-recoverable value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of
chassis>
Previous state was:
If sensor type is not discrete:
Temperature sensor value (in
degrees Celsius):
If sensor type is discrete:
Discrete temperature state:
<State>
<Reading>
<State>
ErrorA temperature
sensor on the
backplane board,
system board, or
drive carrier in the
specified system
exceeded its failure
threshold. The
sensor location,
chassis location,
previous state, and
temperature sensor
value are provided.
ErrorA temperature
sensor on the
backplane board,
system board, or
drive carrier in the
specified system
detected an error
from which it
cannot recover.
The sensor
location, chassis
location, previous
state, and
temperature sensor
value are provided.
24Event Message Reference
Cooling Device Messages
Cooling device sensors listed in Table 2-3 monitor how well a fan is
functioning. Cooling device messages provide status and warning information
for fans in a particular chassis.
Table 2-3. Cooling Device Messages
Event IDDescriptionSeverityCause
1100 Fan sensor has
failed
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
1101 Fan sensor value
unknown
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
ErrorA fan sensor in the specified
system is not functioning.
The sensor location, chassis
location, previous state, and fan
sensor value are provided.
ErrorA fan sensor in the specified
system could not obtain a
reading. The sensor location,
chassis location, previous state,
and a nominal fan sensor value
are provided.
Event Message Reference25
Table 2-3. Cooling Device Messages (continued)
Event IDDescriptionSeverityCause
1102 Fan sensor returned
to a normal value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
1103 Fan sensor detected
a warning value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
Information A fan sensor reading on the
specified system returned to a
valid range after crossing a
warning threshold. The sensor
location, chassis location,
previous state, and fan sensor
value are provided.
WarningA fan sensor reading in the
specified system exceeded a
warning threshold. The sensor
location, chassis location,
previous state, and fan sensor
value are provided.
26Event Message Reference
Table 2-3. Cooling Device Messages (continued)
Event IDDescriptionSeverityCause
1104 Fan sensor detected
a failure value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
1105 Fan sensor detected
a
non-recoverable
value
Sensor location:
<Location in
chassis>
Chassis location:
<Name of chassis>
Previous state was:
<State>
Fan sensor value:
<Reading>
ErrorA fan sensor in the specified
system detected the failure of
one or more fans. The sensor
location, chassis location,
previous state, and fan sensor
value are provided.
ErrorA fan sensor detected an error
from which it cannot recover.
The sensor location, chassis
location, previous state, and fan
sensor value are provided.
Event Message Reference27
Voltage Sensor Messages
Voltage sensors listed in Table 2-4 monitor the number of volts across critical
components. Voltage sensor messages provide status and warning information
for voltage sensors in a particular chassis.
Table 2-4. Voltage Sensor Messages
Event IDDescriptionSeverityCause
1150 Voltage sensor has failed
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value (in
Volts):
If sensor type is discrete:
Discrete voltage state:
<State>
1151 Voltage sensor value unknown
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value
(in Volts):
If sensor type is discrete:
Discrete voltage state:
<State>
<Reading>
<Reading>
ErrorA voltage sensor in
the specified system
failed. The sensor
location, chassis
location, previous
state, and voltage
sensor value are
provided.
WarningA voltage sensor in
the specified system
could not obtain
a reading. The sensor
location, chassis
location, previous
state, and a nominal
voltage sensor value
are provided.
28Event Message Reference
Table 2-4. Voltage Sensor Messages (continued)
Event IDDescriptionSeverityCause
1152 Voltage sensor returned to
a normal value
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value
(in Volts):
If sensor type is discrete:
Discrete voltage state:
<Reading>
<State>
1153 Voltage sensor detected a
warning value
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value
(in Volts):
If sensor type is discrete:
Discrete voltage state:
<Reading>
<State>
Information A voltage sensor in
the specified system
returned to a valid
range after crossing
a failure threshold.
The sensor location,
chassis location,
previous state, and
voltage sensor value
are provided.
WarningA voltage sensor in
the specified system
exceeded its warning
threshold. The sensor
location, chassis
location, previous
state, and voltage
sensor value are
provided.
Event Message Reference29
Table 2-4. Voltage Sensor Messages (continued)
Event IDDescriptionSeverityCause
1154 Voltage sensor detected
a failure value
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value
(in Volts):
If sensor type is discrete:
Discrete voltage state:
<Reading>
<State>
1155 Voltage sensor detected a
non-recoverable value
Sensor location: <Location
in chassis>
Chassis location: <Name of
chassis>
Previous state was: <State>
If sensor type is not
discrete:
Voltage sensor value
(in Volts):
If sensor type is discrete:
Discrete voltage state:
<Reading>
<State>
ErrorA voltage sensor in
the specified system
exceeded its failure
threshold. The sensor
location, chassis
location, previous
state, and voltage
sensor value are
provided.
ErrorA voltage sensor in
the specified system
detected an error
from which it cannot
recover. The sensor
location, chassis
location, previous
state, and voltage
sensor value are
provided.
30Event Message Reference
Loading...
+ 194 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.