Dell Poweredge C4130 EMC Servers Troubleshooting Guide User Manual Power Edge C6320p User's Guide15 En Us

User Manual: Dell poweredge-c4130 - Dell EMC PowerEdge Servers Troubleshooting Guide

Open the PDF directly: View PDF PDF.
Page Count: 98

DownloadDell Poweredge-c4130 EMC PowerEdge Servers Troubleshooting Guide User Manual - Power Edge Poweredge-c6320p User's Guide15 En-us
Open PDF In BrowserView PDF
Dell EMC PowerEdge Servers
Troubleshooting Guide

Notes, cautions, and warnings
NOTE: A NOTE indicates important information that helps you make better use of your product.
CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem.
WARNING: A WARNING indicates a potential for property damage, personal injury, or death.

Copyright © 2017 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other
trademarks may be trademarks of their respective owners.

2017 - 12
Rev. A05

Contents
1 Introduction....................................................................................................................................................6
Audience..............................................................................................................................................................................6
Recommended tools.......................................................................................................................................................... 6
Documentation resources................................................................................................................................................. 7
Safety instructions............................................................................................................................................................. 8
2 Diagnostic indicators..................................................................................................................................... 9
Status LED indicators........................................................................................................................................................ 9
System health and system ID indicator codes.............................................................................................................. 10
iDRAC Quick Sync 2 indicator codes............................................................................................................................. 10
iDRAC Direct LED indicator codes.................................................................................................................................. 11
NIC indicator codes........................................................................................................................................................... 11
Power supply unit indicator codes..................................................................................................................................12
Non-redundant power supply unit indicator codes...................................................................................................... 14
Hard drive indicator codes...............................................................................................................................................15
uSATA SSD indicator codes.............................................................................................................................................16
Internal dual SD module indicator codes........................................................................................................................16
3 Running diagnostics..................................................................................................................................... 18
Receiving automated support with SupportAssist ......................................................................................................18
PSA/ePSA Diagnostics.................................................................................................................................................... 18
Running the PSA/ePSA Diagnostics........................................................................................................................ 18
PSA/ePSA Diagnostics error codes......................................................................................................................... 19
Debugging mini crash dump files using by WinDbg in Windows operating system................................................. 35
4 Troubleshooting hardware issues................................................................................................................. 40
Troubleshooting no power issues...................................................................................................................................40
Troubleshooting system startup failure.......................................................................................................................... 41
Troubleshooting external connections............................................................................................................................41
Troubleshooting the video subsystem............................................................................................................................41
Troubleshooting a USB device........................................................................................................................................ 41
Troubleshooting iDRAC Direct (USB XML configuration).....................................................................................42
Troubleshooting iDRAC Direct (Laptop connection)............................................................................................. 42
Troubleshooting a serial I/O device................................................................................................................................42
Troubleshooting a NIC..................................................................................................................................................... 43
Troubleshooting a wet system........................................................................................................................................43
Troubleshooting a damaged system.............................................................................................................................. 44
Troubleshooting the system battery.............................................................................................................................. 44
Troubleshooting cooling problems..................................................................................................................................45
Troubleshooting cooling fans.......................................................................................................................................... 45
Troubleshooting an internal USB key.............................................................................................................................46
Troubleshooting an SD card............................................................................................................................................46

Contents

3

Troubleshooting an optical drive.....................................................................................................................................47
Troubleshooting a tape backup unit............................................................................................................................... 47
Troubleshooting a storage controller............................................................................................................................. 48
Troubleshooting expansion cards................................................................................................................................... 48
Troubleshooting processors............................................................................................................................................ 49
Troubleshooting hard drives............................................................................................................................................49
Checking hard drive status in the PERC BIOS...................................................................................................... 50
FAQs.............................................................................................................................................................................51
Symptoms...................................................................................................................................................................53
Troubleshooting system memory................................................................................................................................... 53
Troubleshooting memory module issues....................................................................................................................... 54
Prerequisites to follow when installing or upgrading memory.............................................................................. 54
Memory population guidelines..................................................................................................................................54
Memory modes.......................................................................................................................................................... 55
Correctable and uncorrectable errors..................................................................................................................... 56
System stops responding during POST after upgrading or installing a memory module..................................56
How to resolve unsupported or mismatched memory module error messages reported during POST
on a PowerEdge system?......................................................................................................................................... 57
Troubleshooting power supply units...............................................................................................................................57
Troubleshooting power source problems.................................................................................................................57
Troubleshooting power supply unit problems......................................................................................................... 57
Troubleshooting RAID......................................................................................................................................................58
RAID configuration using OpenManage Server Administrator.............................................................................58
RAID configuration by using Unified Server Configurator..................................................................................... 61
Downloading and installing the RAID controller log export by using PERCCLI tool on ESXi hosts on
Dell’s 13th generation of PowerEdge servers.........................................................................................................64
Configuring RAID by using Lifecycle Controller......................................................................................................68
Starting and target RAID levels for virtual disk reconfiguration and capacity expansion..................................69
Reconfiguring or migrating virtual disks.................................................................................................................. 70
Foreign Configuration Operations............................................................................................................................ 72
Viewing Patrol Read report....................................................................................................................................... 73
Check Consistency report........................................................................................................................................ 75
Virtual disk troubleshooting ..................................................................................................................................... 75
Troubleshooting memory or battery errors on the PERC controller on Dell PowerEdge servers.................... 77
Slicing.......................................................................................................................................................................... 79
RAID puncture............................................................................................................................................................ 79
Blank or skipped RAID configuration screen during SMTD DVD installation...................................................... 82
Troubleshooting thermal issue........................................................................................................................................83
5 Server management software issues........................................................................................................... 84
How to set up Auto Dedicated NIC feature?............................................................................................................... 84
How do I configure RAID using operating system deployment wizard?................................................................... 85
How to set up e-mail alerts?.......................................................................................................................................... 85
How to export license using iDRAC web interface?....................................................................................................85
How to specify language and keyboard type?............................................................................................................. 86
How to configure network settings using Lifecycle Controller?................................................................................ 86

4

Contents

What are the different types of iDRAC licenses?.........................................................................................................87
What are the differences between iDRAC7 and iDRAC8?..........................................................................................87
Can I upgrade the iDRAC license from express to enterprise and BMC to express?..............................................87
How to activate license on iDRAC?...............................................................................................................................88
How to find out missing licenses?................................................................................................................................. 88
How to update BIOS on 13th generation PowerEdge servers?.................................................................................89
Which are the operating systems supported on PowerEdge servers?.....................................................................89
6 Troubleshooting operating system issues.....................................................................................................90
Troubleshooting blue screen errors (BSODs)...............................................................................................................90
Troubleshooting a Purple Screen of Death (PSOD).....................................................................................................91
Troubleshooting no boot issues for Windows operating systems...............................................................................91
No POST issues (iDRAC)................................................................................................................................................ 91
“First Boot Device cannot be set” error message is displayed when configuring a boot device during
POST............................................................................................................................................................................ 91
“Alert! iDRAC6 not responding.. Power required may exceed PSU wattage...” error message is
displayed at POST during a reboot.......................................................................................................................... 92
Troubleshooting a No POST situation........................................................................................................................... 92
Windows............................................................................................................................................................................93
FAQs............................................................................................................................................................................ 93
Symptoms...................................................................................................................................................................95
VMware.............................................................................................................................................................................95
FAQs............................................................................................................................................................................ 95
Symptoms...................................................................................................................................................................96
Linux.................................................................................................................................................................................. 96
FAQs............................................................................................................................................................................ 96
Symptoms...................................................................................................................................................................96
7 Getting help................................................................................................................................................. 97
Contacting Dell................................................................................................................................................................. 97
Locating Service Tag of your system.............................................................................................................................97

Contents

5

1
Introduction
Use this guide to learn how to identify and troubleshoot the Dell PowerEdge server issues.
In particular, this guide:
•

Provides troubleshooting procedures for issues related to Server Operating System, Server Hardware, and Server Management
Software.

•

Provides an overview of diagnostic indicators and describes how to use the indicator codes to facilitate troubleshooting.

•

Lists Dell PowerEdge server error messages and their probable causes, and provides the actions recommended to correct them.
NOTE: This guide does not cover every possible issue that might occur on Dell PowerEdge servers, however focuses on issues
that are frequently encountered or are frequently asked questions.

Topics:
•

Audience

•

Recommended tools

•

Documentation resources

•

Safety instructions

Audience
The information in this troubleshooting guide is intended primarily for administrators, who are responsible for managing the Dell PowerEdge
servers, however might be useful for all users of Dell servers.

Recommended tools
Lists the basic tools and equipment necessary to perform troubleshooting tasks on the Dell PowerEdge servers.
•

Key to the bezel lock
The key is needed only if your system includes a bezel.

•

Phillips #1 screwdriver

•

Phillips #2 screwdriver

•

Torx #T30 screwdriver

•

1/4 inch flat head screwdriver

•

#4 nut driver

•

Plastic scribe

•

Wrist grounding strap

•

ESD mat

You need the following tools to assemble the cables for a DC power supply unit:
•

AMP 90871-1 hand-crimping tool or equivalent

•

Tyco Electronics 58433-3 or equivalent

•

Wire-stripper pliers to remove insulation from size 10 AWG solid or stranded, insulated copper wire

6

Introduction

NOTE: Use alpha wire part number 3080 or equivalent (65/30 stranding).

Documentation resources
This section describes the troubleshooting documentation related to all components in a PowerEdge system.
Table 1. Additional documentation resources for your system
Task

Document

Location

Setting up your system

For information about installing the system into a
rack, see the Rack documentation included with
your rack solution.

Dell.com/poweredgemanuals
Dell.com/poweredgemanuals

For information about turning on the system and
the technical specifications of your system, see the
Getting Started With Your System document that
is shipped with your system.
Configuring your system

For information about the iDRAC features,
configuring and logging in to iDRAC, and managing
your system remotely, see the Integrated Dell
Remote Access Controller User's Guide.

Dell.com/idracmanuals

For information about installing the operating
system, see the operating system documentation.

Dell.com/operatingsystemmanuals

For information about understanding Remote
Dell.com/idracmanuals
Access Controller Admin (RACADM)
subcommands and supported RACADM interfaces,
see the RACADM Command Line Reference Guide
for iDRAC.
For information about updating drivers and
firmware, see the Methods to download firmware
and drivers section in this document.

Dell.com/support/drivers

Installing and servicing your
system

For information on installing or removing the
components in your system, see the Installation
and Service Manual for your system.

Dell.com/poweredgemanuals

Managing your system

For information about systems management
Dell.com/openmanagemanuals
software offered by Dell, see the Dell OpenManage
Systems Management Overview Guide.
For information about setting up, using, and
troubleshooting OpenManage, see the Dell
OpenManage Server Administrator User’s Guide.

Dell.com/openmanagemanuals

For information about installing, using, and
troubleshooting Dell OpenManage Essentials, see
the Dell OpenManage Essentials User’s Guide.

Dell.com/openmanagemanuals

For information about installing and using Dell
SupportAssist, see the Dell EMC SupportAssist
Enterprise User’s Guide.

Dell.com/SupportAssist Enterprise

For information about installing and using Active
System Manager (ASM), see the Active System
Manager User’s Guide.

Dell.com/asmdocs

Introduction

7

Task

Understanding event and error
messages

Document

Location

For understanding the features of Dell Lifecycle
Controller (LCC), see the Dell Lifecycle Controller
User’s Guide.

Dell.com/idracmanuals

For information about partner programs enterprise
systems management, see the OpenManage
Connections Enterprise Systems Management
documents.

Dell.com/openmanagemanuals

For information about checking the event and error Dell.com/openmanagemanuals > OpenManage
messages generated by the system firmware and
software
agents that monitor system components, see the
Dell Event and Error Messages Reference Guide.

Safety instructions
WARNING: Whenever you need to lift the system, get others to assist you. To avoid injury, do not attempt to lift the system by
yourself.
WARNING: Opening or removing the system cover while the system is powered on may expose you to a risk of electric shock.
CAUTION: Do not operate the system without the cover for a duration exceeding five minutes.
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
CAUTION: Operating the system without the system cover can result in component damage.
NOTE: It is recommended that you always use an antistatic mat and antistatic strap while working on components inside the
system.
NOTE: To ensure proper operation and cooling, all bays in the system and system fans must be populated always with a
component or with a blank.

8

Introduction

2
Diagnostic indicators
The diagnostic indicators on the system indicates operation and error status.

Status LED indicators
The status LED indicators on the system front panel display error status during system startup.
NOTE: No status LED indicators are illuminated when the system is turned off. To start the system, plug it into a working power
source and press the power button.
NOTE: Status LED indicators are always off and only turn solid amber if any error occurs.
Table 2. Status LED indicators
Icon

Description

Condition

Corrective action

Hard drive
indicator

The indicator turns solid amber if
there is a hard drive error.

Check the System Event Log to determine if the hard
drive has an error. Run the appropriate Online Diagnostics
test. Restart the system and run embedded diagnostics
(ePSA). If the hard drives are configured in a RAID array,
restart the system and enter the host adapter
configuration utility program.

Temperature
indicator

The indicator turns solid amber if the
system experiences a thermal error
(for example, the ambient
temperature is out of range or fan
failure).

Ensure that none of the following conditions exist:
•
•
•
•

A cooling fan has been removed or has failed.
System cover, air shroud, EMI filler panel, memory
module blank, or back filler bracket is removed.
Ambient temperature is too high.
External airflow is obstructed.

If the problem persists, see the Getting help section.
Electrical indicator The indicator turns solid amber if the
system experiences an electrical error
(for example, voltage out of range, or
a failed power supply unit (PSU) or
voltage regulator).

Check the System Event Log or system messages for the
specific issue. If it is due to a problem with the PSU, check
the LED on the PSU. Reseat the PSU. If the problem
persists, see the Getting help section.

Memory indicator

The indicator turns solid amber if a
memory error occurs.

Check the System Event Log or system messages for the
location of the failed memory. Reseat the memory module.
If the problem persists, see the Getting help section.

PCIe indicator

The indicator turns solid amber if a
PCIe card experiences an error.

Restart the system. Update any required drivers for the
PCIe card. Reinstall the card. If the problem persists, see
the Getting help section.
NOTE: For more information about the supported
PCIe cards, see the Expansion card installation
guidelines section.

Diagnostic indicators

9

System health and system ID indicator codes

Figure 1. System health and system ID indicators
Table 3. System health and system ID indicator codes
System health and system ID indicator code

Condition

Blue solid

Indicates that the system is turned on, system is healthy and system
ID mode is not active. Press the system health and system ID button
to switch to system ID mode.

Blinking blue

Indicates that the system ID mode is active. Press the system health
and system ID button to switch to system health mode.

Amber solid

Indicates that the system is in fail-safe mode. If the problem persists,
see the Getting help section.

Blinking amber

Indicates that the system is experiencing a fault. Check the System
Event Log or the LCD panel, if available on the bezel, for specific error
message. For more information about error messages, see the Dell
Event and Error Messages Reference Guide at Dell.com/
openmanagemanuals > OpenManage software.

iDRAC Quick Sync 2 indicator codes
NOTE: iDRAC Quick Sync 2 module (optional) is located on the left control panel of your system.

Figure 2. iDRAC Quick Sync 2 indicators
Table 4. iDRAC Quick Sync 2 indicators
Wireless indicator code

Condition

Off (default state)

Indicates that the iDRAC Quick Sync 2
If the LED fails to turn on, reseat the left control panel
feature is turned off. Press the iDRAC Quick flex cable and check again. If the problem persists,
Sync 2 button to turn on the iDRAC Quick
see the Getting help section.
Sync 2 feature.

White solid

Indicates that iDRAC Quick Sync 2 is ready
to communicate. Press the iDRAC Quick
Sync 2 button to turn off.

If the LED fails to turn off, restart the system. If the
problem persists, see the Getting help section.

Blinks white rapidly

Indicates data transfer activity.

If the indicator continues to blink indefinitely, see the
Getting help section.

10

Diagnostic indicators

Corrective action

Wireless indicator code

Condition

Corrective action

Blinks white slowly

Indicates that firmware update is in
progress.

If the indicator continues to blink indefinitely, see the
Getting help section.

Blinks white five times rapidly and
then turns off

Indicates that the iDRAC Quick Sync 2
feature is disabled.

Check if iDRAC Quick Sync 2 feature is configured to
be disabled by iDRAC. If the problem persists, see the
Getting help section. For more information, see
Integrated Dell Remote Access Controller User's
Guide at Dell.com/idracmanuals or Dell
OpenManage Server Administrator User’s Guide at
Dell.com/openmanagemanuals.

Amber solid

Indicates that the system is in fail-safe
mode.

Restart the system. If the problem persists, see the
Getting help section.

Blinking amber

Indicates that the iDRAC Quick Sync 2
hardware is not responding properly.

Restart the system. If the problem persists, see the
Getting help section.

iDRAC Direct LED indicator codes
The iDRAC Direct LED indicator lights up to indicate that the port is connected and is being used as a part of the iDRAC subsystem. iDRAC
Direct LED indicator is located below the iDRAC Direct port on the front panel.
The following table describes iDRAC Direct activity when configuring iDRAC Direct by using your laptop or tablet and USB to micro USB
(type AB) cable:
Table 5. iDRAC Direct LED indicator codes
iDRAC Direct LED
indicator pattern

Condition

Solid green for two seconds Indicates that the laptop or tablet is connected.
Flashing green (on for two
seconds and off for two
seconds)

Indicates that the laptop or tablet connected is recognized.

Turns off

Indicates that the laptop or tablet is unplugged.

NIC indicator codes
Each NIC on the back panel has indicators that provide information about the activity and link status. The activity LED indicator indicates if
data is flowing through NIC, and the link LED indicator indicates the speed of the connected network.

Figure 3. NIC indicators
1

Link LED indicator

2

Activity LED indicator

Diagnostic indicators

11

Table 6. NIC indicators
Status

Condition

Link and activity indicators are off

The NIC is not connected to the network.

Link indicator is green and activity indicator is blinking green The NIC is connected to a valid network at its maximum port speed and
data is being sent or received.
Link indicator is amber and activity indicator is blinking
green

The NIC is connected to a valid network at less than its maximum port
speed and data is being sent or received.

Link indicator is green and activity indicator is off

The NIC is connected to a valid network at its maximum port speed and
data is not being sent or received.

Link indicator is amber and activity indicator is off

The NIC is connected to a valid network at less than its maximum port
speed and data is not being sent or received.

Link indicator is blinking green and activity is off

NIC identify is enabled through the NIC configuration utility.

Power supply unit indicator codes
AC power supply units (PSUs) have an illuminated translucent handle that serves as an indicator and DC PSUs have an LED that serves as
an indicator. The indicator shows whether power is present or a power fault has occurred.

Figure 4. AC PSU status indicator
1

AC PSU status indicator or handle

Table 7. AC PSU status indicator
Power indicator codes

Condition

Green

A valid power source is connected to the PSU and the PSU is operational.

Blinking amber

Indicates a problem with the PSU.

Not illuminated

Power is not connected.

Blinking green

When the firmware of the PSU is being updated, the PSU handle blinks green.
CAUTION: Do not disconnect the power cord or unplug the PSU when updating firmware. If
firmware update is interrupted, the PSUs do not function.

Blinking green and turns off

When hot-plugging a PSU, the PSU handle blinks green five times at a rate of 4 Hz and turns off. This
indicates a PSU mismatch with respect to efficiency, feature set, health status, or supported voltage.
CAUTION: If two PSUs are installed, both the PSUs must have the same type of label. For
example, Extended Power Performance (EPP) label. Mixing PSUs from previous generations of
PowerEdge servers is not supported, even if the PSUs have the same power rating. This results
in a PSU mismatch condition or failure to turn the system on.

12

Diagnostic indicators

Power indicator codes

Condition
CAUTION: When correcting a PSU mismatch, replace only the PSU with the blinking indicator.
Swapping the PSU to make a matched pair can result in an error condition and unexpected
system shutdown. To change from a high output configuration to a low output configuration or
vice versa, you must turn off the system.
CAUTION: AC PSUs support both 240 V and 120 V input voltages with the exception of Titanium
PSUs, which support only 240 V. When two identical PSUs receive different input voltages, they
can output different wattages, and trigger a mismatch.
CAUTION: If two PSUs are used, they must be of the same type and have the same maximum
output power.
CAUTION: Combining AC and DC PSUs is not supported and triggers a mismatch.

Figure 5. DC PSU status indicator
1

DC PSU status indicator

Table 8. DC PSU status indicator codes
Power indicator codes

Condition

Green

A valid power source is connected to the PSU and the PSU is operational.

Blinking amber

Indicates a problem with the PSU.

Not illuminated

Power is not connected.

Blinking green

When hot-plugging a PSU, the PSU indicator blinks green. This indicates that there is a PSU mismatch
with respect to efficiency, feature set, health status, or supported voltage.
CAUTION: When correcting a PSU mismatch, replace only the PSU with the blinking
indicator. Swapping the PSU to make a matched pair can result in an error condition and
unexpected system shutdown. To change from a High Output configuration to a Low Output
configuration or vice versa, you must turn off the system.
CAUTION: If two PSUs are used, they must be of the same type and have the same
maximum output power.
CAUTION: Combining AC and DC PSUs is not supported and triggers a mismatch.

Diagnostic indicators

13

Non-redundant power supply unit indicator codes
Press the self-diagnostic button to perform a quick health check on the non-redundant power supply unit (PSU) of the system.

Figure 6. Non-redundant AC PSU status indicator and self-diagnostic button
1

Self-diagnostic button

2

AC PSU status indicator

Table 9. Non-redundant AC PSU status indicator
Power Indicator Pattern

Condition

Not lit

Power is not connected or PSU is faulty.

Green

A valid power source is connected to the PSU and the PSU is operational.

14

Diagnostic indicators

Hard drive indicator codes
Each hard drive carrier has an activity LED indicator and a status LED indicator. The indicators provide information about the current status
of the hard drive. The activity LED indicator indicates whether the hard drive is currently in use or not. The status LED indicator indicates
the power condition of the hard drive.

Figure 7. Hard drive indicators
1

Hard drive activity indicator

3

Hard drive

2

Hard drive status LED indicator

NOTE: If the hard drive is in the Advanced Host Controller Interface (AHCI) mode, the status LED indicator does not turn on.
Table 10. Hard drive indicator codes
Drive-status indicator pattern

Condition

Flashes green twice per second

Identifying drive or preparing for removal.

Off

Drive ready for insertion or removal.
NOTE: The drive status indicator remains off until all hard
drives are initialized after the system is turned on. Drives are
not ready for removal during this time.

Flashes green, amber, and then turns off

Predicted drive failure.

Flashes amber four times per second

Drive failed.

Flashes green slowly

Drive rebuilding.

Steady green

Drive online.

Flashes green for three seconds, amber for three seconds, and Rebuild stopped.
then turns off after six seconds

Diagnostic indicators

15

uSATA SSD indicator codes

Figure 8. uSATA SSD indicators
1

uSATA SSD activity indicator

3

uSATA SSD

2

uSATA SSD status indicator

NOTE: If the SSD is in the Advanced Host Controller Interface (AHCI) mode, the status indicator (on the right side) does not
function and remains off.
Table 11. Drive status indicator codes
Drive-status indicator pattern

Condition

Flashes green twice per second

Identifying drive or preparing for removal.

Off

Drive ready for insertion or removal.
NOTE: The drive status indicator remains off until all hard
drives are initialized after the system is turned on. Drives are
not ready for insertion or removal during this time.

Flashes green, amber, and turns off

Predicted drive failure.

Flashes amber four times per second

Drive failed.

Steady green

Drive online.

Flashes green for three seconds, amber for three seconds, and Rebuild stopped.
turns off after six seconds

Internal dual SD module indicator codes
The Internal Dual SD module (IDSDM) provides you with a redundant SD card solution. You can configure the IDSDM for storage or as the
OS boot partition. The IDSDM card offers the following features:
•

Dual card operation — maintains a mirrored configuration by using SD cards in both the slots and provides redundancy.
NOTE: When the Redundancy option is set to Mirror Mode in the Integrated Devices screen of System Setup, the information
is replicated from one SD card to another.

•

Single card operation — single card operation is supported, but without redundancy.

The following table describes the IDSDM indicator codes:

16

Diagnostic indicators

Table 12. IDSDM indicator codes
Convention

IDSDM indicator code

Description

A

Green

Indicates that the card is online.

B

Flashing green

Indicates rebuild or activity.

C

Flashing amber

Indicates card mismatch or that the card has failed.

D

Amber

Indicates that the card is offline, has failed, or is write-protected.

E

Not lit

Indicates that the card is missing or is booting.

Diagnostic indicators

17

3
Running diagnostics
Running diagnostics help you to identify the cause for a system issue. The diagnostics test your system hardware without requiring
additional equipment or risking data loss.
Topics:
•

Receiving automated support with SupportAssist

•

PSA/ePSA Diagnostics

•

Debugging mini crash dump files using by WinDbg in Windows operating system

Receiving automated support with SupportAssist
Dell SupportAssist is an optional Dell Services offering that automates technical support for your Dell server, storage, and networking
devices. By installing and setting up a SupportAssist application in your IT environment, you can receive the following benefits:
•

Automated issue detection — SupportAssist monitors your Dell devices and automatically detects hardware issues, both proactively
and predictively.

•

Automated case creation — When an issue is detected, SupportAssist automatically opens a support case with Dell Technical Support.

•

Automated diagnostic collection — SupportAssist automatically collects system state information from your devices and uploads it
securely to Dell. This information is used by Dell Technical Support to troubleshoot the issue.

•

Proactive contact — A Dell Technical Support agent contacts you about the support case and helps you resolve the issue.

The available benefits vary depending on the Dell Service entitlement purchased for your device. For more information about
SupportAssist, go to Dell.com/SupportAssist.

PSA/ePSA Diagnostics
Run the Embedded System Diagnostics (ePSA) if your system does not boot.

Running the PSA/ePSA Diagnostics
1

While the system is booting, press  to enter Boot Manager. Alternatively, press  to enter Lifecycle Controller.

2

Use the up and down arrow keys to select System Utilities → Hardware Diagnostics.

3

Note down the error code.

NOTE: For systems that do not have Lifecycle Controller, press  to launch the Utility Mode (diags) option.
The following table describes the PSA/ePSA diagnostics error messages.

18

Running diagnostics

PSA/ePSA Diagnostics error codes
Error number (PSA and ePSA)

Error message

Description

Steps

PSA NA

CPU - exception occurred

An error occurred during the
tests that may involve the
system board.

1

ePSA 2000-0111

2
3

PSA NA
ePSA 2000-0112

CPU - machine check exception
detected

An error occurred during the
tests that may involve the
system board.

1
2
3

PSA NA
ePSA 2000-0114

CPU - Cache integrity test
discrepancy

An error occurred during the
tests that may involve the
system board.

1
2
3
4

PSA NA
ePSA 2000-0115

CPU - Stress Thermal condition.
Limit (d)C. Actual (d)C

An error occurred during the
tests that may involve the
system board.

1
2
3
4

PSA NA
ePSA 2000-0121

Memory - memory errors were
detected and repaired

An error occurred during the
tests that may involve the
system board or memory of the
system. However, the system
has self repaired.

1
2
3
4

PSA 1000-0122
ePSA 2000-0122

PSA Memory - test initialization
failure
ePSA Memory - memory errors
were detected and excessive
errors were detected

An error occurred during the
tests that may involve the
system board or memory of the
system. However, the system
has self repaired.

1
2
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
Check temperatures in
system health and check
that no airflow obstructed.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
Check temperatures in
system health and check
that no airflow obstructed.
If failure continues, contact
Dell Technical Support
Turn off the system and
reseat the memory
modules.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Turn off the system and
reseat the memory
modules.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.

Running diagnostics

19

Error number (PSA and ePSA)

PSA 1000-0123

Error message

ePSA 2000-0124
PSA NA

If failure continues, contact
Dell Technical Support

An error occurred during the
tests that may involve the
system board or memory of the
system. However, the system
has self repaired.

1

Turn off the system and
reseat the memory
modules.
Repeat the PSA
diagnostics.

System Log - ,


This is information in the system
log to show time and messages
related to system events.

1
2

Clear the system log.
Repeat the PSA
diagnostics.

Event Log

The IPMI system event log is full
for various reasons or logging
has stopped because too many
ECC errors have occurred.

1

Clear the IPMI system
event log.
Repeat the PSA
diagnostics.

Memory - integrity test failed

ePSA 2000-0125

PSA NA

ePSA 2000-0131

2

2

Event Log

The event log(s) must be cleared 1
before testing can continue.
2

Clear the system event log.
Repeat the PSA
diagnostics.

Battery - the battery is not
installed

An error occurred during the
tests that may involve the main
system board or battery of the
system.

Turn off the system and
reseat the system battery.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support.

ePSA 2000-0126
PSA NA

Steps
4

ePSA 2000-0123

PSA NA

Description

1
2
3
4

PSA NA
ePSA 2000-0132

Battery - the battery is reaching
the end of its usable life

An error occurred during the
tests that may involve the main
system board or battery of the
system.

1
2
3
4

PSA NA
ePSA 2000-0133

Battery - the battery cannot
supply sufficient power

An error occurred during the
tests that may involve the main
system board or battery of the
system.

1
2
3
4

PSA 2000-0141
ePSA 2000-0141

20

Running diagnostics

Hard Drive - no drive detected

Your system BIOS is reporting
1
that no Hard Disk Drive is being
reported. If a Portable, reseat
the hard drive, if a Desktop,
2
reseat both ends of the data
cable and reseat the power cable

Turn off the system and
reseat the system battery.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support.
Turn off the system and
reseat battery.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support.
If you don't have a hard
disk drive (HDD), this may
be an automatic message
and requires no action.
If you have an HDD,
reconnect your HDD to the
system board.

Error number (PSA and ePSA)

PSA 1000-0142
ePSA 2000-0142

Error message

PSA Hard Drive - drive self test
failed

Description

Steps

to the drive. Repeat the PSA
diagnostics. If a known good
hard drive is available, see if the
good drive can be detected in
the system or try the suspect
drive in a working system.

3
4

Your hard disk drive has
indicated a failure.

1
2

ePSA Hard Drive - self test
unsuccessful

5

3
PSA 1000-0143
ePSA 2000-0143
PSA 1000-0144
ePSA 2000-0144
PSA 1000-0145
ePSA 2000-0145

Hard Drive - SMART read
command unsuccessful

Hard Drive - no support for drive Your hard disk drive has
self test
indicated a failure.
PSA Hard Drive - timeout
waiting for Drive Self Test to
complete

The hard drive test did not
complete the last test
attempted.

1

2

3

4
5
6

ePSA 2000-0146

Hard Drive - self test log
contains previous errors

Your hard drive has indicated a
failure.

1
2
3
4

PSA 1000-0147
ePSA 2000-0147

Update to the latest BIOS.
Turn off your computer and
reconnect your hard disk
drive (HDD) to the system
board for instructions.)
Repeat the PSA
diagnostics.

Your hard disk drive has
indicated a failure.

ePSA Hard Drive - self test did
not complete

PSA 1000-0146

Update to the latest BIOS.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support

PSA Optical Drive - IDE status
failed.

Your CD or DVD drive has
indicated a failure.

1
2

Check Dell.com/support
for a firmware update for
your hard drive. Update the
firmware if one is available.
Reseat the drive, reseat the
data cable and power
connection at both ends if
it is desktop.
Turn off your computer and
reconnect your hard disk
drive (HDD) to the system
board. For more
information, see your
systems Owners Manual at
Dell.com/
poweredgemanuals.
Update to the latest BIOS.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Run a Chkdsk /r or format
your hard drive and reinstall
your operating system.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Turn off your computer and
reconnect your optical drive
to the system board.

Running diagnostics

21

Error number (PSA and ePSA)

Error message

Description

ePSA Optical Drive - self test -(s)
PSA 1000-0148
ePSA 2000-0148 replaced by
2000-0151, 2000-0152

PSA Optical Drive - BIST --(s)
ePSA Optical Drive - incorrect
status

Your CD or DVD drive has
indicated a failure.

Steps
3

Repeat the PSA
diagnostics.

1

Update to the latest BIOS
version.
Turn off your computer and
reconnect your optical drive
to the system board.
Repeat the PSA
diagnostics.

2
3

PSA NA
ePSA 2000-0149

Optical Drive - no drive detected Your CD or DVD drive has
indicated a failure.

1
2
3

PSA NA

Hard Drive - No drive detected.

ePSA 2000-0150 replaced
2000-0141

PSA NA

Hard Drive - BIST --(s)

ePSA 2000-0151

Your system BIOS is reporting
that no Hard Disk Drive is being
reported. If a Portable, reseat
the hard drive, if a Desktop,
reseat both ends of the data
cable and reseat the power cable
to the drive. Repeat the PSA
diagnostics. If a replacement
working hard drive is available,
see if the working hard drive is
detected by the system or try
the suspect drive in a working
system.

1

2

3
4
5

The hard drive is showing an
1
incorrect status in the
diagnostic. Check for a firmware 2
update for your hard drive.
3
4

PSA NA

Optical Drive - BIST --(s)

ePSA 2000-0152

The CD or DVD Drive is showing
an incorrect status in the
diagnostic.

1
2
3
4

PSA NA
ePSA 2000-0153

Hard Drive - Removable Hard
Drive [d] - Incorrect status = [x]
[s]

Check the installation of the
removable drive, cables, and
connections.

1

2

22

Running diagnostics

Update to the latest BIOS
version.
Turn off your computer and
reconnect your optical drive
to the system board.
Repeat the PSA
diagnostics.
If you don't have a hard
disk drive (HDD), this may
be an automatic message
and requires no action.
If you have an HDD,
reconnect your hard disk
drive (HDD) to the system
board.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
If you have a replacement
hard drive, install it.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS.
Remove and reinstall your
optical drive.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support
If you have an HDD,
reconnect your hard disk
drive (HDD) to the system
board.
Update to the latest BIOS
version.

Error number (PSA and ePSA)

Error message

Description

Steps
3
4

PSA NA
ePSA 2000-0154

Tape Drive - Tape Drive [s] - S/N Check installation of the tape
[d], incorrect status = [d] [d]
drive, cables and connections. If
the error persists, ensure that
the drive firmware is current.

1
2
3
4

PSA NA

Hard Drive - Not Installed

ePSA 2000-0155

This is an error displayed when
HDD is not inserted in the
notebooks.

1
2
3
4

PSA 1000-0212

System board - CMOS, Location An error occurred during the
= (x), Expected = (x), Found =
tests that may involve the main
ePSA 2000-0212 (Not used with
(x)
system board of the system.
UEFI BIOS)

1
2
3

PSA 1000-0213
ePSA 2000-0213 (Not used with
UEFI BIOS)

System board - CMOS battery
failure detected

PSA 1000-0221

PSA System board - Interval
timer Channel 0 (mode 0) is not
ePSA 2000-0221 (Not used with
generating interrupts
UEFI BIOS)
ePSA Timer - Interval timer not
functional

PSA 1000-0222

PSA System board - Interval
timer Channel 0 (mode 0) is not
ePSA 2000-0222 (Not used with
generating interrupts
UEFI BIOS)
ePSA Timer - Interval timer not
functional

An error occurred during the
tests involving the CMOS
battery (This maintains all the
settings in the BIOS when there
is no power to the system) On
desktop systems this is a easily
replaceable watch size battery,
some portable systems may have
a replaceable battery too.

1

An error occurred during the
tests that may involve the main
system board of the system. If a
memory error is detected, try
memory modules individually. If
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

1

An error occurred during the
tests that may involve the main
system board of the system. If a
memory error is detected, try
memory modules individually. If

1

2
3
4

2
3
4

2

Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support
Update to the latest BIOS.
Remove and reinstall your
tape drive.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support
Update to the latest BIOS
version.
Remove and reinstall your
Hard drive.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support.
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
Remove and replace your
CMOS battery.
If failure continues, contact
Dell Technical Support

Update to the latest BIOS
version.
Reseat the CMOS battery.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

Update to the latest BIOS
version.
Repeat the PSA
diagnostics

Running diagnostics

23

Error number (PSA and ePSA)

Error message

Description

Steps

no 2000-0123 memory error & If 3
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

If failure continues, contact
Dell Technical Support

System board - Timer - Interval
timer initial clock output level
ePSA 2000-0223 (Not used with
incorrect
UEFI BIOS)

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

PSA 1000-0224

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

System board - Failure in Interval An error occurred during the
1
timer in mode
tests that may involve the main
ePSA 2000-0231 (Not used with
system board of the system. If a 2
UEFI BIOS)
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

PSA 1000-0223

ePSA 2000-0224 (Not used with
UEFI BIOS)

System board - Interval timer
had wrong time period in mode

PSA 1000-0231

PSA 2000-0232
ePSA 2000-0232 (Not used with
UEFI BIOS)

PSA 2000-0233

System board - the RTC did not
generate periodic ticks

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

PSA System board - RTC
'seconds' count is not updating

An error occurred during the
1
tests that involve the Real Time
Clock (RTC) of the main system 2
board in the system. If a memory
error is detected, try memory
modules individually. If no

Update to the latest BIOS
version.
Repeat the PSA
diagnostics

ePSA 2000-0233 (Not used with
ePSA RTC - 'seconds' count is
UEFI BIOS)
not updating

24

Running diagnostics

Error number (PSA and ePSA)

Error message

Description

Steps

2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

3

If failure continues, contact
Dell Technical Support

PSA System board - timeout
waiting for RTC update flag to
ePSA 2000-0234 (Not used with
set
UEFI BIOS)
ePSA System board - HPET
incorrect time period.

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

PSA 1000-0235

System board - PM timer 1 had
wrong time period.

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

BIOS - A20 gate not enabled

An error occurred during the
1
tests that may involve the main
system board of the system. If a 2
memory error is detected, try
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

PSA System board - no interrupt An error occurred during the
1
detected for IRQ.
tests that may involve the main
ePSA 2000-0242 (Not used with
ePSA- System board - Interrupt system board of the system. If a 2
UEFI BIOS)
controller - IRQ (d) - %s not
memory error is detected, try
detected
memory modules individually. If
3
no 2000-0123 memory error & If
diagnostics fail again after the
BIOS is current, contact
Technical Support to resolve the
problem.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

PSA 1000-0234

ePSA NA

PSA 1000-0241
ePSA 2000-0241 (Not used with
UEFI BIOS)

PSA 1000-0242

PSA NA
ePSA 2000-0243

System board - USB device, IO
board, Daughter Card

An error occurred during the
1
tests that may involve the USB
controller or ports of the main
2
system board of the system.
Disconnect any USB devices and
run the diagnostic again. Test

Update to the latest BIOS
version.
Repeat the PSA
diagnostics

Running diagnostics

25

Error number (PSA and ePSA)

PSA NA

Error message

Description

Steps

USB devices in a different port.
Try a known good USB device.

3

If failure continues, contact
Dell Technical Support

System board - USB device

An error occurred during the
1
tests that may involve the USB
controller or ports of the main
2
system board of the system.
Disconnect any USB devices and 3
run the diagnostic again. Test
USB devices in a different port.
Try a known good USB device.

Update to the latest BIOS
version
Repeat the PSA
diagnostics
If failure continues, contact
Dell Technical Support

System board - USB device

An error occurred during the
1
tests that may involve the USB
controller or ports of the main
2
system board of the system.
Disconnect any USB devices and 3
run the diagnostic again. Test
USB devices in a different port.
Try a known good USB device.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Event Log - the log contains
failing records.

This relates to BIOS events in
servers only.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.

ePSA 2000-0244

PSA NA
ePSA 2000-0245

PSA NA
ePSA 2000-0251

PSA NA

1
2

System board - Data errors.

ePSA 2000-0261

Multiple memory DIMMs failed,
presumed to be caused by
motherboard issues.

1
2
3

PSA NA
ePSA 2000-0313

PSA NA
ePSA 2000-0314

Touchpad - pointing stick/
touchpad not detected

Thermal - the (s) reading (dc)
exceeds the thermal limit.

The mouse, touchpad, or
trackstick is not being detected
by the diagnostic tools. Power
the system off, reseat any cable
connection and check the BIOS
to ensure that the touchpad or
mouse has not been disabled.

1
2
3
4

The system board, heat sink, fan, 1
or processor are failing the
diagnostic tools.
2
3

PSA NA
ePSA 2000-0315

26

Running diagnostics

Sensor - the (s) reading (dc) is
lower than expected

The system board or sensor is
failing the diagnostic tools.

1
2
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
If your mouse, touchpad, or
pointing stick is
disconnected, reconnect it.
For laptops, make sure that
your touchpad is active.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Check the logs, the fan and
for any other signs of
overheating.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version
Check the system logs.
If failure continues, contact
Technical Support

Error number (PSA and ePSA)

Error message

Description

Steps

PSA 1000-0321

PSA LCD EDID - unable to
access EDID EEPROM

LCD Extended Display
Identification Data (EDID) unable to access the EDID
Electrically Erasable
Programmable Read-Only
Memory (EEPROM) in the LCD
display is indicating a data failure.
If there is video on the LCD, then
the Display does not need
replacement.

1

ePSA 2000-0321

PSA 1000-0322
ePSA 2000-0322

PSA NA
ePSA 2000-0323

ePSA Unable to detect LCD

2
3
4

PSA LCD Panel - Error accessing LCD panel - unable to modify
1
the LCD inverter
brightness. Try to adjust the
brightness in Windows using the 2
ePSA LCD panel - unable to
modify brightness
hotkeys. Boot to the BIOS and
see if brightness can be adjusted 3
at that point outside of Windows
4

Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

LCD panel - Unable to detect
inverter lamp status.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Check LCD connector and
cables.

1
2
3

PSA NA
ePSA 2000-0324

PSA NA
ePSA 2000-0325

PSA 1000-0326
ePSA 2000-0326

Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics..
If failure continues, contact
Dell Technical Support

LCD panel - user reported LCD
BIST colors were not displayed

LCD panel - user provided no
input for LCD BIST

LCD panel - unable to turn lamp
on or off

You may get this error if you
answered No to the LCD
BIST test instead of Yes. If you
were able to clearly see red, blue,
green white and white screen
with text without distortion,
lines, or color problems, re-run
the diagnostic and if the screens
appear normal, click Yes

1

You may get this error if you
answered No to the LCD
BIST test instead of Yes. If you
were able to clearly see red, blue,
green white and white screen
with text without distortion,
lines, or color problems, re-run
the diagnostic and if the screens
appear normal, click Yes

1

The backlight lamp was not able
to be turned on or off during the
diagnostic testing

1
2

2
3
4

2
3
4

3
4

Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the LCD BIST
diagnostics.
If failure continues, contact
Dell Technical Support

Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support

Update to the latest BIOS.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support

Running diagnostics

27

Error number (PSA and ePSA)

Error message

Description

Steps

PSA NA

LCD panel - unable to use BIOS
interface

The Liquid Crystal Display (LCD)
panel does not display the BIOS
screen correctly and the BIOS
screen is not visible on the LCD.

1
2

ePSA 2000-0327

3
4

PSA NA
ePSA 2000-0328

LCD panel - unable to detect
variance in ambient light sensor

The Ambient light sensor that
automatically dims the LCD in
low light did not respond during
the diagnostics.

1
2
3
4

PSA NA
ePSA 2000-0331

PSA NA
ePSA 2000-0332

Video controller - no video
controller detected

Video memory - Video memory
integrity test discrepancy

The system is not detecting the
graphics adapter. If you are using
a desktop system and if a PCIe
expansion card is installed, reseat
the card and reconnect any
internal power connections to
the card. Then reconnect the
video cable and repeat the PSA
diagnostic.

1
2

3
4
5

PSA diagnostics detected a
1
video memory failure. Please
reset the system memory and
2
update the BIOS to most current 3
version.
4
5

PSA 1000-0333
ePSA 2000-0333

PSA Video - Graphics test timed PSA diagnostics did not record
out waiting for keyboard
the user input (Y or N) after the
response
video test. Ensure that you
accurately answer queries that
ePSA Video - User provided no
are prompted during the
input for graphics test
diagnostic.

1

2
3
4
5

PSA 1000-0334
ePSA 2000-0334

28

Running diagnostics

Video - user reported the
patterns were not displayed
correctly

You may get this error if you
answered No to the color test
instead of Yes. If you were able
to clearly see both the vertical

1
2

Update to the latest BIOS.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Desktop: Turn off your
computer and if equipped
with PCIe expansion card,
reseat the card.
Reconnect your LCD cable.
Repeat the PSA
diagnostics..
If failure continues,, contact
Dell Technical Support
Update to the latest BIOS
version.
Reseat the system memory
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues,, contact
Dell Technical Support
Error occurs when a
graphics test times out
while waiting for you to
enter a response.
Update to the latest BIOS
version.
Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
You may get this error if
you answered No to the
color test instead of Yes.
Update to the latest BIOS
version.

Error number (PSA and ePSA)

PSA NA

Error message

Cables - not detected

ePSA 2000-0411

Description

Steps

and horizontal color bars without 3
distortion, lines, or color
problems, re-run the diagnostic
4
and if the bar appears normal,
click Yes.
5

Turn off your computer and
reconnect your LCD cable.
Repeat the PSA
diagnostics..
If failure continues, contact
Dell Technical Support

Normally, the cable involved in
1
the error (LCD LVDS CABLE for
2
example) is indicated in the error
message. Reseat the cable
connection and inspect the cable
and connections for damage.
3

Update to the latest BIOS.
Turn off your system and
reconnect the cable,
jumper, or connection
indicated in the error
message.
Replace damaged cables or
devices.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

4
5
PSA NA
ePSA 2000-0412

Cables - auxiliary cable not
detected

Normally, the cable involved in
1
the error (LCD LVDS CABLE for
example) is indicated in the error 2
message. Reseat the cable
connection and inspect the cable
and connections for damage.
3
4
5

PSA NA
ePSA 2000-0413

Cables - LCD cable not detected Normally, the cable involved in
1
the error (LCD LVDS CABLE for
example) is indicated in the error 2
message. Reseat the cable
connection and inspect the cable
and connections for damage.
3
4
5

PSA NA
ePSA 2000-0414

Cables - Invertor cable not
detected

Normally, the cable involved in
1
the error (LCD LVDS CABLE for
example) is indicated in the error 2
message. Reseat the cable
connection and inspect the cable
and connections for damage.
3
4

Update to the latest BIOS
version.
Turn off your system and
reconnect the cable,
jumper, or connection
indicated in the error
message.
Replace damaged cables or
devices.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Turn off your system and
reconnect the cable,
jumper, or connection
indicated in the error
message.
Replace damaged cables or
devices.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Turn off your system and
reconnect the cable,
jumper, or connection
indicated in the error
message.
Replace damaged cables or
devices.
Repeat the PSA
diagnostics.

Running diagnostics

29

Error number (PSA and ePSA)

PSA NA
ePSA 2000-0415

Error message

Cables - Check the following
cables, jumper, connection, or
sensors: [s]

Description

Steps
5

If failure continues, contact
Dell Technical Support

Normally, the cable involved in
1
the error (LCD LVDS CABLE for
example) is indicated in the error 2
message. Reseat the cable
connection and inspect the cable
and connections for damage.

Update to the latest BIOS
version.
Turn off your system and
reconnect the cable,
jumper, or connection
indicated in the error
message.
Replace damaged cables or
devices.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

3
4
5
PSA NA
ePSA 2000-0511

Fan - the (s) fan failed to
respond correctly

The cooling fan is not responding 1
to the diagnostics tool. Please
check nothing is obstructing the 2
fan.
3
4

PSA NA
ePSA 2000-0512

Fan - the (s) fan is running faster This could be an issue with the
than expected.
system board, temperature
sensor, or cooling fan. Please
check everything is firmly
seated.

1
2

3
4
PSA NA
ePSA 2000-0620

PSA NA
ePSA 2000-0621

Network - Network [d] {Vendor supplied error}

Network - Network [d] - Driver
version [x] outdated. Version [x]
or newer required for "[s]"

The Network (Ethernet) device
may be faulty. Depending on the
system, this may be on the
motherboard, an adapter, or a
daughter card. Errors include
Register test failed or
Packet loopback test
failed.

1

The Network (Ethernet) device
Firmware may be out of date.

1

2
3
4

2
3

30

Running diagnostics

Update to the latest BIOS
version.
Check your system for any
obstructions to the air
vents or cooling fan.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Check for any loose
connections, any
obstructed air vents, and
for any other signs of
overheating.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS/
Firmware versions.
Check for any loose
connections or any bent
pins in the connectors.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS/
Firmware versions.
Repeat the PSA
diagnostics.
If failure continues, contact
Technical Support

Error number (PSA and ePSA)

Error message

PSA NA

BIOS - No BIOS support for
The motherboard BIOS revision 1
software interrupt [x] function[x] may not be current. Update the
[x]
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - No BIOS support for SMI
interface function(x) or Sensor
[x] exceeded thermal zone [d].
Peak zone was [d].

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Fan - Unable to set
Manufacturing Mode

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Fan - Unable to
determine fan speeds

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

LCD - LCD BIST not supported

The LCD BIST may not exist on
all systems.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

ePSA 2000-8001

PSA NA
ePSA 2000-8002

PSA NA
ePSA 2000-8003

PSA NA
ePSA 2000-8004

PSA NA
ePSA 2000-8005

Description

Steps

1
2
3

PSA NA
ePSA 2000-8006

PSA NA
ePSA 2000-8007

BIOS - Fan - Unable to set fans
to ([d], [d], or [d]) speed or No
chipset event timer.

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Log contains Fan events
or Timer expected [d] observed
[d]

This message is informational
only - it provides a record of fan
events.

Clear Log.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

1
2
3

PSA NA
ePSA 2000-8008

Diagnostics - A. Out of memory!
fMalloc() Failed!

The system may be unstable and 1
should be rebooted.

B. Unable to allocate memory for
object data. Unable to [s]
testable memory

2
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Running diagnostics

31

Error number (PSA and ePSA)

Error message

Description

Steps

C. Unable to start application
processors
D. Unable to stop all APs
PSA NA

Diagnostics - Software Error

ePSA 2000-8009

The System may be unstable and 1
should be rebooted.
2
3

PSA NA
ePSA 2000-800B

BIOS - Retrieve vendor ID
function error

The system may be unstable.

1
2
3

PSA NA
ePSA 2000-800C

PSA NA
ePSA 2000-800D

PSA NA
ePSA 2000-800E

ePSA 2000-800F

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Set lamp off function
error. Vendor: [s] Revision: [d]

The motherboard BIOS revision 1
may not be current. Update the
BIOS to the most current version 2
and the issue should resolve.
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Set lamp on function
error. Vendor: [s] Revision: [d]

The motherboard BIOS revision
may not be current.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

1
2

BIOS - Restore function error.
Vendor: [s] Revision: [d].

The motherboard BIOS revision
may not be current.

1
2
3

PSA NA

System Board

ePSA 2000-8010

High-Precision event timer not
found.

1
2
3

32

Running diagnostics

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

BIOS - Get/Set inverter mode
function error. Vendor: [s]
Revision: [d]

3
PSA NA

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Error number (PSA and ePSA)

Error message

PSA NA

USB Device Diagnostics - Invalid The attached USB device is
status returned from the device returning an invalid status to the
BIOS.

ePSA 2000-8011

Description

Steps
1
2
3

PSA NA
ePSA 2000-8012

Diagnostics - Invalid parameter
passed to the device. Unknown
test [d] selected

An unexpected parameter was
passed to a device under test.

1
2
3

PSA NA
ePSA 2000-8013

Diagnostics - LCD [s] doesn't
support test commands

Update the BIOS to the most
current version and the issue
should resolve.

1
2
3

PSA NA
ePSA 2000-8014

PSA NA
ePSA 2000-8016

Diagnostics - Battery - unable to The motherboard BIOS revision
retrieve battery health
may not be current.

Reseat the Battery
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

1
2

BIOS - Battery - BIOS has no
support for battery health

This optional feature may not be
supported.

1
2
3

PSA NA
ePSA 2000-8018

Diagnostics - Fatal: The module
reported multiple test results!!

The system may be unstable.

1
2
3

PSA NA
ePSA 2000-8019

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

4

ePSA 2000-8017

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Diagnostics - ADDF module ([s]) The diagnostics program has run 1
device ([s]) failed with error
an external module, which has
2
code [x], number [x]. No EPSA
reported an unusual error.
beep code mapped!
3

3

PSA NA

Reseat the USB Device.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Diagnostics - Unable to log to
NVRAM

The system may be unstable.

1
2
3

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Running diagnostics

33

Error number (PSA and ePSA)

Error message

Description

Steps

PSA NA

Diagnostics - Low memory. [d]k
bytes free!

The system may be unstable.

1

ePSA 2000-8020

2
3

PSA NA
ePSA 2000-8115

Diagnostics - Unable to stop all
APs

The system may be unstable.

1
2
3

PSA NA
ePSA 2000-8154

Tape Drive - Tape Drive [d] - S/N Install the correct tape drive
[s], ULTRIUM [d] media found
media.
but drive. Requires ULTRIUM [s]
for [s]

1
2
3

PSA NA
ePSA 2000-8155

Tape Drive - Tape Drive [d] - S/N Try different tape drive media.
[s], data read does not match
data written

1
2
3

PSA NA
ePSA 2000-8156

Tape Drive - Tape Drive [d] - S/N Insert writable tape drive media.
[s], no media cannot test drive

1
2
3

PSA NA
ePSA 2000-8157

Tape Drive - Tape Drive [d] - S/N Tape drive model is unknown and 1
[s], drive is not a supported drive not supported by diagnostics.
2
3
4

PSA NA
ePSA 2000-8158

PSA NA
ePSA 2000-8160

Use correct tape drive
media.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Use different tape drive
media.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Insert writable media.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Reseat the Drive.
Test with a known good
drive if possible.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

The string indicates the
1
backplane, expander, or
removable hard drive is reporting 2
an incorrect status.
3

Reseat the drives/cables/
connections.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

PERC - PERC Battery [d] incorrect status = [x], [s]

The [s] string may be one of the
following messages:

Reseat parts.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

•

Running diagnostics

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Backplane - [DRIVE] Drive [d] incorrect status = [x], [s]

•

34

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Battery missing or
disconnected
Replace battery pack

1
2
3

Error number (PSA and ePSA)

Error message

Description
•

PSA NA
ePSA 2000-8165

PSA NA

OS - MBR code is unknown,
possibly infected by a virus

ePSA 2000-8170

PSA NA
ePSA 2000-8415

Remaining capacity low

The Master Boot Record on the
Hard Disk Drive has
unrecognized code present
which is an indication of a virus
infecting the system. It is
recommended that you scan
your system with an up-to-date
antivirus application.

1

OS - Detected virus (s)

A virus appears to be infecting
the system. It is recommended
that you scan your system with
an up-to-date antivirus
application.

1

Update and run your antivirus software.

PCIe - Training error on device
PciTag [s] VendorID [x] DeviceID
[x] SVid [x] SDid [x] Link
Degraded, maxLinkWidth = x[d],
negotiatedLinkWidth = x[d]

The system is reporting an error
on a PCIe adapter.

1
2

Reseat the PCIe adapter.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Cables - BIOS reports no
testable cable/jumper/
connector/sensor

An internal failure of the
diagnostic tool, not a hardware
failure.

ePSA 2000-8166

PSA NA

Steps

2

3

1
2
3

PSA NA
ePSA 2000-8611

Audio - User reported not
hearing speaker tones

If the beep was not heard, verify
that the internal speakers are
connected properly. On some
systems, internal speakers are a
purchased option. For a given
system, if the speaker is optional,
it might not exist on that system.
If audio is not heard, make sure
to unplug anything plugged into
an external headphone jack and
re-run the test. External HP
jacks can mute the internal
speakers at the hardware level.

1
2
3
4

Update and run your antivirus software.
Update or reinstall your OS.

Update to the latest BIOS
version.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support
Unplug all external Audio
Jacks.
Reseat the internal speaker
connector.
Repeat the PSA
diagnostics.
If failure continues, contact
Dell Technical Support

Debugging mini crash dump files using by WinDbg in
Windows operating system
Prerequisites
1

Click Start > Control Panel > System.

Running diagnostics

35

Figure 9. Opening the Systems page
2

In the System page, click Advanced system settings in the left pane.

Figure 10. Advanced system settings page
3

36

In the System Properties window, click Settings under the Startup and Recovery section.

Running diagnostics

Figure 11. System Properties window
4

In the Startup and Recovery window, System failure section, do the following:
a

Select Write an event to the system log to ensure that the minidump file is created in the event of a system failure.

b

Select Automatically restart to restart the system after a blue screen of death (BSOD) occurs.
NOTE: For servers, it is recommended that you select the Automatically restart option so that the server can function
if the error is not critical.

c

Verify that the Overwrite any existing file option is not selected. This ensures that a record of failures is maintained if there are
repeated occurrences of system failures.

Running diagnostics

37

Figure 12. Startup and Recovery window
5

When a BSOD occurs, collect the minidump files by navigating to C:\windows\minidump.

6

Download and install the Windows Debugger (WinDbg) tool from https://msdn.microsoft.com/en-us/windows/hardware/
hh852365.aspx.

Steps
1
Open WinDbg.
2

Click File → Symbol File Path or press Ctrl + S on the keyboard.

3

In the Symbol Path field, enter SRV**http://msdl.microsoft.com/download/symbols where,  is the path
where you want to download the symbol files.

4

Click OK.

5

Click File → Open Crash Dump or press Ctrl + D.

6

Navigate to the folder where the mini dump file is saved.

7

Select the mini dump file and click OK.

8

Click No when the system prompts to save information for workspace.
Debugging starts.

9

Click !analyze –v to get detailed debugging information.

10

Note down the values for the following parameters:
a
b

38

DEFAULT_BUCKET_ID
MODULE_NAME

Running diagnostics

c
11

IMAGE_NAME

Call Dell Technical Support for further assistance.

Running diagnostics

39

4
Troubleshooting hardware issues
This section helps you troubleshoot hardware issues in your system.
NOTE: If the issue still persists, contact Dell Technical Support for assistance.

Topics:
•

Troubleshooting no power issues

•

Troubleshooting system startup failure

•

Troubleshooting external connections

•

Troubleshooting the video subsystem

•

Troubleshooting a USB device

•

Troubleshooting a serial I/O device

•

Troubleshooting a NIC

•

Troubleshooting a wet system

•

Troubleshooting a damaged system

•

Troubleshooting the system battery

•

Troubleshooting cooling problems

•

Troubleshooting cooling fans

•

Troubleshooting an internal USB key

•

Troubleshooting an SD card

•

Troubleshooting an optical drive

•

Troubleshooting a tape backup unit

•

Troubleshooting a storage controller

•

Troubleshooting expansion cards

•

Troubleshooting processors

•

Troubleshooting hard drives

•

Troubleshooting system memory

•

Troubleshooting memory module issues

•

Troubleshooting power supply units

•

Troubleshooting RAID

•

Troubleshooting thermal issue

Troubleshooting no power issues
Prerequisite
Ensure all power supplies are firmly seated, power cables are connected, and both power supplies are operating.
Steps
1
Swap the AC power cable with a known-good power cable.
If the system works with a known-good AC power cable, replace the power cable.
2

40

Check the status of the power indicator on the power supply unit (PSU).

Troubleshooting hardware issues

•
•
•

If the power indicator is lit Green, reseat the PSU. If this does not fix the issue, test the system with a good PSU.
If the power indicator is lit Amber, this indicates a PSU fault condition. Replace the PSU with a good PSU and check the indicator
status.
If the issue is resolved, consider replacing the PSU.

Troubleshooting system startup failure
If you boot the system to the BIOS boot mode after installing an operating system from the UEFI Boot Manager, the system stops
responding. To avoid this issue, you must boot to the same boot mode in which you installed the operating system.
For all other startup issues, note the system messages that appear on the screen.

Troubleshooting external connections
Before troubleshooting any external devices, ensure that all external cables are securely attached to the external connectors on your
system before troubleshooting any external devices.

Troubleshooting the video subsystem
Prerequisite
NOTE: Ensure the Local Server Video Enabled option is selected in the iDRAC Graphical User Interface (GUI), under Virtual
Console. If this option is not selected, local video is disabled.
Steps
1
Check the cable connections (power and display) to the monitor.
2

Check the video interface cabling from the system to the monitor.

3

Run the appropriate diagnostic test.

If the tests run successfully, the problem is not related to video hardware.
Next step
If the tests fail, see the Getting help section.

Troubleshooting a USB device
Prerequisite
NOTE: Follow steps 1 to 5 to troubleshoot a USB keyboard or mouse. For other USB devices, go to step 6.
Steps
1
Disconnect the keyboard and/or mouse cables from the system and reconnect them.
2

If the problem persists, connect the keyboard and/or mouse to another USB port on the system.

3

If the problem is resolved, restart the system, enter System Setup, and check if the non-functioning USB ports are enabled.

4

In iDRAC Settings Utility, ensure that USB Management Port Mode is configured as Automatic or Standard OS Use.

5

If the problem is not resolved, replace the keyboard and/or mouse with a known working keyboard or mouse.
If the problem is not resolved, proceed to step 6 to troubleshoot other USB devices attached to the system.
If the problem is not resolved, proceed to troubleshoot other USB devices attached to the system.

6

Turn off all attached USB devices, and disconnect them from the system.

7

Restart the system.

8

If your keyboard is functioning, enter System Setup, verify that all USB ports are enabled on the Integrated Devices screen. If your
keyboard is not functioning, use remote access to enable or disable the USB options.

9

If the system is not accessible, reset the NVRAM_CLR jumper inside your system and restore the BIOS to the default settings. See the
System board jumper setting section

Troubleshooting hardware issues

41

10

In the IDRAC Settings Utility, ensure that USB Management Port Mode is configured as Automatic or Standard OS Use.

11

Reconnect and turn on each USB device one at a time.

12

If a USB device causes the same problem, turn off the device, replace the USB cable with a known good cable, and turn on the
device.

Next step
If all troubleshooting fails, see the Getting help section.

Troubleshooting iDRAC Direct (USB XML configuration)
For information about USB storage device and system configuration, see Integrated Dell Remote Access Controller User's Guide at
Dell.com/idracmanuals.
Steps
1

Ensure that your USB storage device is connected to the front USB Management Port, identified by

icon.

2

Ensure that your USB storage device is configured with an NTFS or an FAT32 file system with only one partition.

3

Verify that the USB storage device is configured correctly. For more information about configuring the USB storage device, see
Integrated Dell Remote Access Controller User's Guide at Dell.com/idracmanuals.

4

In the iDRAC Settings Utility, ensure that USB Management Port Mode is configured as Automatic or iDRAC Direct Only.

5

Ensure that the iDRAC Managed: USB XML Configuration option is either Enabled or Enabled only when the server has default
credential settings.

6

Remove and reinsert the USB storage device.

7

If import operation does not work, try with a different USB storage device.

Next step
If all troubleshooting fails, see the Getting help section.

Troubleshooting iDRAC Direct (Laptop connection)
For information about USB laptop connection and system configuration, see the Integrated Dell Remote Access Controller User's Guide at
Dell.com/idracmanuals.
Steps
1

Ensure that your laptop is connected to the front USB Management Port, identified by

icon with a USB Type A/A cable.

2

On the iDRAC Settings Utility screen, ensure that USB Management Port Mode is configured as Automatic or iDRAC Direct Only.

3

If the laptop is running Windows operating system, ensure that the iDRAC Virtual USB NIC device driver is installed.

4

If the driver is installed, ensure that you are not connected to any network through WiFi or cabled ethernet, as iDRAC Direct uses a
non-routable address.

Next step
If all troubleshooting fails, see the Getting help section.

Troubleshooting a serial I/O device
Steps
1
Turn off the system and any peripheral devices connected to the serial port.
2

Swap the serial interface cable with a known working cable, and turn on the system and the serial device.
If the problem is resolved, replace the interface cable with a known working cable.

3

Turn off the system and the serial device, and swap the serial device with a compatible device.

4

Turn on the system and the serial device.

42

Troubleshooting hardware issues

Next step
If the problem persists, see the Getting help section.

Troubleshooting a NIC
Steps
1
Run the appropriate diagnostic test. For more information, see the Using system diagnostics section for the available diagnostic tests.
2

Restart the system and check for any system messages pertaining to the NIC controller.

3

Check the appropriate indicator on the NIC connector:
•

If the link indicator does not glow, the cable connected might be disengaged.

•

If the activity indicator does not glow, the network driver files might be damaged or missing.
Install or replace the drivers as necessary. For more information, see the NIC documentation.

•

Try another known good network cable.

4

• If the problem persists, use another connector on the switch or hub.
Ensure that the appropriate drivers are installed and the protocols are bound. For more information, see the NIC documentation.

5

Enter System Setup and confirm that the NIC ports are enabled on the Integrated Devices screen.

6

Ensure that all the NICs, hubs, and switches on the network are set to the same data transmission speed and duplex. For more
information, see the documentation for each network device.

7

Ensure that all the NICs and switches on the network are set to the same data transmission speed and duplex. For more information,
see the documentation for each network device.

8

Ensure that all network cables are of the proper type and do not exceed the maximum length.

Next step
If the problem persists, see the Getting help section.

Troubleshooting a wet system
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1
Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.
2

Remove the system cover.

3

Remove the following components (if installed) from the system:
•

Power supply unit(s)

•

Optical drive

•

Hard drives

•

Hard drive backplane

•

USB memory key

•

Hard drive tray

•

Cooling shroud

•

Expansion card risers (if installed)

•

Expansion cards

•

Cooling fan assembly (if installed)

•

Cooling fan(s)

•

Memory modules

Troubleshooting hardware issues

43

•

Processor(s) and heat sink(s)

4

• System board
Let the system dry thoroughly for at least 24 hours.

5

Reinstall the components you removed in step 3 except the expansion cards.

6

Install the system cover.

7

Turn on the system and attached peripherals.
If the problem persists, see the Getting help section.

8

If the system starts properly, turn off the system, and reinstall all the expansion cards that you removed.

9

Run the appropriate diagnostic test. For more information, see the Using system diagnostics section.

Next step
If the tests fail, see the Getting help section.

Troubleshooting a damaged system
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1
Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.
2

Remove the system cover.

3

Ensure that the following components are properly installed:
•

cooling shroud

•

expansion card risers (if installed)

•

expansion cards

•

power supply unit(s)

•

cooling fan assembly (if installed)

•

cooling fan(s)

•

processor(s) and heat sink(s)

•

memory modules

4

• hard drive carriers or cage
Ensure that all cables are properly connected.

5

Install the system cover.

6

Run the appropriate diagnostic test. For more information, see the Using system diagnostics section.

Next step
If the problem persists, see the Getting help section.

Troubleshooting the system battery
Prerequisites
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: If the system is turned off for long periods of time (for weeks or months), the NVRAM may lose the system configuration
information. This situation is caused by a defective battery.

44

Troubleshooting hardware issues

NOTE: Some software may cause the system time to speed up or slow down. If the system seems to operate normally except for
the time set in System Setup, the problem may be caused by a software, rather than by a defective battery.
Steps
1
Re-enter the time and date in System Setup.
2

Turn off the system, and disconnect it from the electrical outlet for at least an hour.

3

Reconnect the system to the electrical outlet, and turn on the system.

4

Enter System Setup.
If the date and time displayed in System Setup are not correct, check the System Error Log (SEL) for system battery messages.

Next step
If the problem persists, see the Getting help section.

Troubleshooting cooling problems
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Ensure that the following conditions exist:
•

System cover, cooling shroud, EMI filler panel, memory module blank, or back filler bracket is not removed.

•

Ambient temperature is not higher than the system specific ambient temperature.

•

External airflow is not obstructed.

•

A cooling fan is not removed or has not failed.

•

The expansion card installation guidelines have been followed.

Additional cooling can be added by one of the following methods:
From the iDRAC web GUI:
1

Click Hardware > Fans > Setup.

2

From the Fan Speed Offset drop-down list, select the cooling level required or set the minimum fan speed to a custom value.

From F2 System Setup:
1

Select iDRAC Settings > Thermal, and set a higher fan speed from the fan speed offset or minimum fan speed.

From RACADM commands:
1

Run the command racadm help system.thermalsettings

For more information, see the Integrated Dell Remote Access User’s Guide at Dell.com/idracmanuals.

Troubleshooting cooling fans
Prerequisites
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: The fan number is referenced by the management software of the system. In the event of a problem with a particular fan,
you can easily identify and replace it by noting down the fan numbers on the cooling fan assembly.
1

Follow the safety guidelines listed in the Safety instructions section.

Troubleshooting hardware issues

45

2

Follow the procedure listed in the Before working inside your system section.

Steps
1
Reseat the fan or the fan's power cable.
2

Restart the system.

Next steps
1

Follow the procedure listed in the After working inside your system section.

2

If the problem persists, see the Getting help section.

Troubleshooting an internal USB key
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1
Enter System Setup and ensure that the USB key port is enabled on the Integrated Devices screen.
2

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

3

Remove the system cover.

4

Locate the USB key and reseat it.

5

Install the system cover.

6

Turn on the system and attached peripherals, and check if the USB key is functioning.

7

If the problem is not resolved, repeat step 2 and step 3.

8

Insert a known working USB key.

9

Install the system cover.

Next step
If the problem persists, see the Getting help section.

Troubleshooting an SD card
Prerequisites
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: Certain SD cards have a physical write-protect switch on the card. If the write-protect switch is turned on, the SD card is
not writable.
Steps
1
Enter System Setup, and ensure that the Internal SD Card Port is enabled.
2

Turn off the system, including any attached peripherals, and disconnect the system from the electrical outlet.

3

Remove the system cover.
NOTE: When an SD card failure occurs, the internal dual SD module controller notifies the system. On the next restart, the
system displayed a message indicating the failure. If redundancy is enabled at the time of SD card failure, a critical alert will
be logged and chassis health will degrade.

4

Replace the failed SD card with a new SD card.

5

Install the system cover.

46

Troubleshooting hardware issues

6

Reconnect the system to its electrical outlet and turn on the system, including any attached peripherals.

7

Enter System Setup, and ensure that the Internal SD Card Port and Internal SD Card Redundancy modes are set to the needed
modes.
Verify that the correct SD slot is set as Primary SD Card.

8

Check if the SD card is functioning properly.

9

If the Internal SD Card Redundancy option is set to Enabled at the time of the SD card failure, the system prompts you to perform a
rebuild.
NOTE: The rebuild is always sourced from the primary SD card to the secondary SD card.

Troubleshooting an optical drive
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1

Try using a different CD or DVD.

2

If the problem is not resolved, enter System Setup and ensure that the integrated SATA controller and the drive's SATA port are
enabled.

3

Run the appropriate diagnostic test.

4

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

5

If installed, remove the bezel.

6

Remove the system cover.

7

Ensure that the interface cable is securely connected to the optical drive and to the controller.

8

Ensure that a power cable is properly connected to the drive.

9

Install the system cover.

Next step
If the problem persists, see the Getting help section.

Troubleshooting a tape backup unit
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1

Use a different tape cartridge.

2

Ensure that the device drivers for the tape backup unit are installed and are configured correctly. See your tape drive documentation
for more information about device drivers.

3

Reinstall the tape-backup software as instructed in the tape-backup software documentation.

4

Ensure that the interface cable of the tape drive is connected to the external port on the controller card.

5

Perform the following steps to ensure that the controller card is properly installed:
a
b
c
d
e

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.
Remove the system cover.
Reseat the controller card in the expansion card slot.
Install the system cover.
Turn on the system and attached peripherals.

Troubleshooting hardware issues

47

6

Run the appropriate diagnostic test. For more information, see the Using system diagnostics.

Next step
If you cannot resolve the problem, see the Getting help section.

Troubleshooting a storage controller
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: When troubleshooting a controller, see the documentation for your operating system and the controller.
1

Run the appropriate diagnostic test. See the Using system diagnostics section.

2

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

3

Remove the system cover.

4

Verify that the installed expansion cards are compliant with the expansion card installation guidelines.

5

Ensure that each expansion card is firmly seated in its connector.

6

Install the system cover.

7

Reconnect the system to the electrical outlet, and turn on the system and attached peripherals.

8

If the problem is not resolved, turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

9

Remove the system cover.

10

Remove all expansion cards installed in the system.

11

Install the system cover.

12

Reconnect the system to the electrical outlet, and turn on the system and attached peripherals.

13

Run the appropriate diagnostic test. See the Using system diagnostics section. If the tests fail, see the Getting help section.

14

For each expansion card you removed in step 10, perform the following steps:
a

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

b

Remove the system cover.

c

Reinstall one of the expansion cards.

d

Install the system cover.

e

Run the appropriate diagnostic test. See the Using system diagnostics section.

If the problem persists, see the Getting help section.

Troubleshooting expansion cards
Prerequisites
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: When troubleshooting an expansion card, you also have to see the documentation for your operating system and the
expansion card.
Steps
1
Run the appropriate diagnostic test. See the Using system diagnostics section.
2

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

3

Remove the system cover.

4

Ensure that each expansion card is firmly seated in its connector.

48

Troubleshooting hardware issues

5

Install the system cover.

6

Turn on the system and attached peripherals.

7

If the problem is not resolved, turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

8

Remove the system cover.

9

Remove all expansion cards installed in the system.

10

Install the system cover.

11

Run the appropriate diagnostic test. See the Using system diagnostics section.
If the tests fail, see the Getting help section.

12

For each expansion card you removed in step 8, perform the following steps:
a Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.
b Remove the system cover.
c Reinstall one of the expansion cards.
d Install the system cover.
e Run the appropriate diagnostic test. See the Using system diagnostics section.

Next step
If the problem persists, see the Getting help section.

Troubleshooting processors
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1
Run the appropriate diagnostics test. See the Using system diagnostics section.
2

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

3

Remove the system cover.

4

Ensure that the processor and heat sink are properly installed.

5

Install the system cover.

6

Run the appropriate diagnostic test. See the Using system diagnostics section.

7

If the problem persists, see the Getting help section.

Troubleshooting hard drives
Prerequisites
CAUTION: This troubleshooting procedure can erase data stored on the hard drive. Before you proceed, back up all files on the
hard drive.
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: Ensure that the sleds internal SATA cables are connected correctly.
Steps
1
Run the appropriate diagnostic test. See the Using system diagnostics section.
Depending on the results of the diagnostics test, proceed as required through the following steps.
2

If your system has a RAID controller and your hard drives are configured in a RAID array, perform the following steps:
a Restart the system and press F10 during system startup to run the Dell Lifecycle Controller, and then run the Hardware
Configuration wizard to check the RAID configuration.

Troubleshooting hardware issues

49

See the Dell Lifecycle Controller documentation or online help for information about RAID configuration.
Ensure that the hard drives are configured correctly for the RAID array.
Take the hard drive offline and reseat the drive.
Exit the configuration utility and allow the system to boot to the operating system.

b
c
d
3

Ensure that the needed device drivers for your controller card are installed and are configured correctly. For more information, see the
operating system documentation.

4

Restart the system and enter the System Setup.

5

Verify that the controller is enabled and the drives are displayed in the System Setup.

Next step
If the problem persists, see the Getting help section.

Checking hard drive status in the PERC BIOS
1

Press Ctrl + R during the POST process to open PERC BIOS Configuration Utility.

Figure 13. PERC Configuration Utility VD Mgmt screen
2

To check if the hard drive is participating in the RAID array, perform the following steps:
a

50

Press Ctrl + N to go to the PD Mgmt (Physical Disk Management) screen and check if any hard drives are offline or missing.

Troubleshooting hardware issues

Figure 14. PERC Configuration Utility PD Mgmt screen

Hard drive status

Description

Offline

Hard drive is not part of the RAID array.

Online

Hard drive is part of the RAID array.

Ready

Hard dive is ready to be a part of the RAID array.

FAQs
How do I identify a hard drive failure?
Hard drive failures may occur because of logical, head, or mechanical failures. The following table describes the symptoms of failing hard
drives:
Symptom

Description

Noisy hard drives

Noisy hard drives indicate head crash, noisy bearings, or a failed
motor.

Data and disk errors

Files may not get saved or programs may stop responding.

Hard drive not detected

Operating system does not detect the hard drive.

System crash

Blue screen errors.

Slow read-write process

Delay in reading or writing data on the hard drive.

Perform the following steps if any of these symptoms occur:
1

Check if the hard drives are installed correctly.

2

Check the hard drive indicators. For more information, see Hard drive indicator codes section.

3

For systems with LED indicators, check the diagnostic LEDs. For more information, see Status LED indicators section.

Troubleshooting hardware issues

51

NOTE: The diagnostic LED indicators might be different for different
systems.
4

For systems with LCD panel, check for the following error codes:
Error Code

Error Message

Description

E1810

Hard drive 
fault.

Hard drive  has had a fault as determined by the SAS subsystem.

E1811

Hard drive 
rebuild aborted.

Drive  has had its rebuild aborted.

E1812

Hard drive 
removed.

Drive  has been removed from the system.

5

Check the hard drive status in SupportAssist.

6

Check the hard drive status in the RAID BIOS. For more information, see Checking hard drive status in the PERC BIOS section.

If you have any of the above error conditions, contact Dell Technical Support. When you call, keep the SupportAssist report and the
troubleshooting logs ready.

How do I extract RAID-Controller Logs using MegaCLI for Windows and
Linux?
NOTE: To export information about the status of the RAID controller and the connected hard drives, use the MegaCLI tool by LSI
(Avagotech).
To create the appropriate diagnostic files, perform the following steps:
1

Download the appropriate version of the MegaCli tool for Windows Operating systems from the Avagotech support site.
NOTE:
For the most popular Linux operating systems, you also need the following file to ensure a smooth execution: Lib_Utils-1.00-09.noarch.rpm

2

Unzip the downloaded files and copy them in any directory on the server or host system according your installed operating system.
For example, on Windows the path can be C:\temp\megacli, and on Linux the path can be /tmp/megacli.

3

Open a terminal in Linux or a command prompt in Windows operating systems and navigate to the newly created directory.

4

Now, run the following commands:
•

To create the RAID controller logs in Windows, use the command MegaCli.exe -FwTermLog -Dsply –aALL >
ttylog.txt.

•

To create the Event Log in Windows, use the command MegaCli.exe -AdpEventLog -IncludeDeleted -f
deleted.txt -aALL.

•

To create the RAID controller logs in Linux, use the command ./MegaCli64 -FwTermLog -Dsply –aALL > /tmp/
ttylog.txt.

•

To create the Event Log in Linux, use the command ./MegaCli64 -AdpEventLog -IncludeDeleted -f
deleted.txt -aALL.
NOTE: The deleted.txt file is always stored in the MegaCli folder.

52

Troubleshooting hardware issues

Symptoms
Hard disk drive making noise
Description
Clicking or other loud noise is heard from the hard drive. The noise can be intermittent and may or may not coincide with read or write
operations.
Resolution
1

Run the custom ePSA diagnostics and obtain the error code. For more information on ePSA diagnostics, see PSA/ePSA Diagnostics.

2

If the diagnostics fail, call Dell Technical Support for further assistance.

3

If the diagnostics pass, reseat the hard drive.

4

If the problem persists, call Dell Technical Support for further assistance.

Troubleshooting system memory
Prerequisite
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
Steps
1

If the system is operational, run the appropriate diagnostic test. See the Using system diagnostics section for the available diagnostic
tests.
If the diagnostic tests indicate a fault, follow the corrective actions provided by the diagnostic tests.

2

If the system is not operational, turn off the system and attached peripherals, and unplug the system from the power source. Wait at
least for 10 seconds, and then reconnect the system to the power source.

3

Turn on the system and attached peripherals, and note the messages on the screen.
If an error message is displayed indicating a fault with a specific memory module, go to step 12.

4

Enter System Setup, and check the system memory setting. Make any changes to the memory settings, if needed.
If the memory settings match the installed memory but the problem still persists, go to step 12.

5

Turn off the system and attached peripherals, and disconnect the system from the electrical outlet.

6

Remove the system cover.

7

Check the memory channels and ensure that they are populated correctly.
NOTE: See the system event log or system messages for the location of the failed memory module. Reinstall the memory
device.

8

Reseat the memory modules in their sockets.

9

Install the system cover.

10

Enter System Setup and check the system memory setting.
If the problem is not resolved, proceed with step 11.

11

Remove the system cover.

12

If a diagnostic test or error message indicates a specific memory module as faulty, swap or replace the module with a known working
memory module.

13

To troubleshoot an unspecified faulty memory module, replace the memory module in the first DIMM socket with a module of the
same type and capacity.

Troubleshooting hardware issues

53

If an error message is displayed on the screen, this may indicate a problem with the installed DIMM type(s), incorrect DIMM
installation, or defective DIMM(s). Follow the on-screen instructions to resolve the problem.
14

Install the system cover.

15

As the system boots, observe any error message that is displayed and the diagnostic indicators on the front of the system.

16

If the memory problem persists, repeat step 12 through step 15 for each memory module installed.

Next step
If the problem persists, see the Getting help section.

Troubleshooting memory module issues
Following are guidelines to troubleshoot memory module issues:
•

Check for the LCD status messages. The server’s LCD screen displays text messages that indicate multi-bit and single-bit errors were
detected in the System Event Log (SEL).

•

Run the Dell SupportAssist to identify a faulty DIMM in the memory modules or in the System Event Logs (SEL) logs.

•

Run the MPMemory diagnostics, which checks the logs first and then checks the DIMMs. If there is an issue, the test results display an
error.

•

Check for POST messaging error beep codes.

•

Check for the POST status on the LCD screen.

•

Check for memory and system compatibility.

•

Delete the Single Bit Errors in the SEL log by using the OMSA, SupportAssist and CTRL-E to clear the SEL logs.

•

Clear the Single-Bit Error (SBE) Log and Multi-Bit Error (MBE) Log to resolve the memory issues.

•

Check for CPU socket for any bent pins.

•

Ensure that the system is using the latest BIOS version. If not, update the BIOS and check if the error occurs.

Prerequisites to follow when installing or upgrading memory
•

Identify the maximum memory capacity supported by the processor installed in the system.

•

While installing the memory module, ensure that you follow the memory population rules. For more information, see the Memory
population guidelines section.

•

Ensure that the memory configurations set in the System BIOS matches with the memory modules installed in the system.

Memory population guidelines
Your system supports Flexible Memory Configuration, enabling the system to be configured and run in any valid chipset architectural
configuration. The recommended guidelines for installing memory modules are as follows:
•

Populate DIMM sockets only if a processor is installed in the system.

•

Populate release levers for all sockets in the following order-first white, then black, and then green.

•

Mixing of different DIMM types is not supported. For example, only RDIMMs, only UDIMMs, or only LRDIMMs can be installed in a
system.

•

x4 and x8 DRAM based DIMMs can be mixed.

•

Memory modules of different ranks can be mixed. For example, single rank and dual rank memory modules can be mixed. While mixing
DIMMs with different capacities, ensure that you populate the sockets by highest rank count in the following order—first in sockets
with white release levers, then black, and then green. For example, if you want to mix single-rank and dual-rank memory modules,
populate dual-rank memory modules in the sockets with white release tabs and single-rank memory modules in the sockets with black
release tabs. You can only mix maximum of two different rank DIMMs in the system.

•

Memory modules of different capacities can be mixed. For example, 4 GB and 8 GB memory modules can be mixed. While installing
memory modules of different capacities, ensure that you populate the sockets with memory modules with highest capacity first. For

54

Troubleshooting hardware issues

example, if you want to mix 4 GB and 8 GB memory modules, populate 8 GB memory modules in the sockets with white release tabs
and 4 GB memory modules in the sockets with black release tabs. You can mix maximum of two memory modules of different capacity.
•

In a dual-processor configuration, the memory configuration for each processor should be identical through the first eight slots. For
example, if you populate socket A1 for processor 1, then populate socket B1 for processor 2, and so on.
NOTE: For more information about the system memory guidelines specific to your system, see the Owner’s Manual at Dell.com/
poweredgemanuals.

Memory modes
There are four types of memory modes:
•

Advanced ECC (Lockstep)

•

Memory Optimized (Independent Channel)

•

Memory Sparing

•

Memory Mirroring

You can configure the memory modes in the System BIOS. For more information, see the Configuring memory modes section.

Configuring memory modes
About this task
To configure memory modes, follow the procedure listed below:
Steps
1
Turn on or restart your system.
2

Press F2 immediately after you see the following message:  = System Setup.

3

Click System Setup Main menu > System BIOS > Memory Settings.

4

In the Memory Operating Mode field, select the memory mode you want.

Memory Mirroring
In the memory mirroring mode, half of the installed memory is used to mirror the active DIMMs. When an uncorrectable error is detected,
the system switches over to the mirrored copy. This mode offers SDDC and multi-bit protection and also provides strongest DIMM
reliability mode compared to all other modes.

Memory Sparing
In this mode, one rank per channel is reserved as a spare. If persistent correctable errors are detected on a rank, the data from this rank is
copied to the spare rank and the failed rank is disabled.
With memory sparing enabled, the system memory available to the operating system is reduced by one rank per channel. For example, in a
dual-processor configuration with sixteen 4 GB dual-rank DIMMs, the available system memory is: 3/4 (ranks/channel) × 16 (DIMMs) × 4
GB = 48 GB, and not 16 (DIMMs) × 4 GB = 64 GB.
NOTE: Memory sparing does not offer protection against a multi-bit uncorrectable error.
NOTE: Both Advanced ECC or Lockstep and Optimizer modes support Memory Sparing.

Troubleshooting hardware issues

55

Memory optimized (independent channel)
This mode supports SDDC only for memory modules that use x4 device width, and the mode does not impose any specific slot population
requirements.

Advanced ECC
Advanced ECC mode extends SDDC from x4 DRAM based DIMMs to both x4 and x8 DRAMs. This protects against single DRAM chip
failures during normal operation.

Memory lane failure
The disabling of the memory module in the channel results in memory lane failure.
Memory lane failure may occur due to the following reasons:
•

The system is not using latest BIOS

•

The memory modules are not seated properly on the DIMM sockets in the system.

•

The system has a faulty DIMM socket.

Correctable and uncorrectable errors
Correctable errors can be detected and corrected if the chipset and DIMM support this functionality. Correctable errors are generally single
bit errors (SBE). Most of the Dell servers are capable of detecting and correcting single bit errors. In addition, Dell servers with Advanced
ECC mode support can detect and correct multi-bit errors.
Correctable errors can be classified as hard errors and soft errors. Hard errors typically indicate a problem with the DIMM. Although hard
correctable memory errors are corrected by the system and will not result in system downtime or data corruption, they indicate a problem
with the hardware. Soft errors do not indicate any issue with the DIMM. A soft error occurs when the data or ECC bits or both data and
ECC bits on the DIMM are incorrect, but the error will not continue to occur after the data or ECC bits or both data and ECC bits on the
DIMM have been corrected.
Uncorrectable errors are always multi-bit memory errors. While correctable errors do not affect the normal operation of the system,
uncorrectable memory errors will immediately result in a system crash or shutdown of the system when not configured for Mirroring or
RAID AMP modes. Uncorrectable memory errors can typically be isolated down to a failed Bank of DIMMs, rather than the DIMM itself.

System stops responding during POST after upgrading or
installing a memory module
Issue:
During POST, the system stops responding or displays MEMTEST lane failure error message.
Resolution
Ensure the following:
•

The system is using the latest BIOS.

•

The memory modules are seated properly on the DIMM sockets in the system.

•

The system has a no faulty DIMM socket.

56

Troubleshooting hardware issues

How to resolve unsupported or mismatched memory module
error messages reported during POST on a PowerEdge
system?
Description:
After installing or upgrading a memory module, when you try to boot the system, an error message appears reporting an unsupported or
mismatched memory module.
Resolution:
•

If the memory module was replaced or upgraded, ensure that the maximum memory capacity and the memory type are supported by
the processor installed in the system.

•

While installing the memory module, ensure that you follow the memory population rules. For more information, see the Memory
population guidelines section.

Troubleshooting power supply units
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
The following sections provide information on troubleshooting power source and power supply units problems.

Troubleshooting power source problems
1

Press the power button to ensure that your system is turned on. If the power indicator does not glow when the power button is
pressed, press the power button firmly.

2

Plug in another working power supply unit to ensure that the system board is not faulty.

3

Ensure that no loose connections exist.
For example, loose power cables.

4

Ensure that the power source meets applicable standards.

5

Ensure that there are no short circuits.

6

Have a qualified electrician check the line voltage to ensure that it meets the needed specifications.
NOTE: Few power supply units require 200-240V AC to deliver their rated capacity. For more information, see the system
Technical Specifications section in the Installation and Service Manual available at Dell.com/support/manuals.

Troubleshooting power supply unit problems
1

Ensure that no loose connections exist.
For example, loose power cables.

2

Ensure that the power supply handle LED indicates that the power supply is working properly.

3

If you have recently upgraded your system, ensure that the power supply unit has enough power to support the new system.

4

If you have a redundant power supply configuration, ensure that both the power supply units are of the same type and wattage.
If the LED You may have to upgrade to a higher wattage power supply unit.

5

Ensure that you use only power supply units with the Extended Power Performance (EPP) label on the back.

Troubleshooting hardware issues

57

6

Reseat the power supply unit.
NOTE: After installing a power supply unit, allow several seconds for the system to recognize the power supply unit and
determine if it is working properly.
If the problem persists, see the Getting help section.

Troubleshooting RAID
RAID configuration using OpenManage Server Administrator
For more information about the OpenManage Server Administrator RAID Configuration video, go to https://www.youtube.com/playlist?
list=PLe5xhhyFjDPfL6NsJ29FXLVsP9uE-AFO0.

Create Virtual Disk Express Wizard
The Create Virtual Disk Wizard allows you to select the wizard type and the RAID level. The Create Virtual Disk Express Wizard
calculates the appropriate virtual disk configuration based on the available space and controller requirements. To make your own selections
for the virtual disk configuration, select the Advanced Wizard option.
About this task
This task is not supported on PERC hardware controllers running in HBA mode. To create a virtual disk by using the Create Virtual Disk
Express Wizard, perform the following steps:
Steps
1
On the upper-left corner of the Server Administrator page, expand Storage .
2

Click PERC Controller.

3

Click Virtual Disks.
The Virtual Disk(s) on Controller  page is displayed.

4

Click Go to the Create Virtual Disk Wizard.
The Create Virtual Disk Wizard  page is displayed.

5

Select the Express Wizard option and the RAID level from the drop-down menu.

6

Click Continue.
The Create Virtual Disk Express Wizard -  page displays the summary of attributes of the selected RAID level.
The attributes include Bus Protocol, Stripe Element Size, Read Policy, and the selected physical disk. The default values of the
virtual disk attributes excluding the RAID level are recommended for the selected RAID level.

7

In the Name field, type a name for the virtual disk.
The virtual disk name can contain only alphanumeric characters, spaces, dashes, and underscores. The maximum length of the virtual
disk name depends on the controller. In most cases, the maximum length is 15 characters. The name cannot start with a space or end
with a space.
NOTE: Dell recommends that you specify a unique name for each virtual disk. If you have virtual disks with the same name,
it is difficult to differentiate the generated alerts.

8

In the Size field, type the size of the virtual disk.
The virtual disk size must be within the minimum and maximum values displayed against the Size field.
In some cases, the virtual disk is slightly larger than the size you specify. The Create Virtual Disk Wizard adjusts the size of the virtual
disk to avoid rendering a portion of the physical disk space unusable.

9

If you want to change your selections, click Go Back To Previous Page to return to Create Virtual Disk Express Wizard page.

10

Click Finish to complete the virtual disk creation or click Exit Wizard to cancel the virtual disk creation.

58

Troubleshooting hardware issues

For PERC H700 and PERC H800 controllers, if any of the drives you selected is in the spun down state, the following message is
displayed:
The below listed physical drive(s) are in the Spun Down state. Executing this task on these
drive(s) takes additional time, because the drive(s) need to spun up.
The message displays the ID(s) of the spun down drive(s).
NOTE: When the hard drive is in the spun down state, the drive is not in active use.
The virtual disk is displayed on the Virtual Disk(s) on Controller  page.

Create virtual disk Advanced Wizard
The Create Virtual Disk Advanced Wizard allows you to specify the read, write, cache policy, and parameters such as RAID level, bus
protocol, media type, and encrypted disk for the virtual disk. You can also select the physical disks and the controller connector. You must
have a good knowledge of RAID levels and hardware to use the Advanced Wizard. If you want the wizard to recommend a virtual disk
configuration for you, select the Express Wizard option.
About this task
NOTE: This task is not supported on PERC hardware controllers running in HBA mode.
To create a virtual disk using the Create Virtual Disk Advanced Wizard:
Steps
1
On the upper-right corner of the Server Administrator page, click Storage.
2

Click .

3

Click Virtual Disks.
The Virtual Disk(s) on Controller  page is displayed.

4

Click Go to the Create Virtual Disk Wizard.
The Create Virtual Disk Wizard  page is displayed.

5

Select the Advanced Wizard option.

6

To make sure that only encrypted physical disks are used to create the virtual disk, select Yes from the Create Encrypted Virtual Disk
drop-down list.
The RAID levels are available for selection based on the number of encrypted physical disks.
If you select No, the RAID levels are available based on the total number of physical disks present on the system.

7

Select the required RAID level from the drop-down menu.

8

Select Bus Protocol.
The options are:
•

9

SAS

• SATA
Select Media Type.
The available options are:
•

10

HDD

• SSD
Click Continue.
NOTE: If you are creating an encrypted virtual disk, only the encrypted physical disks are displayed. Else, both encrypted
and unencrypted physical disks are displayed.
Example:
Connector 0

Troubleshooting hardware issues

59

The Connector section of the page displays the connectors on the controller and the disks attached to each connector. Select the
disks you want to include in the virtual disk. In this example, the controller has a single connector with five disks.
•

Physical disk 0:0

•

Physical disk 0:1

•

Physical disk 0:2

•

Physical disk 0:3

•

Physical disk 0:4

Physical Disks Selected
The Physical Disks Selected section of the page displays the disks you have chosen. In this example, two disks are selected.
•

Physical disk 0:0

•

Physical disk 0:1

Each RAID level has specific requirements for the number of disks that must be selected. RAID 10, RAID 50, and RAID 60 also have
requirements for the number of disks that must be included in each stripe or span.
If the controller is a SAS controller with firmware version 6.1 and later, and you selected RAID 10, RAID 50, and RAID 60, the user
interface displays the following:
•

All Disks—Enables you to select all the physical disks in all the enclosures.

•

Enclosure—Enables you to select all physical disks in the enclosure.
NOTE: The All Disks and Enclosure options enable you to edit spans after selecting the physical disks that comprise them.
You can remove a span and specify a span again with different physical disks before proceeding.

•

Number of Disks per Span — Enables you to select the number of disks in each span (default = 2). This option is available only on
SAS controllers with firmware version 6.1 and later.
NOTE: This option is available only if Intelligent Mirroring is selected on the Create Virtual Disk Advanced Wizard page.
NOTE: On a SAS controller with firmware version 6.1 and later, RAID 10 supports only even number of disks per span and a
maximum of 8 spans with 32 disks in each span.
Let us consider that the controller has three enclosures with six physical disks each (total number of available disks = 3 x 6 =18
disks). If you select four disks per span, the controller creates four spans (18 disks/4 disks per span = 4 spans). The last two disks
of the last enclosure are not part of RAID 10.

•

Select the number of disks to create a single spanned virtual disk list box — Enables you to create a single span virtual disk with
22 or 26 physical drives for PERC controllers. This list box option is displayed only if you have selected RAID 10 in step 1 and the
system has 22 or more physical drives.

11

NOTE: Only physical disks that comply with the virtual disk parameters, selected in the Create Virtual Disk Wizard page
are displayed.
Select the required connector and the corresponding physical disk, and click Continue.

12

In the Name field, type a name for the virtual disk.
The virtual disk name can contain only alphanumeric characters, spaces, dashes, and underscores. The maximum length depends on
the controller. In most cases, the maximum length is 15 characters.
The name cannot start with a space or end with a space. If you have virtual disks with the same name, it is hard to differentiate the
generated alerts, Dell recommends that you specify a unique name for each virtual disk.

13

In the Size field, type the size of the virtual disk.
The virtual disk size must be within the minimum and maximum values displayed near the Size field.
In some cases, the virtual disk is slightly larger than the size you specify. The Create Virtual Disk Wizard adjusts the size of the virtual
disk to avoid rendering a portion of the physical disk space unusable.

60

Troubleshooting hardware issues

NOTE:
•

If a physical disk is receiving a SMART alert, it cannot be used in a virtual disk.

•

For a controller that has more than one channel, it may be possible to configure a virtual disk that is channelredundant.

Depending on the RAID level you selected and the virtual disk size, this page displays the disks and connectors (channels or ports)
available for configuring the virtual disk.
NOTE: The Size field displays the default size of the virtual disk depending upon the RAID configuration you selected.
However, you can also specify a different size based on your requirement. The virtual disk size must be within the minimum
and maximum values displayed against the Size field. In some cases, the virtual disk is slightly larger than the size you
specify. The Create Virtual Disk Wizard adjusts the size of the virtual disk to avoid rendering a portion of the physical disk
space unusable.
14

Select a stripe size from the Stripe Element Size drop-down list box. The stripe size refers to the amount of space that each stripe
consumes on a single disk.

15

Select the required read, write, and disk cache policy from the respective drop-down list box. These selections can vary depending on
the controller.
NOTE:
•

There is limited support for write policy on controllers that do not have a battery. The cache policy is not supported on
controllers that do not have a battery.

•

16

If you have selected the Create Encrypted Virtual Disk option in Create Virtual Disk Advanced Wizard, then in the
Summary of Virtual Disk Attributes, an Encrypted Virtual Disk attribute is displayed with a value Yes.
Click Span Edit to edit the spans created in Create Virtual Disk Advanced Wizard.
CAUTION: If you click Span Edit, Intelligent Mirroring that has already been applied becomes invalid.
NOTE: The Span Edit option is available in the following two cases:
•

Only if the controller is a SAS controller with firmware 6.1 and later.

17

• If you selected RAID 10 while using Create virtual disk Advanced Wizard.
If you want to change your selections, click Go Back To Previous Page to return to Create Virtual Disk Advanced Wizard page.

18

Click Finish to complete the virtual disk creation or click Exit Wizard to cancel the virtual disk creation.
The virtual disk is displayed in the Virtual Disk(s) on Controller  page.
For PERC H700 and PERC H800 controllers, if any of the drives you selected is in the spun down state, the following message is
displayed:
The below listed physical drive(s) are in the spun down state. Executing this task on these
drive(s) takes additional time, because the drive(s) need to spun up.
The message displays the ID(s) of the spun down drive(s).
The Create Virtual Disk Advanced Wizard -  page displays a checkbox next to each physical disk that is suitable
as a dedicated hot spare. Select a Physical Disk checkbox if you want to assign a dedicated hot spare.
The Physical Disk checkbox is not available if the controller does not have a physical disk that is a suitable hot spare for the virtual
disk you are creating. For example, the available physical disks may not be sufficient to protect the virtual disk. If the Physical Disk
checkbox is not available, specify a smaller virtual disk, use a different RAID level, or change the disk selection.
NOTE: If you are creating an encrypted virtual disk, then encrypted physical disks are listed as candidates for hot spare.

RAID configuration by using Unified Server Configurator
The Unified Server Configurators (USC)’s main features include operating system deployment, firmware updates, hardware configuration,
and diagnostics. The UEFI specification creates framework for many powerful applications that run without an operating system. The UEFI

Troubleshooting hardware issues

61

specification includes the Trusted Computing Group (TCG) protocol that allows for Trusted Platform Module (TPM) measurements and
secure updates which USC leverages to its fullest extent.
1

Select the RAID configuration tab to create the virtual disk as the boot device.

2

On the View Current Configuration screen, you can view a list of current controllers and virtual disks installed in the system.
a

3

If the virtual disk or disks exist in the system, click Exit to exit the screen and continue with operating system installation. if the
virtual disk or disks do not exist in the system, continue with the new virtual disk creation process.

On the Select RAID Controller screen, select the controller on which to configure a virtual disk.
•

If the selected controller contains a foreign configuration, the Foreign Configuration Detected screen is displayed.

•

Depending on the foreign configuration, you have the option to either Clear, Import or Ignore the foreign configuration.

•

If you select the Clear Foreign Configuration option, all the data gets deleted on the drives containing the foreign configuration,
thereby allowing the drives to be used in a new configuration.

•

Selecting Import Foreign Configuration attempts to import the foreign configuration.

•

Selecting Ignore Foreign Configuration leaves the drives in their current state and uses free drives to create a new configuration.
NOTE: Selecting the Ignore Foreign Configuration causes the system to generate alerts and error messages every time the
system is started.

4

62

Select either the Express Wizard or Advanced Wizard window on the Select a Configuration Option screen.

Troubleshooting hardware issues

Figure 15. Flowchart of Unified Server Configurator’s RAID configuration process
5

The Express option selects the appropriate disks depending upon the RAID type selected for virtual disk creation.
The Summary screen is displayed. You can review the choices selected during the Express wizard.

6

Click Finish to create the virtual disk to be used for operating system installation.

7

The Advanced option takes you to a series of more screens. Select the RAID type on the Basic Settings screen. On the Select
Physical Disks screen, select the physical disks that are the part of the virtual disk.
The Additional Settings screen is displayed to define additional parameters for the virtual disk, such as stripe element size, read and
write policy, and whether or not to assign a hot spare disk. The Summary of virtual disk attributes is displayed.

8

Click Finish to create virtual disk to be used for operating system installation.

Troubleshooting hardware issues

63

Downloading and installing the RAID controller log export by
using PERCCLI tool on ESXi hosts on Dell’s 13th generation of
PowerEdge servers
To export information about the status of the RAID controller and its attached hard drives, you can use the PERCCLI tool. To download and
install the RAID controller log export by using PERCCLI tool on ESXi hosts on Dell’s 13th generation of PowerEdge servers, perform the
following steps:
1

Download the latest verison of PERCCLI for ESX tool from Dell.com/support/home.

2

To upload the PERCCLI tool to Datastore:
a
b

Select the host (1), and then click on the Configuration tab (2).
In the Hardware panel, select Storage (3) and right-click datastore1.

Figure 16. vSphere Client
a

In Datastore Browser, click Upload, and then click Upload File.

Figure 17. Datastore Browser
3

64

To enable Secure Socket Shell (SSH) select host (1), and then click on the Configuration tab (2). In the Software panel, select
Security Profile (3). In Services, click Properties (4) at the upper-right corner of the screen.

Troubleshooting hardware issues

Figure 18. Configuration tab
a

In the Services properties window, select SSH (1), and then click Options... (2)

Troubleshooting hardware issues

65

Figure 19. Open SSH Options
b

66

In the SSH Options window, click Start (1), and then click OK (2) to activate the service.

Troubleshooting hardware issues

Figure 20. Start SSH Service
4

To unzip vmware-esx-perccli, open an SSH connection via PUTTY and run the command: unzip /vmfs/volumes/datastore1/
vmware-esx-perccli-1.05.08.zip
PUTTY is a free and open-source terminal emulator, serial console and network file transfer application. It supports several network
protocols, including SCP, SSH, Telnet, rlogin, and raw socket connection. You can download it from Google. The files vmware-esxperccli-1.05.08.vib and Readme.txt are now extracted to: /vmfs/volumes/datastore1/
WARNING: Please check the correct path (e.g. datastore1) according to your installation package

5

Install PERCCLI by using command: esxcli software vib install -v /vmfs/volumes/datastore1/vmware-esxperccli-1.05.08.vib --no-sig-check

Figure 21. Install PERCCLI
6

Go to directory /opt/lsi/perccli by using command: cd /opt/lsi/perccli

7

Collect the logs by using command: ./perccli /c0 show termlog

Troubleshooting hardware issues

67

The log is created with the name: #### MegaSAS.log

Figure 22. Log creation
8

Copy MegaSAS.log to datastore by using command:cp /opt/lsi/perccli/MegaSAS.log /vmfs/volumes/datastore1/

9

Copy the file to desktop with Datastore Browser.

Figure 23. Log file in Datastore Browser
Now the logs are exported on ESXi hosts on the Dell 13th generation PowerEdge servers.

Configuring RAID by using Lifecycle Controller
Prerequisite
If your system has one or more supported PERC RAID controllers with PERC 8 firmware or later, or software RAID controllers, use the
RAID Configuration wizard to configure a virtual disk as the boot device.
NOTE: Create boot virtual disk only from a disk drive populated across slots 0–3 of the system. For slot information, see the
server Owner’s Manual at Dell.com/poweredgemanuals.

68

Troubleshooting hardware issues

Steps
1

Launch the Lifecycle Controller home page, click Hardware Configuration, and then click Configuration Wizards.
The Configuration Wizards window is displayed.

2

In Storage Configuration Wizards, click RAID Configuration.

3

Select the RAID controller to view its current virtual disk configuration and disk attributes. Click Next.

4

Select the RAID level for the virtual disk that you want to create and click Next.

5

On the Select Physical Disks screen, the default values for Protocol, Media Type, and Encryption capability are displayed.

6

Select the required physical disks that you want to include in the virtual disk, and then click Next.

7

On the Virtual Disk Attributes screen, type the virtual disk name.
NOTE: The virtual disk name should not contain any space or special characters.

8

Type the size of the physical disk.
The values for Stripe Element Size, Read Policy, and Write Policy are displayed by default.

9

If you want to change the values for Stripe Element, Read Policy and Write Policy, select the required values from the respective
drop-down lists.
The Summary of virtual disk attributes is displayed.

10

Click Finish .

11

On the RAID Configuration Warning window, click Yes if you wish to continue with the creation of virtual disk.
The RAID Configuration window is displayed. Note that this operation might take approximately one minute to complete.

12

When the virtual disk is created successfully, the RAID Configuration Success window is displayed. Click OK.
You are taken to the Hardware configuration window. Click Back to go to the home page.

13

You can verify, if the RAID operation was successful by checking the events in the Lifecycle Log.
a
b

Click View Lifecycle Log History. The View Lifecycle Log window is displayed. Note that this operation might take few minutes.
On the View Log History page, you can verify if the RAID operation is successful. For more information about Dell Lifecycle
controller, go toDell.com/ techcenter/LC .

For more information about the Dell Lifecycle Controller RAID Configuration video, go to https://www.youtube.com/watch?
v=JFPmeHNENV4

Starting and target RAID levels for virtual disk reconfiguration
and capacity expansion
After you have created a virtual disk, the possibilities for reconfiguring the virtual disk depend on the controller, RAID level, and available
physical disks.
Table 13. Possible scenarios for reconfiguring a virtual disk
Controller

Starting RAID Level

Target RAID Level

Comments

PERC 6/E, PERC 6/I, PERC
H800 Adapter, PERC H700
Adapter, PERC H700 Integrated,
and PERC H700 Modular, PERC
H310 Adapter, PERC H310 Mini
Monolithic, PERC H310 Mini
Blades, PERC H710 Adapter,
PERC H710 Mini Blades, PERC
H710 Mini Monolithic, PERC
H710P Adapter, PERC H710P
Mini Blades, PERC H710P Mini
Monolithic, PERC H810 Adapter,

RAID 0

RAID 1

Add a single disk

RAID 0

RAID 0, RAID 5

Add at least one additional disk.

RAID 0

RAID 6

RAID 6 requires a minimum of 4
disks.
Reconfiguration from RAID 0 to
RAID 6 requires at least 2
additional disks even when this
exceeds the 4-disk minimum
required by RAID 6

RAID 1

RAID 0

With or without adding
additional disks

Troubleshooting hardware issues

69

Controller

Starting RAID Level

PERC H330 Adapter, PERC
RAID 1
H330 Mini Monolithic, PERC
H330 Mini Blades, PERC H330
Embedded, PERC H730 Adapter,
PERC H730 Mini Monolithic,
RAID 5
PERC H730 Mini Blades, PERC
H730P Adapter, PERC H730P
RAID 5
Mini Monolithic, PERC H730P
Mini Blades, PERC H730P Slim,
PERC H830 Adapter, PERC
FD33xD/FD33xS
RAID 6

Target RAID Level

Comments

RAID 5, RAID 6

Add at least one additional disk.
RAID 6 requires a minimum of 4
disks.

RAID 0

With or without adding
additional disks

RAID 5, RAID 6

Add at least one additional disk.
RAID 6 requires a minimum of 4
disks.

RAID 0, RAID 5

With or without adding
additional disks

RAID 6

RAID 6

Add at least one additional disk

RAID 10

RAID 10

Without adding additional disks

SAS 6/iR

N/A

N/A

N/A

PERC S100, S110, S130, and
S300

RAID 0

RAID 0

With or without additional disks

RAID 1

RAID 1

Without additional disks

RAID 5

RAID 5

With or without additional disks

RAID 10

RAID 10

Without additional disks

NOTE: The order of the controllers displayed on Storage Management may differ from the order of the controllers displayed in
the Human Interface (HII) and PERC Option ROM. The order of the controllers does not cause any limitation.
NOTE: When creating virtual disks using software RAID controllers, the information related to the physical disks linked to the
virtual disk is enumerated or displayed on Storage Management after a short delay. This delay in displaying the information does
not cause any functional limitation. If you are creating partial virtual disks, Dell recommends that you provide Storage
Management adequate time between each partial virtual disk creation process.
NOTE: RAID 10 virtual disk reconfiguration operation is not supported with Intelligent Mirroring.

Reconfiguring or migrating virtual disks
About this task
Reconfiguring or migrating a virtual disk (VDs) enables you to increase the capacity or change the RAID level of the virtual disk.
NOTE: In the PERC BIOS Configuration Utility window, with software RAID S110 and S130 controllers, if a physical disk (SATA
SSD or HDD) is removed from a virtual disk and the same physical disk is reinserted (hot plug) into the virtual disk instantly, then
the virtual disk state is displayed as Ready or Non RAID (for the PERC S110 and S100) and the physical disk state is displayed as
Online. However, if the same physical disk is reinserted after a short delay, then the virtual disk state is displayed as Degraded
and the physical disk state is displayed as Ready.
To reconfigure a virtual disk:
Steps
1

Review the information in the Starting and Target RAID Levels for Virtual Disk Reconfiguration and Capacity Expansion section in this
document.

2

In the tree view, locate the controller on which the virtual disk resides. Expand the controller object until the Virtual Disks object is
displayed.

3

Select the Reconfigure task from the virtual disk’s Tasks drop-down list box and click Execute.

70

Troubleshooting hardware issues

4

Complete the Reconfigure task using the Reconfigure wizard.

Starting and target RAID levels for virtual disk reconfiguration and
capacity expansion
After you have created a virtual disk, the possibilities for reconfiguring the virtual disk depend on the controller, RAID level, and available
physical disks.
Table 14. Possible scenarios for reconfiguring a virtual disk
Controller

Starting RAID Level

Target RAID Level

Comments

PERC 6/E, PERC 6/I, PERC
H800 Adapter, PERC H700
Adapter, PERC H700 Integrated,
and PERC H700 Modular, PERC
H310 Adapter, PERC H310 Mini
Monolithic, PERC H310 Mini
Blades, PERC H710 Adapter,
PERC H710 Mini Blades, PERC
H710 Mini Monolithic, PERC
H710P Adapter, PERC H710P
Mini Blades, PERC H710P Mini
Monolithic, PERC H810 Adapter,
PERC H330 Adapter, PERC
H330 Mini Monolithic, PERC
H330 Mini Blades, PERC H330
Embedded, PERC H730 Adapter,
PERC H730 Mini Monolithic,
PERC H730 Mini Blades, PERC
H730P Adapter, PERC H730P
Mini Monolithic, PERC H730P
Mini Blades, PERC H730P Slim,
PERC H830 Adapter, PERC
FD33xD/FD33xS

RAID 0

RAID 1

Add a single disk

RAID 0

RAID 0, RAID 5

Add at least one additional disk.

RAID 0

RAID 6

RAID 6 requires a minimum of 4
disks.
Reconfiguration from RAID 0 to
RAID 6 requires at least 2
additional disks even when this
exceeds the 4-disk minimum
required by RAID 6

RAID 1

RAID 0

With or without adding
additional disks

RAID 1

RAID 5, RAID 6

Add at least one additional disk.
RAID 6 requires a minimum of 4
disks.

RAID 5

RAID 0

With or without adding
additional disks

RAID 5

RAID 5, RAID 6

Add at least one additional disk.
RAID 6 requires a minimum of 4
disks.

RAID 6

RAID 0, RAID 5

With or without adding
additional disks

RAID 6

RAID 6

Add at least one additional disk

RAID 10

RAID 10

Without adding additional disks

SAS 6/iR

N/A

N/A

N/A

PERC S100, S110, S130, and
S300

RAID 0

RAID 0

With or without additional disks

RAID 1

RAID 1

Without additional disks

RAID 5

RAID 5

With or without additional disks

RAID 10

RAID 10

Without additional disks

NOTE: The order of the controllers displayed on Storage Management may differ from the order of the controllers displayed in
the Human Interface (HII) and PERC Option ROM. The order of the controllers does not cause any limitation.

Troubleshooting hardware issues

71

NOTE: When creating virtual disks using software RAID controllers, the information related to the physical disks linked to the
virtual disk is enumerated or displayed on Storage Management after a short delay. This delay in displaying the information does
not cause any functional limitation. If you are creating partial virtual disks, Dell recommends that you provide Storage
Management adequate time between each partial virtual disk creation process.
NOTE: RAID 10 virtual disk reconfiguration operation is not supported with Intelligent Mirroring.

Foreign Configuration Operations
Foreign configuration is virtual disk data residing on physical disks that have been moved from one controller to another.
The Foreign Configuration Operations task provides a preview of the foreign configurations that you can import.
NOTE: On the PERC BIOS Configuration Utility screen, foreign configuration operations are not supported on PERC hardware
controllers running in HBA mode.
NOTE: Foreign Configuration Operations task is available only on PERC 6 and SAS controllers with firmware versions 6.1 and
later.
NOTE: It is not recommended to remove an external enclosure cable while the operating system is running on the system.
Removing the cable could result in a foreign configuration when the connection is re-established.
The Foreign Configuration Operations task is displayed only when a controller detects a foreign configuration. Select this option and click
Execute to display the Foreign Configuration Preview page.
The Foreign Configuration Preview page provides a preview of the foreign disks and enables you to perform operations such as, importing,
recovering, or clearing the foreign disks. You can also import or clear a locked foreign configuration.
If any foreign configurations that are locked by using Local Key manager (LKM) are detected, the associated Encryption Key Identifier is
displayed prompting you to provide the corresponding passphrase to unlock the drives.
To avoid unlocking foreign configurations and to proceed to preview, import, or clear a foreign configuration that has not been locked, on
the PERC BIOS Configuration Utility Foreign Configuration screen, click Skip or Continue.
If you do not want to import or clear the foreign configurations, or in case of loss of the associated passphrase of the corresponding
Encryption Key Identifier, execute the Instant Encrypt Erase task for the physical disks.
CAUTION: Executing the Instant Encrypt Erase task erases all data on the physical disk.
Some conditions, such as an unsupported RAID level or an incomplete disk group, can prevent the import or recovery of foreign virtual
disks.

Foreign Configuration properties
The following table describes the properties that are displayed on the PERC BIOS Configuration Utility Foreign Configuration screen for
the Foreign Disks and Global Hot Spares.
Table 15. Foreign Configuration Properties
Property

Definition

Status

These icons represent the severity or health of the storage component.

72

•

—Normal/OK

•

—Warning/Non-critical

Troubleshooting hardware issues

Property

Definition
•

—Critical/Failure/Error

Name

Displays the name of the foreign configuration and is available as a link. This link enables you to
access the physical disks that constitute the foreign disk.

State

Displays the current state of the foreign configuration. Possible values are:
•
•
•

Ready — The foreign disk can be imported and functions normally after import.
Degraded — The foreign disk is in degraded state and rebuilds after import.
Failed — The foreign disk has encountered a failure and is no longer functioning. You cannot
import the foreign configuration.

The foreign configuration may be in degraded or failed state due to any of the following reasons:
•
•
•
•
•
•
•

Missing physical disk — One of the physical disk(s) in the potential virtual disk is missing or not
available.
Missing span — One or more spans of a hybrid virtual disk is missing.
Stale physical disks — One or more physical disks in the configuration may contain out-of-date
data relating to other disks of that virtual disk. Hence, the data integrity of the imported virtual
disk is not intact.
Unsupported configuration of the virtual disk — The virtual disk has an unsupported RAID level.
Import and Export — The virtual disks available for import exceed the number of virtual disks
available for export.
Incompatible physical disks — Configuration on the physical disks is not recognized by the RAID
firmware.
Orphan drive — A physical disk in the foreign configuration has configuration information that
matches another physical disk that is already a part of an array (either a foreign or a native
array).

Layout

Displays the RAID level of the foreign configuration.

Remarks

Provides information about the foreign virtual disk. If the virtual disk cannot be imported, the reason
for failure is displayed.
•
•
•
•
•
•

Dedicated Hot Spare

Exceeded maximum — The number of virtual disks selected for import has exceeded the
maximum number of supported disks.
Missing physical disk or Missing span — One or more physical disk(s) or span(s) in the virtual
disk to be imported is missing.
Unsupported — The selected RAID level is not supported on this controller.
Orphan drive — The physical disk has been replaced and is no longer a part of the RAID
volume. The configuration should be cleared.
Stale physical disk — The physical disk to be imported in the virtual disk has outdated data.
Partially foreign — The virtual disk is part of an already existing configuration. Some physical
disks in this virtual disk are foreign.

Displays whether the foreign disk is a dedicated hot spare.

Based on the properties information, you can decide whether you want to import, recover, or clear the foreign configuration.

Viewing Patrol Read report
The patrol read report provides information on all the patrol reads performed on the controller in the chronological order. It provides
information such as last run time and result. If a patrol read fails, the reason for the failure is displayed.

Troubleshooting hardware issues

73

Setting Patrol Read mode
NOTE: This task is not supported on PERC hardware controllers running in HBA mode.
Patrol read identifies disk errors in order to avoid disk failures and data loss or corruption. The Set Patrol Read task is applicable only for
disks used as virtual disks or hot spares.
The Set Patrol Read task runs in the background and corrects the disk errors, when possible. When the Set Patrol Read mode is set to
Auto, patrol read is initiated when the controller is idle for a specific period of time and when no other background tasks are active. In this
scenario, the patrol read enhances the system performance as disk errors can be identified and corrected when there is no input/output
activity on the disk.
The controller adjusts the amount of system resources dedicated for patrol read based on the amount of controller activity that is
competing with the Patrol Read task. When the controller activity is high, fewer system resources are dedicated to the patrol read task.
Patrol Read does not run on a physical disk in the following circumstances:
•

The physical disk is not included in a virtual disk or is assigned as a hot spare.

•

The physical disk is included in a virtual disk that is currently undergoing one of the following:
•

Rebuild

•

Reconfiguration or reconstruction

•

Background initialization

•

Check consistency

In addition, the Patrol Read is suspended during heavy I/O activity and resumes when the I/O is finished.
To set Patrol Read mode, select the desired Patrol Read Mode option. The options available are:
•

Auto — Initiates the Patrol Read task. After the task is complete, it automatically runs again within a specified period. For example, on
some controllers the Patrol Read runs every four hours and on other controllers, the Patrol Read runs every seven days. The Patrol
Read task runs continuously on the system starting again within the specified period after each iteration of the task completes. If the
system reboots while the Patrol Read task is running in Auto mode, the Patrol Read restarts at zero percent (0%). When the Patrol
Read task is set to Auto mode, you cannot start or stop the task. Auto mode is the default setting.

•

Manual — Enables you to start and stop the Patrol Read task using Start and Stop Patrol Read. Setting the mode to Manual does not
initiate the Patrol Read task. If the system reboots while Patrol Read is running in Manual mode, Patrol Read does not restart.

•

Disabled — Prevents the Patrol Read task from running on the system.

NOTE: For more information on how often the Patrol Read task runs when in Auto mode, see your controller documentation.

To set Patrol Read mode
Select the desired Patrol Read Mode option. The options available are:
•

Auto — Initiates the Patrol Read task. After the task is complete, it automatically runs again within a specified period. For example, on
some controllers the Patrol Read runs every four hours and on other controllers, the Patrol Read runs every seven days. The Patrol
Read task runs continuously on the system starting again within the specified period after each iteration of the task completes. If the
system reboots while the Patrol Read task is running in Auto mode, the Patrol Read restarts at zero percent (0%). When the Patrol
Read task is set to Auto mode, you cannot start or stop the task. Auto mode is the default setting.

•

Manual — Enables you to start and stop the Patrol Read task using Start and Stop Patrol Read. Setting the mode to Manual does not
initiate the Patrol Read task. If the system reboots while Patrol Read is running in Manual mode, Patrol Read does not restart.

•

Disabled — Prevents the Patrol Read task from running on the system.

NOTE: For more information on how often the Patrol Read task runs when in Auto mode, see your controller documentation.

74

Troubleshooting hardware issues

Check Consistency report
The check consistency report provides information on all the consistency checks performed on the controller in a chronological order. It
provides information such as last run time and result. If the consistency check fails, it provides the reason for the failure.

Performing a Check Consistency
The Check Consistency task verifies the accuracy of the redundant (parity) information. This task only applies to redundant virtual disks.
When necessary, the Check Consistency task rebuilds the redundant data. If the virtual disk is in a Failed Redundancy state, running a
check consistency may return the virtual disk to a Ready state.

Canceling a Check Consistency
The Cancel Check Consistency task stops a check consistency operation while it is in progress.

Pausing a Check Consistency
The Pause Check Consistency task pauses a check consistency while it is in progress.
NOTE: The Pause Check Consistency task updates the virtual disk State property to Resynching Paused immediately. The
Progress property may continue to increment for up to three seconds. This time delay occurs because the polling task may take
up to three seconds to query the task information and update the display.

Resuming a Check Consistency
The Resume Check Consistency task resumes check consistency after it has been paused.
To locate view Check Consistency report in Storage Management:
1

In the Server Administrator window, under the system tree, click Storage.

2

Select View Check Consistency Report from the Select Report drop-down menu.

3

Click Execute.

To locate view Check Consistency report in Storage Management
1

In the Server Administrator window, under the system tree, click Storage.

2

Select View Check Consistency Report from the Select Report drop-down menu.

3

Click Execute.

Virtual disk troubleshooting
Rebuilding of virtual disk does not work
Rebuilding of virtual disk does not work in the following situations:
•

The virtual disk is non redundant — For example, a RAID 0 virtual disk cannot be rebuilt because RAID 0 does not provide data
redundancy.

•

There is no hot spare assigned to the virtual disk — As long as the virtual disk is redundant, to rebuild it:
•

Remove the failed physical disk and replace it. A rebuild automatically starts on the new disk.

Troubleshooting hardware issues

75

•

Assign a hot spare to the virtual disk and then perform a rebuild.

•

You are attempting to rebuild onto a hot spare that is too small — Different controllers have different size requirements for hot spares.

•

The hot spare is unassigned from the virtual disk — This occurs on some controllers if the hot spare is assigned to more than one
virtual disk and is being used to rebuild a failed physical disk for another virtual disk.

•

The virtual disk includes failed or corrupt physical disks — This situation may generate alert 2083. For information on alert messages,
see the Server Administrator Messages Reference Guide at Dell.com/support/home.

•

The rebuild rate setting is too low — If the rebuild rate setting is quite low and the system is processing a number of operations, then
the rebuild may take an unusual amount of time to complete.

•

The rebuild is canceled — Another user can cancel a rebuild that you have initiated.

Rebuilding of virtual disk completes with errors
About this task
A rebuild completes with errors when a portion of the disk containing redundant (parity) information is damaged. The rebuild process can
restore data from the healthy portions of the disk but not from the damaged portion.
When a rebuild is able to restore all data except data from damaged portions of the disk, it indicates successful completion while also
generating alert 2163. For information on alert messages, see the Server Administrator Messages Reference Guide at Dell.com/support/
home.
The rebuild may also report sense key errors. In this situation, take the following actions to restore the maximum data possible:
Steps
1
Back up the degraded virtual disk onto a fresh (unused) tape drive.
•

If the backup is successful — If the backup completes successfully, then the user data on the virtual disk has not been damaged.
In this case, you can continue with step 2.

•
2

If the backup encounters errors — If the backup encounters errors then the user data has been damaged and cannot be
recovered from the virtual disk. In this case, the only possibility for recovery is to restore from a previous backup of the virtual disk.
Perform Check Consistency on the virtual disk that you have backed up onto a tape drive.

3

Restore the virtual disk from the tape drive onto healthy physical disks.

Cannot create a virtual disk
You may be attempting a RAID configuration that is not supported by the controller. Check the following:
•

How many virtual disks already exist on the controller? Each controller supports a maximum number of virtual disks.

•

Is there adequate available space on the disk? The physical disks that you have selected for creating the virtual disk must have an
adequate amount of free space available.

•

The controller may be performing other tasks, such as rebuilding a physical disk, that must run to completion before the controller can
create the new virtual disk.

A virtual disk of minimum size is not visible to Windows Disk Management
If you create a virtual disk using the minimum allowable size in Storage Management, the virtual disk may not be visible to Windows Disk
Management even after initialization. This occurs because Windows Disk Management is only able to recognize extremely small virtual disks
if they are dynamic. It is advisable to create virtual disks of larger size when using Storage Management.

Virtual disk errors on systems running Linux
About this task
On some versions of the Linux operating system, the virtual disk size is limited to 1TB. If you create a virtual disk that exceeds the 1TB
limitation, your system may experience the following behavior:

76

Troubleshooting hardware issues

•

I/O errors to the virtual disk or logical drive.

•

Inaccessible virtual disk or logical drive.

•

Virtual disk or logical drive size is smaller than expected.

If you have created a virtual disk that exceeds the 1TB limitation, you must:
Steps
1
Back up your data.
2

Delete the virtual disk.

3

Create one or more virtual disks that are smaller than 1TB.

4

Restore your data from backup.
Irrespective of whether your Linux operating system limits the virtual disk size to 1TB, the virtual disk size depends on the version of
the operating system and any updates or modifications that you have implemented. For more information on operating system, see
your operating system documentation.

Problems associated with using the same physical disks for both redundant
and nonredundant virtual disks
When creating virtual disks, you should avoid using the same physical disks for both redundant and nonredundant virtual disks. This applies
to all controllers. Using the same physical disks for both redundant and nonredundant virtual disks can result in unexpected behavior
including data loss.
NOTE: SAS controllers do not allow you to create redundant and nonredundant virtual disks on the same set of physical disks.

Troubleshooting memory or battery errors on the PERC
controller on Dell PowerEdge servers
Interpreting LCD and Embedded Diagnostic event messages
Issue:

The server LCD presents a error message, or an error message is generated when running the Enhanced Pre-Boot
System Assessment (ePSA).

Solution:

For more information about error messages, see the Dell Event and Error Messages Reference Guide at Dell.com/
openmanagemanuals > OpenManage software.
The Event Message Reference contains the error and event information generated by firmware and other agents
that monitor system components. These events might be logged and displayed on one of the system management
consoles, or both logged and displayed.
NOTE: To run the Embedded System Diagnostics (also known as Enhanced Pre-Boot System
Assessment):
1

As the system boots, press F11.

2

Use the up-and-down arrows keys to select System Utilities > Launch Dell Diagnostics.

Troubleshooting hardware issues

77

Troubleshooting conditions that lead to error message
NOTE: Troubleshooting the associated events may also prevent the error message from occurring. Error message can occur
normally when one of the following conditions occur.
•

OS indicates abnormal shutdown.

•

OS indicates error occurred (blue screen occurred in Windows).

•

Spontaneous power loss condition.

Try the following troubleshooting steps:
•

Reboot to OS
If the OS boot is successful, rebooting again should result in no message being displayed.

•

Clear Controller Cache
•

Ctrl + M for SCSI controllers (PERC 3, PERC 4).

•

Ctrl + R for SAS/SATA controllers (PERC 5, PERC 6 and newer controllers).

•

Wait for five minutes to allow contents of cache to purge.

•

Reboot back to controller BIOS.
NOTE: If error persists, the likelihood of a hardware error is increased. Contact Dell Technical Support for further
troubleshooting steps.

•

•

If error is eliminated, boot to OS.

•

If OS boot is still not successful and/or the error persists, this may indicate a problem with the OS. Contact Technical Support for
further troubleshooting steps.

Check the Physical PERC Controller.
a

Inspect the DIMM and DIMM Socket for Damage.
1

Turn off the system and remove the power cable(s) from the system.

2

Let the system sit for 30 seconds to allow any remaining flea power to drain.

3

Remove the PERC controller. For information about removing and replacing parts in this system, refer to the user guide
located at Dell.com/poweredgemanuals.

4

Remove the RAID memory battery. Ensure to reinstall the memory battery after inserting the DIMM.

5

Remove the memory DIMM from the controller, if applicable.

6

Check DIMM socket for any bent pins or other damage. Check the edge connector of the memory DIMM for any damage.

b

If the controller has embedded memory or the memory socket is damaged, contact Dell Technical Support.

c

If the memory is damaged, the controller memory may need to be replaced, contact Dell Technical Support.

d

If there is no damage, replace the memory DIMM and reinstall the controller.

e

Swap the controller memory a with known good memory, if possible.
1

If there is no known good memory available, contact Dell Technical Support.

2

if the error does not occur with the known good memory, contact Dell Technical Support.

3

If the error remains with a known good memory, contact Dell Technical Support.

Additional information for troubleshooting memory or battery errors on the
PERC controller
A RAID Controller error message is displayed during POST to indicate that the controller's cache does not contain all of the expected
information, or it contains data destined for a hard drive that cannot be or has not been written to the drive. The most common reasons
why this error may occur are:
•

78

Server did not perform a normal shutdown process – Power loss and/or spontaneous restarts can result in incomplete or corrupted
data to remain in cache that cannot be written to a drive.

Troubleshooting hardware issues

•

Cache memory is defective – Bad cache memory can cause data to become corrupted. This can cause OS-related issues and
spontaneous reboots.

•

Loss of battery power while server is shutdown – Controllers that do not use NVCACHE (Non-Volatile Cache) memory utilize batteries
that can retain the contents of cache for a limited time (24-72 hours) while the server is not powered on. Once the battery drains, the
entire contents of cache is lost and the controller recognizes that the cache memory does not contain all of the information expected.
Controllers that do utilize NVCache (some H700/H800 controllers and newer controllers such as H710, H710P, H810) are very unlikely
to encounter this issue since the battery only needs to maintain power for 30 seconds or less in most cases.

•

PERC Battery Maintenance
A PERC battery that is suspected to have failed or has a warning symbol displayed in OpenManage Server Administrator should have a
manual Learn Cycle performed. A Learn Cycle causes the battery to discharge and recharge, and restores the battery to a fully
functional condition. In some cases, multiple Learn Cycle procedures may be required to restore the battery to an effectively charged
state. To perform a manual Learn Cycle, select Start Learn Cycle from the Battery Tasks drop-down menu in Open Manage Server
Administrator (OMSA).

•

Cache Use
Hardware RAID controllers utilize cache (a temporary repository of information) for their normal operation. The normal operation cache
comprises DRAM memory, which, like system memory, retains data only when powered on.
Newer controllers utilize NVCache, which is utilized when the server is powered off. NVCache memory contains both DRAM memory
(for normal operation) and flash memory (non-volatile). The controllers battery (if operational) powers the DRAM memory during a
power loss so that the contents can be copied into the flash memory for indefinite storage.

The contents of cache can essentially be broken into three parts:
•

RAID configuration and metadata - Information about the RAID arrays including configuration information, disk members, role of disks,
etc.

•

Controller logs - RAID controllers maintain several log files. Dell technicians rely on the TTY log as the primary log for troubleshooting
various RAID and hard drive issues.

•

RAID data - This is the actual data destined to be written to the individual hard drives. Data is written into the cache of the controller in
both Write Through and Write Back cache policy modes.

Slicing
Configuring multiple RAID arrays across the same set of disks is called Slicing.

RAID puncture
A RAID puncture is a feature of Dell PowerEdge RAID Controller (PERC) designed to allow the controller to restore the redundancy of the
array despite the loss of data caused by a double fault condition. Another name for a RAID puncture is rebuild with errors. When the RAID
controller detects a double fault and there is insufficient redundancy to recover the data in the impacted stripe, the controller creates a
puncture in that stripe and enables the rebuild to continue.

Troubleshooting hardware issues

79

•

Any condition that causes data to be inaccessible in the same stripe on more than one drive is a double fault.

•

Double faults cause the loss of all data within the impacted stripe.

•

All RAID punctures are double faults but all double faults are NOT RAID punctures.

Causes of RAID puncture
Without the RAID puncture feature, the array rebuild would fail, and leave the array in a degraded state. In some cases, the failures may
cause additional drives to fail, and cause the array to be in a non-functioning offline state. Puncturing an array has no impact on the ability
to boot to or access any data on the array.
RAID punctures can occur in one of two situations:
•

Double Fault already exists (Data already lost).
Data error on an online drive is propagated (copied) to a rebuilding drive.

•

Double Fault does not exist (Data is lost when second error occurs).
While in a degraded state, if a bad block occurs on an online drive, that LBA is RAID punctured.

This advantage of puncturing an array is keeping the system available in production till the redundancy of the array is restored. The data in
the affected stripe is lost whether the RAID puncture occurs or not. The primary disadvantage of this method is that while the array has a
RAID puncture in it, uncorrectable errors will continue to be encountered whenever the impacted data (if any) is accessed.
A RAID puncture can occur in the following three locations:
•

In blank space that contains no data. That stripe will be inaccessible, but since there is no data in that location, it will have no significant
impact. Any attempts to write to a RAID punctured stripe by an OS will fail and data will be written to a different location.

•

In a stripe that contains data that isn't critical such as a README.TXT file. If the impacted data is not accessed, no errors are generated
during normal I/O. Attempts to perform a file system backup will fail to backup any files impacted by a RAID puncture. Performing a
Check Consistency or Patrol Read operations will generate Sense code: 3/11/00 for the applicable LBA and/or stripes.

•

In data space that is accessed. In such a case, the lost data can cause a variety of errors. The errors can be minor errors that do not
adversely impact a production environment. The errors can also be more severe and can prevent the system from booting to an
operating system, or cause applications to fail.

An array that is RAID punctured will eventually have to be deleted and recreated to eliminate the RAID puncture. This procedure causes all
data to be erased. The data would then need to be recreated or restored from backup after the RAID puncture is eliminated. The resolution
for a RAID puncture can be scheduled for a time that is more advantageous to needs of the business.
If the data within a RAID punctured stripe is accessed, errors will continue to be reported against the affected bad LBAs with no possible
correction available. Eventually (this could be minutes, days, weeks, months, and so on), the Bad Block Management (BBM) Table will fill up
causing one or more drives to become flagged as predictive failure. As seen in the figure, drive 0 will typically be the drive that gets flagged
as predictive failure due to the errors on drive 1 and drive 2 being propagated to it. Drive 0 may actually be working normally, and replacing
drive 0 will only cause that replacement to eventually be flagged predictive failure as well.
A Check Consistency performed after a RAID puncture is induced will not resolve the issue. This is why it is very important to perform a
Check Consistency on a regular basis. It becomes especially important prior to replacing drives, when possible. The array must be in an
optimal state to perform the Check Consistency.
A RAID array that contains a single data error in conjunction with an additional error event such as a hard drive failure causes a RAID
puncture when the failed or replacement drive is rebuilt into the array. As an example, an optimal RAID 5 array includes three members:
drive 0, drive 1 and drive 2. If drive 0 fails and is replaced, the data and parity remaining on drives 1 and 2 are used to rebuild the missing
information on to the replacement drive 0. However, if a data error exists on drive 1 when the rebuild operation reaches that error, there is
insufficient information within the stripe to rebuild the missing data in that stripe. Drive 0 has no data, drive 1 has bad data and drive 2 has
good data as it is being rebuilt. There are multiple errors within that stripe. Drive 0 and drive 1 do not contain valid data, so any data in that
stripe cannot be recovered and is therefore lost.The result as shown in Figure 3 is that RAID punctures (in stripes 1 and 2) are created
during the rebuild. The errors are propagated to drive 0.

80

Troubleshooting hardware issues

Puncturing the array restores the redundancy and returns the array to an optimal state. This provides for the array to be protected from
additional data loss in the event of additional errors or drive failures.

How to fix a RAID puncture?
Issue:

How to fix RAID arrays that have been subjected to a puncture?

Solution:

Complete the following steps to resolve the issue:
WARNING: Following these steps will result in the loss of all data on the array. Ensure that you are
prepared to restore from backup or other means prior to following these steps. Use caution so that
following these steps does not impact any other arrays.
1

Discard Preserved Cache, if it exists.

2

Clear foreign configurations, if any.

3

Delete the array.

4

Shift the position of the drives by one.
Move Disk 0 to slot 1, Disk 1 to slot 2, and Disk 2 to slot 0.

5

Recreate the array as desired.

6

Perform a Full Initialization of the array (not a Fast Initialization).

7

Perform a Check Consistency on the array.

If the Check Consistency completes without errors, you can safely assume that the array is now healthy and the
puncture is removed. Data can now be restored to the healthy array.

Preventing problems before they happen and solving punctures after they
occur
Dell's RAID controllers contain a number of features to prevent many types of problems and to handle a variety of errors that do occur. The
primary job of a RAID controller is to preserve the integrity of the data contained on its array(s). Even in the more extreme cases of damage
(such as punctures), the array's data is often available and the server can remain in production. Part of any maintenance plan should be the
proactive maintenance of the RAID arrays. Dell's RAID controllers are highly reliable and very good at managing its arrays without user
intervention. Disregarding proper maintenance can cause even the most sophisticated technologies to experience problems over time.
There are a number of things that can help maintain the health of arrays, and prevent the majority of data errors, double faults and
punctures.
It is highly recommended to perform routine and regular maintenance. Proactive maintenance can correct existing errors, and prevent some
errors from occurring. It is not possible to prevent all errors from occurring, but most serious errors can be mitigated significantly with
proactive maintenance. For storage and RAID subsystems these steps are:
•

Update drivers and firmware on controllers, hard drives, backplanes and other devices.

Troubleshooting hardware issues

81

•

Perform routine Check Consistency operations (Dell recommends every 30 days).

•

Inspect cabling for signs of wear and damage and ensure good connections.

•

Review logs for indications of problems.
This doesn’t have to be a high level technical review, but could simply be a cursory view of the logs looking for extremely obvious
indications of potential problems. Contact Dell Technical Support with any questions or concerns.

Blank or skipped RAID configuration screen during SMTD DVD
installation
Issue:

Some Seagate drives used as internal server storage shipped with the incorrect Dell firmware and may experience
the following symptoms. (External storage is not affected)
1

2

During an installation from System Management Tool and Documentation (SMTD) DVD.
•

If there is no Redundant Array of Independent Disks (RAID) configuration from Ctrl + R and the DVD is
booted, it will display a blank white screen where there should be array configuration setup.

•

If there is a RAID configuration from Ctrl + R already present on the drives, it boots to the SMTD DVD but
skip the RAID configuration screen. The menu item for RAID configuration in the left window does not
appear in the action list. Installation can be completed using this method.

Enclosure or server hard drives are being configured or added in an existing Linux installation.
The drives may not be seen in the Linux browser that is using OpenManage Storage Management to
configure the drives or the Vendor ID for the disk drive may appear with garbled characters.

Solution:

Model

Capacity

Part Number

Correct FW

Shipped FW

ST936751SS

36GB

RN829

SM04

SM07

ST973451SS

73GB

XT764

SM04

SM07

ST973402SS

73GB

HT952

S229

S22B

ST9146802SS

146GB

CM318

S229

S22B

ST373355SS

73GB

GY582

T211

T213

ST3146755SS

146GB

TN938

T211

T213

ST3300555SS

300GB

HT954

T211

T213

ST3400755SS

400GB

GY583

NS25

NS27

ST373455SS

73GB

GY581

S528

S529

ST3146855SS

146GB

HT953

S528

S529

ST3300655SS

300GB

TN937

S528

S529

If the shipped version in the above chart matches the current drive firmware version then a firmware flash is
required.
For drives that are installed in a server with an existing operating system, use the latest Dell Update Package
(DUP) to roll-back or update the drives.

82

Troubleshooting hardware issues

For drives that are experiencing the installation problem, use Release R212231 to flash the drives. R212231 will rollback the drives listed above to the correct firmware version without the need for user input. Create the bootable
USB or CD using the dddp.exe included in the release.
NOTE: If a drive has a version that is incrementally newer than the shipped version above then a rollback is not required.
The update will take about 5 minutes plus 1 minute for each additional drive in the server. Restart the server
installation after rolling-back the drives.

Troubleshooting thermal issue
Thermal issues can occur due to malfunctioning ambient temperature sensors, malfunctioning fans, dusty heat sinks, and malfunctioning
thermal sensors and so on.
To resolve the thermal issues:
1

Check the LCD and Embedded System Management (ESM) logs for any additional error messages to identify the faulty component.

2

Ensure that airflow to the machine is not blocked. Placing it in an enclosed area or blocking the air vent, can cause it to overheat. If
installed in a rack, ensure that the rack cooling system is working normally.

3

Check for the ambient temperature is within acceptable levels.

4

Check the internal system fans for obstructions and ensure that all fans are spinning properly. Swap any failing fans with a knowngood fan for testing.

5

Ensure that all the required shrouds and blanks are installed.

6

Check if all the fans are functioning properly, the heat sink is installed correctly, and thermal grease is applied.

Troubleshooting hardware issues

83

5
Server management software issues
This section helps to manage software issues related to the server management.
Topics:
•

How to set up Auto Dedicated NIC feature?

•

How do I configure RAID using operating system deployment wizard?

•

How to set up e-mail alerts?

•

How to export license using iDRAC web interface?

•

How to specify language and keyboard type?

•

How to configure network settings using Lifecycle Controller?

•

What are the different types of iDRAC licenses?

•

What are the differences between iDRAC7 and iDRAC8?

•

Can I upgrade the iDRAC license from express to enterprise and BMC to express?

•

How to activate license on iDRAC?

•

How to find out missing licenses?

•

How to update BIOS on 13th generation PowerEdge servers?

•

Which are the operating systems supported on PowerEdge servers?

How to set up Auto Dedicated NIC feature?
The Auto Dedicated NIC feature provides the option to automatically reroute the iDRAC management traffic for the scenarios such as
connecting a crash cart or reconfiguring network cables. When this feature is enabled, iDRAC automatically and dynamically detects a
system's network mode. It senses the system's network cable configuration and checks if a cable is connected to the system's dedicated
NIC port.
This feature was made available with iDRAC7 starting with the firmware version 1.30.30 with an Enterprise license. For the Dell PowerEdge
rack and tower servers of 600 series and above, the dedicated NIC port is standard on the system, but requires an iDRAC7 Enterprise
license to enable the port. Dell PowerEdge rack and tower servers of series 500 and below comes with an add-in card if ordered with an
Enterprise license at the point-of-sale. If an Enterprise license is purchased later than point-of-sale, the add-in card must also be purchased
to have the dedicated NIC port.
NOTE: This feature is not available on blade servers.
Enable the Auto Dedicated NIC feature using iDRAC web interface:
•

Log on to the iDRAC7 web interface.

•

Click Overview, select iDRAC Setting, and then select Network.

•

In the Network Settings section, select the Auto Dedicated NIC.

NOTE: The system must have a valid Enterprise license.

NOTE: Auto Dedicated NIC is disabled by default, this check box is no selected when you first log in to iDRAC7.

84

Server management software issues

NOTE: For the Dell PowerEdge blade servers, the NIC Selection field is always set at Dedicated which means that Auto
Dedicated NIC is always disabled and not supported. Auto Dedicated NIC is supported on the Dell PowerEdge tower and rack
servers only. A new read-only field Active NIC Interface displays the currently active network interface on the iDRAC7 web
interface.
For more information about Auto dedicated NIC feature, see En.community.dell.com/techcenter/extras/m/white_papers/20275980

How do I configure RAID using operating system
deployment wizard?
You can configure RAID using an operating System deployment wizard on the Dell Lifecycle controller. To configure RAID:
•

Launch the Lifecycle Controller.

•

In the left pane, click OS Deployment.

•

On the OS Deployment page, click Deploy OS.

•

On the Deploy OS page, click Configure RAID First, and then click Next.

•

The storage controllers available for configuration are displayed in the RAID Configuration page.

•

Select a storage controller. The RAID configuration options are displayed.

•

Follow the instruction on the screen, complete the RAID setting tasks, and then click Finish.

NOTE: Make sure that the selected controller is not in a non-RAID mode.

The RAID configuration is applied to the virtual disks.
To create virtual disks using RAID configuration at the preferred RAID level, watch the Dell Lifecycle Controller RAID configuration video on
YouTube https://www.youtube.com/watch?v=JFPmeHNENV4

How to set up e-mail alerts?
You can set up an e-mail alert message which can be sent to one or more e-mail addresses. When iDRAC senses a platform event, such as
an environmental warning or a component failure, an alert message is sent to a designated e-mail address.
To configure e-mail alerts:
•

Log in to the iDRAC web interface.

•

On the left pane, click Alerts.

•

Click SNMP and Email Settings tab.

•

Navigate to the Destination the Email Addresses section.

•

Select the State field, enter the Destination Email Addresses, and click Apply.
NOTE: Ensure that the platform event filters are configured before configuring the e-mail alert settings.
NOTE: If the mail server is Microsoft Exchange 2007, you must configure the iDRAC domain name to receive alerts from the
iDRAC.

How to export license using iDRAC web interface?
To export the license using iDRAC web interface:
•

Log in to the iDRAC web interface.

•

On the left navigation pane of the System Summary screen, click Licenses.

•

If you have a single license, you can initiate the license export process by directly navigating to the License Options drop-down list. If
you have multiple licenses, expand the Licenses table and navigate to the particular license, and select Export from the License
Options drop-down list.

Server management software issues

85

•

Save and click OK.

The license is exported successfully. Check your download folder to verify if the export license process is successful.
For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892

How to specify language and keyboard type?
To specify the language and keyboard type using Lifecycle Controller:
•

Launch the Lifecycle Controller.

•

On the left pane, click Settings.

•

On the Settings pane, click Language and Keyboard.

•

From the Language drop-down menu, select the language.

•

From the Keyboard Type drop-down menu, select the keyboard type.

•

Click Finish to save the new settings.

How to configure network settings using Lifecycle
Controller?
About this task
To configure network settings using Lifecycle Controller:
Steps
1
Launch the Lifecycle Controller.
2

On the left pane, click Settings.

3

On the Settings pane, click Network Settings.

4

From the NIC Card drop-down menu, select the NIC port that you want to configure.

5

From the IPV4 Network Settings→ IP Address Source drop-down menu, select one of the following options:

NOTE: You can use only one NIC at a time to communicate with the network.
•

No Configuration—indicates that the NIC must not be configured.

•

DHCP—indicates that the NIC must be configured using an IP address from a DHCP server. If DHCP is selected, a DHCP IP
address is displayed on the Network Settings page.

•
6

Static IP—indicates that the NIC must be configured using a static IP. Type the IP Address Properties—IP Address, Subnet
Mask, Default Gateway, and DNS Address. If you do not have this information, contact your network administrator.
From the IPV6 Network Settings→ IP Address Source drop-down menu, select one of the following options:
•

No Configuration—indicates that the NIC must not be configured.

•

DHCPv6—indicates that the NIC must be configured using an IP address from a DHCPv6 server. If DHCPv6 is selected, a
DHCPv6 IP address is displayed on the Network Settings page.
NOTE: While configuring DHCP server with IPv6, the configuration fails if you disable forwarding or advertising
options.

•
7

Static IP—indicates that the NIC must be configured using a static IP. Type the IP Address Properties—IP Address, Subnet
Mask, Default Gateway, and DNS Address. If you do not have this information, contact your network administrator.
Click Enabled, and type the VLAN ID and Priority under Lifecycle Controller VLAN Settings.
You cannot configure the VLAN settlings of the following NICs:

86

•

Emulex SeaHawk-2 (FH) PCIe Adapter

•

Emulex SeaHawk-2 (LP) PCIe Adapter

•

Emulex Vindicator-2 rNDC

•

Emulex Sea Stallion-2 Mezzanine Card

•

Emulex Pave Low-2 bNDC

Server management software issues

•

Emulex SeaHawk-2 (FH) NIC Only PCIe Adapter

•

Emulex SeaHawk-2 (LP) NIC Only PCIe Adapter

•

Emulex Vindicator-2 NIC Only rNDC

•

Emulex Sea Stallion-2 NIC Only Mezzanine Card

• Emulex Pave Low-2 NIC Only bNDC
Click Next.

8

NOTE: If the Lifecycle Controller settings are not correctly configured, an error message is
displayed.
NOTE: If you are unable to connect to a network, verify the settings. For information about correct network settings,
contact your network administrator.

What are the different types of iDRAC licenses?
There are three types of the iDRAC licenses:
•

Basic Management — Basic Management which was earlier known as BMC, was the default iDRAC version for the 11th generation of
the servers ranging between 200– 500 series.

•

Express — iDRAC Express is the standard, default offering that is part of the base configuration for 600 series and above. It does not
require license installation, back up, or license management. Express offers embedded tools, console integration, and simplified remote
access.
•

•

Express for Blades — For the 12th and 13th generation blade servers, there is a new Express for Blades iDRAC version. It offers a
single-user virtual console session and virtual media in addition to the standard Express offerings.

Enterprise — There are two types of Enterprise licenses, based on the type and the duration.
•

Evaluation - This license is for a 30-day trial of certain features. It can be used on any system.

•

Perpetual - This license is valid for the life of the product. It does not expire and never needs to be renewed. It must be bound to
only one service tag at a time.

NOTE: This is not recommended for the production environment.

For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892

What are the differences between iDRAC7 and
iDRAC8?
The overview of the key features and the differences between iDRAC7 and iDRAC8 are compiled in the En.community.dell.com/
techcenter/extras/m/white_papers/20440743. It provides the different licenses that are available for the servers and also includes key
features of each license.

Can I upgrade the iDRAC license from express to
enterprise and BMC to express?
You can upgrade the iDRAC license from Express to Enterprise or BMC to either Express or Enterprise without having to open the box or
install any hardware.
Request for the type of the iDRAC license you require by providing the service tag number of your server. Once you receive the license file
in the .xml format, save it in your local drive to replace the license with the existing one.
To upgrade/ replace the existing license:
•

Log in to the iDRAC web interface

•

On the left navigation pane, clickOverview, click Server, and then click Licenses.

Server management software issues

87

•

From the License Options drop-down list, select Replace. Click Browse to navigate to the local folder where the license file is saved.

•

Select the license file and click Open.

•

Click Apply to upgrade/ replace the new license file.

The purchase of a new PowerEdge server with iDRAC8 is the same as it was with iDRAC7. For the 600 series servers and above, all the
hardware required for iDRAC Enterprise is on the motherboard. Not only does this eliminate the possibility of a card unseating during
shipping, but it also allows for an upgrade if available. If Enterprise is ordered from Dell at point of sale on 200-500 series servers, the
necessary hardware is installed in the factory. This piece of hardware is the Dell Ports card, which sits in a PCIe slot and has the RJ-45
connector for the Dedicated NIC and the slot for the vFlash SD card.
For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892

How to activate license on iDRAC?
You can manage your licenses by creating your account and access License Management portal. It enables you to view your digital
licenses, obtain a replacement license, or reassign licenses to different hardware.
To create an account:
•

Go to www.dell.com/support/retail/lkm.

•

Click Create an Account

•

Enter the required information and click Create Account.

•

Click Sign In, the end user license agreement is displayed.

•

Review the Dell Software License Agreement, and click Yes, I Agree button. You are redirected to the License Management portal.

To activate the license:
•

Log in to the License Management portal.

•

Click Activate Licenses tab.

•

To activate license, navigate to the license and click Assign License.
NOTE: All the licenses must be associated with the asset. The Activate Licenses tab lists the licenses that must be bound to
an asset.

For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892

How to find out missing licenses?
If you misplace your original license and need to retrieve the original license, follow the steps to find out missing license:
•

Log in to the License Management portal.

•

Click Manage Licenses tab.
NOTE: All the licenses must be associated with the asset. The Manage Licenses tab lists the licenses that are associated with
an asset.

•

To download license, navigate to the license and click Get Key. Deliver My License Key window is displayed.

•

To download the license directly to your computer, select Download and then click Submit. Select Email if you want the license key on
an email.

For more information on the iDRAC licensing feature, see En.community.dell.com/techcenter/extras/m/white_papers/20067892.

88

Server management software issues

How to update BIOS on 13th generation PowerEdge
servers?
The 13th generation of Dell PowerEdge servers offers various methods namely local or remote, with or without an operating system, to
update the system BIOS. The different methods are listed below. You can choose the method that best suits your need and environment.
•

Executing the BIOS Dell Update Package (DUP) from within the operating system.

•

Using the UEFI-based BIOS flash utility in a preboot environment.

•

Using the Lifecycle Controller Platform Update option—F10.

•

Using the Update and Rollback feature in the iDRAC web GUI.

•

Using the WS-MAN based one to many Remote Update method—Remote Enablement
NOTE: Legacy DOS-based BIOS update utility is no longer supported.

For detailed information about different methods of updating BIOS see En.community.dell.com/techcenter/extras/m/white_papers/
20440526

Which are the operating systems supported on
PowerEdge servers?
Dell collaborates extensively with Microsoft to ensure the consistent and reliable performance of the Microsoft operating systems running
on the Dell PowerEdge servers. For more information on the Microsoft Server operating systems supported on specific PowerEdge
servers, see En.community.dell.com/techcenter/extras/m/white_papers/20438149/download.aspx

Server management software issues

89

6
Troubleshooting operating system issues
This section helps you troubleshoot operating system issues in your system.
NOTE: If the problem persists, contact Dell Technical Support for further assistance.

Topics:
•

Troubleshooting blue screen errors (BSODs)

•

Troubleshooting a Purple Screen of Death (PSOD)

•

Troubleshooting no boot issues for Windows operating systems

•

No POST issues (iDRAC)

•

Troubleshooting a No POST situation

•

Windows

•

VMware

•

Linux

Troubleshooting blue screen errors (BSODs)
1

Identify the stop code or the stop message when the system displays the blue screen of death.

Figure 24. Blue screen of death
2

Run the PSA/ePSA diagnostics. For more information, see PSA/ePSA Diagnostics.

3

If the diagnostics pass and the issue persists, identify the stage in which the blue screen error occurs.

4

If the BSOD occurs during the boot process, check for minimum to POST components. For more information, see Troubleshooting a
No POST situation.
If the issue persists, call Dell Technical Support.

5

90

If the BSOD occurs during pre-logon, boot into Safe Mode and disable all startup entries. Isolate one startup entry at a time until you
find the causing agent.

Troubleshooting operating system issues

NOTE: If you are unable to boot into Safe Mode, call Dell Technical Support.
6

If the issue persists, check for minimum to POST components. For more information, see No POST .
If the issue persists, call Dell Technical Support.

7

If the BSOD occurs intermittently or during post-logon, debug mini crash dump files using WinDbg. For more information, see
Debugging mini crash dump files using by WinDbg in Windows operating system.
After debugging, call Dell Technical Support with the minidump file.

Troubleshooting a Purple Screen of Death (PSOD)
For information on Troubleshooting a PSOD, see Troubleshooting a Purple Screen of Death and Interpreting a purple screen of death.

Troubleshooting no boot issues for Windows operating
systems
1

Check the hard drive status in the PERC BIOS. For more information, see Checking hard drive status in the PERC BIOS.

2

Boot the server in the safe mode.
NOTE: If the server is a domain controller, boot the server in Directory Services Repair Mode (DSRM).
If the server boots successfully, the issue is with an installed driver, application, or service. Proceed to Step 4.

3

For Windows Server, you can perform the following recovery options:
•

Use the bootrec command to troubleshoot startup issues. For more information, see https://support.microsoft.com/en-in/kb/
927392.

•

Use the chkdsk tool to determine whether there is a disk problem. For more information, see https://technet.microsoft.com/enus/library/cc730714.aspx

•

Use DiskPart to verify the status of disk partitions. For more information. see https://technet.microsoft.com/en-in/library/
bb490893.aspx.

•

Use the bcdedit utility to view or modify the boot configuration database (BCD). For more information, see https://
technet.microsoft.com/en-us/library/cc731662.aspx.
NOTE: For additional recovery console commands, see https://support.microsoft.com/en-us/kb/326215.
NOTE: For more troubleshooting steps, see https://support.microsoft.com/en-us/kb/325375.

4

From within the safe mode or DSRM, use the msconfig utility to disable startup applications and non-Microsoft services on the server.

5

Reboot the server in the normal mode.

6

If the server boots, use the msconfig utility to identify the disrupting service or application by enabling services and startup
applications one at a time and rebooting the server until the server fails to boot.

7

If the issue persists, use Dell's Linux-based OMSA LiveCD to boot the system and run diagnostic tests to determine whether any
hardware is malfunctioning.
OMSA LiveCD can also be used to recover data from a server whose operating system does not start.

No POST issues (iDRAC)
This section provides details on troubleshooting iDRAC issues.

“First Boot Device cannot be set” error message is displayed
when configuring a boot device during POST.
Description

Troubleshooting operating system issues

91

The error message “First Boot Device cannot be set. Either the system BIOS is out-of-date, or the server needs a reboot for the settings to
take effect” displays in the POST mode.
Resolution
Allow the server to completely boot to the OS or turn off the server before setting a vFlash partition to the first boot device. This allows
the server to boot to the vFlash partition and the error will no longer be seen.

“Alert! iDRAC6 not responding.. Power required may exceed
PSU wattage...” error message is displayed at POST during a
reboot.
Description
An error message “Alert! iDRAC6 not responding.. Power required may exceed PSU wattage. Alert! Continuing system boot accepts risk
that system may power without warning. Strike the F1 key to continue, F2 to run the system setup program" is displayed at POST during a
reboot.
Resolution
Perform the following steps:
1

Reboot the server to resolve the issue.

2

Remove the AC power for 30 seconds. Once AC power is re-applied, allow two minutes for the iDRAc to complete POST or initialize.

3

Check if the Network Adapter is updated to the latest firmware.

Troubleshooting a No POST situation
Power On Self Test (POST) is a series of diagnostic tests that run automatically when you turn on your system. POST tests the memory,
the keyboard and the disk drivers. If the test is successful, the computer boots itself, else the system displays an LED error or an error
message in the LCD panel. This situation is called No POST.
Prerequisites
CAUTION: Many repairs may only be done by a certified service technician. You should only perform troubleshooting and simple
repairs as authorized in your product documentation, or as directed by the online or telephone service and support team.
Damage due to servicing that is not authorized by Dell is not covered by your warranty. Read and follow the safety instructions
that are shipped with your product.
NOTE: When removing or replacing parts, always turn off the server, remove the power cord, and wait for ten seconds for the
static flea power to drain. Reconnect the power cord, wait for a minute and turn on the server. This allows time for the
Baseboard Management Controller (BMC) to power up. Error messages may not be reported correctly if the static flea power is
not fully drained. For more information on removing and installing hardware components, see your system’s Owner’s Manual at
Dell.com/poweredgemanuals.
NOTE: Ensure that you give enough time for the server to POST. Newer systems may take up to three minutes before any video
appears during the POST. During this period, a message on the LCD screen is displayed, which indicates that the server is
booting.
Steps
1

Check the LCD screen or LED indicators for any error messages.
For more information about the error messages, see the Dell Event and Error Messages Reference Guide at Dell.com/
openmanagemanuals > OpenManage software.

2

Ensure that the server is turned on by verifying that the power supply LED glows green.
If the power LED is lit amber, see Power supply unit indicator codes.

3

92

Remove all the Electrostatic Discharge (ESD) from the server.

Troubleshooting operating system issues

a
b
c
d
e

Turn off the server.
Disconnect all cables from the server including the power cable.
Press and hold the power button for 60 seconds to discharge.
Reconnect the power and video cable only.
Turn on the server.
If the server fails to POST, proceed to the next step.

4

Disconnect all cables from the server including the power cable.

5

Bring the server to the minimum configuration for the POST.
NOTE: Minimum to POST configuration is the config that has the minimum components required to complete POST.
Typically, the minimum to POST configuration for rack servers is PSU1, CPU1, memory module in A1 slot, and the default
riser without expansion cards. For tower servers, the minimum to POST configuration is PSU1, CPU1, and memory module
in A1 slot. For modular servers, the minimum to POST configuration is CPU1 and memory module in A1 slot.

6

Reconnect the power and video cable only.

7

Attempt to POST the server.

8

a

If the server completes the POST, turn off the server and plug the components one at a time until the defective part is found.

b

If you identify the defective part, contact Dell Technical Support with information about the defective part.
If you are unable to identify the defective part, go to the next step.

Disconnect the hard drives, optical drives, and tape drives from the server and attempt to POST the server.
a

If the server completes the POST, plug the hard drives back one at a time until the defective hard drives are found.

b

If you identify the defective part, contact Dell Technical Support with information about the defective part.
If you are unable to identify the defective part, go to the next step.

9

Reseat the control panel connector.

10

Ensure that the processors and heat sinks are seated correctly.

11

If the server does not complete the POST, clear the NVRAM using the jumper.
For more information , see your system’s Owner’s manual at Dell.com/poweredgemanuals.

Next step
If the issue still persists, contact Dell Technical Support for assistance.

Windows
FAQs
How to resolve the yellow bangs in device manager for Dell 12th
Generation Servers?
Description
After installing Windows Server 2012 R2 on Dell PowerEdge 12th Generation Servers, two yellow bangs appear in the Device Manager
under Hidden Devices: PCI SIMPLE COMMUNICATIONS CONTROLLER. These devices do not impact the server.
Resolution
Download and install chipset drivers from Dell.com/support for the respective servers.

Troubleshooting operating system issues

93

Why are the USB keyboard and mouse not detected during the Windows
Server 2008 R2 SP1 installation?
Cause
This issue occurs because Windows Server 2008 R2 SP1 does not have native USB 3.0 driver support.

Resolution
1

In System Setup, ensure that the USB 3.0 option on the Integrated Device Settings screen is set to Disable.
NOTE: By default, the USB 3.0 option is disabled. If enabled, the operating system fails to detect the USB devices such as
keyboard, mouse, and USB DVD. Windows Server 2008 R2 SP1 supports out-of-box drivers for USB 3.0, and are available
at Dell.com/support.

2

Install the drivers after installing the OS

3

Restart the system.

4

In System Setup, ensure that the USB 3.0 option on the Integrated Device Settings screen is set to Enable.
NOTE: Ensure that you first install the drivers, and then restart the system to enter System
Setup,

Why does the installation wizard stop responding during the Windows OS
installation?
Cause
This issue occurs because the Windows 2008 R2 SP1 does not support Secure Boot.

Resolution:
Before installing Windows 2008 R2 SP1, ensure that the Secure Boot option in System Setup is set to Disable.
NOTE: Secure Boot is supported only on Windows 2012 R2 and Windows 2012 R2 with native storage controller and out-of-box
drivers. Secure Boot is supported only in the UEFI mode.

Why does Windows OS installation using Lifecycle Controller, on
PowerEdge Servers fail at times with an error message?
Cause
This issue occurs when the size of the drivers exceed the temporary storage space provided by the OS.

Resolution:
Before installing the OS, ensure that there are no add-on devices installed on the system. To avoid removing the hardware manually, you
can also disable the PCIe slots in the BIOS configuration utility.

94

Troubleshooting operating system issues

Why does Windows Server 2008 R2 SP1 display a blank screen in UEFI
mode after installation?
After installing Windows Server 2008 R2 SP1 in the UEFI mode, either using Lifecycle Controller (LC) or manually, may display a blank
screen while booting to the OS.

Cause
This issue occurs because the Windows Server 2008 R2 does not support Graphics Output Protocol (GOP).

Resolution
In the BIOS configuration utility, ensure that Load Legacy Option Rom on the Miscellaneous Settings screen is set to Enable.
Restart your system and boot to the operating system again.

Symptoms
Booting to iSCSI or FCoE fails
Description
When trying to install Windows Server 2012 R2 operating system on an iSCSI or FCOE LUN, you may see a failure either during the
operating system installation or at first boot.
Resolution
This is a known issue. This issue has been fixed in operating systems pre-installed by Dell and in the recovery media shipped with your
system. For more information, see the knowledge base article KB2894179 at support.microsoft.com.

VMware
FAQs
Why are VMs configured with Fault Tolerance not in a protected state in
ESXi 6.0?
For some PowerEdge systems with AMD 6300 series processor, VMs configured with Fault Tolerance (FT) might not be in a protected
state. Sometimes, secondary VM takes more time to attain the protected state. This is a known issue. Affected systems include PowerEdge
systems R815, R715 and M915.

Troubleshooting operating system issues

95

Symptoms
Dell PowerEdge Express Flash NVMe PCIe SSD device is not detected
during hot-plug in ESXi 6.0
Description
When the Express Flash NVMe PCIe SSD namespace is set as Offline and hot-plug operation is performed in the same slot, the SSD does
not initialize and is not detected.
This is a known issue. Perform one of the following steps:
1

Reconnect the drive to a different drive slot, if available.

2

Restart the sfcbd-watchdog service and reinsert the drive.

Linux
FAQs
Why are network ports displayed as unknown in YaST2 network
configuration in SUSE Linux Enterprise?
While configuring a bonding interface, the configured bond-slaves are listed as Unknown Network Device. The YaST installer is unable to
write the device name in to the ifcfg files. Delete the existing configuration of previously configured network interfaces.

Symptoms
Unable to boot from iSCSI when static IPv6 address is assigned to the
initiator
Description
The SUSE Linux Enterprise Server 12 does not boot from iSCSI over IPv6 address after successful installation, when static IPv6 address is
assigned to the initiator.
Resolution
Use Dynamic Host Configuration Protocol (DHCP) address for the initiator.

96

Troubleshooting operating system issues

7
Getting help
Topics:
•

Contacting Dell

•

Locating Service Tag of your system

Contacting Dell
Dell provides several online and telephone based support and service options. If you do not have an active internet connection, you can find
contact information about your purchase invoice, packing slip, bill, or Dell product catalog. Availability varies by country and product, and
some services may not be available in your area. To contact Dell for sales, technical assistance, or customer service issues:
1

Go to Dell.com/support.

2

Select your country from the drop-down menu on the lower right corner of the page.

3

For customized support:
a
b

Enter your system Service Tag in the Enter your Service Tag field.
Click Submit.
The support page that lists the various support categories is displayed.

4

For general support:
a
b
c

Select your product category.
Select your product segment.
Select your product.
The support page that lists the various support categories is displayed.

5

For contact details of Dell Global Technical Support:
a
b

Click Global Technical Support.
The Contact Technical Support page is displayed with details to call, chat, or e-mail the Dell Global Technical Support team.

Locating Service Tag of your system
Your system is identified by a unique Express Service Code and Service Tag number. The Express Service Code and Service Tag are found
on the front of the system by pulling out the information tag. Alternatively, the information may be on a sticker on the chassis of the
system. The mini Enterprise Service Tag (EST) is found on the back of the system. This information is used by Dell to route support calls to
the appropriate personnel.

Getting help

97

Figure 25. Locating Service Tag of your system
1

Information tag (top view)

2

Information tag (back view)

3

Open Manage Mobile (OMM) label

4

MAC address and secure password label

5

Service tag label

98

Getting help



Source Exif Data:
File Type                       : PDF
File Type Extension             : pdf
MIME Type                       : application/pdf
PDF Version                     : 1.6
Linearized                      : No
Author                          : Dell
Create Date                     : 2018:01:08 16:13:48+00:00
Modify Date                     : 2018:01:08 16:13:48+00:00
Creator                         : AH XSL Formatter V6.3 MR3 for Windows (x64) : 6.3.4.25128 (2016/07/01 17:39JST)
Producer                        : Antenna House PDF Output Library 6.3.815 (Windows (x64))
Keywords                        : manual, guide, documentation, Troubleshooting , Guide, PowerEdge.Servers, SupportAssist, Dell System.E-Support Tool, ePSA, Embedded.System Diagnostics, No boot, No POST, Mini crash dump files, Blue Screen of Death, BSOD, PERC, hard drive.failed, sense keys, predictive.failure, medium error, RAID puncture, BadLBA
Title                           : Dell EMC PowerEdge Servers Troubleshooting Guide
Subject                         : Troubleshooting Guide
Trapped                         : False
Page Count                      : 98
Page Mode                       : UseOutlines
Tagged PDF                      : Yes
Language                        : EN
EXIF Metadata provided by EXIF.tools

Navigation menu