Sun Microsystems, Inc
Sun Fire X4140, X4240, and Servers Diagnostics Guide
August 2008, Revision A
Please Recycle
Contents
2. Using SunVTS Diagnostic Software
3. Troubleshooting DIMM Problems
Preface
Status Indicator LEDs
Using the ILOM Service Processor GUI to View System Information
Error Handling
A. Event Logs and POST Codes
Handling of Uncorrectable Errors
Handling of Correctable Errors
Handling of Parity Errors PERR
Handling of System Errors SERR
vi Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Preface
Before You Read This Document
Related Documentation
http//docs.sun.com
Typographic ConventionsThird-Party
Web Sites
Sun Welcomes Your Comments
x Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Service Troubleshooting Flowchart
“Service Troubleshooting Flowchart” on page
“Gathering Service Information” on page “System Inspection” on page
Initial Inspection of the Server
Gathering Service Information
2. Document the server settings before you make any changes
1. Collect information about the following items
4. Check for potential device conflicts before you add a new device
System Inspection
Troubleshooting Power Problems
Externally Inspecting the Server
Internally Inspecting the Server
2. Remove the server cover
6 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Using SunVTS Diagnostic Software
Running SunVTS Diagnostic Tests
SunVTS Documentation
Diagnosing Server Problems With the Bootable Diagnostics CD
Requirements
Using the Bootable Diagnostics CD
c. With the three lower buttons you can perform the following actions
a. Click the Log button
Close the Log file window - The window is closed
Troubleshooting DIMM Problems
“DIMM Population Rules” on page “DIMM Replacement Policy” on page
“How DIMM Errors Are Handled by the System” on page
“Isolating and Correcting DIMM ECC Errors” on page
How DIMM Errors Are Handled by the System
DIMM Replacement Policy
Uncorrectable DIMM Errors
# ipmitool -H 10.6.77.249 -U root -P changeme -I lanplus sel list
Correctable DIMM Errors
BIOS DIMM Error Messages
DIMM Fault LEDs
DIMM fault LED is off - The DIMM is operating properly
Chapter 3 Troubleshooting DIMM Problems
FIGURE 3-1 DIMMs and LEDs on Motherboard
Isolating and Correcting DIMM ECC Errors
4. Disconnect the AC power cords from the server
8. Dust off the DIMMs, clean the contacts, and reseat them
10. Reconnect AC power cords to the server
9. If there is no obvious damage, replace any failed DIMMs
20 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
11. Power on the server and run the diagnostics test again
12. Review the log file
Event Logs and POST Codes
“Viewing Event Logs” on page “Power-On Self-Test POST” on page
Viewing Event Logs
Main Advanced PCIPnP Boot Security Chipset Exit
The Advanced Menu Event Logging Details screen is displayed
Appendix A Event Logs and POST Codes
24 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Power-On Self-Test POST
How BIOS POST Memory Testing Works
Redirecting Console Output
11. Click the Start Redirection button
Changing POST Options
28 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
3. Select Boot Settings Configuration
30 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
POST Codes
32 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
POST Codes Continued
POST Code Checkpoints
34 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
POST Code Checkpoints Continued
Initializes NUM-LOCK status and programs the KBD typematic rate
36 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
POST Code Checkpoints Continued
Status Indicator LEDs
External Status Indicator LEDs
Front Panel LEDs
Back Panel LEDs
38 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Rear PS LED Amber Power supply fault
Internal Status Indicator LEDs
Hard Drive LEDs
40 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
FIGURE B-4 DIMMs and LEDs on Motherboard
Appendix B Status Indicator LEDs
FIGURE B-5 DIMMs and LEDs on Mezzanine Board
42 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Using the ILOM Service Processor GUI to View System Information
“Making a Serial Connection to the SP” on page
“Viewing ILOM SP Event Logs” on page
“Viewing Replaceable Component Information” on page
Making a Serial Connection to the SP
cd /SP/console start
“Viewing ILOM SP Event Logs” on page
“Viewing Replaceable Component Information” on page
Viewing ILOM SP Event Logs
You can select from the following types of events
Interpreting Event Log Time Stamps
Viewing Replaceable Component Information
48 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
2. From the System Information tab, select Components
Viewing Sensors
50 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
4. Click a sensor to display its thresholds
FIGURE C-3 Sensor Readings Page
52 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
FIGURE C-4 Sensor Details Page
Error Handling
Handling of Uncorrectable Errors
“Handling of Uncorrectable Errors” on page
“Handling of Correctable Errors” on page
Note the following considerations for this revision
FIGURE D-1 DMI Log Screen, Uncorrectable Error
Appendix D Error Handling
Handling of Correctable Errors
The BIOS logs an SEL record The BIOS logs an event in DMI
FIGURE D-2 DMI Log Screen, Correctable Error
Appendix D Error Handling
EXAMPLE D-1 DMI Log Screen, Correctable Error, Memory Decreased
58 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Handling of Parity Errors PERR
FIGURE D-3 DMI Log Screen, PCI Parity Error
Handling of System Errors SERR
DMI Log Screen with Error
62 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Handling Mismatching Processors
The BIOS performs a complete POST
No SEL or DMI event is recorded
The system enters Halt mode and the following message is displayed
Hardware Error Handling Summary
Hardware Error Handling Summary
Hardware Error Handling Summary
sync flood error occurred on last
boot, press F1 to continue
Continued
66 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Hardware Error Handling Summary
Continued
Hardware Error Handling Summary Continued
The Front Fan Fault, Service Action Required
tach signals
Multiple fan
68 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide August
Index