Sun Microsystems X4240, X4440, X4140 manual Sp Sel

Page 75

TABLE D-1

Hardware Error Handling Summary

(Continued)

 

 

 

 

 

 

 

 

 

Logged (DMI

 

 

 

 

Log or SP

 

Error

Description

Handling

SEL)

Fatal?

 

 

 

 

 

Single-bit

With ECC enabled

DRAM ECC

in the BIOS Setup,

error

the CPU detects

 

and corrects a

 

single-bit error on

 

the DIMM interface.

The CPU corrects the error in hardware. No SP SEL

Normal

interrupt or machine check is generated by

operation

the hardware. The polling is triggered every

 

half-second by SMI timer interrupts and is

 

done by the BIOS SMI handler.

 

The BIOS SMI handler starts logging each

 

detected error and stops logging when the

 

limit for the same error is reached. The BIOS's

 

polling can be disabled through a software

 

interface.

 

Single four-bit

With CHIP-KILL

DRAM error

enabled in the BIOS

 

Setup, the CPU

 

detects and corrects

 

for the failure of a

 

four-bit-wide

 

DRAM on the

 

DIMM interface.

The CPU corrects the error in hardware. No SP SEL

Normal

interrupt or machine check is generated by

operation

the hardware. The polling is triggered every

 

half-second by SMI timer interrupts and is

 

done by the BIOS SMI handler.

 

The BIOS SMI handler starts logging each

 

detected error and stops logging when the

 

limit for the same error is reached. The BIOS's

 

polling can be disabled through a software

 

interface.

 

Uncorrectable

The CPU detects an

The “sync flood” method is used to prevent

SP SEL

Fatal

DRAM ECC

uncorrectable

the erroneous data from being propagated

 

 

error

multiple-bit DIMM

across the Hypertransport links. The system

 

 

 

error.

reboots, the BIOS recovers the machine check

 

 

 

 

register information, maps this information to

 

 

 

 

the failing DIMM (when CHIPKILL is

 

 

 

 

disabled) or DIMM pair (when CHIPKILL is

 

 

 

 

enabled), and logs that information to the SP.

 

 

 

 

The BIOS will halt the CPU.

 

 

Unsupported

Unsupported

The BIOS displays an error message, logs an

DMI Log

Fatal

DIMM

DIMMs are used, or

error, and halts the system.

SP SEL

 

configuration

supported DIMMs

 

 

 

 

are loaded

 

 

 

 

improperly.

 

 

 

HyperTranspor

CRC or link error

Sync floods on HyperTransport links, the

DMI Log

Fatal

t link failure

on one of the

machine resets itself, and error information

SP SEL

 

 

Hypertransport

gets retained through reset.

 

 

 

Links.

The BIOS reports, A Hyper Transport

 

 

 

 

sync flood error occurred on last

 

 

 

 

boot, press F1 to continue.

 

 

 

 

 

 

 

Appendix D Error Handling

65

Image 75
Contents Sun Fire X4140, X4240, Servers Diagnostics Guide Please Recycle Contents Status Indicator LEDs Error HandlingEvent Logs and Post Codes Index Page Before You Read This Document PrefaceRelated Documentation Web Sites Typographic ConventionsThird-PartySun Welcomes Your Comments Initial Inspection of the Server Service Troubleshooting FlowchartGathering Service Information Document the server settings before you make any changesCollect information about the following items Troubleshooting Power Problems System InspectionExternally Inspecting the Server 1X4140 Server Front Panel Internally Inspecting the ServerLocate Button/LED Power Button Page Running SunVTS Diagnostic Tests Using SunVTS Diagnostic SoftwareSunVTS Documentation Diagnosing Server Problems With the Bootable Diagnostics CDUsing the Bootable Diagnostics CD Close the Log file window The window is closed Click the Log buttonDimm Population Rules Troubleshooting Dimm ProblemsDimm Replacement Policy How Dimm Errors Are Handled by the SystemUncorrectable Dimm Errors Troubleshooting Dimm Problems 1Lines in Ipmi Output Correctable Dimm ErrorsDimm Fault LEDs Bios Dimm Error MessagesPage 1DIMMs and LEDs on Motherboard 2DIMMs and LEDs on Mezzanine Board Isolating and Correcting Dimm ECC ErrorsReconnect AC power cords to the server Page Viewing Event Logs Event Logs and Post CodesESC Advanced Menu Event Logging Details screen is displayed ESC How Bios Post Memory Testing Works Power-On Self-Test PostRedirecting Console Output Appendix a Event Logs and Post Codes Changing Post Options Select BootBoot Settings Configuration screen is displayed Select Boot Settings ConfigurationPage Post Codes Post Codes Primary I/O port Post Code CheckpointsPost Code Checkpoints Initialize Int-13 and prepare for IPL detection Save system context for Acpi External Status Indicator LEDs Status Indicator LEDsBack Panel LEDs Front Panel LEDsHard Drive LEDs Internal Status Indicator LEDsFigure B-4DIMMs and LEDs on Motherboard Figure B-5DIMMs and LEDs on Mezzanine Board Page P E N D I X C Making a Serial Connection to the SP To start the serial console, type the following commandsFrom the System Monitoring tab, select Event Logs Viewing Ilom SP Event LogsFigure C-1System Event Logs Table C-1Event Log Fields Interpreting Event Log Time StampsViewing Replaceable Component Information Figure C-2Replaceable Component Information From the System Information tab, select ComponentsViewing Sensors Figure C-3Sensor Readings Figure C-4Sensor Details Handling of Uncorrectable Errors Error HandlingIpmitool sel list Figure D-1DMI Log Screen, Uncorrectable Error Handling of Correctable Errors Appendix D Error Handling Page Handling of Parity Errors Perr NMI Event Handling of System Errors Serr PCI Serr Handling Mismatching Processors Hardware Error Handling Summary Hardware Error Handling SummarySP SEL PCI Serr SP SEL Page Index BiosPost

X4140, X4440, X4240 specifications

Sun Microsystems was a prominent player in the computing industry, known for its innovative and powerful server systems. Among its notable offerings were the Sun Fire X4240, X4440, and X4140 servers, which made significant inroads in the market for high-performance computing solutions.

The Sun Fire X4240 server was designed to meet the demands of data-intensive applications. It offered impressive scalability, supporting up to 64 GB of DDR2 memory across its eight memory slots. This server utilized AMD Opteron processors, which provided excellent performance thanks to their multi-core architecture. The X4240 also featured a flexible I/O architecture, allowing for various configurations tailored to specific workload requirements.

Next in line was the Sun Fire X4440, which expanded on the capabilities of the X4240. This server was particularly valuable for virtualization and consolidation tasks. It featured up to 128 GB of memory and supported more CPU options, with dual- and quad-core Opteron processors available. The X4440 also included a high-density design that enabled increased storage capacity, accommodating up to 12 SFF drives. This made it ideal for databases and enterprise applications needing fast access to large volumes of data.

Finally, the Sun Fire X4140 brought a balance of performance and efficiency. Like its counterparts, it supported AMD's Opteron processors, delivering robust processing power. The X4140 was designed for environments where space and power efficiency were critical. It was notable for its compact form factor, which allowed organizations to pack more servers into less physical space without sacrificing performance. The server architecture included advanced thermal management technologies, ensuring optimal airflow and cooling, which contributed to reliability in demanding environments.

In terms of connectivity, all three servers featured multiple Gigabit Ethernet ports, offering redundant network connectivity essential for enterprise-level applications. The integrated management interfaces simplified server monitoring and maintenance, ensuring that IT administrators could efficiently manage their resources.

In summary, the Sun Fire X4240, X4440, and X4140 were pivotal servers from Sun Microsystems that provided robust performance, scalability, and efficiency. Their features made them suitable for a variety of workloads, from virtualization to data management, cementing their place in the server market during their era.