NEC 5800/1000 manual Features for performance improvement, Compiler

Page 4

Supercomputer-class Performance

Features for performance improvement

Dual-Core Intel® Itanium® processor and high-speed inter/intra Cell cache-to-cache data transfer

At the heart of the Express5800/1000 series server is the 64-bit Dual-Core Intel® Itanium® processor, redesigned for even faster processing of larger data sets.

The system has been equipped with the NEC designed chipset, “A3 ”, in order to improve performance by utilizing, to its full extent, the massive 24MB of cache memory that has been built into the Dual-Core Intel® Itanium® processor

Technologies to increase cache-to-cache data transfer, such as the VLC architecture and CCI, have been implemented

to maximize the performance for enterprise mission critical[1080Rf] computing.

Increased Memory Bandwidth

Improved Inter/Intra-Cell memory data transfer

Very Large Cache (VLC) Architecture

High-speed/low latency Intra-Cell cache-to-cache data transfer

Dedicated Cache Coherency Interface (CCI)

High-speed/low latency Inter-Cell cache-to-cache data transfer

Crossbar-less configuration

Improved data transfer latency between Cell/Cell and Cell/IO

[1320Xf]

[1160Xf]

High processing power of the Dual-Core Intel® Itanium® processor

Dual-Core, massive L3 cache and EPIC (Explicitly Parallel Instruction Computing) architecture

The Dual-Core Intel® Itanium® processor is Intel’s first production in the Itanium® processor family with two complete 64-bit cores on one processor and also the first member of the Intel® Itanium® processor family to include Hyper-Threading Technology, which provides four times the number of application threads provided by earlier single-core implementations.

With a maximum of 24MB of On-Die L3 cache, the Dual-Core Intel®

Itanium® processor excels at high volume data transactions.

EPIC architecture provides a variety of advanced implementations of parallelism, predication, and speculation, resulting in superior Instruction-Level Parallelism (ILP) to help address the current and future requirements of high-end enterprise and technical workloads.

Conventional Superscalar RISC Processor

Parallel processing with EPIC architecture

Original Source Code

Compiler

Sequential

Machine Code

Some level of parallelization is achieved however, it is not maximized nor efficient

Partial HW

Parallelization

Original Source Code

Intel® Itanium® processor supported compiler

Intel® Itanium® processor

 

source is parallelized at

 

compile time

 

Efficient parallel processing

 

is made possible due to the

 

thorough parallelization.

Hardware

In the EPIC architecture, parallelization is run at compile time, allowing for maximum parallelization with minimal scheduling.

4

Image 4
Contents NEC Express5800/1000 Series Crossbar-less configuration Available only on 1080Rf Very Large Cache VLC ArchitectureDedicated Cache Coherency Interface CCI Resource virtualization through Floating IOA3 Chipset CellDual-Core Intel Itanium processor Compiler Features for performance improvementIntel Itanium processor supported compiler Crossbar-less configuration VLC ArchitectureDedicated Cache Coherency Interface CCI Directory Based Cache CoherencyFramework for hardware, firmware and OS error handling RAS Design PhilosophyMemory Mirroring Partial Chipset degradationHighly Available Center Plane Complete modularization and redundancyDiagnostics of the error detection circuits Substantial strengthening of data integrityExpress5800/1000 Series Cell cardEnhanced error detection of the high-speed interconnect Two independent power sourcesRealization of a mainframe-class platform serviceability InternetSuperior standard chassis configuration Investment ProtectionResource virtualization through floating I/O Multi OS support / Rich application lineupProcessor Dual Core Intel Itanium processor L1 Cache/core 16KB I / 16KB DL2 Cache/core 1MB I / 256KB D L3 Cache/core 12MB