Metrics for Integrity Servers Dual-Core Itanium 2 and Itanium 9300 Quad-Core Processor Systems

The following CPU events are directly measured:

BACK_END_BUBBLE.ALL — The number of cycles when the back end of the pipeline was stalled. This is the number of cycles lost (stall cycles) due to any of five possible events (FPU/L1D, RSE, EXE, branch/exception, or the front end).

BE_EXE_BUBBLE.GRALL — The number of Full Pipe Bubbles in Main Pipe due to GR/GR or GR/load dependency stalls. This is the number of cycles lost (stall cycles) due to GR/GR or GR/load dependency.

BE_FLUSH_BUBBLE.ALL — The number of Full Pipe Bubbles in Main Pipe due to pipeline flushes. This is the number of cycles lost (stall cycles) due to branch misprediction or exception/interruption flush.

BE_L1D_FPU_BUBBLE.L1D — The number of Full Pipe Bubbles in Main Pipe due to L1D cache. This is the number of cycles lost (stall cycles) due to L1D cache and L1/L2 DTLB.

CPU_OP_CYCLES.ALL — The number of elapsed CPU operating cycles.

When HyperThreading is on, this is the number of elapsed CPU operating cycles used by only this process's hyperthread.

CPU_OP_CYCLES.ALL:all_threads=true

The number of elapsed CPU operating cycles used by both hyperthreads. Available only when HyperThreading is on.

IA64_INST_RETIRED — The number of retired IA-64 instructions. The count includes predicated on and predicated off instructions and nops, but excludes hardware-inserted RSE operations.

NOPS_RETIRED — The number of retired nop.i, nop.m, or nop.b instructions. The count excludes predicated off nop instructions.

PREDICATE_SQUASHED_RETIRED — The number of predicated off instructions retired.

THREAD_SWITCH_EVENTS.ALL — The number of hardware thread switches.

THREAD_SWITCH_EVENTS.L3MISS — The number of hardware thread switches due to L3 cache miss.

The following CPU metrics are derived (calculated) from the above CPU events:

% of Cycles lost due to stalls (lower is better) — The percentage of cycles lost due to all stalls.

% of Cycles lost due to stalls caused by L1D (L1D and L1/L2 DTLB) — The percentage of cycles lost due to L1D cache and L1/L2 DTLD.

% of Cycles lost due to GR/GR or GR/load dependency stalls — The percentage of cycles lost due to GR/load or GR/GR dependency stalls.

% of Cycles lost due to branch misprediction or interruption flush stalls — The percentage of cycles lost due to branch misprediction or interruption flush.

Effective instructions retired — The number of effective instructions retired excluding nop and predicated off instructions.

Raw CPI (lower is better) — The cycles per instruction, including nop and predicated off instructions.

Effective CPI (lower is better) — The cycles per effective instruction, excluding nop and predicated off instructions.

Effective CPI during unstalled execution (lower is better) — The cycles per effective instruction, excluding stall cycles, nop, and predicated off instructions.

198 Descriptions of Measurement Reports