Metrics for Integrity Servers Dual-Core Itanium 2 and Itanium 9300 Quad-Core Processor Systems

The following CPU events are directly measured:

BACK_END_BUBBLE.ALL — The number of cycles when the back end of the pipeline was stalled. This is the number of cycles lost (stall cycles) due to any of five possible events (FPU/L1D, RSE, EXE, branch/exception, or the front end).

BE_EXE_BUBBLE.GRALL — The number of Full Pipe Bubbles in Main Pipe due to GR/GR or GR/load dependency stalls. This is the number of cycles lost (stall cycles) due to GR/GR or GR/load dependency.

BE_FLUSH_BUBBLE.ALL — The number of Full Pipe Bubbles in Main Pipe due to pipeline flushes. This is the number of cycles lost (stall cycles) due to branch misprediction or exception/interruption flush.

BE_L1D_FPU_BUBBLE.L1D — The number of Full Pipe Bubbles in Main Pipe due to L1D cache. This is the number of cycles lost (stall cycles) due to L1D cache and L1/L2 DTLB.

CPU_OP_CYCLES.ALL — The number of elapsed CPU operating cycles.

When HyperThreading is on, this is the number of elapsed CPU operating cycles used by only this process's hyperthread.

CPU_OP_CYCLES.ALL:all_threads=true

The number of elapsed CPU operating cycles used by both hyperthreads. Available only when HyperThreading is on.

IA64_INST_RETIRED — The number of retired IA-64 instructions. The count includes predicated on and predicated off instructions and nops, but excludes hardware-inserted RSE operations.

NOPS_RETIRED — The number of retired nop.i, nop.m, or nop.b instructions. The count excludes predicated off nop instructions.

PREDICATE_SQUASHED_RETIRED — The number of predicated off instructions retired.

THREAD_SWITCH_EVENTS.ALL — The number of hardware thread switches.

THREAD_SWITCH_EVENTS.L3MISS — The number of hardware thread switches due to L3 cache miss.

The following CPU metrics are derived (calculated) from the above CPU events:

% of Cycles lost due to stalls (lower is better) — The percentage of cycles lost due to all stalls.

% of Cycles lost due to stalls caused by L1D (L1D and L1/L2 DTLB) — The percentage of cycles lost due to L1D cache and L1/L2 DTLD.

% of Cycles lost due to GR/GR or GR/load dependency stalls — The percentage of cycles lost due to GR/load or GR/GR dependency stalls.

% of Cycles lost due to branch misprediction or interruption flush stalls — The percentage of cycles lost due to branch misprediction or interruption flush.

Effective instructions retired — The number of effective instructions retired excluding nop and predicated off instructions.

Raw CPI (lower is better) — The cycles per instruction, including nop and predicated off instructions.

Effective CPI (lower is better) — The cycles per effective instruction, excluding nop and predicated off instructions.

Effective CPI during unstalled execution (lower is better) — The cycles per effective instruction, excluding stall cycles, nop, and predicated off instructions.

198 Descriptions of Measurement Reports

Page 198
Image 198
HP UX Caliper Software manual Descriptions of Measurement Reports