MT_BE_THRSW_ACTUAL_OUT.ANY — The number of events that switched the foreground thread into a background thread (also called as switching out) or background thread switching in.

MT_BE_THRSW_ACTUAL_OUT.MLD_USE — The number of thread switches from foreground to background due to wait on middle level data cache (MLD).

CPU_OP_CYCLES.ALL — The number of elapsed CPU operating cycles. When HyperThreading is on, this is the number of elapsed CPU operating cycles used by the hyperthread of a specific process.

CPU_OP_CYCLES.ALL (at=true) — The number of elapsed CPU operating cycles used by both hyperthreads. Available only when HyperThreading is on.

CYC_BE_BUBBLE.ANY — The number of CPU cycles lost to replays, flushes or bubbles, including partial replay cycles.

CYC_BE_IBD_STALL.GR_LOAD — This is the number of cycles lost (stall cycles) due to GR load RAW or WAW dependency condition of the instruction.

CYC_BE_EXE_REPLAY.GR_LOAD_RAW — This is the number of cycles lost (stall cycles) in replay due to RAW hazard in an instruction's GR load.

CYC_BE_EXE_REPLAY.GR_LOAD_WAW — This is the number of cycles lost (stall cycles) in replay due to WAW hazard in an instruction's GR load.

CYC_BE_DET_REPLAY.GR_LOAD — This is the number of cycles lost (stall cycles) in replay due to memory loads of single cycle GR load instructions. The loads do not hit the FLD (first level data cache) and have to be obtained from lower level caches or memory leading to extra cycles.

% of Cycles lost due to stalls (lower is better) — The percentage of cycles lost due to all stalls.

Cycles lost due to GR load penalties — The percentage of cycles lost due to GR/load or GR/GR dependency stalls like hazards, DET_REPLAY & IBD_STALL.

Effective instructions retired — The number of effective instructions retired excluding nop and predicated off instructions.

Raw CPI (lower is better) — The cycles per instruction, including nop and predicated off instructions.

Raw IPC (higher is better) — The instructions per cycle (inverse of CPI).

Effective CPI (lower is better) — The cycles per effective instruction, excluding nop and predicated off instructions.

Effective IPC (higher is better) — The effective instructions per cycle (inverse of Effective CPI).

Effective CPI during unstalled execution (lower is better) — The cycles per effective instruction, excluding stall cycles, nop, and predicated off instructions.

Effective IPC during unstalled execution (higher is better) — The effective instructions per cycle, (inverse of Effective CPI during unstalled execution).

Number of times the thread is switched out — (MT_BE_THRSW_ACTUAL_OUT.ANY) The number of events that switched the foreground thread into a background thread.

Thread switch outs due to L2 use — The percentage of thread switches due to the usage of middle level data cache (MLD or L2D ) in the total number of thread switches.

Core cycles due to this thread — The percentage of cycles by this Hyperthread in relation to the total number of cycles of the core when Hyperthreading is on.

fcount Measurement Report Description

Only on HP-UX.

With the fcount measurement, produced by the fcount measurement configuration file, HP Caliper measures and reports exact function call counts. This gives the total number of times each function is called, either directly or indirectly.

208 Descriptions of Measurement Reports