Figure 2 fprof Measurement Report for matmul, with Default Report Output

================================================================================

HP Caliper 4.3.0 Report Summary for Flat Profile 1

================================================================================

Collection Run 1: (Flat Profile) 2

---------------------------------

Processor Information

3

 

 

Machine name:

 

 

 

fitzroy

Number of processors:

4

Processor type:

Itanium2 9M

Processor speed:

1600 MHz

Virtual machine:

no

Run Information

 

 

 

 

 

4

 

 

 

 

Configuration:

/opt/caliper/config/fprof

Date:

May 05, 2007

Version:

HP Caliper - HP-UX Itanium Version 4.3.0 (2007-05-05)

OS:

HP-UX B.11.23 U ia64

Database:

/home/meagher/.hp_caliper_databases/fprof

Measurement scope: per-process

Sampling Specification

 

 

5

 

Sampling event:

 

CPU_CYCLES

Sampling period:

 

500000 events

Sampling period variation: 25000 (5.00% of sampling period)

Sampling counter privilege: user (user-space sampling)

Data

granularity:

16

bytes

Data

sampled:

IP

 

--------------------------------------------------------------------------

Metrics Summed for Entire Run 6

-----------------------------------------------

 

PLM

 

 

Event Name

U..K

TH

Count

-----------------------------------------------

CPU_CYCLES

x___

0

659001879

BACK_END_BUBBLE.ALL

x___

0

99866365

BE_EXE_BUBBLE.GRALL

x___

0

1244270

-----------------------------------------------

PLM: Privilege Level Mask

U/K = user/kernel levels (U: level 3, K: level 0)

The intermediate levels (1, 2) are unused on HP-UX or Linux

x : the metric is measured at the given level (_ : not measured)

TH: event THreshold, determines the event counter behavior, TH == 0 : counter += event_count_in_cycle

TH > 0 : counter += (event_count_in_cycle >= threshold ? 1 : 0)

-----------------------------------------------

%of Cycles lost due to stalls (lower is better):

15.15= 100 * (BACK_END_BUBBLE.ALL / CPU_CYCLES)

%of Cycles stalled due to GR/GR or GR/load dependency (lower is better):

0.19= 100 * (BE_EXE_BUBBLE.GRALL / CPU_CYCLES)

-----------------------------------------------

Process Summary 7

-------------------------------------------

% Total

Cumulat

 

 

IP

% of

IP

 

Samples

Total

Samples

Process

-------------------------------------------

100.00

100.00

1319

matmul (pid: 6991)

-------------------------------------------

[Minimum process entries: 5, percent cutoff: 2.00, cumulative percent cutoff: 100.00]

-------------------------------------------

================================================================================

Flat Profile Report for matmul 8

================================================================================

Target Application

9

 

 

Program:

/home/meagher/matmul

Invocation:

./matmul

Process ID:

6991 (started by Caliper)

Start time:

09:31:37 AM

End time:

09:31:38 AM

Termination Status:

1

Last modified:

May 5, 2007 at 09:09 AM

Memory model:

ILP32

Processor set:

default

Example: Running fprof on a Short Program, with Default Output

21