Target

Execution Time

 

10

 

 

Real

time:

0.428

seconds

User

time:

0.415

seconds

System

time: 0.008

seconds

Sampling

Specification

 

 

11

Number

of samples:

1319

Data

sampled:

 

IP

Metrics Summed for Entire Run 12

-----------------------------------------------

 

PLM

 

 

Event Name

U..K

TH

Count

-----------------------------------------------

CPU_CYCLES

x___

0

659001879

BACK_END_BUBBLE.ALL

x___

0

99866365

BE_EXE_BUBBLE.GRALL

x___

0

1244270

-----------------------------------------------

PLM: Privilege Level Mask

U/K = user/kernel levels (U: level 3, K: level 0)

The intermediate levels (1, 2) are unused on HP-UX or Linux

x : the metric is measured at the given level (_ : not measured)

TH: event THreshold, determines the event counter behavior, TH == 0 : counter += event_count_in_cycle

TH > 0 : counter += (event_count_in_cycle >= threshold ? 1 : 0)

-----------------------------------------------

%of Cycles lost due to stalls (lower is better):

15.15= 100 * (BACK_END_BUBBLE.ALL / CPU_CYCLES)

%of Cycles stalled due to GR/GR or GR/load dependency (lower is better):

0.19= 100 * (BE_EXE_BUBBLE.GRALL / CPU_CYCLES)

-----------------------------------------------

Load Module Summary

13

 

 

 

-----------------------------------------------

 

% Total

Cumulat

 

 

 

 

IP

% of

 

IP

 

 

Samples

Total

 

Samples

Load Module

 

-----------------------------------------------

 

96.97

96.97

 

 

1279

matmul

 

2.88

99.85

 

 

38

libc.so.1

 

0.15

100.00

 

 

2

dld.so

 

-----------------------------------------------

 

100.00

100.00

 

 

1316

Total

 

-----------------------------------------------

 

Function Summary

 

 

 

 

 

 

14

 

 

 

 

-------------------------------------------------------------------------

% Total

Cumulat

 

 

 

 

IP

% of

 

IP

 

 

Samples

Total

 

Samples

Function

File

-------------------------------------------------------------------------

96.97

96.97

 

 

1279

matmul::main

matmul.c

2.88

99.85

 

 

38

libc.so.1::rand

rand.c

-------------------------------------------------------------------------

[Minimum function entries: 5, percent cutoff: 0.10, cumulative percent cutoff: 100.00]

Function Details 15

---------------------------------------------------

% Total

 

Line

 

IP

IP

Slot

>Statement

Samples

Samples

Col,Offset

Instruction

---------------------------------------------------

96.97[matmul::main, 0x40009a0, matmul.c]

1279

~38

Function Totals

------------------------------------------

[/home/meagher/matmul.c]

 

 

 

34

~16

>

mata[i][j] = matb[i][j] = (float) rand() ;

1

~28

>

for (k = 0 ; k < INDEX ; k++) {

1244

~29

>

matres[i][j] =

---------------------------------------------------

2.88[libc.so.1::rand, 0x4119f60, rand.c]

38~57 Function Totals

------------------------------------------

[File not found: /ux/libsobj_i380em/libs/libc/shared_em_32/obj/../../../

../../core/libs/libc/shared_em_32/../core/gen/rand.c] 16

-----------------------------------------------

4

~57

>

11

~61

>

18

~77

>

Example: Running fprof on a Short Program, with Default Output

23