Project

General

Profile

Actions

Bug #544

closed

execution time for enabled tracepoints has increased in 2.2.0-rc2 compared to 2.1.1

Added by Stanislav Vovk almost 11 years ago. Updated about 10 years ago.

Status:
Won't fix
Priority:
Normal
Target version:
-
Start date:
05/27/2013
Due date:
% Done:

0%

Estimated time:

Description

Commit id (2.2.0-rc2) ============
userspace-rcu : 8ff97cc3836c6a2739b62af0a859511c9c284e4f
lttng-ust : 996aeadece97a55e7f48ca1e73958a53de62ed4f
lttng-tools : 7f2a80589c0b04627eaa351a9debff202836f930

Commit id (2.1.1) ============
userspace-rcu : 3ff667f1ed404609bed536a8b926b90385153090
lttng-ust (2.1.2): 3f7a0876566a432ad2cc2b27a2725641e36da92d
lttng-tools : 959f036852e1eef40b2fb201a5b2914285a0abbe

Problem Description: ====================
Basically when comparing two above versions on PowerPC(LSI ACP34xx) it seems that time to execute one enabled tracepoint has increased by a factor of ~2,3. Test was done as following. One tracepoint is executed inside a loop a million times. Time is taken before entering the loop and right after exiting the loop. Then a difference and average time is calculated. Application sleeps 2 seconds at start(before time i taken) to wait that everything is initialized.

Same test were run in the same environment and build with same toolchain and sysroot, the only thing different is LTTng version. For detailed environment specifications see further down.

LTTng was configured as below,
root@du1:~# lttng list s1
Tracing session s1: [active]
Trace path: /root/out

=== Domain: UST global ===

Channels:
-------------
- ch1: [enabled]

Attributes:
overwrite mode: 1
subbufers size: 4096
number of subbufers: 4
switch timer interval: 0
read timer interval: 200
output: mmap()
Events:
sample_component:trace1 (type: tracepoint) [enabled]

Just to get an idea of execution times look at output from test program below.

LTTng 2.1.1:
root@du1:~# ./ltt-test-2.1.1 -l 1000000 -t 3
Benchmark test starts now...
Time it took to loop 1000000 times : 3 s 438450550 ns | [ 3438 ns per tp ]
Time it took to loop 1000000 times : 3 s 548152600 ns | [ 3548 ns per tp ]
Time it took to loop 1000000 times : 3 s 486106110 ns | [ 3486 ns per tp ]

LTTng 2.2.0-rc2:
root@du1:~# ./ltt-test-2.2.0 -l 1000000 -t 3
Benchmark test starts now...
Time it took to loop 1000000 times : 8 s 566419650 ns | [8566 ns per tp]
Time it took to loop 1000000 times : 8 s 594758380 ns | [8594 ns per tp]
Time it took to loop 1000000 times : 8 s 550842610 ns | [8550 ns per tp]

Target Environment ==================
root@du1:~# uname -a
Linux du1 3.4.34-rt40-00001-g7f4955b #1 SMP PREEMPT RT Wed, 22 May 2013 12:58:10 +0200 ppc GNU/Linux

root@du1:~# /lib/libc.so.6
GNU C Library (Wind River Linux Sourcery CodeBench 4.6a-98) stable release version 2.15, by Roland McGrath et al.
Copyright (C) 2012 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.
There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A
PARTICULAR PURPOSE.
Compiled by GNU CC version 4.6.3.
Compiled on a Linux 3.4.11 system on 2013-01-15.
Available extensions:
crypt add-on version 2.1 by Michael Glad and others
GNU Libidn by Simon Josefsson
Native POSIX Threads Library by Ulrich Drepper et al
BIND-8.2.3-T5B
libc ABIs: UNIQUE IFUNC
For bug reporting instructions, please see:
<>.

root@du1:~# cat /proc/cpuinfo
processor : 0
cpu : 476dd2
clock : 1600.000000MHz
revision : 32.192 (pvr 11a5 20c0)
bogomips : 200.00

processor : 1
cpu : 476dd2
clock : 1600.000000MHz
revision : 32.192 (pvr 11a5 20c0)
bogomips : 200.00

processor : 2
cpu : 476dd2
clock : 1600.000000MHz
revision : 32.192 (pvr 11a5 20c0)
bogomips : 200.00

processor : 3
cpu : 476dd2
clock : 1600.000000MHz
revision : 32.192 (pvr 11a5 20c0)
bogomips : 200.00

total bogomips : 800.00
timebase : 100000000
platform : ACP3400DUS41
model : ibm,acpx1-4xx
Memory : 1792 MB

Is problem reproducible? =========================
Always

Perf stats ==========================
See attached.


Files

perf-stats.txt (10.2 KB) perf-stats.txt Stanislav Vovk, 05/27/2013 04:25 AM
sample.tgz (4.64 KB) sample.tgz Stanislav Vovk, 05/29/2013 03:33 AM
perf-annotate-2.1.1.txt (300 KB) perf-annotate-2.1.1.txt Stanislav Vovk, 05/30/2013 07:47 AM
perf-annotate-2.1-stable.txt (684 KB) perf-annotate-2.1-stable.txt Stanislav Vovk, 05/30/2013 07:47 AM
perf-report-2.1.1.txt (25.4 KB) perf-report-2.1.1.txt Stanislav Vovk, 05/30/2013 07:47 AM
perf-report-2.1-stable.txt (49.1 KB) perf-report-2.1-stable.txt Stanislav Vovk, 05/30/2013 07:47 AM
sample2.tgz (3.43 KB) sample2.tgz simple test program Stanislav Vovk, 07/30/2013 09:51 AM
Actions

Also available in: Atom PDF