llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Samsonov	d45837155d	[TSan] Update check_analyze.sh expectations to match trunk Clang output. llvm-svn: 227877	2015-02-02 22:17:23 +00:00
Dmitry Vyukov	afdcc96d9f	tsan: optimize memory access functions The optimization is two-fold: First, the algorithm now uses SSE instructions to handle all 4 shadow slots at once. This makes processing faster. Second, if shadow contains the same access, we do not store the event into trace. This increases effective trace size, that is, tsan can remember up to 10x more previous memory accesses. Perofrmance impact: Before: [ OK ] DISABLED_BENCH.Mop8Read (2461 ms) [ OK ] DISABLED_BENCH.Mop8Write (1836 ms) After: [ OK ] DISABLED_BENCH.Mop8Read (1204 ms) [ OK ] DISABLED_BENCH.Mop8Write (976 ms) But this measures only fast-path. On large real applications the speedup is ~20%. Trace size impact: On app1: Memory accesses : 1163265870 Including same : 791312905 (68%) on app2: Memory accesses : 166875345 Including same : 150449689 (90%) 90% of filtered events means that trace size is effectively 10x larger. llvm-svn: 209897	2014-05-30 13:36:29 +00:00
Kostya Serebryany	5b7cb1db61	[tsan] old-dstyle Makefile for tests; two helper scripts that analyze the assembly code of the hot functions llvm-svn: 156547	2012-05-10 15:10:03 +00:00

Author

SHA1

Message

Date

Alexey Samsonov

d45837155d

[TSan] Update check_analyze.sh expectations to match trunk Clang output.

llvm-svn: 227877

2015-02-02 22:17:23 +00:00

Dmitry Vyukov

afdcc96d9f

tsan: optimize memory access functions

The optimization is two-fold:
First, the algorithm now uses SSE instructions to
handle all 4 shadow slots at once. This makes processing
faster.
Second, if shadow contains the same access, we do not
store the event into trace. This increases effective
trace size, that is, tsan can remember up to 10x more
previous memory accesses.

Perofrmance impact:
Before:
[       OK ] DISABLED_BENCH.Mop8Read (2461 ms)
[       OK ] DISABLED_BENCH.Mop8Write (1836 ms)
After:
[       OK ] DISABLED_BENCH.Mop8Read (1204 ms)
[       OK ] DISABLED_BENCH.Mop8Write (976 ms)
But this measures only fast-path.
On large real applications the speedup is ~20%.

Trace size impact:
On app1:
Memory accesses                   :       1163265870
  Including same                  :        791312905 (68%)
on app2:
Memory accesses                   :        166875345
  Including same                  :        150449689 (90%)
90% of filtered events means that trace size is effectively 10x larger.

llvm-svn: 209897

2014-05-30 13:36:29 +00:00

Kostya Serebryany

5b7cb1db61

[tsan] old-dstyle Makefile for tests; two helper scripts that analyze the assembly code of the hot functions

llvm-svn: 156547

2012-05-10 15:10:03 +00:00

3 Commits