llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	9be66737c1	[Hexagon] Add a scheduling DAG mutation - Remove output dependencies on USR_OVF register. - Update chain edge latencies between v60 vector loads/stores. llvm-svn: 275586	2016-07-15 17:48:09 +00:00
Etienne Bergeron	9da82d6aca	[compiler-rt] Fix 64-bits exception handlers in ASAN 64-bits runtime Summary: This is adding the appropriate suport for exception handling for 64-bits ASAN on windows. Reviewers: rnk Subscribers: kubabrecka, llvm-commits, wang0109, chrisha Differential Revision: https://reviews.llvm.org/D22395 llvm-svn: 275585	2016-07-15 17:28:10 +00:00
Etienne Bergeron	a566fe341f	[compiler-rt] Fix incorrect handling of indirect load. Summary: Indirect load are relative offset from RIP. The current trampoline implementation is incorrectly copying these instructions which make some unittests crashing. This patch is not fixing the unittests but it's fixing the crashes. The functions are no longer hooked. Patches will come soon to fix these unittests. Reviewers: rnk Subscribers: llvm-commits, wang0109, chrisha Differential Revision: https://reviews.llvm.org/D22410 llvm-svn: 275584	2016-07-15 17:26:33 +00:00
Adam Nemet	aad816083e	[OptRemark,LDist] RFC: Add hotness attribute Summary: This is the first set of changes implementing the RFC from http://thread.gmane.org/gmane.comp.compilers.llvm.devel/98334 This is a cross-sectional patch; rather than implementing the hotness attribute for all optimization remarks and all passes in a patch set, it implements it for the 'missed-optimization' remark for Loop Distribution. My goal is to shake out the design issues before scaling it up to other types and passes. Hotness is computed as an integer as the multiplication of the block frequency with the function entry count. It's only printed in opt currently since clang prints the diagnostic fields directly. E.g.: remark: /tmp/t.c:3:3: loop not distributed: use -Rpass-analysis=loop-distribute for more info (hotness: 300) A new API added is similar to emitOptimizationRemarkMissed. The difference is that it additionally takes a code region that the diagnostic corresponds to. From this, hotness is computed using BFI. The new API is exposed via an analysis pass so that it can be made dependent on LazyBFI. (Thanks to Hal for the analysis pass idea.) This feature can all be enabled by setDiagnosticHotnessRequested in the LLVM context. If this is off, LazyBFI is not calculated (D22141) so there should be no overhead. A new command-line option is added to turn this on in opt. My plan is to switch all user of emitOptimizationRemark* to use this module instead. Reviewers: hfinkel Subscribers: rcox2, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D21771 llvm-svn: 275583	2016-07-15 17:23:20 +00:00
Kostya Serebryany	5d9a17742e	[libFuzzer] add ThreadedLeakTest llvm-svn: 275582	2016-07-15 17:19:43 +00:00
David Majnemer	a940f360cb	[AliasAnalysis] Give back AA results for fence instructions Calling getModRefInfo with a fence resulted in crashes because fences don't have a memory location. Add a new predicate to Instruction called isFenceLike which indicates that the instruction mutates memory but not any single memory location in particular. In practice, it is a proxy for the set of instructions which "mayWriteToMemory" but cannot be used with MemoryLocation::get. This fixes PR28570. llvm-svn: 275581	2016-07-15 17:19:24 +00:00
Etienne Bergeron	3d89db445d	TestCase null_deref was failing in Win64: c:\lipo\work\asan\b_llvm>c:\lipo\work\asan\b_llvm\projects\compiler-rt\test\asan\X86_64WindowsConfig\TestCases\Output\null_deref.cc.tmp ================================================================= ==5488==ERROR: AddressSanitizer: access-violation on unknown address 0x000000000028 (pc 0x7ff701f91067 bp 0x000c8cf8fbf0 sp 0x000c8cf8fbb0 T0) ==5488==The signal is caused by a READ memory access. ==5488==Hint: address points to the zero page. #0 0x7ff701f91066 in NullDeref(int *) C:\lipo\work\asan\llvm\projects\compiler-rt\test\asan\TestCases\null_deref.cc:15:10 #1 0x8a0388830a67 (<unknown module>) The reason was symbols was not initilized. In fact, it was first inited with a call to stack.Print(), which calls WinSymbolizerTool::SymbolizePC, then InitializeDbgHelpIfNeeded(). Since the StackWalk was performed before the stack.Print(), stack frames where not gathered correctly. There should be a better place to initialize symbols. For now, this patch makes the test happy. Patch by Wei Wang Differential Revision: https://reviews.llvm.org/D22410 llvm-svn: 275580	2016-07-15 17:16:37 +00:00
Tobias Grosser	2025173494	GPGPU: Format statements scheduled on the host ourselves Otherwise ppcg would try to call into pet functionality that this not available, which obviously will cause trouble. As we can easily print these statements ourselves, we just do so. llvm-svn: 275579	2016-07-15 17:12:41 +00:00
Krzysztof Parzyszek	771c34513a	[Hexagon] Update instruction itineraries llvm-svn: 275578	2016-07-15 16:58:34 +00:00
Wei Ding	ea41f356bb	AMDGPU: Add Clang Builtin for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22380 llvm-svn: 275577	2016-07-15 16:43:03 +00:00
Dehao Chen	dcafd5ebfd	[PM] Convert LoopInstSimplify Pass to new PM Summary: Convert LoopInstSimplify to new PM. Unfortunately there is no exisiting unittest for this pass. Reviewers: davidxl, silvas Subscribers: silvas, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22280 llvm-svn: 275576	2016-07-15 16:42:11 +00:00
Justin Bogner	92a8c6112c	IR: Sort generic intrinsics before target specific ones This splits out the intrinsic table such that generic intrinsics come first and target specific intrinsics are grouped by target. From here we can find out which target an intrinsic is for or differentiate between generic and target intrinsics. The motivation here is to make it easier to move target specific intrinsic handling out of generic code. llvm-svn: 275575	2016-07-15 16:31:37 +00:00
Krzysztof Parzyszek	f24f468e6d	[Hexagon] Fixes/changes to instruction selection - Add patterns for rr/abs addressing modes. - Set addrMode to PostInc where necessary. - Misc fixes. llvm-svn: 275574	2016-07-15 16:29:02 +00:00
Tobias Grosser	2341fe9e76	GPGPU: Use schedule whole components for scheduler This option increases the scalability of the scheduler and allows us to remove the 'gisting' workaround we introduced in r275565 to handle a more complicated test case. Another benefit of using this option is also that the generated code looks a lot more streamlined. Thanks to Sven Verdoolaege for reminding me of this option. llvm-svn: 275573	2016-07-15 16:15:47 +00:00
Jun Bum Lim	a5737d8eac	[DSE]Enhance shorthening MemIntrinsic based on OverlapIntervals Summary: This change use the overlap interval map built from partial overwrite tracking to perform shortening MemIntrinsics. Add test cases which was missing opportunities before. Reviewers: hfinkel, eeckstein, mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D21909 llvm-svn: 275571	2016-07-15 16:14:34 +00:00
Dean Michael Berris	57ac95b0a6	XRay: Remove duplicate checks for xray instrumentation flags llvm-svn: 275570	2016-07-15 15:46:39 +00:00
Krzysztof Parzyszek	bba0bf7d37	[Hexagon] Improve patterns with stack-based addressing - Treat bitwise OR with a frame index as an ADD wherever possible, fold it into addressing mode. - Extend patterns for memops to allow memops with frame indexes as address operands. llvm-svn: 275569	2016-07-15 15:35:52 +00:00
Nico Weber	f7f2b81602	In dag-optnone.ll, use varargs instead of win64 to fast SDIsel. The test used to rely on targeting win64 to disable fast isel, but I'd like to teach fast isel about win64 rets. Change the test to use varargs to disable fast isel. llvm-svn: 275568	2016-07-15 15:30:18 +00:00
Matthew Simpson	f855346f0b	[LV] Swap A and B in interleaved access analysis (NFC) This patch swaps A and B in the interleaved access analysis and clarifies related comments. The algorithm is more intuitive if we let access A precede access B in program order rather than the reverse. This change was requested in the review of D19984. llvm-svn: 275567	2016-07-15 15:22:43 +00:00
Yaxun Liu	b3d17690eb	[AMDGPU] Add metadata for runtime Added emitting metadata to elf for runtime. Runtime requires certain information (metadata) about kernels to be able to execute and query them. Such information is emitted to an elf section as a key-value pair stream. Differential Revision: https://reviews.llvm.org/D21849 llvm-svn: 275566	2016-07-15 14:58:21 +00:00
Tobias Grosser	e4725437e8	GPGPU: Drop domain constraints from flow dependences This works around a shortcoming of the isl scheduler, which even for some smaller test cases does not terminate in case domain constraints are part of the flow dependences. llvm-svn: 275565	2016-07-15 14:43:04 +00:00
Jacques Pienaar	71c30a14b7	Rename AnalyzeBranch* to analyzeBranch*. Summary: NFC. Rename AnalyzeBranch/AnalyzeBranchPredicate to analyzeBranch/analyzeBranchPredicate to follow LLVM coding style and be consistent with TargetInstrInfo's analyzeCompare and analyzeSelect. Reviewers: tstellarAMD, mcrosier Subscribers: mcrosier, jholewinski, jfb, arsenm, dschuff, jyknight, dsanders, nemanjai Differential Revision: https://reviews.llvm.org/D22409 llvm-svn: 275564	2016-07-15 14:41:04 +00:00
Igor Laevsky	ee40d1e8da	Re-submit r272891 "Prevent dangling pointer problems in BranchProbabilityInfo" Most possibly problem was caused by the same reason as PR28400. This change bypasses it by using CallbackVH instead of AssertingVH. Differential Revision: https://reviews.llvm.org/D20957 llvm-svn: 275563	2016-07-15 14:31:16 +00:00
Daniel Sanders	db5e666304	Revert r275141 - Mips: Avoid implicit iterator conversions, NFC It appears to have caused some failures in our buildbots. llvm-svn: 275562	2016-07-15 13:54:20 +00:00
Sebastian Pop	4177480aad	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 275561	2016-07-15 13:45:20 +00:00
Aaron Ballman	51fb031a36	Removing a few more :option: tags that we do not have corresponding .. option directives for; these are causing the sphinx bot to fail (http://lab.llvm.org:8011/builders/clang-sphinx-docs/builds/15214/steps/docs-clang-html/logs/stdio ). llvm-svn: 275560	2016-07-15 13:13:45 +00:00
Nitesh Jain	7481e40ef8	[LLVM][MIPS] Fix createStubFunction to emit JR encoding based on Arch. Reviewers: vkalintiris, dsanders Subscribers: jaydeep, bhushan, mohit.bhakkad, slthakur, llvm-commits Differential Revision: https://reviews.llvm.org/D21172 llvm-svn: 275559	2016-07-15 12:56:37 +00:00
Aaron Ballman	05efec8b84	Removing a few more :option: tags that we do not have corresponding .. option directives for; these are causing the sphinx bot to fail (http://lab.llvm.org:8011/builders/clang-sphinx-docs/builds/15213/steps/docs-clang-html/logs/stdio ). llvm-svn: 275558	2016-07-15 12:55:47 +00:00
Tobias Grosser	6293ba6973	GPGPU: Add memory reference tag ids to tagged accesses It seems we forgot to actually add the memory access ids to the tagged accesses, but instead just tagged the accesses with empty isl_ids. This issue was found by inspection and without code generation it is difficult to test just by itself. We fix it for now without test case and expect our code generation tests to cover this later on. llvm-svn: 275557	2016-07-15 12:44:27 +00:00
Kirill Bobyrev	ee99fd13ae	[clang-rename] fix testset Make yet unsupported tests marked with FIXME pass so that buildbot doesn't fail. llvm-svn: 275556	2016-07-15 12:22:38 +00:00
Pavel Labath	9377a7b6a8	Fix TestDarwinNSLogOutput for windows pexpect python package does not exist on windows llvm-svn: 275555	2016-07-15 12:19:28 +00:00
Simon Pilgrim	efd841e294	[X86][AVX] Added shuffle tests for UNPCK+PERMUTE lowerVectorShuffleAsPermuteAndUnpack could solve this if it worked with 256-bit vectors llvm-svn: 275554	2016-07-15 11:51:46 +00:00
Tobias Grosser	cfa0361d35	GPGPU: Do not check for hidden declarations We do not have them in Polly and the code to check for them is directly referring to pet data structures which we do not have available. This commit avoids undefined behavior. As such issues are difficult to reproduce, this commit comes without a test case. llvm-svn: 275553	2016-07-15 11:42:53 +00:00
Simon Pilgrim	cf9c31550c	[X86][AVX2] Added a memory version of test_mm256_broadcastsi128_si256 This should lower to vbroadcasti128 llvm-svn: 275552	2016-07-15 11:40:27 +00:00
Tobias Grosser	225dca7838	GPGPU: Test scalar/array types i1/i3/i8/i32/i60/i64/i80/i120/i128/i3000 Arrays with integer base type are similar to arrays with floating point types, with the exception that LLVM's integer types can take some odd values. We add a selection of different values to make sure we correctly round these types when necessary. References to scalar integer types are special, as we currently do not model these types as array accesses as they are considered 'synthesizable' by Polly. As a result, we do not generate explicit data-transfers for them, but instead will need to keep track of all references to 'synthesizable' values separately. At the current stage, this is only visible by missing host-to-device data-transfer calls. In the future, we will also require special code generation strategies. llvm-svn: 275551	2016-07-15 11:33:47 +00:00
Kirill Bobyrev	32db76949c	[clang-rename] apply stylistic fixes llvm-svn: 275550	2016-07-15 11:29:16 +00:00
Eugene Leviant	0e36f42846	[ELF] Overriding reserved symbols in linker script llvm-svn: 275549	2016-07-15 11:20:04 +00:00
Tobias Grosser	8d9dcfc592	GPGPU: Test scalar parameters of type half/float/double/fp128/x86_fp80/ppc_fp128 We currently only test that the code structure we generate for these scalar parameters is correct and we add these types to make sure later code generation additions have sufficient test coverage. In case some of these types cannot be mapped due to missing hardware support on the GPU some of these test cases may need to be updated later on. llvm-svn: 275548	2016-07-15 11:12:29 +00:00
Tobias Grosser	2d010daf85	GPGPU: Make sure scops with more than one array work We use this opportunity to add a test case containing a scalar parameter. llvm-svn: 275547	2016-07-15 10:51:14 +00:00
Tobias Grosser	b307ed4d08	GPGPU: Free options to avoid memory leak ppcg does not free the option structs for us. To avoid a memory leak we do this ourselves. llvm-svn: 275546	2016-07-15 10:32:22 +00:00
Kirill Bobyrev	713bdc01fb	[clang-rename] add few tests Thiis patch introduces few additional tests including one case the tool does not handle yet, which should be fixed in the future. Differential Revision: https://reviews.llvm.org/D22102 llvm-svn: 275545	2016-07-15 10:21:33 +00:00
Pavel Labath	4abe5d69ea	[NPL] Simplify process launch code Summary: This removes one level of indirection, which was just packing and repacking launch args into different structures. NFC. Reviewers: tberghammer Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D22357 llvm-svn: 275544	2016-07-15 10:18:15 +00:00
Simon Pilgrim	2683ad54ad	[X86][AVX2] Improve lowerShuffleAsRepeatedMaskAndLanePermute permutation of 64-bit sub-lanes As discussed on PR28136, lowerShuffleAsRepeatedMaskAndLanePermute was attempting to match repeated masks at the 128-bit level and then permute the resultant lanes at the 128-bit (AVX1) or 64-bit (AVX2) sub-lane level. This change allows us to create the repeated masks at the sub-lane level (and then concat them together to create a 128-bit repeated mask) and then select which sub-lane to permute. This has no effect on the AVX1 codegen. Fixes PR28136. llvm-svn: 275543	2016-07-15 09:49:12 +00:00
Haojian Wu	39a718c48b	[include-fixer] Always add as few as possible qualifiers to the unidentified symbol. Reviewers: bkramer Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D22367 llvm-svn: 275542	2016-07-15 08:12:48 +00:00
James Molloy	8f16dffbb1	[ARM] Fix build after r275540 A rebase seemed so innocent before committing. Turns out someone changed a pointer to a reference in the mean time :( llvm-svn: 275541	2016-07-15 08:12:44 +00:00
James Molloy	b3326df56a	[Thumb-1] Select post-increment load and store where possible Thumb-1 doesn't have post-inc or pre-inc load or store instructions. However the LDM/STM instructions with writeback can function as post-inc load/store: ldm r0!, {r1} @ load from r0 into r1 and increment r0 by 4 Obviously, this only works if the post increment is 4. llvm-svn: 275540	2016-07-15 08:03:56 +00:00
Maxim Ostapenko	26b0bc0bd6	[asan] Reduce flakiness of halt_on_error-torture.cc testcase. It seems in some situations we have clashes on very first error so test only prints "nested bug in the same thread, aborting" rather than "use-after-poison", so remove corresponding " RUN: FileCheck %s < 10.txt" line. Also, the two last " RUN: FileCheck --check-prefix=CHECK-COLLISION %s < 1.txt \|\| FileCheck --check-prefix=CHECK-NO-COLLISION %s < 1.txt" look wrong, they should check 10.txt. Fix these lines too. Differential Revision: https://reviews.llvm.org/D22309 llvm-svn: 275539	2016-07-15 07:57:43 +00:00
James Molloy	2af08fa051	[ARM] Followup to r275537 addressing review comments Address Chad's comment in D22216 which I missed due to tunnel vision on the "LGTM" comment. llvm-svn: 275538	2016-07-15 07:57:35 +00:00
James Molloy	a454a11d60	[ARM] Prefer indirect calls in minsize mode ... When we emit several calls to the same function in the same basic block. An indirect call uses a "BLX r0" instruction which has a 16-bit encoding. If many calls are made to the same target, this can enable significant code size reductions. llvm-svn: 275537	2016-07-15 07:55:21 +00:00
Tobias Grosser	a56f8f8e58	GPGPU: Shorten ppcg include paths to avoid conflict with cuda.h Instead of directly linking to ppcg's main source directory, we link to the parent director. This allows us to access ppcg's include files with 'ppcg/cuda.h' and avoids a conflict with NVIDIA's cuda.h header. Also drop an include directory that is currently not used. llvm-svn: 275536	2016-07-15 07:50:36 +00:00

1 2 3 4 5 ...

236581 Commits All Branches Search

236581 Commits

All Branches