llvm-project

Commit Graph

Author	SHA1	Message	Date
Lang Hames	4a59a34597	[Kaleidoscope][BuildingAJIT] Fix a bug in the symbol resolver in Chapter2. Symbol resolution should be done on the top layer of the stack unless there's a good reason to do otherwise. In this case it would have worked because OptimizeLayer::addModuleSet eagerly passes all modules down to the CompileLayer, meaning that searches in CompileLayer will find the definitions. In later chapters where the top layer's addModuleSet isn't a pass-through, this would break. llvm-svn: 270899	2016-05-26 19:44:33 +00:00
Krzysztof Parzyszek	da0b9a959e	[Hexagon] Enable the post-RA scheduler The aggressive anti-dependency breaker can rename the restored callee- saved registers. To prevent this, mark these registers are live on all paths to the return/tail-call instructions, and add implicit use operands for them to these instructions. llvm-svn: 270898	2016-05-26 19:44:28 +00:00
Chad Rosier	14aa2ad1f4	[AArch64] Generate rev16/rev32 from bswap + srl when upper bits are known zero. Canonicalize (srl (bswap i32 x), 16) to (rotr (bswap i32 x), 16), if the high 16-bits of x are zero. Similarly, canonicalize (srl (bswap i64 x), 32) to (rotr (bswap i64 x), 32), if the high 32-bits of x are zero. test_rev_w_srl16: test_rev_w_srl16: and w8, w0, #0xffff and w8, w0, #0xffff rev w8, w8 ---> rev16 w0, w8 lsr w0, w8, #16 test_rev_x_srl32: test_rev_x_srl32: rev x8, x8 ---> rev32 x0, x8 lsr x0, x8, #32 llvm-svn: 270896	2016-05-26 19:41:33 +00:00
Changpeng Fang	71369b3a39	AMDGPU/SI: Enable load-store-opt by default. Summary: Enable load-store-opt by default, and update LIT tests. Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D20694 llvm-svn: 270894	2016-05-26 19:35:29 +00:00
Michael Kuperstein	ae21491819	[BasicAA] Extend inbound GEP negative offset logic to GlobalVariables r270777 improved the precision of alloca vs. inbounbds GEP alias queries: if we have (a) an inbounds GEP and (b) a pointer based on an alloca, and the beginning of the object the GEP points to would have a negative offset with respect to the alloca, then the GEP can not alias pointer (b). This makes the same logic fire when (b) is based on a GlobalVariable instead of an alloca. Differential Revision: http://reviews.llvm.org/D20652 llvm-svn: 270893	2016-05-26 19:30:49 +00:00
David Majnemer	d99068d26d	[MemCpyOpt] Don't perform callslot optimization across may-throw calls An exception could prevent a store from occurring but MemCpyOpt's callslot optimization would fire anyway, causing the store to occur. This fixes PR27849. llvm-svn: 270892	2016-05-26 19:24:24 +00:00
Rafael Espindola	30c080a085	coff: fix the section of weak symbols. llvm-svn: 270889	2016-05-26 18:48:23 +00:00
Michael Kuperstein	9a81b62a01	[BBVectorize] Don't vectorize selects with a scalar condition and vector operands. This fixes PR27879. Differential Revision: http://reviews.llvm.org/D20659 llvm-svn: 270888	2016-05-26 18:43:57 +00:00
Krzysztof Parzyszek	729e7ad31f	Add test/CodeGen/MIR/Hexagon/lit.local.cfg Require that Hexagon is a registered target. llvm-svn: 270887	2016-05-26 18:35:45 +00:00
Krzysztof Parzyszek	143f684a79	Do not rename registers that do not start an independent live range llvm-svn: 270885	2016-05-26 18:22:53 +00:00
Rafael Espindola	6ddf5f4437	coff: fix the value of weak definitions. It looks like this doesn't get a lot of use. llvm-svn: 270883	2016-05-26 18:04:53 +00:00
Pete Cooper	d6e6bf1808	Don't allocate in APInt::slt. NFC. APInt::slt was copying the LHS and RHS in to temporaries then making them unsigned so that it could use an unsigned comparision. It did this even on the paths which were trivial to give results for, such as the sign bit of the LHS being set while RHS was not set. This changes the logic to return out immediately in the trivial cases, and use an unsigned comparison in the remaining cases. But this time, just use the unsigned comparison directly without creating any temporaries. This works because, for example: true = (-2 slt -1) = (0xFE ult 0xFF) Also added some tests explicitly for slt with APInt's larger than 64-bits so that this new code is tested. Using the memory for 'opt -O2 verify-uselistorder.lto.opt.bc -o opt.bc' (see r236629 for details), this reduces the number of allocations from 26.8M to 23.9M. llvm-svn: 270881	2016-05-26 17:40:07 +00:00
David Majnemer	7f32420ed5	[CaptureTracking] Volatile operations capture their memory location The memory location that corresponds to a volatile operation is very special. They are observed by the machine in ways which we cannot reason about. Differential Revision: http://reviews.llvm.org/D20555 llvm-svn: 270879	2016-05-26 17:36:22 +00:00
Artem Belevich	11f69ba0cf	Init member structs in constructor. Fixes build error on windows where MSVC does not support list initialization inside member initializer list. llvm-svn: 270877	2016-05-26 17:29:20 +00:00
Xinliang David Li	b02f3b141c	Revert 270865 -- unexplained bot failure on linux/ppcle llvm-svn: 270876	2016-05-26 17:27:22 +00:00
Lang Hames	105518fe3c	[Orc] Merge some common code for creating CompileCallbackManagers and IndirectStubsManagers. llvm-svn: 270874	2016-05-26 17:20:35 +00:00
Artem Belevich	49e9a81236	[NVPTX] Added NVVMIntrRange pass NVVMIntrRange adds !range metadata to calls of NVVM intrinsics that return values within known limited range. This allows LLVM to generate optimal code for indexing arrays based on tid/ctaid which is a frequently used pattern in CUDA code. Differential Revision: http://reviews.llvm.org/D20644 llvm-svn: 270872	2016-05-26 17:02:56 +00:00
Artem Tamazov	6edc135d0f	[AMDGPU][llvm-mc] s_getreg/setreg* - hwreg - factor out strings/literals etc. Hwreg(...) syntax implementation unified with sendmsg(...). Common strings moved to Utils MathExtras.h functionality utilized. Added missing build dependency in Disassembler. Differential Revision: http://reviews.llvm.org/D20381 llvm-svn: 270871	2016-05-26 17:00:33 +00:00
Chris Bieneman	1bc0f4395c	[CMake] Restrict libxar linkage to just llvm-objdump At some point we're going to need libObject to have this dependency, but as it is now this is causing too many headaches. This commit will reduce the linkage to just llvm-objdump where it is strictly needed, and we'll cross the libObject bridge later when we need it. llvm-svn: 270866	2016-05-26 16:32:40 +00:00
Xinliang David Li	0777a93bee	Use new interface in Triple /NFC llvm-svn: 270865	2016-05-26 16:28:01 +00:00
Adrian Prantl	aa9d6c3630	Undo a suboptimal clang-format decision. NFC llvm-svn: 270861	2016-05-26 16:06:04 +00:00
Ismail Donmez	a1c8c60e20	Since some time clang itself figures out the default for ms-compatibility-version and uses it. Trying to figure it out during build is redundant and also will not work when the environment variable VSINSTALLDIR is not defined (which is not defined if you don't install whole Visual Studio but use Visual C++ Build Tools package). Tested by bootstrapping clang with clang-cl. Differential Revision: http://reviews.llvm.org/D20672 llvm-svn: 270860	2016-05-26 15:52:23 +00:00
Artem Tamazov	b49c3361e5	Fix build warning introduced in r270552 "[AMDGPU][llvm-mc] Disassembler: support for TTMP/TBA/TMA registers." llvm-svn: 270859	2016-05-26 15:52:16 +00:00
Simon Pilgrim	cf340bd9c1	[X86][SSE] When lowering a 256-bit shuffle as PMOVZX, reduce the input vector to the lower 128-bit subvector. Most often as not this is what it started out as, the extraction is zero-cost on AVX and the PMOVZX/PMOVSX folding logic is based around 128-bit loads. llvm-svn: 270858	2016-05-26 15:40:36 +00:00
Krzysztof Parzyszek	de37cfb596	[Hexagon] Select the aggressive anti-dependency breaker llvm-svn: 270857	2016-05-26 15:38:50 +00:00
Diana Picus	81bc3170e8	[AMDGPU] Remove exit-on-error flag from test (PR27762) Similar to r269948, but for argument lowering. Fixes PR27762 Differential Revision: http://reviews.llvm.org/D20430 llvm-svn: 270856	2016-05-26 15:24:55 +00:00
Diana Picus	20a8d8e97e	[BPF] Remove exit-on-error flag in test (PR27767) The exit-on-error flag is needed to avoid an assert where llvm::SelectionDAGISel::LowerArguments doesn't create enough arguments. Fill up with zeroes to reach the right number of args. Fixes PR27767. Differential Revision: http://reviews.llvm.org/D20571 llvm-svn: 270855	2016-05-26 15:23:50 +00:00
Chad Rosier	e5819e2732	[InstCombine] Catch more bswap cases missed due to zext and truncs. Fixes PR27824. Differential Revision: http://reviews.llvm.org/D20591. llvm-svn: 270853	2016-05-26 14:58:51 +00:00
Simon Pilgrim	50c37ceb3b	[X86][SSE] Added load_zext_16i8_to_8i32 test Odd issue with input vector not being folded into pmovzx on AVX2+ targets llvm-svn: 270852	2016-05-26 14:45:30 +00:00
Teresa Johnson	28c03b56ec	[ThinLTO] Resolve LinkOnceAny Summary: Ensure we keep prevailing copy of LinkOnceAny by converting it to WeakAny. Rename odr_resolution test to the now more appropriate weak_resolution (weak in the linker sense includes linkonce). Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D20634 llvm-svn: 270850	2016-05-26 14:16:52 +00:00
Chad Rosier	816a67da49	[AArch64] Generate a BFI/BFXIL from 'or (and X, MaskImm), OrImm'. If and only if the value being inserted sets only known zero bits. This combine transforms things like and w8, w0, #0xfffffff0 movz w9, #5 orr w0, w8, w9 into movz w8, #5 bfxil w0, w8, #0, #4 The combine is tuned to make sure we always reduce the number of instructions. We avoid churning code for what is expected to be performance neutral changes (e.g., converted AND+OR to OR+BFI). Differential Revision: http://reviews.llvm.org/D20387 llvm-svn: 270846	2016-05-26 13:27:56 +00:00
Rafael Espindola	a224de06bc	Use shouldAssumeDSOLocal on AArch64. This reduces code duplication and now AArch64 also handles PIE. llvm-svn: 270844	2016-05-26 12:42:55 +00:00
Igor Breger	8437bb70fd	[AVX512] Fix intrinsic cmp{sd\|ss} lowering. Differential Revision: http://reviews.llvm.org/D20615 llvm-svn: 270843	2016-05-26 12:42:25 +00:00
John Brawn	3546c2f158	Add auto-exporting of symbols from tools so that plugins work on Windows The problem with plugins on Windows is that when building a plugin DLL it needs to explicitly link against something (an exe or DLL) if it uses symbols from that thing, and that thing must explicitly export those symbols. Also there's a limit of 65535 symbols that can be exported. This means that currently plugins only work on Windows when using BUILD_SHARED_LIBS, and that doesn't work with MSVC. This patch adds an LLVM_EXPORT_SYMBOLS_FOR_PLUGINS option, which when enabled automatically exports from all LLVM tools the symbols that a plugin could want to use so that a plugin can link against a tool directly. Plugins can specify what tool they link against by using PLUGIN_TOOL argument to llvm_add_library. The option can also be enabled on Linux, though there all it should do is restrict the set of symbols that are exported as by default all symbols are exported. This option is currently OFF by default, as while I've verified that it works with MSVC, linux gcc, and cygwin gcc, I haven't tried mingw gcc and I have no idea what will happen on OSX. Also unfortunately we can't turn on LLVM_ENABLE_PLUGINS when the option is ON as bugpoint-passes needs to be loaded by both bugpoint.exe and opt.exe which is incompatible with this approach. Also currently clang plugins don't work with this approach, which will be fixed in future patches. Differential Revision: http://reviews.llvm.org/D18826 llvm-svn: 270839	2016-05-26 11:16:43 +00:00
Simon Pilgrim	ab3809193c	[X86][F16C] Added F16C fast-isel tests to match clang/test/CodeGen/f16c-builtins.c llvm-svn: 270837	2016-05-26 10:26:56 +00:00
Simon Pilgrim	0e4fdc0842	[X86][AVX2] Added gather fast-isel tests to match clang/test/CodeGen/avx2-builtins.c llvm-svn: 270835	2016-05-26 10:07:05 +00:00
Chris Dewhurst	9013d069b0	[Sparc] Extend the assembler printing support for Sparc back-end. Allows display of floating-point registers and display of assembler meta-data output. llvm-svn: 270829	2016-05-26 07:28:31 +00:00
David Majnemer	474512576e	[MergedLoadStoreMotion] Don't transform across may-throw calls It is unsafe to hoist a load before a function call which may throw, the throw might prevent a pointer dereference. Likewise, it is unsafe to sink a store after a call which may throw. The caller might be able to observe the difference. This fixes PR27858. llvm-svn: 270828	2016-05-26 07:11:09 +00:00
Adam Nemet	4f7bbf617b	[ConstantFold] NFC cleanup after previous change. Merge two conditions. llvm-svn: 270827	2016-05-26 07:08:09 +00:00
Adam Nemet	c68534bd13	[ConstantFold] Fix incorrect index rewrites for GEPs Summary: If an index for a vector or array type is out-of-range GEP constant folding tries to factor it into preceding dimensions. The code however does not consider addressing of structure field padding which should not qualify as out-of-range index. As demonstrated by the testcase, this can occur if the indexing performed on a vector type and the preceding index is an array type. SROA generates GEPs for example involving padding bytes as it slices an alloca. My fix disables this folding if the element type is a vector type. I believe that this is the only way we can end up with padding. (We have no access to DataLayout so I am not sure if there is actual robust way of actually checking the presence of padding.) Reviewers: majnemer Subscribers: llvm-commits, Gerolf Differential Revision: http://reviews.llvm.org/D20663 llvm-svn: 270826	2016-05-26 07:08:05 +00:00
David Majnemer	8cce333abd	[MergedLoadStoreMotion] Small cleanup No functional change is intended. llvm-svn: 270824	2016-05-26 05:43:12 +00:00
Peter Collingbourne	b9aa1f4a03	MemorySSA: Revert r269678 and r268068; replace with special casing in MemorySSA. It turns out that too many passes are relying on alias analysis results for control dependencies. Until we fix that by introducing a more accurate modelling of control dependencies, special case assume in MemorySSA instead. Also introduce tests to ensure we don't regress the FunctionAttrs or LICM passes. Differential Revision: http://reviews.llvm.org/D20658 llvm-svn: 270823	2016-05-26 04:58:46 +00:00
Steven Wu	ec6f56eb39	Revert "[CMake] LINK_LIBS need to be public for Darwin dylib targets" This reverts r270723. This commit breaks greendragon. llvm-svn: 270820	2016-05-26 04:35:35 +00:00
Craig Topper	a423aa4642	[X86] Add the AVX storeu intrinsics to InstCombine and LoopStrengthReduce in the same places that the SSE/SSE2 storeu intrinsics appear. I don't really know how to test this. Just seemed like we should be consistent. llvm-svn: 270819	2016-05-26 04:28:45 +00:00
Sanjoy Das	ee77a4828e	[IRCE] Use C++11 style initializers; NFC llvm-svn: 270815	2016-05-26 01:50:18 +00:00
Teresa Johnson	683abe79b2	[ThinLTO/gold] Handle bitcode archives Summary: Several changes were required for ThinLTO links involving bitcode archive static libraries. With this patch clang/llvm bootstraps with ThinLTO and gold. The first is that the gold callbacks get_input_file and release_input_file can normally be used to get file information for each constituent bitcode file within an archive. However, these interfaces lock the underlying file and can't be for each archive constituent for ThinLTO backends where we get all the input files up front and don't release any until after the backend threads complete. However, it is sufficient to only get and release once per file, and then each consituent bitcode file can be accessed via get_view. This required saving some information to identify which file handle is the "leader" for each claimed file sharing the same file descriptor, and other information so that get_input_file isn't necessary later when processing the backends. Second, the module paths in the index need to distinguish between different constituent bitcode files within the same archive file, otherwise they will all end up with the same archive file path. Do this by appending the offset within the archive for the start of the bitcode file, returned by get_input_file when we claim each bitcode file, and saving that along with the file handle. Third, rather than have the function importer try to load a file based on the module path identifier (which now contains a suffix to distinguish different bitcode files within an archive), use a custom module loader. This is the same approach taken in libLTO, and I am using the support refactored into the new LTO.h header in r270509. The module loader parses the bitcode files out of the memory buffers returned from gold via the get_view callback and saved in a map. This also means that we call the function importer directly, rather than add it to the pass pipeline (which was in the plan to do already for other reasons). Reviewers: pcc, joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D20559 llvm-svn: 270814	2016-05-26 01:46:41 +00:00
Saleem Abdulrasool	fbf920f9b4	llvm-objdump: support dumping AUX records for weak externals This is a support COFF feature. Ensure that we can display the weak externals auxiliary symbol. It contains useful information (such as the default binding and how to resolve the symbol). This reapplies the previous patch with a modification which hopefully should fix the endianness issues. The variadic call would promote the ulittle32_t to a uint32_t which would lose the byte-swapping behaviour desired. llvm-svn: 270813	2016-05-26 01:45:12 +00:00
Peter Collingbourne	ffecb1441b	MemorySSA: Remove argument to createNewAccess function. There is only one caller of MemorySSA::createNewAccess, and it passes true as the IgnoreNonMemory argument. Remove that argument and fold its behavior into createNewAccess. llvm-svn: 270812	2016-05-26 01:19:17 +00:00
Peter Collingbourne	7c10dd1411	MemorySSA: Fix example in header comment. This fixes the example so that it matches the pass's behavior. I was a little confused by the example until I tried running it and realized that there was a mistake. Differential Revision: http://reviews.llvm.org/D20657 llvm-svn: 270811	2016-05-26 01:19:14 +00:00
Lang Hames	be84d2beee	[Kaleidoscope][BuildingAJIT] Add a stub Chapter 2 doc. llvm-svn: 270809	2016-05-26 00:38:04 +00:00

1 2 3 4 5 ...

132427 Commits