llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	476170384f	[X86] Tidied up shuffle decode function doxygen descriptions As discussed on D18441 - auto brief is used so we don't need /brief, we don't need to include the function name and added some missing descriptions. llvm-svn: 265785	2016-04-08 14:17:07 +00:00
Adhemerval Zanella	15cba9e7e5	[lld] [ELF/AARCH64] Fix dynamic relocation from PIC GOT access This patch fixes dynamic relocation creation from GOT access in dynamic objects on aarch64. Current code creates a plt relative one (R_AARCH64_JUMP_SLOT) instead of a got relative (R_AARCH64_GLOB_DAT). It leads the programs fails with: $ cat t.cc std::string test = "hello...\n"; int main () { printf ("%s\n", test.c_str()); return 0; } $ clang++ t.cc -fpic -o t $ ./t hello... Segmentation fault (core dumped) Due the fact it will try to access the plt instead of the got for __cxa_atexit registration for the std::string destruction. It will lead in a bogus function address in atexit. llvm-svn: 265784	2016-04-08 14:10:41 +00:00
Alexey Bader	954ba21f85	[OpenCL] Complete image types support. I. Current implementation of images is not conformant to spec in the following points: 1. It makes no distinction with respect to access qualifiers and therefore allows to use images with different access type interchangeably. The following code would compile just fine: void write_image(write_only image2d_t img); kernel void foo(read_only image2d_t img) { write_image(img); } // Accepted code which is disallowed according to s6.13.14. 2. It discards access qualifier on generated code, which leads to generated code for the above example: call void @write_image(%opencl.image2d_t* %img); In OpenCL2.0 however we can have different calls into write_image with read_only and wite_only images. Also generally following compiler steps have no easy way to take different path depending on the image access: linking to the right implementation of image types, performing IR opts and backend codegen differently. 3. Image types are language keywords and can't be redeclared s6.1.9, which can happen currently as they are just typedef names. 4. Default access qualifier read_only is to be added if not provided explicitly. II. This patch corrects the above points as follows: 1. All images are encapsulated into a separate .def file that is inserted in different points where image handling is required. This avoid a lot of code repetition as all images are handled the same way in the code with no distinction of their exact type. 2. The Cartesian product of image types and image access qualifiers is added to the builtin types. This simplifies a lot handling of access type mismatch as no operations are allowed by default on distinct Builtin types. Also spec intended access qualifier as special type qualifier that are combined with an image type to form a distinct type (see statement above - images can't be created w/o access qualifiers). 3. Improves testing of images in Clang. Author: Anastasia Stulova Reviewers: bader, mgrang. Subscribers: pxli168, pekka.jaaskelainen, yaxunl. Differential Revision: http://reviews.llvm.org/D17821 llvm-svn: 265783	2016-04-08 13:40:33 +00:00
Aaron Ballman	829b5d42af	Silencing a 32-bit shift implicit conversion warning from MSVC; NFC. llvm-svn: 265782	2016-04-08 12:21:58 +00:00
Chuang-Yu Cheng	98c1894755	CXX_FAST_TLS calling convention: performance improvement for PPC64 This is the same change on PPC64 as r255821 on AArch64. I have even borrowed his commit message. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given machine function and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. Author: Tom Jablin (tjablin) Reviewers: hfinkel kbarton cycheng http://reviews.llvm.org/D17533 llvm-svn: 265781	2016-04-08 12:04:32 +00:00
Vasileios Kalintiris	957d849e03	[mips] Use range-based for loops. NFC. llvm-svn: 265780	2016-04-08 10:33:00 +00:00
Johannes Doerfert	41cda15940	[FIX] Allow to lookup domains for non-affine subregion blocks llvm-svn: 265779	2016-04-08 10:32:26 +00:00
Johannes Doerfert	3ef78d6d38	[FIX] Adjust execution context of hoisted loads wrt. error domains If we build the domains for error blocks and later remove them we lose the information that they are not executed. Thus, in the SCoP it looks like the control will always reach the statement S: for (i = 0 ... N) if (valid == 0) doSth(&ptr); S: A[i] = ptr; Consequently, we would have assumed "ptr" to be always accessed and preloaded it unconditionally. However, only if "valid != 0" we would execute the optimized version of the SCoP. Nevertheless, we would have hoisted and accessed "ptr"regardless of "valid". This changes the semantic of the program as the value of "*valid" can cause a change of "ptr" and control if it is executed or not. To fix this problem we adjust the execution context of hoisted loads wrt. error domains. To this end we introduce an ErrorDomainCtxMap that maps each basic block to the error context under which it might be executed. Thus, to the context under which it is executed but an error block would have been executed to. To fill this map one traversal of the blocks in the SCoP suffices. During this traversal we do also "remove" error statements and those that are only reachable via error statements. This was previously done by the removeErrorBlockDomains function which is therefor not needed anymore. This fixes bug PR26683 and thereby several SPEC miscompiles. Differential Revision: http://reviews.llvm.org/D18822 llvm-svn: 265778	2016-04-08 10:30:09 +00:00
Johannes Doerfert	b47cbe1c72	[FIX] Handle multiplications in the SCEVAffinator again If ScalarEvolution cannot look through some expression but we do, it might happen that a multiplication will arrive at the SCEVAffinator::visitMulExpr. While we could always try to improve the extractConstantFactor function we might still miss something, thus we reintroduce the code to generate multiplicative piecewise-affine functions as a fall-back. llvm-svn: 265777	2016-04-08 10:27:40 +00:00
Johannes Doerfert	e85d50defb	Add test cases for the removal of error blocks llvm-svn: 265776	2016-04-08 10:26:39 +00:00
Johannes Doerfert	7b81103589	[FIX] Look through div & srem instructions in SCEVs The findValues() function did not look through div & srem instructions that were part of the argument SCEV. However, in different other places we already look through it. This mismatch caused us to preload values in the wrong order. llvm-svn: 265775	2016-04-08 10:25:58 +00:00
Alexander Kornienko	477e5d8d31	[clang-tidy] cppcoreguidelines-interfaces-global-init Summary: This check flags initializers of globals that access extern objects, and therefore can lead to order-of-initialization problems (this recommandation is part of CPP core guidelines). Note that this only checks half of the guideline for now (it does not enforce using constexpr functions). Reviewers: aaron.ballman, alexfh Subscribers: aaron.ballman, etienneb, Eugene.Zelenko, cfe-commits Patch by Clement Courbet! Differential Revision: http://reviews.llvm.org/D18649 llvm-svn: 265774	2016-04-08 09:51:06 +00:00
Jeroen Ketema	ad659c3400	[llvm-c] Expose LLVMContextGetDiagnostic{Handler,Context} Differential Revision: http://reviews.llvm.org/D18820 llvm-svn: 265773	2016-04-08 09:19:02 +00:00
Zlatko Buljan	53a037f5cc	[mips][microMIPS] Add CodeGen support for ADD, ADDIU, ADDU and DADD* instructions Differential Revision: http://reviews.llvm.org/D16454 llvm-svn: 265772	2016-04-08 07:27:26 +00:00
Craig Topper	00230805f2	Use std::fill to simplify some code. NFC llvm-svn: 265771	2016-04-08 07:10:46 +00:00
Dmitry Polukhin	24c4f28ac9	[IFUNC] Fix ifunc-asm.ll test It seems that llc cannot be called used in assembler tests so test that checks asm for particular target needs to be moved to codegen. llvm-svn: 265770	2016-04-08 06:45:19 +00:00
Valery Pykhtin	f5cf6fba3f	[AMDGPU] Add some VI disassembler tests missing from previous autogeneration due to different filecheck prefix. NFC. llvm-svn: 265769	2016-04-08 05:42:20 +00:00
Duncan P. N. Exon Smith	4ec55f8ab6	Reapply "ValueMapper: Treat LocalAsMetadata more like function-local Values" This reverts commit r265765, reapplying r265759 after changing a call from LocalAsMetadata::get to ValueAsMetadata::get (and adding a unit test). When a local value is mapped to a constant (like "i32 %a" => "i32 7"), the new debug intrinsic operand may no longer be pointing at a local. http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/19020/ The previous coommit message follows: -- This is a partial re-commit -- maybe more of a re-implementation -- of r265631 (reverted in r265637). This makes RF_IgnoreMissingLocals behave (almost) consistently between the Value and the Metadata hierarchy. In particular: - MapValue returns nullptr or "metadata !{}" for missing locals in MetadataAsValue/LocalAsMetadata bridging paris, depending on the RF_IgnoreMissingLocals flag. - MapValue doesn't memoize LocalAsMetadata-related results. - MapMetadata no longer deals with LocalAsMetadata or RF_IgnoreMissingLocals at all. (This wasn't in r265631 at all, but I realized during testing it would make the patch simpler with no loss of generality.) r265631 went too far, making both functions universally ignore RF_IgnoreMissingLocals. This broke building (e.g.) compiler-rt. Reassociate (and possibly other passes) don't currently maintain dominates-use invariants for metadata operands, resulting in IR like this: define void @foo(i32 %arg) { call void @llvm.some.intrinsic(metadata i32 %x) %x = add i32 1, i32 %arg } If the inliner chooses to inline @foo into another function, then RemapInstruction will call `MapValue(metadata i32 %x)` and assert that the return is not nullptr. I've filed PR27273 to add a Verifier check and fix the underlying problem in the optimization passes. As a workaround, return `!{}` instead of nullptr for unmapped LocalAsMetadata when RF_IgnoreMissingLocals is unset. Otherwise, match the behaviour of r265631. Original commit message: ValueMapper: Make LocalAsMetadata match function-local Values Start treating LocalAsMetadata similarly to function-local members of the Value hierarchy in MapValue and MapMetadata. - Don't memoize them. - Return nullptr if they are missing. This also cleans up ConstantAsMetadata to stop listening to the RF_IgnoreMissingLocals flag. llvm-svn: 265768	2016-04-08 03:13:22 +00:00
Sanjoy Das	f60a0d7452	Adapt to LLVM API change Replace mayBeOverridden with isInterposable llvm-svn: 265767	2016-04-08 01:31:02 +00:00
Richard Smith	6a6747d9dd	[modules] Add a comment to explain why -E leaves some #includes in the preprocessed output. llvm-svn: 265766	2016-04-08 01:23:59 +00:00
Duncan P. N. Exon Smith	805873148a	Revert "ValueMapper: Treat LocalAsMetadata more like function-local Values" This reverts commit r265759, since even this limited version breaks some bots: http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/3311 http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/17696 This also reverts r265761 "ValueMapper: Unduplicate RF_NoModuleLevelChanges check, NFC", since I had trouble separating it from r265759. llvm-svn: 265765	2016-04-08 00:56:21 +00:00
Quentin Colombet	e57546de40	[TargetRegisterInfo] Re-apply r265734. Original commit message: [TargetRegisterInfo] Refactor the code to use BitMaskClassIterator. llvm-svn: 265764	2016-04-08 00:51:00 +00:00
Quentin Colombet	2fbe04e93d	[TargetRegisterInfo] Fix BitMaskClassIterator::moveToNextID implementation. Make sure we do not read past the size of the mask. Although we were not using the value read, this is bad and makes ASan complain. llvm-svn: 265763	2016-04-08 00:50:58 +00:00
Sanjoy Das	5ce3272833	Don't IPO over functions that can be de-refined Summary: Fixes PR26774. If you're aware of the issue, feel free to skip the "Motivation" section and jump directly to "This patch". Motivation: I define "refinement" as discarding behaviors from a program that the optimizer has license to discard. So transforming: ``` void f(unsigned x) { unsigned t = 5 / x; (void)t; } ``` to ``` void f(unsigned x) { } ``` is refinement, since the behavior went from "if x == 0 then undefined else nothing" to "nothing" (the optimizer has license to discard undefined behavior). Refinement is a fundamental aspect of many mid-level optimizations done by LLVM. For instance, transforming `x == (x + 1)` to `false` also involves refinement since the expression's value went from "if x is `undef` then { `true` or `false` } else { `false` }" to "`false`" (by definition, the optimizer has license to fold `undef` to any non-`undef` value). Unfortunately, refinement implies that the optimizer cannot assume that the implementation of a function it can see has all of the behavior an unoptimized or a differently optimized version of the same function can have. This is a problem for functions with comdat linkage, where a function can be replaced by an unoptimized or a differently optimized version of the same source level function. For instance, FunctionAttrs cannot assume a comdat function is actually `readnone` even if it does not have any loads or stores in it; since there may have been loads and stores in the "original function" that were refined out in the currently visible variant, and at the link step the linker may in fact choose an implementation with a load or a store. As an example, consider a function that does two atomic loads from the same memory location, and writes to memory only if the two values are not equal. The optimizer is allowed to refine this function by first CSE'ing the two loads, and the folding the comparision to always report that the two values are equal. Such a refined variant will look like it is `readonly`. However, the unoptimized version of the function can still write to memory (since the two loads //can// result in different values), and selecting the unoptimized version at link time will retroactively invalidate transforms we may have done under the assumption that the function does not write to memory. Note: this is not just a problem with atomics or with linking differently optimized object files. See PR26774 for more realistic examples that involved neither. This patch: This change introduces a new set of linkage types, predicated as `GlobalValue::mayBeDerefined` that returns true if the linkage type allows a function to be replaced by a differently optimized variant at link time. It then changes a set of IPO passes to bail out if they see such a function. Reviewers: chandlerc, hfinkel, dexonsmith, joker.eph, rnk Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18634 llvm-svn: 265762	2016-04-08 00:48:30 +00:00
Duncan P. N. Exon Smith	0fa8aca0e6	ValueMapper: Unduplicate RF_NoModuleLevelChanges check, NFC llvm-svn: 265761	2016-04-08 00:41:10 +00:00
Adrian Prantl	3e9c88753b	DwarfDebug: Support floating point constants in location lists. This patch closes a gap in the DWARF backend that caused LLVM to drop debug info for floating point variables that were constant for part of their scope. Floating point constants are emitted as one or more DW_OP_constu joined via DW_OP_piece. This fixes a regression caught by the LLDB testsuite that I introduced in r262247 when we stopped blindly expanding the range of singular DBG_VALUEs to span the entire scope and started to emit location lists with accurate ranges instead. Also deletes a now-impossible testcase (debug-loc-empty-entries). <rdar://problem/25448338> llvm-svn: 265760	2016-04-08 00:38:37 +00:00
Duncan P. N. Exon Smith	267185ec92	ValueMapper: Treat LocalAsMetadata more like function-local Values This is a partial re-commit -- maybe more of a re-implementation -- of r265631 (reverted in r265637). This makes RF_IgnoreMissingLocals behave (almost) consistently between the Value and the Metadata hierarchy. In particular: - MapValue returns nullptr or "metadata !{}" for missing locals in MetadataAsValue/LocalAsMetadata bridging paris, depending on the RF_IgnoreMissingLocals flag. - MapValue doesn't memoize LocalAsMetadata-related results. - MapMetadata no longer deals with LocalAsMetadata or RF_IgnoreMissingLocals at all. (This wasn't in r265631 at all, but I realized during testing it would make the patch simpler with no loss of generality.) r265631 went too far, making both functions universally ignore RF_IgnoreMissingLocals. This broke building (e.g.) compiler-rt. Reassociate (and possibly other passes) don't currently maintain dominates-use invariants for metadata operands, resulting in IR like this: define void @foo(i32 %arg) { call void @llvm.some.intrinsic(metadata i32 %x) %x = add i32 1, i32 %arg } If the inliner chooses to inline @foo into another function, then RemapInstruction will call `MapValue(metadata i32 %x)` and assert that the return is not nullptr. I've filed PR27273 to add a Verifier check and fix the underlying problem in the optimization passes. As a workaround, return `!{}` instead of nullptr for unmapped LocalAsMetadata when RF_IgnoreMissingLocals is unset. Otherwise, match the behaviour of r265631. Original commit message: ValueMapper: Make LocalAsMetadata match function-local Values Start treating LocalAsMetadata similarly to function-local members of the Value hierarchy in MapValue and MapMetadata. - Don't memoize them. - Return nullptr if they are missing. This also cleans up ConstantAsMetadata to stop listening to the RF_IgnoreMissingLocals flag. llvm-svn: 265759	2016-04-08 00:33:44 +00:00
Rui Ueyama	57bbdaf937	Simplify createELFFile. NFC. llvm-svn: 265758	2016-04-08 00:18:25 +00:00
Rui Ueyama	4655ea3ff8	Define a helper function to simplify. NFC. llvm-svn: 265757	2016-04-08 00:14:55 +00:00
Richard Smith	466a15ef7f	[modules] Don't write @import in -E output if the current language mode doesn't support @import; use the form as written instead. llvm-svn: 265756	2016-04-08 00:09:53 +00:00
Quentin Colombet	e0a7ffa6cb	Revert "[TargetRegisterInfo] Refactor the code to use BitMaskClassIterator." This reverts commit r265734. Looks like ASan is not happy about it. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/11741 Looking. llvm-svn: 265755	2016-04-08 00:03:51 +00:00
Quentin Colombet	dcf5cf6a29	[RegisterBankInfo] Make the debug output more compact. Print the mask of the partial mapping as an hexadecimal instead of a binary value. llvm-svn: 265754	2016-04-08 00:03:49 +00:00
Sunil Srivastava	3780b68b8f	Fix formatting and wording of llvm-ranlib error message. NFC. Patch by Douglas Yung! Reviewed by Rafael Espindola llvm-svn: 265753	2016-04-08 00:02:14 +00:00
Davide Italiano	08f8f21b91	[IR/Verifier] Fix (yet another) crash. We need to check that if we reference a retainedType from DICompileUnit we're actually referencing a DICompositeType. llvm-svn: 265752	2016-04-08 00:01:32 +00:00
Justin Lebar	25c36fd61b	[CUDA] Tweak math forward declares so we're compatible with libstdc++4.9. Summary: See comments in patch; we were assuming that some stdlib math functions would be defined in namespace std, when in fact the spec says they should be defined in the global namespace. libstdc++4.9 became more conforming and broke us. This new implementation seems to cover the known knowns. Reviewers: rsmith Subscribers: cfe-commits, tra Differential Revision: http://reviews.llvm.org/D18882 llvm-svn: 265751	2016-04-07 23:55:53 +00:00
Rui Ueyama	6c8b0c1b96	Use EM_NONE instead of 0 to represent an invalid value. NFC. llvm-svn: 265750	2016-04-07 23:54:33 +00:00
Quentin Colombet	e16f561d91	[RegBankSelect] Add a few debug statements. llvm-svn: 265749	2016-04-07 23:53:55 +00:00
Rui Ueyama	d86ec30168	ELF: Add --build-id=sha1 option. llvm-svn: 265748	2016-04-07 23:51:56 +00:00
Quentin Colombet	9a2ae85e67	[RegisterBankInfo] Add print and dump method to the InstructionMapping helper class. llvm-svn: 265747	2016-04-07 23:31:58 +00:00
Quentin Colombet	e087c9fc12	[RegisterBankInfo] Add print and dump method to the ValueMapping helper class. llvm-svn: 265746	2016-04-07 23:25:43 +00:00
Quentin Colombet	03c419628e	[MachineInstr] Teach the print method about RegisterBank. Properly print either the register class or the register bank or a virtual register. Get rid of a few ifdefs in the process. llvm-svn: 265745	2016-04-07 23:18:11 +00:00
Peter Collingbourne	e217660591	ELF: Ignore --detect-odr-violations flag. Differential Revision: http://reviews.llvm.org/D18877 llvm-svn: 265744	2016-04-07 23:15:50 +00:00
Quentin Colombet	88f7f6bc4f	[AArch64] Fix a typo in the register class to register bank mapping. For GPR family we want the GPR register bank, not FPR! llvm-svn: 265743	2016-04-07 23:10:14 +00:00
Quentin Colombet	f4e36faa9c	[RegisterBankInfo] Escap \@ in r265741. [-Wdocumentation] llvm-svn: 265742	2016-04-07 23:08:52 +00:00
Quentin Colombet	c68b020412	[RegisterBankInfo] Change the semantic of recordRegBankForType. Now, recordRegBankForType records only the first register bank that covers a type instead of the last. This behavior can, nevertheless, be override with the additional Force parameter to force the update. llvm-svn: 265741	2016-04-07 23:02:00 +00:00
David Blaikie	97a42caeb0	llvm-dwarfdump: Use deque rather than vector to preserve object reference/pointer identity TUs in each unit refer to the unit they are in, if the unit is moved this reference is invalidated & things break. No test case because UB isn't testable - ASan would likely catch this on a large enough test case (just needs to have enough TUs that a reallocation of the vector would occur) but didn't seem worthwhile. Up for debate/revisiting if anyone feels strongly. llvm-svn: 265740	2016-04-07 22:59:58 +00:00
Quentin Colombet	ac40034e06	[RegisterBankInfo] Strengthen getInstrMappingImpl. Teach the target independent code how to take advantage of type information to get the mapping of an instruction. llvm-svn: 265739	2016-04-07 22:52:49 +00:00
Adrian McCarthy	f9f3609704	Fix TestImport for Windows by ensuring backslashes in the directory paths are properly escaped in Python. The Python import works by ensuring the directory of the module or package is in sys.path, and then it does a Python `import foo`. The original code was not escaping the backslashes in the directory path, so this wasn't working. Differential Revision: http://reviews.llvm.org/D18873 llvm-svn: 265738	2016-04-07 22:52:12 +00:00
Rui Ueyama	3a41be277a	ELF: Implement --build-id=md5. Previously, we supported only one hash function, FNV-1, so BuildIdSection directly handled hash computation. In this patch, I made BuildIdSection an abstract class and defined two subclasses, BuildIdFnv1 and BuildIdMd5. llvm-svn: 265737	2016-04-07 22:49:21 +00:00
Quentin Colombet	e918006a87	[RegisterBankInfo] Add a way to record what register bank covers a specific type. This will be used to find the default mapping of the instruction. Also, this information is recorded, instead of computed, because it is expensive from a type to know which register bank maps it. Indeed, we need to iterate through all the register classes of all the register banks to find the one that maps the given type. llvm-svn: 265736	2016-04-07 22:45:42 +00:00

1 2 3 4 5 ...

227245 Commits All Branches Search

227245 Commits

All Branches