llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	dc01ca1896	Remove unnecessary use of unique_ptr::release() used to construct another unique_ptr. llvm-svn: 213556	2014-07-21 16:23:21 +00:00
Zachary Turner	07b21d7a69	Rename dosep.ty to dosep.py llvm-svn: 213555	2014-07-21 16:16:31 +00:00
David Blaikie	370a67a56c	Remove unused variable. llvm-svn: 213554	2014-07-21 16:13:24 +00:00
Zachary Turner	37373b0c98	Remove spurious debugging message from CMake. llvm-svn: 213553	2014-07-21 16:10:20 +00:00
Tom Stellard	e812f2fdd8	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	b02094e115	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	42639a57de	R600/SI: Specify wavefront size for SI and CI llvm-svn: 213550	2014-07-21 15:44:58 +00:00
Tom Stellard	8e44d948b6	R600/SI: Remove vaddr operand from BUFFER_LOAD_*_OFFSET instructions This operand is never used. llvm-svn: 213549	2014-07-21 15:44:55 +00:00
Daniel Sanders	e22244b733	[mips] Do not emit '.module fp=...' unless we really need to. We now emit this value when we need to contradict the default value. This restores support for binutils 2.24. When a suitable binutils has been released we can resume unconditionally emitting .module directives. This is preferable to omitting the .module directives since the .module directives protect against, for example, accidentally assembling FP32 code with -mfp64 and producing an unusuable object. llvm-svn: 213548	2014-07-21 15:25:24 +00:00
Marshall Clow	f915d67c60	make the same change as in 213546 for vector<bool> llvm-svn: 213547	2014-07-21 15:15:15 +00:00
Marshall Clow	0df880209d	In response to bug #20362 , change the order of operations in vector move assignment so that if the allocator move assignment throws, we aren't left with two objects pointing at the same memory. This is not a complete fix; I am unconvinced that a complete fix is possible. With this change in place, we will leak the old contents of the vector. LWG issue #2106 , when adopted, will make this problem illegal. Thanks to Thomas Koeppe for the report and analysis. llvm-svn: 213546	2014-07-21 15:11:13 +00:00
Robert Khasanov	bfa0131365	[SKX] Enabling SKX target and AVX512BW, AVX512DQ, AVX512VL features. Enabling HasAVX512{DQ,BW,VL} predicates. Adding VK2, VK4, VK32, VK64 masked register classes. Adding new types (v64i8, v32i16) to VR512. Extending calling conventions for new types (v64i8, v32i16) Patch by Zinovy Nis <zinovy.y.nis@intel.com> Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213545	2014-07-21 14:54:21 +00:00
Tom Stellard	32411403b2	docs: Update relaease documents to include the patch number in the RELEASE tags This will make it easier to update the release scripts to support bug-fix releases. llvm-svn: 213544	2014-07-21 14:28:31 +00:00
Dan Liew	12902a0ed8	Export LLVM_ENABLE_ASSERTIONS in LLVMConfig.cmake so clients know if the version of LLVM they are trying to use was built with or without assertions. llvm-svn: 213532	2014-07-21 14:17:15 +00:00
Tom Stellard	067c81567b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
Tom Stellard	b2114caf62	R600/SI: Add isCFDepth0 Predicate to SALU addc pattern llvm-svn: 213529	2014-07-21 14:01:12 +00:00
Tom Stellard	54a3b65bb9	R600/SI: Use VALU for i1 XOR llvm-svn: 213528	2014-07-21 14:01:10 +00:00
Tom Stellard	01825afad7	R600/SI: Use a custom encoding method for simm16 in SOPP branch instructions This allows us to explicitly define the type of fixup that is needed, so we can distinguish this from future fixup types. llvm-svn: 213527	2014-07-21 14:01:08 +00:00
Tom Stellard	e08fe68bdd	R600/SI: Rename SOPP operands to match the encoding fields llvm-svn: 213526	2014-07-21 14:01:05 +00:00
Alexander Potapenko	96008ea849	[lsan] Allow using ucontext.h in the test on OSX. llvm-svn: 213523	2014-07-21 13:35:09 +00:00
Daniel Sanders	68c3747efb	[mips] Add MipsOptionRecord abstraction and use it to implement .reginfo/.MIPS.options This abstraction allows us to support the various records that can be placed in the .MIPS.options section in the future. We currently use it to record register usage information (the ODK_REGINFO record in our ELF64 spec). Each .MIPS.options record should subclass MipsOptionRecord and provide an implementation of EmitMipsOptionRecord. Patch by Matheus Almeida and Toma Tabacu llvm-svn: 213522	2014-07-21 13:30:55 +00:00
Tom Stellard	edf1570d4e	TableGen: Allow AddedComplexity values to be negative This is useful for cases when stand-alone patterns are preferred to the patterns included in the instruction definitions. Instead of requiring that stand-alone patterns set a larger AddedComplexity value, which can be confusing to new developers, the allows us to reduce the complexity of the included patterns to achieve the same result. llvm-svn: 213521	2014-07-21 13:28:54 +00:00
Simon Atanasyan	6f3382cd44	[Mips] Fix typo in the comment. llvm-svn: 213520	2014-07-21 13:16:53 +00:00
Hal Finkel	b035621720	Move the CapturesBefore tracker from AA into CaptureTracking There were two generally-useful CaptureTracker classes defined in LLVM: the simple tracker defined in CaptureTracking (and made available via the PointerMayBeCaptured utility function), and the CapturesBefore tracker available only inside of AA. This change moves the CapturesBefore tracker into CaptureTracking, generalizes it slightly (by adding a ReturnCaptures parameter), and makes it generally available via a PointerMayBeCapturedBefore utility function. This logic will be needed, for example, to perform noalias function parameter attribute inference. No functionality change intended. llvm-svn: 213519	2014-07-21 13:15:48 +00:00
Alexander Potapenko	e816521c00	[lsan] Define MAP_ANONYMOUS as MAP_ANON for OSX in the test. llvm-svn: 213518	2014-07-21 13:12:44 +00:00
Aaron Ballman	659b96670c	This declaration has no definition, which is causing MSVC to emit several "no suitable definition provided for explicit template instantiation request" C4661 warnings. llvm-svn: 213517	2014-07-21 13:08:08 +00:00
Alexander Potapenko	4789f63bf3	[lsan] Use a more standard-conformant sched_yield() instead of pthread_yield(). There's no pthread_yield() on OSX (only sched_yield() and pthread_yield_np()). llvm-svn: 213516	2014-07-21 13:01:06 +00:00
Aaron Ballman	6c078a5960	Fixing an MSVC conversion warning about implicitly converting the shift results to 64-bits. No functional change intended. llvm-svn: 213515	2014-07-21 12:31:43 +00:00
Hal Finkel	c782aa5a9b	Move isIdentifiedFunctionLocal from BasicAA to AA The ability to identify function locals will exist outside of BasicAA (for example, logic for inferring noalias function arguments will need this), so make this concept generally accessible without code duplication. No functionality change. llvm-svn: 213514	2014-07-21 12:27:23 +00:00
Daniel Sanders	decb7a2b0b	[mips] Try to fix the test/ExecutionEngine tests on a MIPS host. Fix a dangerous default case that caused MipsCodeEmitter to discard pseudo instructions it didn't recognize. It will now call llvm_unreachable() for unrecognized pseudo's and explicitly handles PseudoReturn, PseudoReturn64, PseudoIndirectBranch, PseudoIndirectBranch64, CFI_INSTRUCTION, IMPLICIT_DEF, and KILL. There may be other pseudos that need handling but this was enough for the ExecutionEngine tests to pass on my test system. llvm-svn: 213513	2014-07-21 12:25:34 +00:00
Alexey Bataev	6125da9258	[OPENMP] Initial parsing and sema analysis for 'flush' directive. llvm-svn: 213512	2014-07-21 11:26:11 +00:00
Daniel Sanders	d7c2796045	[mips] Do not emit '.module [no]oddspreg' unless we really need to. We now emit this directive when we need to contradict the default value (e.g. -mno-odd-spreg is given) or an option changed the default value (e.g. -mfpxx is given). This restores support for the currently available head of binutils. However, at this point binutils 2.24 is still not sufficient since it does not support '.module fp=...'. llvm-svn: 213511	2014-07-21 10:45:47 +00:00
Alexander Musman	d9ed09f7a5	[OPENMP] Parsing/Sema of the OpenMP directive 'critical'. llvm-svn: 213510	2014-07-21 09:42:05 +00:00
Benjamin Kramer	ddf36dea13	[clang-tidy] Fix a false positive in the make_pair checker if an argument has a explicit template argument. This required a rather ugly workaround for a problem in ASTMatchers where callee() is only overloaded for Stmt and Decl but not for Expr. llvm-svn: 213509	2014-07-21 09:40:52 +00:00
Chandler Carruth	efd14a62a3	FileCheck-ize a test. llvm-svn: 213508	2014-07-21 09:23:21 +00:00
Tim Northover	f7a02c1762	CodeGen: emit IR-level f16 conversion intrinsics as fptrunc/fpext This makes the first stage DAG for @llvm.convert.to.fp16 an fptrunc, and correspondingly @llvm.convert.from.fp16 an fpext. The legalisation path is now uniform, regardless of the input IR: fptrunc -> FP_TO_FP16 (if f16 illegal) -> libcall fpext -> FP16_TO_FP (if f16 illegal) -> libcall Each target should be able to select the version that best matches its operations and not be required to duplicate patterns for both fptrunc and FP_TO_FP16 (for example). As a result we can remove some redundant AArch64 patterns. llvm-svn: 213507	2014-07-21 09:13:56 +00:00
Chandler Carruth	3c0012beb6	[SDAG,cleanup] Switch the DAG combiner over to use the spelling 'Worklist' consistently rather than a deeply confusing mixture of 'WorkList' and 'Worklist'. Notably, the very 'WorkList' of the DAG combiner was exposed to target specific DAG combines under an interface 'AddToWorklist' which was implemented by in turn calling 'AddToWorkList' in the combiner. This has sent me circling with the wrong case in grep one too many times. I chose to normalize on 'Worklist' because that one won the grep-vote for llvm/lib/... by a hundered hits or so, and it is used in places relatively "canonical" such as InstCombine's Worklist. Let's all jsut pick this casing, whether "correct", "good", or "bad" and be consistent... llvm-svn: 213506	2014-07-21 08:56:44 +00:00
Chandler Carruth	24ceb0ce66	[SDAG] Rather than using a narrow test against the one dummy node on the stack, filter all handle nodes from the DAG combiner worklist. This will also handle cases where other handle nodes might be (erroneously) added to the worklist and then cause bugs and explosions when deleted. For example, when running the legalizer within the DAG combiner, there are times when other handle nodes are used and can end up here. llvm-svn: 213505	2014-07-21 08:32:31 +00:00
Andrea Di Biagio	0fb2013192	[DAGCombiner] Improve the shuffle-vector folding logic. Canonicalize shuffles according to rules: * shuffle(A, shuffle(A, B)) -> shuffle(shuffle(A,B), A) * shuffle(B, shuffle(A, B)) -> shuffle(shuffle(A,B), B) * shuffle(B, shuffle(A, Undef)) -> shuffle(shuffle(A, Undef), B) This patch helps identifying more shuffle pairs that could be combined reusing the already existing rules in the DAGCombiner. Added new test 'combine-vec-shuffle-5.ll' to verify that the canonicalized shuffles are now folded into a single shuffle node by the DAGCombiner. Added more test cases to 'combine-vec-shuffle-4.ll'. llvm-svn: 213504	2014-07-21 07:30:54 +00:00
Andrea Di Biagio	4d8bd41600	[DAG] Refactor some logic. No functional change. This patch removes function 'CommuteVectorShuffle' from X86ISelLowering.cpp and moves its logic into SelectionDAG.cpp as method 'getCommutedVectorShuffles'. This refactoring is in preperation of an upcoming change to the DAGCombiner. llvm-svn: 213503	2014-07-21 07:28:51 +00:00
James Dennett	1810fc93b7	Trivial doc fixes: add missing whitespace, and s/overriden/overridden/g. llvm-svn: 213502	2014-07-21 06:14:27 +00:00
Daniel Jasper	58ed9c9167	clang-tidy: [misc-use-override] Slightly tweak the wording of warning. 'final' should really be used with care. llvm-svn: 213501	2014-07-21 06:06:38 +00:00
James Dennett	ab4ebb42f9	Add clang::DesignatedInitExpr::designators() for range-based access, with overloads for designators_range and designators_const_range. llvm-svn: 213500	2014-07-21 06:03:12 +00:00
Richard Smith	22fdae9bd5	Add missing initialization found due to a valgrind false positive. This field is never inspected in the object state initialized by this constructor; however, initializing it seems reasonable, since it has a meaningful value. llvm-svn: 213499	2014-07-21 05:27:31 +00:00
Richard Smith	57721ac591	[modules] Fix some of the confusion when computing the override set for a macro introduced by finalization. This is still not entirely correct; more fixes to follow. llvm-svn: 213498	2014-07-21 04:10:40 +00:00
Gerolf Hoflehner	ae1ec299df	Fix for regression: [Bug 20369] wrong code at -O3 on x86_64-linux-gnu in 64-bit mode Prevents hoisting of loads above stores and sinking of stores below loads in MergedLoadStoreMotion.cpp (rdar://15991737) llvm-svn: 213497	2014-07-21 03:02:46 +00:00
Alexey Bataev	4c904adf7c	[OPENMP] Added several test cases for clauses 'ordered' and 'nowait': if there are more than one 'nowait' or 'ordered' clause an error message is expected. llvm-svn: 213496	2014-07-21 02:45:36 +00:00
Ulrich Weigand	601957fa23	[PowerPC] Optimize passing certain aggregates by value In addition to enabling ELFv2 homogeneous aggregate handling, LLVM support to pass array types directly also enables a performance enhancement. We can now pass (non-homogeneous) aggregates that fit fully in registers as direct integer arrays, using an element type to encode the alignment requirement (that would otherwise go to the "byval align" field). This is preferable since "byval" forces the back-end to write the aggregate out to the stack, even if it could be passed fully in registers. This is particularly annoying on ELFv2, if there is no parameter save area available, since we then need to allocate space on the callee's stack just to hold those aggregates. Note that to implement this optimization, this patch does not attempt to fully anticipate register allocation rules as (defined in the ABI and) implemented in the back-end. Instead, the patch is simply passing any aggregate passed by value using the array mechanism if its size is up to 64 bytes. This means that some of those will end up being passed in stack slots anyway, but the generated code shouldn't be any worse either. (Large aggregates remain passed using "byval" to enable optimized copying via memcpy etc.) llvm-svn: 213495	2014-07-21 00:56:36 +00:00
Ulrich Weigand	b712237da6	[PowerPC] Support the ELFv2 ABI This patch implements clang support for the PowerPC ELFv2 ABI. Together with a series of companion patches in LLVM, this makes clang/LLVM fully usable on powerpc64le-linux. Most of the ELFv2 ABI changes are fully implemented on the LLVM side. On the clang side, we only need to implement some changes in how aggregate types are passed by value. Specifically, we need to: - pass (and return) "homogeneous" floating-point or vector aggregates in FPRs and VRs (this is similar to the ARM homogeneous aggregate ABI) - return aggregates of up to 16 bytes in one or two GPRs The second piece is trivial to implement in any case. To implement the first piece, this patch makes use of infrastructure recently enabled in the LLVM PowerPC back-end to support passing array types directly, where the array element type encodes properties needed to handle homogeneous aggregates correctly. Specifically, the array element type encodes: - whether the parameter should be passed in FPRs, VRs, or just GPRs/stack slots (for float / vector / integer element types, respectively) - what the alignment requirements of the parameter are when passed in GPRs/stack slots (8 for float / 16 for vector / the element type size for integer element types) -- this corresponds to the "byval align" field With this support in place, the clang part simply needs to detect whether an aggregate type implements a float / vector homogeneous aggregate as defined by the ELFv2 ABI, and if so, pass/return it as array type using the appropriate float / vector element type. llvm-svn: 213494	2014-07-21 00:48:09 +00:00
Ulrich Weigand	85d5df25de	[PowerPC] ELFv2 aggregate passing support This patch adds infrastructure support for passing array types directly. These can be used by the front-end to pass aggregate types (coerced to an appropriate array type). The details of the array type being used inform the back-end about ABI-relevant properties. Specifically, the array element type encodes: - whether the parameter should be passed in FPRs, VRs, or just GPRs/stack slots (for float / vector / integer element types, respectively) - what the alignment requirements of the parameter are when passed in GPRs/stack slots (8 for float / 16 for vector / the element type size for integer element types) -- this corresponds to the "byval align" field Using the infrastructure provided by this patch, a companion patch to clang will enable two features: - In the ELFv2 ABI, pass (and return) "homogeneous" floating-point or vector aggregates in FPRs and VRs (this is similar to the ARM homogeneous aggregate ABI) - As an optimization for both ELFv1 and ELFv2 ABIs, pass aggregates that fit fully in registers without using the "byval" mechanism The patch uses the functionArgumentNeedsConsecutiveRegisters callback to encode that special treatment is required for all directly-passed array types. The isInConsecutiveRegs / isInConsecutiveRegsLast bits set as a results are then used to implement the required size and alignment rules in CalculateStackSlotSize / CalculateStackSlotAlignment etc. As a related change, the ABI routines have to be modified to support passing floating-point types in GPRs. This is necessary because with homogeneous aggregates of 4-byte float type we can now run out of FPRs before we run out of the 64-byte argument save area that is shadowed by GPRs. Any extra floating-point arguments that no longer fit in FPRs must now be passed in GPRs until we run out of those too. Note that there was already code to pass floating-point arguments in GPRs used with vararg parameters, which was done by writing the argument out to the argument save area first and then reloading into GPRs. The patch re-implements this, however, in favor of code packing float arguments directly via extension/truncation, BITCAST, and BUILD_PAIR operations. This is required to support the ELFv2 ABI, since we cannot unconditionally write to the argument save area (which the caller might not have allocated). The change does, however, affect ELFv1 varags routines too; but even here the overall effect should be advantageous: Instead of loading the argument into the FPR, then storing the argument to the stack slot, and finally reloading the argument from the stack slot into a GPR, the new code now just loads the argument into the FPR, and subsequently loads the argument into the GPR (via BITCAST). That BITCAST might imply a save/reload from a stack temporary (in which case we're no worse than before); but it might be implemented more efficiently in some cases. The final part of the patch enables up to 8 FPRs and VRs for argument return in PPCCallingConv.td; this is required to support returning ELFv2 homogeneous aggregates. (Note that this doesn't affect other ABIs since LLVM wil only look for which register to use if the parameter is marked as "direct" return anyway.) Reviewed by Hal Finkel. llvm-svn: 213493	2014-07-21 00:13:26 +00:00

1 2 3 4 5 ...

178910 Commits All Branches Search

178910 Commits

All Branches