llvm-project

Commit Graph

Author	SHA1	Message	Date
Colin LeMahieu	a071a8e5b6	[Hexagon] PC-relative offsets are relative to packet start rather than the offset of the relocation. Set relocation addend and check it's correct in the ELF. llvm-svn: 239769	2015-06-15 21:52:13 +00:00
Simon Pilgrim	aa9f712967	[X86][SSE] Added tests for vector i8/i16 to f32/f64 conversions llvm-svn: 239767	2015-06-15 21:49:31 +00:00
Peter Collingbourne	82437bf7a5	Protection against stack-based memory corruption errors using SafeStack This patch adds the safe stack instrumentation pass to LLVM, which separates the program stack into a safe stack, which stores return addresses, register spills, and local variables that are statically verified to be accessed in a safe way, and the unsafe stack, which stores everything else. Such separation makes it much harder for an attacker to corrupt objects on the safe stack, including function pointers stored in spilled registers and return addresses. You can find more information about the safe stack, as well as other parts of or control-flow hijack protection technique in our OSDI paper on code-pointer integrity (http://dslab.epfl.ch/pubs/cpi.pdf) and our project website (http://levee.epfl.ch). The overhead of our implementation of the safe stack is very close to zero (0.01% on the Phoronix benchmarks). This is lower than the overhead of stack cookies, which are supported by LLVM and are commonly used today, yet the security guarantees of the safe stack are strictly stronger than stack cookies. In some cases, the safe stack improves performance due to better cache locality. Our current implementation of the safe stack is stable and robust, we used it to recompile multiple projects on Linux including Chromium, and we also recompiled the entire FreeBSD user-space system and more than 100 packages. We ran unit tests on the FreeBSD system and many of the packages and observed no errors caused by the safe stack. The safe stack is also fully binary compatible with non-instrumented code and can be applied to parts of a program selectively. This patch is our implementation of the safe stack on top of LLVM. The patches make the following changes: - Add the safestack function attribute, similar to the ssp, sspstrong and sspreq attributes. - Add the SafeStack instrumentation pass that applies the safe stack to all functions that have the safestack attribute. This pass moves all unsafe local variables to the unsafe stack with a separate stack pointer, whereas all safe variables remain on the regular stack that is managed by LLVM as usual. - Invoke the pass as the last stage before code generation (at the same time the existing cookie-based stack protector pass is invoked). - Add unit tests for the safe stack. Original patch by Volodymyr Kuznetsov and others at the Dependable Systems Lab at EPFL; updates and upstreaming by myself. Differential Revision: http://reviews.llvm.org/D6094 llvm-svn: 239761	2015-06-15 21:07:11 +00:00
Rafael Espindola	64a27fb801	Don't indent inside a namespace. NFC. llvm-svn: 239760	2015-06-15 21:04:27 +00:00
Rafael Espindola	6ace68554d	Replace @ with the more common \. NFC. llvm-svn: 239759	2015-06-15 21:02:49 +00:00
Rafael Espindola	cbdcb50554	Don't repeat names in comments and start functions with a lower case letter. llvm-svn: 239756	2015-06-15 20:55:37 +00:00
Alex Lorenz	735c47ec3e	MIR Serialization: Connect the machine function analysis pass to the MIR parser. This commit connects the machine function analysis pass (which creates machine functions) to the MIR parser, which will initialize the machine functions with the state from the MIR file and reconstruct the machine IR. This commit introduces a new interface called 'MachineFunctionInitializer', which can be used to provide custom initialization for the machine functions. This commit also introduces a new diagnostic class called 'DiagnosticInfoMIRParser' which is used for MIR parsing errors. This commit modifies the default diagnostic handling in LLVMContext - now the the diagnostics are printed directly into llvm::errs() so that the MIR parsing errors can be printed with colours. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9928 llvm-svn: 239753	2015-06-15 20:30:22 +00:00
Eric Christopher	c30eae4567	Remove duplicate conditional in if-stmt. Fixes PR23839. llvm-svn: 239751	2015-06-15 20:16:53 +00:00
Rafael Espindola	4223a1f811	Cleanup the constructor of BitcodeReader. NFC. Use the same argument names as the members. Use default member initializes. Extracted from a patch by Karl Schimpf. llvm-svn: 239749	2015-06-15 20:08:17 +00:00
Sanjoy Das	784582f116	Add "REQUIRES: asserts" to test case that uses -debug-only llvm-svn: 239748	2015-06-15 20:05:38 +00:00
Sanjoy Das	5553bc8e45	Unbreak docs build from r239740. Add FaultMaps.rst to toctree. llvm-svn: 239747	2015-06-15 19:38:15 +00:00
Sanjoy Das	baeb678a91	Unbreak the build from r239740. Do not re-use an enum name as a field name. Some bots don't like this. llvm-svn: 239746	2015-06-15 19:29:44 +00:00
Colin LeMahieu	56efafc056	[Hexagon] Moving pass declarations out of header and in to implementation files. Removing unused function getSubtargetInfo from HexagonMCCodeEmitter.cpp Removing deletion of copy construction and assignment operator since parent already deletes it. llvm-svn: 239744	2015-06-15 19:05:35 +00:00
Sanjoy Das	69fad0799e	[CodeGen] Add a pass to fold null checks into nearby memory operations. Summary: This change adds an "ImplicitNullChecks" target dependent pass. This pass folds null checks into memory operation using the FAULTING_LOAD pseudo-op introduced in previous patches. Depends on D10197 Depends on D10199 Depends on D10200 Reviewers: reames, rnk, pgavlin, JosephTremoulet, atrick Reviewed By: atrick Subscribers: ab, JosephTremoulet, llvm-commits Differential Revision: http://reviews.llvm.org/D10201 llvm-svn: 239743	2015-06-15 18:44:27 +00:00
Sanjoy Das	6b34a46298	[TargetInstrInfo] Add new hook: AnalyzeBranchPredicate. Summary: NFC: no one uses AnalyzeBranchPredicate yet. Add TargetInstrInfo::AnalyzeBranchPredicate and implement for x86. A later change adding support for page-fault based implicit null checks depends on this. Reviewers: reames, ab, atrick Reviewed By: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10200 llvm-svn: 239742	2015-06-15 18:44:21 +00:00
Sanjoy Das	b666ea369c	[TargetInstrInfo] Rename getLdStBaseRegImmOfs and implement for x86. Summary: TargetInstrInfo::getLdStBaseRegImmOfs to TargetInstrInfo::getMemOpBaseRegImmOfs and implement for x86. The implementation only handles a few easy cases now and will be made more sophisticated in the future. This is NFCI: the only user of `getLdStBaseRegImmOfs` (now `getmemOpBaseRegImmOfs`) is `LoadClusterMotion` and `LoadClusterMotion` is disabled for x86. Reviewers: reames, ab, MatzeB, atrick Reviewed By: MatzeB, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10199 llvm-svn: 239741	2015-06-15 18:44:14 +00:00
Sanjoy Das	c63244daa1	[CodeGen] Introduce a FAULTING_LOAD_OP pseudo-op. Summary: This instruction encodes a loading operation that may fault, and a label to branch to if the load page-faults. The locations of potentially faulting loads and their "handler" destinations are recorded in a FaultMap section, meant to be consumed by LLVM's clients. Nothing generates FAULTING_LOAD_OP instructions yet, but they will be used in a future change. The documentation (FaultMaps.rst) needs improvement and I will update this diff with a more expanded version shortly. Depends on D10196 Reviewers: rnk, reames, AndyAyers, ab, atrick, pgavlin Reviewed By: atrick, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10197 llvm-svn: 239740	2015-06-15 18:44:08 +00:00
Sanjoy Das	2d869b230b	[NFC] Extract X86MCInstLower::LowerMachineOperand. Summary: Refactoring-only change that will be used later. Reviewers: reames, atrick Reviewed By: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10196 llvm-svn: 239739	2015-06-15 18:44:01 +00:00
Yaron Keren	43b4d38944	De-duplicate common expression, NFC. llvm-svn: 239736	2015-06-15 17:03:35 +00:00
Yaron Keren	3bf3f1f5b9	Rangify several for loops, NFC. llvm-svn: 239733	2015-06-15 16:20:16 +00:00
Evgeny Astigeevich	ff1f4be4c7	On behalf of Alexandros Lamprineas: LLVM targeting aarch64 doesn't correctly produce aligned accesses for non-aligned data at -O0/fast-isel (-mno-unaligned-access). The root cause seems to be in fast-isel not producing unaligned access correctly for -mno-unaligned-access. The patch just aborts fast-isel for loads and stores when -mno-unaligned-access is present. The regression test is updated to check this new test case (-mno-unaligned-access together with fast-isel). Differential Revision: http://reviews.llvm.org/D10360 llvm-svn: 239732	2015-06-15 15:48:44 +00:00
Benjamin Kramer	f1d570d4c5	[LinkerTest] Use LLVMDisposeMessage to free error string. LLVMDisposeMessage is just a thing wrapper around free at the moment, but it's the proper API to use here. llvm-svn: 239731	2015-06-15 15:42:26 +00:00
Rafael Espindola	063584faef	Avoid a "always true" warning from gcc. llvm-svn: 239729	2015-06-15 14:49:41 +00:00
Rafael Espindola	92200d237a	gold-plugin: save the .o when given -save-temps. The plugin now save the bitcode before and after optimizations and the .o that is passed to the linker. llvm-svn: 239726	2015-06-15 13:36:27 +00:00
Daniel Sanders	fa555dc7f8	Revert r239721 - Replace string GNU Triples with llvm::Triple in InitMCObjectFileInfo. NFC. It appears to cause sparc-little-endian.s to assert on Windows and Darwin. llvm-svn: 239724	2015-06-15 10:34:38 +00:00
Daniel Sanders	d6d12a1192	Replace string GNU Triples with llvm::Triple in InitMCObjectFileInfo. NFC. Summary: This affects other tools so the previous C++ API has been retained as a deprecated function for the moment. Clang has been updated with a trivial patch (not covered by the pre-commit review) to avoid breaking -Werror builds. Other in-tree tools will be fixed with similar trivial patches. This continues the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. Reviewers: rengolin Reviewed By: rengolin Subscribers: llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10366 llvm-svn: 239721	2015-06-15 09:19:41 +00:00
Arnaud A. de Grandmaison	d8673edc2d	[MachineSink] Improve runtime performance. NFC. This patch fixes a compilation time issue, when MachineSink faces PHIs with a huge number of operands. This can happen for example in goto table based interpreters, where some basic blocks can have several of those PHIs, each one with several hundreds operands. MachineSink was spending a significant time re-building and re-sorting the list of successors of the current MachineBasicBlock. The computing and sorting of the current MachineBasicBlock successors is now cached. llvm-svn: 239720	2015-06-15 09:09:06 +00:00
Jingyue Wu	12b0c2835e	[ValueTracking] do not overwrite analysis results already computed Summary: ValueTracking used to overwrite the analysis results computed from assumes and dominating conditions. This patch fixes this issue. Test Plan: test/Analysis/ValueTracking/assume.ll Reviewers: hfinkel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10283 llvm-svn: 239718	2015-06-15 05:46:29 +00:00
Rui Ueyama	55144e2423	[Support][Endian] Define \|= and &= for u{big,little}{16,32,64}_t. llvm-svn: 239716	2015-06-15 03:00:15 +00:00
Hao Liu	1c2e89a57a	[AArch64] Delete two empty files, which should be removed by r239713. llvm-svn: 239715	2015-06-15 02:56:40 +00:00
Hao Liu	d0ca8d7edd	[AArch64] Revert r239711 again. We need to discuss how to share code between AArch64 and ARM backend. llvm-svn: 239713	2015-06-15 01:56:40 +00:00
Hao Liu	cb070e3833	[AArch64] Match interleaved memory accesses into ldN/stN instructions. Re-commit after adding "-aarch64-neon-syntax=generic" to fix the failure on OS X. This patch was firstly committed in r239514, then reverted in r239544 because of a syntax incompatible failure on OS X. llvm-svn: 239711	2015-06-15 01:35:49 +00:00
NAKAMURA Takumi	ec782b4cf2	[CMake] Try to fix r239612, not to miss resources/windows_version_resource.rc in clang build. - Who defines ${LLVM_SOURCE_DIR} ? - Would windows_version_resource.rc be available in an installed llvm tree? I suggest it may be installed in ${PREFIX}/share. llvm-svn: 239703	2015-06-14 21:47:29 +00:00
Benjamin Kramer	228680ded8	[InstSimplify] fsub nnan x, x -> 0.0 is valid without ninf Both inf - inf and (-inf) - (-inf) are NaN, so it's already covered by nnan. llvm-svn: 239702	2015-06-14 21:01:20 +00:00
Benjamin Kramer	4f0524614e	[InstSimplify] Add self-fdiv identities for -ffinite-math-only. When NaNs and Infs are ignored we can fold X / X -> 1.0 -X / X -> -1.0 X / -X -> -1.0 llvm-svn: 239701	2015-06-14 18:53:58 +00:00
Igor Breger	5e49697138	AVX-512: Implemented DAG lowering for shuff62x2/shufi62x2 instuctions ( Shuffle Packed Values at 128-bit Granularity ) Tests added , vector-shuffle-512-v8.ll test re-generated. Differential Revision: http://reviews.llvm.org/D10300 llvm-svn: 239697	2015-06-14 13:07:47 +00:00
Michael Kuperstein	e3de07a529	Add support for parsing the XOR operator in Intel syntax inline assembly. Differential Revision: http://reviews.llvm.org/D10385 Patch by marina.yatsina@intel.com llvm-svn: 239695	2015-06-14 12:59:45 +00:00
Igor Breger	abe4a79b75	AVX-512: Implemented cvtsi2ss/d cvtusi2ss/d instructions with round control for KNL. Added intrinsics for cvtsi2ss/d instructions. Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D10430 llvm-svn: 239694	2015-06-14 12:44:55 +00:00
NAKAMURA Takumi	a6a250a211	AsmPrinter.cpp: Avoid crashes for targeting like "arm-mingw32". CurrentFnSym might not be <MCSymbolELF> here. llvm-svn: 239692	2015-06-14 00:23:40 +00:00
NAKAMURA Takumi	bf6ad02906	Reformat. llvm-svn: 239691	2015-06-14 00:23:33 +00:00
Colin LeMahieu	b8575b14be	[Hexagon] Adding some codegen tests and updating some to match spec. llvm-svn: 239690	2015-06-13 21:46:39 +00:00
Benjamin Kramer	258ea0dbdf	[Statepoints] Skip a vector copy when uniquing values. No functionality change intended. llvm-svn: 239688	2015-06-13 19:50:38 +00:00
Benjamin Kramer	bd7b1c89fc	[ExecutionEngine] ArrayRefize argument passing. No functionality change intended. llvm-svn: 239687	2015-06-13 19:50:29 +00:00
Yaron Keren	f3cf9d1f6e	C++11 Rangify loops in AssemblyWriter::printModule, NFC. llvm-svn: 239686	2015-06-13 17:50:47 +00:00
Rafael Espindola	74f293249d	Don't use std::errc. As noted on Errc.h: // * std::errc is just marked with is_error_condition_enum. This means that // common patters like AnErrorCode == errc::no_such_file_or_directory take // 4 virtual calls instead of two comparisons. And on some libstdc++ those virtual functions conclude that ------------------------ int main() { std::error_code foo = std::make_error_code(std::errc::no_such_file_or_directory); return foo == std::errc::no_such_file_or_directory; } ------------------------- should exit with 0. llvm-svn: 239683	2015-06-13 17:23:04 +00:00
Simon Pilgrim	d3f6427446	[DAGCombiner] Added BSWAP(BSWAP(x)) -> x combine pattern. llvm-svn: 239682	2015-06-13 16:25:12 +00:00
Sanjay Patel	5714998484	hoist loop-invariant; NFCI llvm-svn: 239681	2015-06-13 15:33:15 +00:00
Sanjay Patel	41044f8859	remove function names from comments and clean up; NFC llvm-svn: 239680	2015-06-13 15:32:45 +00:00
Simon Pilgrim	2c35e7a264	[SelectionDAG] Added assertions + UNDEF handling for BSWAP node creation. llvm-svn: 239679	2015-06-13 15:23:58 +00:00
Sanjay Patel	85924e5bf3	remove unnecessary casts; NFCI llvm-svn: 239678	2015-06-13 15:06:33 +00:00
Simon Pilgrim	011381d48b	[DAGCombiner] Added BSWAP vector constant folding support. llvm-svn: 239675	2015-06-13 14:08:15 +00:00
Simon Pilgrim	096cccd01a	Stripped trailing whitespace. NFC. llvm-svn: 239674	2015-06-13 12:57:36 +00:00
Benjamin Kramer	a4b87dbd8d	[LinkerTest] Don't leak error string. llvm-svn: 239673	2015-06-13 12:53:21 +00:00
Simon Pilgrim	a6f44a18f8	Stripped trailing whitespace. NFC. llvm-svn: 239672	2015-06-13 12:51:39 +00:00
Rafael Espindola	454adf6454	Bring in a BumpPtrStringSaver from lld and simplify the interface. StringSaver now always saves to a BumpPtrAllocator. The only reason for having the virtual saveImpl is so lld can have a thread safe version. The reason for the distinct BumpPtrStringSaver class is to avoid the virtual destructor. llvm-svn: 239669	2015-06-13 12:49:52 +00:00
Eric Fiselier	8fcf50515b	[LIT] Fix failing LIT tests Summary: I spend some time trying to get the LIT test suite passing. Here are the changes that I needed to make on my machine. I made the following changes for the following reasons. 1. google-test.py: The Google test format now checks for "[ PASSED ] 1 test." to check if a test passes. 2. discovery.py: The output appears in a different order on my machine than it did in the test. 3. unittest-adaptor.py: The output appears in a different order on my machine than it did in the test. 4. The classname is now formed differently in `getJUnitXML(...)`. I'm not sure what is causing the output order to differ in discovery.py and unittest-adaptor.py. Does anybody have any thoughts? Reviewers: ddunbar, danalbert, jroelofs Reviewed By: jroelofs Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9864 llvm-svn: 239663	2015-06-13 06:55:44 +00:00
Tom Stellard	104ad064df	AMDGPU: s/R600/AMDGPU/ in the Makefiles Now the library names in the Makefiles match the library names in LLVMBuild.txt. This should hopefully fix the remaining bot failures. llvm-svn: 239661	2015-06-13 05:11:14 +00:00
Tom Stellard	3e79bb74e2	configure: Remove non-portable fall-through operator: ;& This was added in r239657. llvm-svn: 239660	2015-06-13 03:46:48 +00:00
Matthias Braun	39a2afc941	Rename TargetSubtargetInfo::enablePostMachineScheduler() to enablePostRAScheduler() r213101 changed the behaviour of this method to not only affect the PostMachineScheduler scheduler but also the PostRAScheduler scheduler, renaming should make this fact clear. Also document that the preferred way is to specify this in the scheduling model instead of overriding this method. Differential Revision: http://reviews.llvm.org/D10427 llvm-svn: 239659	2015-06-13 03:42:16 +00:00
Matthias Braun	88e213159a	MachineLICM: Use TargetSchedModel instead of just itineraries This will use Itinieraries if available, but will also work if just a MCSchedModel is available. Differential Revision: http://reviews.llvm.org/D10428 llvm-svn: 239658	2015-06-13 03:42:11 +00:00
Tom Stellard	45bb48ea19	R600 -> AMDGPU rename llvm-svn: 239657	2015-06-13 03:28:10 +00:00
Matt Wala	bfb5368cc7	Revert 239644. llvm-svn: 239650	2015-06-13 01:08:00 +00:00
Tim Northover	02cfdbb7f1	AArch64: map bare-metal arm64-macho triple to MachO MC layer. Far better than an assertion about expecting ELF. llvm-svn: 239647	2015-06-12 23:37:11 +00:00
Eli Bendersky	ff715e2d5e	Fix returning error message in LLVMLinkModules On error, the temporary output stream wouldn't be flushed and therefore the caller would see an empty error message. Patch by Antoine Pitrou Differential Revision: http://reviews.llvm.org/D10241 llvm-svn: 239646	2015-06-12 23:26:42 +00:00
Lang Hames	37cc9fadd5	[Orc] Tidy up initialization based on review feedback for r239561 from dblaikie. NFC. llvm-svn: 239645	2015-06-12 23:13:06 +00:00
Matt Wala	1f48192d7c	[Scalarizer] Fix potential for stale data in Scattered across invocations Summary: Scalarizer has two data structures that hold information about changes to the function, Gathered and Scattered. These are cleared in finish() at the end of runOnFunction() if finish() detects any changes to the function. However, finish() was checking for changes by only checking if Gathered was non-empty. The function visitStore() only modifies Scattered without touching Gathered. As a result, Scattered could have ended up having stale data if Scalarizer only scalarized store instructions. Since the data in Scattered is used during the execution of the pass, this introduced dangling pointer errors. The fix is to check whether both Scattered and Gathered are empty before deciding what to do in finish(). Reviewers: srhines Reviewed By: srhines Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10422 llvm-svn: 239644	2015-06-12 22:49:11 +00:00
Lang Hames	913f8adc25	[Orc] Tidy up the CompileOnDemand layer based on commit review from dblaikie. NFC. llvm-svn: 239642	2015-06-12 22:22:50 +00:00
Lang Hames	2e36ddf671	[Orc] Fix a bug in the CompileOnDemand layer where stub decls were not cloned into partitions. Also, add an option to clone stub definitions (not just decls) into partitions: these definitions could be inlined in some places to avoid the overhead of calling via the stub. Found by inspection - no test case yet, although I plan to add a unit test for this once the CompileOnDemand layer refactoring settles down. llvm-svn: 239640	2015-06-12 21:31:15 +00:00
Tom Stellard	12a1910e87	R600/SI: Add assembler support for FLAT instructions - Add glc, slc, and tfe operands to flat instructions - Add missing flat instructions - Fix the encoding of flat_load_dwordx3 and flat_store_dwordx3. llvm-svn: 239637	2015-06-12 20:47:06 +00:00
Yaron Keren	4c20debe3c	Rangify several for loops in ValueEnumerator constructor. llvm-svn: 239636	2015-06-12 20:18:20 +00:00
Colin LeMahieu	79ec06525e	[Hexagon] Making intrinsic tests agnostic to register allocation. Narrowing intrinsic parameters to appropriate width. llvm-svn: 239634	2015-06-12 19:57:32 +00:00
Douglas Katzman	8f01f1cfc3	Wrap some long lines in LLVMBuild files. NFC As suggested by jroelofs in a prior review (D9752), it makes sense to generally prefer multi-line format. llvm-svn: 239632	2015-06-12 18:44:57 +00:00
Douglas Katzman	1b5767f72b	Add 'shave' processor name to Triple Based on ArchType, Clang's driver can select a non-Clang compiler. String parsing in Clang would have sufficed if it were only that, however this change anticipates true llvm support. Differential Revision: http://reviews.llvm.org/D10413 llvm-svn: 239631	2015-06-12 18:31:38 +00:00
David Blaikie	473b943ea8	Refix a use of explicit pointer types in GEP constant folding In the glorious future of opaque pointer types, it won't be possible to retrieve the pointee type of a pointer type which is what's being done in this GEP loop - but the first iteration is always a pointer type and the loop doesn't care about that case, except whether or not the index is a constant. So pull that special case out before the loop and start at the second iteration (index 1) instead. Originally committed in r236670 and reverted with a test case in r239015. This change keeps the test case working while also avoiding depending on pointee types. llvm-svn: 239629	2015-06-12 18:22:03 +00:00
Matt Wala	a4afccd8a8	Fix a typo in a comment in MemCpyOpt (test commit) llvm-svn: 239628	2015-06-12 18:16:51 +00:00
Yaron Keren	ef5e7addb3	Rangify two for loops in BitcodeReader.cpp. llvm-svn: 239627	2015-06-12 18:13:20 +00:00
Pete Cooper	83a930c80b	Remove unnecessary MCExpr.h include from MCSymbol.h MCSymbol.h already forwards declares MCExpr and only uses MCExpr* so doesn't need to include the header. llvm-svn: 239626	2015-06-12 18:07:34 +00:00
Pete Cooper	255d117d43	Remove a bunch of inline keywords from User. NFC. This came up in the patch review for http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150608/281362.html. llvm-svn: 239624	2015-06-12 17:48:21 +00:00
Pete Cooper	b676b01b84	Move OperandList to be allocated prior to User for hung off subclasses. For hung off uses, we need a Use* to tell use where the operands are. This was User::OperandList but we want to remove that to save space of all subclasses which aren't making use of 'hung off uses'. Hung off uses now allocate their own 'OperandList' Use* in the User::new which they call. getOperandList() now uses the hung off uses bit to work out where the Use* for the OperandList lives. If a User has hung off uses, then this bit tells them to go back a single Use* from the User* and use that value as the OperandList. If a User has no hung off uses, then we get the first operand by subtracting (NumOperands * sizeof(Use)) from the User this pointer. This saves a pointer from User and all subclasses. Given the average size of a subclass of User is 112 or 128 bytes, this saves around 7% of space With malloc tending to align to 16-bytes the real saving is typically more like 3.5%. On 'opt -O2 verify-uselistorder.lto.bc', peak memory usage prior to this change is 149MB and after is 143MB so the savings are around 2.5% of peak. Looking at some passes which allocate many Instructions and Values, parseIR drops from 54.25MB to 52.21MB while the Inliner calls to Instruction::clone() drops from 28.20MB to 27.05MB. Reviewed by Duncan Exon Smith. llvm-svn: 239623	2015-06-12 17:48:18 +00:00
Pete Cooper	c91fda3b10	Added a version of User::new for hung off uses. There are now 2 versions of User::new. The first takes a size_t and is the current implementation for subclasses which need 0 or more Use's allocated for their operands. The new version takes no extra arguments to say that this subclass needs 'hung off uses'. The HungOffUses bool is now set in this version of User::new and we can assert in allocHungOffUses that we are allowed to have hung off uses. This ensures we call the correct version of User::new for subclasses which need hung off uses. A future commit will then allocate space for a single Use* which will be used in place of User::OperandList once that field has been removed. Reviewed by Duncan Exon Smith. llvm-svn: 239622	2015-06-12 17:48:14 +00:00
Pete Cooper	b4eede2c07	Rename NumOperands to make it clear its managed by the User. NFC. This is to try make it very clear that subclasses shouldn't be changing the value directly. Now that OperandList for normal instructions is computed using the NumOperands, its critical that the NumOperands is accurate or we could compute the wrong offset to the first operand. I looked over all places which update NumOperands and they are all safe. Hung off use User's don't use NumOperands to compute the OperandList so they are safe to continue to manipulate it. The only other User which changed it was GlobalVariable which has an optional init list but always allocated space for a single Use. It was correctly setting NumOperands to 1 before setting an initializer, and setting it to 0 after clearing the init list, so the order was safe. Added some comments to that code to make sure that this isn't changed in future without being aware of this constraint. Reviewed by Duncan Exon Smith. llvm-svn: 239621	2015-06-12 17:48:10 +00:00
Pete Cooper	74510a409d	Replace all accesses to User::OperandList with getter and setter methods. NFC. We don't want anyone to access OperandList directly as its going to be removed and computed instead. This uses getter's and setter's instead in which we can later change the underlying implementation of OperandList. Reviewed by Duncan Exon Smith. llvm-svn: 239620	2015-06-12 17:48:05 +00:00
Rafael Espindola	c74ac023d8	Have the ELF symbol predicates match more directly the spec. The underlaying issues is that this code can't really know if an OS specific or processor specific section number should return true or false. One option would be to assert or return an error, but that looks like over engineering since extensions are not that common. It seems better to have these be direct implementation of the ELF spec so that they are natural for someone familiar with ELF reading the code. Code that does have to handle OS/Architecture specific values can do it at a higher level. llvm-svn: 239618	2015-06-12 17:23:39 +00:00
Pete Cooper	3664253c52	Don't create instructions from ConstantExpr's in CFLAliasAnalysis. The CFLAA code currently calls ConstantExpr::getAsInstruction which creates an instruction from a constant expr. We then pass that instruction to the InstVisitor to analyze it. Its not necessary to create these instructions as we can just cast from Constant to Operator in the visitor. This is how other InstVisitor’s such as SelectionDAGBuilder handle ConstantExpr. llvm-svn: 239616	2015-06-12 16:13:54 +00:00
Greg Bedwell	95213c31f3	In MSVC builds embed a VERSIONINFO resource in our exe and DLL files. This reinstates my commits r238740/r238741 which I reverted due to a failure in the clang-cl selfhost tests on Windows. I've now fixed the issue in clang-cl that caused the failure so hopefully all should be well now. llvm-svn: 239612	2015-06-12 15:58:29 +00:00
Rafael Espindola	0b9319edb0	Remove a hack that tries to align '*'. The alignment is not required, so we can just remove it for now. The old code is a hack as it depends on the buffer management to find the current column. If the alignment is really desirable, the proper way to do it is to pass in a formatted_raw_stream that knows the current column. llvm-svn: 239603	2015-06-12 12:42:13 +00:00
Rafael Espindola	de28b7375f	Don't depend on the interleaving of stdout and stderr. That can change as we change the buffering. llvm-svn: 239602	2015-06-12 12:20:03 +00:00
Alexander Potapenko	f90556efb8	[ASan] format AddressSanitizer.cpp with `clang-format -style=Google`, NFC llvm-svn: 239601	2015-06-12 11:27:06 +00:00
John Brawn	d9e39d53b6	[ARM] Disabling vfp4 should disable fp16 ARMTargetParser::getFPUFeatures should disable fp16 whenever it disables vfp4, as otherwise something like -mcpu=cortex-a7 -mfpu=none leaves us with fp16 enabled (though the only effect that will have is a wrong build attribute). Differential Revision: http://reviews.llvm.org/D10397 llvm-svn: 239599	2015-06-12 09:38:51 +00:00
Yaron Keren	b5a87b256d	Replace duplicated iplist<T> types with the corresponding typedefs. llvm-svn: 239598	2015-06-12 08:19:32 +00:00
Yaron Keren	26ceb0845b	Rangify for loops, NFC. llvm-svn: 239596	2015-06-12 05:15:27 +00:00
Peter Collingbourne	005354b1f4	LowerBitSets: Give names to aliases of unnamed bitset element objects. It is valid for globals to be unnamed, but aliases must have a name. To avoid creating invalid IR, we need to assign names to any aliases we create that point to unnamed objects that have been moved into combined globals. llvm-svn: 239590	2015-06-12 03:25:05 +00:00
Teresa Johnson	43a65d9529	Revert commit r239480 as it causes https://code.google.com/p/chromium/issues/detail?id=499508#c3 . llvm-svn: 239589	2015-06-12 03:12:00 +00:00
Richard Smith	11e14ec195	Add missing #include, found by modules build. llvm-svn: 239587	2015-06-12 02:13:45 +00:00
Alexey Samsonov	201733b7f0	[SanitizerCoverage] Use llvm::getDISubprogram() to get location of the entry basic block. DebugLoc::getFnDebugLoc() should soon be removed. Also, getDISubprogram() might become more effective soon and wouldn't need to scan debug locations at all, if function-level metadata would be emitted by Clang. llvm-svn: 239586	2015-06-12 01:48:47 +00:00
Alexey Samsonov	9947e48cd1	[GVN] Use a simpler form of IRBuilder constructor. Summary: A side effect of this change is that it IRBuilder now automatically created debug info locations for new instructions, which is the same as debug location of insertion point. This is fine for the functions in questions (GetStoreValueForLoad and GetMemInstValueForLoad), as they are used in two situations: * GVN::processLoad, which tries to eliminate a load. In this case new instructions would have the same debug location as the load they eventually replace; * MaterializeAdjustedValue, which adds new instructions to the end of the basic blocks, which could later be used to replace the load definition. In this case we don't yet know the way the load would be eventually replaced (either by assembling the precomputed values via PHI, or by using them directly), so just using the basic block strategy seems to be reasonable. There is also a special case in the code that would adjust the location of the last instruction replacing the load definition to the location of the load. Test Plan: regression test suite Reviewers: echristo, dberlin, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10405 llvm-svn: 239585	2015-06-12 01:39:48 +00:00
Alexey Samsonov	ff449802c2	[GVN] Use IRBuilder more actively instead of creating instructions manually. llvm-svn: 239584	2015-06-12 01:39:45 +00:00
Reid Kleckner	81d1cc00b7	[WinEH] Put finally pointers in the handler scope table field We were putting them in the filter field, which is correct for 64-bit but wrong for 32-bit. Also switch the order of scope table entry emission so outermost entries are emitted first, and fix an obvious state assignment bug. llvm-svn: 239574	2015-06-11 23:37:18 +00:00
NAKAMURA Takumi	c974a9e50d	MC: Prune \return corresponding to r239552. [-Wdocumentation] llvm-svn: 239571	2015-06-11 23:04:56 +00:00
Lang Hames	af30e78357	[Orc] Attempted fix for GCC ICE on Polly builder. Along the same lines as the fix in r228568. llvm-svn: 239570	2015-06-11 22:51:01 +00:00
Juergen Ributzka	03cb0d8b46	[Stackmaps][X86] Remove EFLAGS and IP registers from the live-out mask. Remove the EFLAGS from the stackmap live-out mask. The EFLAGS register is not supposed to be part of that set, because the X86 calling conventions mark the register as NOT preserved. Also remove the IP registers, since spilling and restoring those doesn't really make any sense. Related to rdar://problem/21019635. llvm-svn: 239568	2015-06-11 22:40:04 +00:00
Reid Kleckner	a9d6253572	[WinEH] Create an llvm.x86.seh.exceptioninfo intrinsic This intrinsic is like framerecover plus a load. It recovers the EH registration stack allocation from the parent frame and loads the exception information field out of it, giving back a pointer to an EXCEPTION_POINTERS struct. It's designed for clang to use in SEH filter expressions instead of accessing the EXCEPTION_POINTERS parameter that is available on x64. This required a minor change to MC to allow defining a label variable to another absolute framerecover label variable. llvm-svn: 239567	2015-06-11 22:32:23 +00:00
Reid Kleckner	6bb26dafa4	[Support] Fix a race initializing a static local in MSVC static local initialization isn't thread safe with MSVC and a race was reported in PR23817. We can't use std::atomic because it's not trivially constructible, so instead do some lame volatile global integer manipulation. llvm-svn: 239566	2015-06-11 22:22:45 +00:00
Michael Zolotukhin	c4e4f33e29	Update stale comment before analyzeLoopUnrollCost. NFC. llvm-svn: 239565	2015-06-11 22:17:39 +00:00
Lang Hames	3f9960a969	[Orc] Remove some unnecesary includes and whitespace that slipped in to r239561. NFC. llvm-svn: 239564	2015-06-11 22:12:24 +00:00
Lang Hames	6a14edd914	[Orc] Make partition identification in the CompileOnDemand layer lazy. This also breaks out the logical dylib symbol resolution logic. llvm-svn: 239561	2015-06-11 21:45:19 +00:00
Peter Collingbourne	82e657b509	Object: Prepend __imp_ when mangling a dllimport symbol in IRObjectFile. We cannot prepend __imp_ in the IR mangler because a function reference may be emitted unmangled in a constant initializer. The linker is expected to resolve such references to thunks. This is covered by the new test case. Strictly speaking we ought to emit two undefined symbols, one with __imp_ and one without, as we cannot know which symbol the final object file will refer to. However, this would require rather intrusive changes to IRObjectFile, and lld works fine without it for now. This reimplements r239437, which was reverted in r239502. Differential Revision: http://reviews.llvm.org/D10400 llvm-svn: 239560	2015-06-11 21:42:18 +00:00
Peter Collingbourne	485ad4860e	LTO: expose LTO_SYMBOL_COMDAT flag, which indicates that the definition is part of a comdat group. Reviewers: rafael Subscribers: llvm-commits, ruiu Differential Revision: http://reviews.llvm.org/D10330 llvm-svn: 239559	2015-06-11 21:41:27 +00:00
Douglas Katzman	3a547f15ae	Fix English usage in command line flag help string. llvm-svn: 239556	2015-06-11 20:03:23 +00:00
Davide Italiano	9306198c07	[ELF] Introduce getValue() for ELF Symbols. Differential Revision: http://reviews.llvm.org/D10328 Reviewed by: rafael llvm-svn: 239555	2015-06-11 19:59:04 +00:00
Daniel Sanders	3e5de88dac	Replace string GNU Triples with llvm::Triple in TargetMachine. NFC. Summary: For the moment, TargetMachine::getTargetTriple() still returns a StringRef. This continues the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. Reviewers: rengolin Reviewed By: rengolin Subscribers: ted, llvm-commits, rengolin, jholewinski Differential Revision: http://reviews.llvm.org/D10362 llvm-svn: 239554	2015-06-11 19:41:26 +00:00
Ahmed Bougacha	c88bf54366	[CodeGen] ArrayRef'ize cond/pred in various TII APIs. NFC. llvm-svn: 239553	2015-06-11 19:30:37 +00:00
Rafael Espindola	7c6e6e49cc	Generalize emitAbsoluteSymbolDiff. This makes emitAbsoluteSymbolDiff always succeed and moves logic from the asm printer to it. The object one now also works on ELF. If two symbols are in the same fragment, we will never move them apart. llvm-svn: 239552	2015-06-11 18:58:08 +00:00
Alexey Samsonov	770f65ca6a	Set proper debug location for branch added in BasicBlock::splitBasicBlock(). This improves debug locations in passes that do a lot of basic block transformations. Important case is LoopUnroll pass, the test for correct debug locations accompanies this change. Test Plan: regression test suite Reviewers: dblaikie, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10367 llvm-svn: 239551	2015-06-11 18:25:54 +00:00
Alexey Samsonov	ea20199b48	[LoopUnroll] Use IRBuilder to create branch instructions. Use IRBuilder::Create(Cond)?Br instead of constructing instructions manually with BranchInst::Create(). It's consistent with other uses of IRBuilder in this pass, and has an additional important benefit: Using IRBuilder will ensure that new branch instruction will get the same debug location as original terminator instruction it will eventually replace. For now I'm not adding a testcase, as currently original terminator instruction also lack debug location due to missing debug location propagation in BasicBlock::splitBasicBlock. That is, the testcase will accompany the fix for the latter I'm going to mail soon. llvm-svn: 239550	2015-06-11 18:25:44 +00:00
Benjamin Kramer	2d221406fa	Replace an instance of custom atomics with standard ones. Eventually I want to get rid of them entirely, but Statistic.h is still blocked on MSVC bugs. No functionality change. llvm-svn: 239545	2015-06-11 17:30:34 +00:00
Rafael Espindola	65d37e64a9	This reverts commit r239529 and r239514. Revert "[AArch64] Match interleaved memory accesses into ldN/stN instructions." Revert "Fixing MSVC 2013 build error." The test/CodeGen/AArch64/aarch64-interleaved-accesses.ll test was failing on OS X. llvm-svn: 239544	2015-06-11 17:30:33 +00:00
Reid Kleckner	2691c59e97	Revert "Fix merges of non-zero vector stores" This reverts commit r239539. It was causing SDAG assertions while building freetype. llvm-svn: 239543	2015-06-11 17:25:24 +00:00
Douglas Katzman	3e2e36b83c	Fix comment typos. llvm-svn: 239541	2015-06-11 16:46:27 +00:00
Matt Arsenault	91f90e694f	SLSR: Pass address space to isLegalAddressingMode This only updates one of the uses. The other is used in cases that may never touch memory, so I'm not sure why this is even calling it. Perhaps there should be a new, similar hook for such cases or pass -1 for unknown address space. llvm-svn: 239540	2015-06-11 16:13:39 +00:00
Matt Arsenault	e23a063dc3	Fix merges of non-zero vector stores Now actually stores the non-zero constant instead of 0. I somehow forgot to include this part of r238108. The test change was just an independent instruction order swap, so just add another check line to satisfy CHECK-NEXT. llvm-svn: 239539	2015-06-11 16:03:52 +00:00
Daniel Sanders	ed64d62c70	Replace string GNU Triples with llvm::Triple in computeDataLayout(). NFC. Summary: This continues the patch series to eliminate StringRef forms of GNU triples from the internals of LLVM that began in r239036. Reviewers: rengolin Reviewed By: rengolin Subscribers: llvm-commits, jfb, rengolin Differential Revision: http://reviews.llvm.org/D10361 llvm-svn: 239538	2015-06-11 15:34:59 +00:00
Tom Stellard	076ac95e79	R600/SI: Define latency for flat instructions llvm-svn: 239535	2015-06-11 14:51:50 +00:00
Tom Stellard	731c927839	R600/SI: Move flat instruction defs to CIInstructions.td llvm-svn: 239534	2015-06-11 14:51:49 +00:00
Tom Stellard	53e015f37d	R600/SI: Add -mcpu=bonaire to a test that uses flat address space Flat instructions don't exist on SI, but there is a bug in the backend that allows them to be selected. llvm-svn: 239533	2015-06-11 14:51:46 +00:00
Sanjay Patel	8b2150efdb	remove function names from comments; NFC llvm-svn: 239532	2015-06-11 14:26:49 +00:00
Aaron Ballman	b6b58b3152	Fixing MSVC 2013 build error. llvm-svn: 239529	2015-06-11 13:06:02 +00:00
Toma Tabacu	e1e460dbc5	Recommit "[mips] [IAS] Add support for BNE and BEQ with an immediate operand." (r239396). Apparently, Arcanist didn't include some of my local changes in my previous commit attempt. llvm-svn: 239523	2015-06-11 10:36:10 +00:00
Zoran Jovanovic	cdfcbe41f2	[mips][microMIPS] Implement ERET and ERETNC instructions http://reviews.llvm.org/D10091 llvm-svn: 239522	2015-06-11 10:22:46 +00:00
Zoran Jovanovic	6b0dcd7b8c	[mips] Change existing uimm10 operand to restrict the accepted immediates http://reviews.llvm.org/D10312 llvm-svn: 239520	2015-06-11 09:51:58 +00:00
Zoran Jovanovic	fcecf26092	[mips][microMIPSr6] Change disassembler tests to one line format llvm-svn: 239519	2015-06-11 09:42:10 +00:00
Hao Liu	405f1d1651	[LoopVectorize] Revert the enabling of interleaved memory access in Loop Vectorizor, which was wrongly committed in r239514. llvm-svn: 239515	2015-06-11 09:18:07 +00:00
Hao Liu	4566d18e89	[AArch64] Match interleaved memory accesses into ldN/stN instructions. Add a pass AArch64InterleavedAccess to identify and match interleaved memory accesses. This pass transforms an interleaved load/store into ldN/stN intrinsic. As Loop Vectorizor disables optimization on interleaved accesses by default, this optimization is also disabled by default. To enable it by "-aarch64-interleaved-access-opt=true" E.g. Transform an interleaved load (Factor = 2): %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle %wide.vec, undef, <0, 2, 4, 6> ; Extract even elements %v1 = shuffle %wide.vec, undef, <1, 3, 5, 7> ; Extract odd elements Into: %ld2 = { <4 x i32>, <4 x i32> } call aarch64.neon.ld2(%ptr) %v0 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 0 %v1 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 1 E.g. Transform an interleaved store (Factor = 2): %i.vec = shuffle %v0, %v1, <0, 4, 1, 5, 2, 6, 3, 7> ; Interleaved vec store <8 x i32> %i.vec, <8 x i32>* %ptr Into: %v0 = shuffle %i.vec, undef, <0, 1, 2, 3> %v1 = shuffle %i.vec, undef, <4, 5, 6, 7> call void aarch64.neon.st2(%v0, %v1, %ptr) llvm-svn: 239514	2015-06-11 09:05:02 +00:00
Arnaud A. de Grandmaison	af37ad19a9	[LiveVariables] Improve isLiveOut runtime performances. NFC. On large goto table based interpreters, where phi nodes can have (very) large fan-ins, isLiveOut exhibited poor performances: about 40% of the full codegen time was spent in PHIElim, sorting MachineBasicBlock addresses. This patch improve the performances for such cases, and does not show compile time regressions on the LNT, at bootstrap (llvm+clang+lldb) or any other benchmarks we have in-house. llvm-svn: 239510	2015-06-11 07:50:21 +00:00
Simon Pilgrim	5965680d53	[X86][SSE] Vectorized i8 and i16 shift operators This patch ensures that SHL/SRL/SRA shifts for i8 and i16 vectors avoid scalarization. It builds on the existing i8 SHL vectorized implementation of moving the shift bits up to the sign bit position and separating the 4, 2 & 1 bit shifts with several improvements: 1 - SSE41 targets can use (v)pblendvb directly with the sign bit instead of performing a comparison to feed into a VSELECT node. 2 - pre-SSE41 targets were masking + comparing with an 0x80 constant - we avoid this by using the fact that a set sign bit means a negative integer which can be compared against zero to then feed into VSELECT, avoiding the need for a constant mask (zero generation is much cheaper). 3 - SRA i8 needs to be unpacked to the upper byte of a i16 so that the i16 psraw instruction can be correctly used for sign extension - we have to do more work than for SHL/SRL but perf tests indicate that this is still beneficial. The i16 implementation is similar but simpler than for i8 - we have to do 8, 4, 2 & 1 bit shifts but less shift masking is involved. SSE41 use of (v)pblendvb requires that the i16 shift amount is splatted to both bytes however. Tested on SSE2, SSE41 and AVX machines. Differential Revision: http://reviews.llvm.org/D9474 llvm-svn: 239509	2015-06-11 07:46:37 +00:00
Arnaud A. de Grandmaison	2e8ffa3b44	[PHIElim] Use ranges and const-ify, NFC. llvm-svn: 239508	2015-06-11 07:45:05 +00:00
Nemanja Ivanovic	ea1db8a697	LLVM support for vector quad bit permute and gather instructions through builtins This patch corresponds to review: http://reviews.llvm.org/D10096 This is the back end portion of the patch related to D10095. The patch adds the instructions and back end intrinsics for: vbpermq vgbbd llvm-svn: 239505	2015-06-11 06:21:25 +00:00
Reid Kleckner	c35e7f52ba	Revert "Move dllimport name mangling to IR mangler." This reverts commit r239437. This broke clang-cl self-hosts. We'd end up calling the __imp_ symbol directly instead of using it to do an indirect function call. llvm-svn: 239502	2015-06-11 01:31:48 +00:00
Pete Cooper	7cbe58d3c5	Remove MachineModuleInfo::UsedFunctions as it has no users. It hasn't been used since r130964. This also removes MachineModuleInfo::isUsedFunction and MachineModuleInfo::AnalyzeModule, both of which were only there to support UsedFunctions. llvm-svn: 239501	2015-06-11 01:04:56 +00:00
Sanjay Patel	1275a3c913	change assert that will never fire to llvm_unreachable llvm-svn: 239497	2015-06-10 23:27:33 +00:00
Jingyue Wu	f6ca8cfdcc	[NFC] added a missing space llvm-svn: 239495	2015-06-10 22:54:02 +00:00
Pete Cooper	3fc3040860	Stop returning a Use* from allocHungOffUses. This always just set the User::OperandList which is now set in that method instead of being returned. Reviewed by Duncan Exon Smith. llvm-svn: 239493	2015-06-10 22:38:46 +00:00
Pete Cooper	93f9ff5781	Add User::growHungoffUses and use it to grow the hung off uses. NFC. PhiNode, SwitchInst, LandingPad and IndirectBr all had virtually identical logic for growing the hung off uses. Move it to User so that they can all call a single shared implementation. Their destructors were all empty after this change and were deleted. They all have virtual clone_impl methods which can be used as vtable anchors. Reviewed by Duncan Exon Smith. llvm-svn: 239492	2015-06-10 22:38:41 +00:00
Pete Cooper	178dcc2938	Delete User::dropHungOffUses and move it in to ~User which is the only caller. NFC. Now that the subclasses which care about hung off uses let ~User clean it up, there's no need for a separate method. Just inline it to ~User and delete it. Reviewed by Duncan Exon Smith. llvm-svn: 239491	2015-06-10 22:38:38 +00:00
Pete Cooper	c6c0439d2a	Make User track whether a class has 'hung off uses' and delete them in its destructor. Currently all of the logic for deleting hung off uses, which PHI/switch/etc use, is in their classes. This adds a bit to Value which tracks whether that user had hung off uses, then User can be responsible for clearing them instead of the sub classes. Note, the bit used here was taken from NumOperands which was 30-bits. Given the reduction to 29 bits, and the average User being just over 100 bytes, a single User with 29-bits of num operands would need 50GB of RAM for itself so its reasonable to assume that 29-bits is enough for now. This is a step towards hiding all the hung off uses logic in the User. Reviewed by Duncan Exon Smith. llvm-svn: 239490	2015-06-10 22:38:34 +00:00
Pete Cooper	87b925b064	Move the special Phi logic for hung off uses in to User::allocHungOffUses. NFC. PhiNode's need to allocate space for an array of Use[N] and then BasicBlock[N]. They had their own allocHungOffUses to handle all of this. This moves the logic in to User::allocHungOffUses and PhiNode passes in a bool to say to allocate the BB space too. Reviewed by Duncan Exon Smith. llvm-svn: 239489	2015-06-10 22:38:30 +00:00
Peter Collingbourne	115fe37621	ArgumentPromotion: Drop sret attribute on functions that are only called directly. If the first argument to a function is a 'this' argument and the second has the sret attribute, the ArgumentPromotion pass may promote the 'this' argument to more than one argument, violating the IR constraint that 'sret' may only be applied to the first or second argument. Although this IR constraint is arguably unnecessary, it highlighted the fact that ArgPromotion does not need to preserve this attribute. Dropping the attribute reduces register pressure in the backend by avoiding the register copy required by sret. Because sret implies noalias, we also replace the former with the latter. Differential Revision: http://reviews.llvm.org/D10353 llvm-svn: 239488	2015-06-10 21:14:34 +00:00
Sanjay Patel	08829bac81	[x86] Add a reassociation optimization to increase ILP via the MachineCombiner pass This is a reimplementation of D9780 at the machine instruction level rather than the DAG. Use the MachineCombiner pass to reassociate scalar single-precision AVX additions (just a starting point; see the TODO comments) to increase ILP when it's safe to do so. The code is closely based on the existing MachineCombiner optimization that is implemented for AArch64. This patch should not cause the kind of spilling tragedy that led to the reversion of r236031. Differential Revision: http://reviews.llvm.org/D10321 llvm-svn: 239486	2015-06-10 20:32:21 +00:00
Sanjay Patel	ccb8d5cc57	punctuation policing; NFC llvm-svn: 239484	2015-06-10 19:52:58 +00:00
Reid Kleckner	c87a6faba1	[WinEH] _except_handlerN uses 0 instead of 1 to indicate catch-all Our usage of 1 was a holdover from __C_specific_handler. llvm-svn: 239482	2015-06-10 18:14:07 +00:00

1 2 3 4 5 ...

118328 Commits