llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	19f9f374d9	[SimplifyLibCalls] require fast-math-flags for pow(X, -0.5) transforms As discussed in PR44330: https://bugs.llvm.org/show_bug.cgi?id=44330 ...the transform from pow(X, -0.5) libcall/intrinsic to reciprocal square root can result in small deviations from the expected result due to differences in the pow() implementation and/or the extra rounding step from the division. This patch proposes to allow that difference with either the 'approximate functions' or 'reassociate' FMF: http://llvm.org/docs/LangRef.html#fast-math-flags In practice, this likely means that the code is compiled with all of 'fast' (-ffast-math), but I have preserved the existing specializations for -0.0/-INF that enable generating safe code if those special values are allowed simultaneously with allowing approximation/reassociation. The question about whether a similar restriction is needed for the non-reciprocal case -- pow(X, 0.5) -- is deferred. That transform is allowed without FMF currently, and this patch does not change that behavior. Differential Revision: https://reviews.llvm.org/D71706	2019-12-21 10:00:53 -05:00
Sanjay Patel	5889e7823d	[InstCombine] add/adjust tests for pow->sqrt; NFC There's at least 1 bug here as discussed in PR44330.	2019-12-19 09:25:19 -05:00
Sanjay Patel	5a4f7cf2ff	[IR] allow fast-math-flags on select of FP values This is a minimal start to correcting a problem most directly discussed in PR38086: https://bugs.llvm.org/show_bug.cgi?id=38086 We have been hacking around a limitation for FP select patterns by using the fast-math-flags on the condition of the select rather than the select itself. This patch just allows FMF to appear with the 'select' opcode. No changes are needed to "FPMathOperator" because it already includes select-of-FP because that definition is based on the (return) value type. Once we have this ability, we can start correcting and adding IR transforms to use the FMF on a 'select' instruction. The instcombine and vectorizer test diffs only show that the IRBuilder change is behaving as expected by applying an FMF guard value to 'select'. For reference: rL241901 - allowed FMF with fcmp rL255555 - allowed FMF with FP calls Differential Revision: https://reviews.llvm.org/D61917 llvm-svn: 361401	2019-05-22 15:50:46 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Evandro Menezes	42422b33cf	[NFC] Fix typo in test cases llvm-svn: 339900	2018-08-16 17:03:22 +00:00
Evandro Menezes	c05c7e11bb	[InstCombine] Expand the simplification of pow(x, 0.5) to sqrt(x) Expand the number of cases when `pow(x, 0.5)` is simplified into `sqrt(x)` by considering the math semantics with more granularity. Differential revision: https://reviews.llvm.org/D50036 llvm-svn: 339887	2018-08-16 15:58:08 +00:00
Sanjay Patel	b1546da0e8	[InstCombine] fix typos in tests; NFC See D50036. llvm-svn: 339713	2018-08-14 19:13:07 +00:00
Sanjay Patel	73b7e9f65e	[InstCombine] add tests for pow->sqrt; NFC D50036 should fix the missed optimizations. llvm-svn: 339711	2018-08-14 19:05:37 +00:00
Evandro Menezes	a7d48286fb	[SLC] Refactor the simplication of pow() (NFC) Use more meaningful variable names. Mostly NFC. llvm-svn: 338266	2018-07-30 16:20:04 +00:00
Sanjay Patel	9771a96f6e	[LibCallSimplifier] allow splat vectors for pow(x, 0.5) -> sqrt() transforms llvm-svn: 318629	2017-11-19 16:42:27 +00:00
Sanjay Patel	fbd3e66b9a	[LibCallSimplifier] partly fix pow(x, 0.5) -> sqrt() transforms As the first test shows, we could transform an llvm intrinsic which never sets errno into a libcall which could set errno (even though it's marked readnone?), so that's not ideal. It's possible that we can also transform a libcall which could set errno to an intrinsic given the fast-math-flags constraint, but that's deferred to determine exactly which set of FMF are needed. Differential Revision: https://reviews.llvm.org/D40150 llvm-svn: 318628	2017-11-19 16:13:14 +00:00
Sanjay Patel	cc318be68d	[InstCombine] add tests for pow(); NFC Also, increase test diversity (and show another bug) by varying the types. llvm-svn: 318430	2017-11-16 17:49:54 +00:00
Sanjay Patel	cebbfacc9e	[InstCombine] add tests for 'afn' FMF; NFC llvm-svn: 318423	2017-11-16 17:06:36 +00:00
Sanjay Patel	dcb9e1b387	[InstCombine] regenerate test checks; NFC llvm-svn: 318420	2017-11-16 17:01:09 +00:00
Matt Arsenault	6a288c1e32	Replace hardcoded intrinsic list with speculatable attribute. No change in which intrinsics should be speculated. llvm-svn: 301995	2017-05-03 02:26:10 +00:00
Davide Italiano	472684eaf5	[SimplifyLibCalls] pow(x, -0.5) -> 1.0 / sqrt(x). Differential Revision: https://reviews.llvm.org/D28479 llvm-svn: 291486	2017-01-09 21:55:23 +00:00
Davide Italiano	873219c406	[SimplifyLibCalls] Restore the old behaviour, emit a libcall. Hal pointed out that the semantic of our intrinsic and the libc call are slightly different. Add a comment while I'm here to explain why we can't emit an intrinsic. Thanks Hal! llvm-svn: 278200	2016-08-10 06:33:32 +00:00
Davide Italiano	e3b916d164	[SimplifyLibCalls] Emit sqrt intrinsic instead of a libcall. llvm-svn: 277972	2016-08-08 03:23:01 +00:00
Sanjay Patel	53ba88dbb0	[LibCallSimplifier] use instruction-level fast-math-flags to transform pow(x, 0.5) calls Also, propagate the FMF to the newly created sqrt() call. llvm-svn: 257503	2016-01-12 19:06:35 +00:00
Davide Italiano	c5cedd195a	[SimplifyLibCalls] New trick: pow(x, 0.5) -> sqrt(x) under -ffast-math. Differential Revision: http://reviews.llvm.org/D14466 llvm-svn: 253521	2015-11-18 23:21:32 +00:00

21 Commits