llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	e5b51f6786	[X86][Btver2] FMUL ymm instructions are double pumped on the JFPM functional pipe llvm-svn: 328217	2018-03-22 17:25:38 +00:00
Andrea Di Biagio	12ef5260ea	[llvm-mca] Move the logic that computes the register file usage to the BackendStatistics view. With this patch, the "instruction dispatched" event now provides information related to the number of microarchitectural registers used in each register file. Similarly, the "instruction retired" event is now able to tell how may registers are freed in each register file. Currently, the BackendStatistics view is the only consumer of register usage/pressure information. BackendStatistics uses that info to print out a few general statistics (i.e. max number of mappings used; total mapping created). Before this patch, the BackendStatistics was forced to query the Backend to obtain the register pressure information. This helps removes that dependency. Now views are completely independent from the Backend. As a consequence, it should be easier to address PR36663 and further modularize the pipeline. Added a couple of test cases in the BtVer2 specific directory. llvm-svn: 328129	2018-03-21 18:11:05 +00:00
Simon Pilgrim	203876f104	[X86][Btver2] Fix crc32 schedule costs The default is currently FAdd for some reason llvm-svn: 327807	2018-03-18 19:54:42 +00:00
Simon Pilgrim	13cd3b0961	[X86][Btver2] Add crc32 resource tests llvm-svn: 327805	2018-03-18 18:55:34 +00:00
Simon Pilgrim	c3db8c7cda	[X86][Btver2] FADD/FHADD ymm instructions are double pumped on the JFPA functional pipe llvm-svn: 327804	2018-03-18 18:45:57 +00:00
Simon Pilgrim	036cc82622	[X86][Btver2] Float bitwise ymm instructions are double pumped on the JFPX (JFPA/JFPM) functional pipes llvm-svn: 327803	2018-03-18 17:10:12 +00:00
Simon Pilgrim	87d2f7463f	[X86][Btver2] F16C instructions are performed on the JSTC functional pipe llvm-svn: 327801	2018-03-18 15:59:51 +00:00
Simon Pilgrim	40f6d6ad0b	[X86][Btver2] SSE4A EXTRQ/INSERTQ instructions are performed on the JVALU0/JVALU1 functional pipes llvm-svn: 327794	2018-03-18 13:05:09 +00:00
Simon Pilgrim	e16790b133	[X86][Btver2] Modelled float bitwise instructions as being performed on the float cluster (FPA/FPM) not the integer. llvm-svn: 327793	2018-03-18 12:37:35 +00:00
Simon Pilgrim	e409f84e7e	[X86][Btver2] Correctly distinguish between scheduling pipe and functional unit for JWriteResFpuPair defs Jaguar's FPU has 2 scheduler pipes (JFPU0/JFPU1) which forward to multiple functional sub-units each. We need to model that an micro-op will both consume the scheduler pipe and a functional unit. This patch just handles the ops defined through JWriteResFpuPair, I'll go through the custom cases later. llvm-svn: 327791	2018-03-18 12:09:17 +00:00
Simon Pilgrim	0ba4a0f3a6	[X86][Btver2] Add llvm-mca tests to show pipe resource usage of most vector instructions Hopefully these tests can be easily reused should any other subtarget get in depth llvm-mca coverage (we can either copy the tests or move them into a common dir and run it with multiple prefixes). llvm-svn: 327788	2018-03-18 09:32:38 +00:00
Simon Pilgrim	9c4157bb70	[X86][Btver2] Tweak pipes test to remove register dependencies It gives us a better view of pipe usage in the timeline which is what the test is trying to show. llvm-svn: 327685	2018-03-15 23:15:11 +00:00
Simon Pilgrim	3894809997	[X86][Btver2] Fix ymm div/sqrt to use fmul unit YMM FDiv/FSqrt are dispatched on pipe JFPU1 but should be performed on the JFPM unit - that is where most of the cycles are spent. This matches the pipes for WriteFSqrt/WriteFDiv definitions. llvm-svn: 327682	2018-03-15 23:00:47 +00:00
Simon Pilgrim	49a56faee2	[X86][Btver2] Add test to show timeline of fpu instructions on different pipes/units Try to demonstrate the scheduling from fpu0/fpu1 pipes to the valu0/vimul/fpa or valu1/stc/fpm functional units llvm-svn: 327676	2018-03-15 22:34:24 +00:00
Andrea Di Biagio	7948738673	[llvm-mca] BackendStatistics: early exit from method printSchedulerUsage if the no scheduler resources were consumed. llvm-svn: 327215	2018-03-10 17:40:25 +00:00
Andrea Di Biagio	373c38a2db	[llvm-mca] Fix handling of zero-latency instructions. This patch fixes a problem found when testing zero latency instructions on target AArch64 -mcpu=exynos-m3 / -mcpu=exynos-m1. On Exynos-m3/m1, direct branches are zero-latency instructions that don't consume any processor resources. The DispatchUnit marks zero-latency instructions as "executed", so that no scheduling is required. The event of instruction executed is then notified to all the listeners, and the reorder buffer (managed by the RetireControlUnit) is updated. In particular, the entry associated to the zero-latency instruction in the reorder buffer is marked as executed. Before this patch, the DispatchUnit forgot to assign a retire control unit token (RCUToken) to the zero-latency instruction. As a consequence, the RCUToken was used uninitialized. This was causing a crash in the RetireControlUnit logic. Fixes PR36650. llvm-svn: 327056	2018-03-08 20:21:55 +00:00
Andrea Di Biagio	7bbac07f22	[llvm-mca] Emit the 'Instruction Info' table before the resource pressure view. In future, both the summary information and the 'instruction info' table should be moved into a separate "Summary" view. llvm-svn: 327010	2018-03-08 15:34:38 +00:00
Andrea Di Biagio	3a6b092017	[llvm-mca] LLVM Machine Code Analyzer. llvm-mca is an LLVM based performance analysis tool that can be used to statically measure the performance of code, and to help triage potential problems with target scheduling models. llvm-mca uses information which is already available in LLVM (e.g. scheduling models) to statically measure the performance of machine code in a specific cpu. Performance is measured in terms of throughput as well as processor resource consumption. The tool currently works for processors with an out-of-order backend, for which there is a scheduling model available in LLVM. The main goal of this tool is not just to predict the performance of the code when run on the target, but also help with diagnosing potential performance issues. Given an assembly code sequence, llvm-mca estimates the IPC (instructions per cycle), as well as hardware resources pressure. The analysis and reporting style were mostly inspired by the IACA tool from Intel. This patch is related to the RFC on llvm-dev visible at this link: http://lists.llvm.org/pipermail/llvm-dev/2018-March/121490.html Differential Revision: https://reviews.llvm.org/D43951 llvm-svn: 326998	2018-03-08 13:05:02 +00:00

... 4 5 6 7 8

368 Commits