perf/x86: Fix spurious NMI with PEBS Load Latency event
Spurious NMIs will be observed with the following command: while :; do perf record -bae "cpu/umask=0x01,event=0xcd,ldlat=0x80/pp" -e "cpu/umask=0x03,event=0x0/" -e "cpu/umask=0x02,event=0x0/" -e cycles,branches,cache-misses -e cache-references -- sleep 10 done The bug was introduced by commit:8077eca079
("perf/x86/pebs: Add workaround for broken OVFL status on HSW+") That commit clears the status bits for the counters used for PEBS events, by masking the whole 64 bits pebs_enabled. However, only the low 32 bits of both status and pebs_enabled are reserved for PEBS-able counters. For status bits 32-34 are fixed counter overflow bits. For pebs_enabled bits 32-34 are for PEBS Load Latency. In the test case, the PEBS Load Latency event and fixed counter event could overflow at the same time. The fixed counter overflow bit will be cleared by mistake. Once it is cleared, the fixed counter overflow never be processed, which finally trigger spurious NMI. Correct the PEBS enabled mask by ignoring the non-PEBS bits. Signed-off-by: Kan Liang <kan.liang@intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Arnaldo Carvalho de Melo <acme@redhat.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Stephane Eranian <eranian@google.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Vince Weaver <vincent.weaver@maine.edu> Fixes:8077eca079
("perf/x86/pebs: Add workaround for broken OVFL status on HSW+") Link: http://lkml.kernel.org/r/1491333246-3965-1-git-send-email-kan.liang@intel.com Signed-off-by: Ingo Molnar <mingo@kernel.org>
This commit is contained in:
parent
18c5c7c618
commit
fd583ad156
|
@ -2151,7 +2151,7 @@ again:
|
||||||
* counters from the GLOBAL_STATUS mask and we always process PEBS
|
* counters from the GLOBAL_STATUS mask and we always process PEBS
|
||||||
* events via drain_pebs().
|
* events via drain_pebs().
|
||||||
*/
|
*/
|
||||||
status &= ~cpuc->pebs_enabled;
|
status &= ~(cpuc->pebs_enabled & PEBS_COUNTER_MASK);
|
||||||
|
|
||||||
/*
|
/*
|
||||||
* PEBS overflow sets bit 62 in the global status register
|
* PEBS overflow sets bit 62 in the global status register
|
||||||
|
|
|
@ -1222,7 +1222,7 @@ get_next_pebs_record_by_bit(void *base, void *top, int bit)
|
||||||
|
|
||||||
/* clear non-PEBS bit and re-check */
|
/* clear non-PEBS bit and re-check */
|
||||||
pebs_status = p->status & cpuc->pebs_enabled;
|
pebs_status = p->status & cpuc->pebs_enabled;
|
||||||
pebs_status &= (1ULL << MAX_PEBS_EVENTS) - 1;
|
pebs_status &= PEBS_COUNTER_MASK;
|
||||||
if (pebs_status == (1 << bit))
|
if (pebs_status == (1 << bit))
|
||||||
return at;
|
return at;
|
||||||
}
|
}
|
||||||
|
|
|
@ -79,6 +79,7 @@ struct amd_nb {
|
||||||
|
|
||||||
/* The maximal number of PEBS events: */
|
/* The maximal number of PEBS events: */
|
||||||
#define MAX_PEBS_EVENTS 8
|
#define MAX_PEBS_EVENTS 8
|
||||||
|
#define PEBS_COUNTER_MASK ((1ULL << MAX_PEBS_EVENTS) - 1)
|
||||||
|
|
||||||
/*
|
/*
|
||||||
* Flags PEBS can handle without an PMI.
|
* Flags PEBS can handle without an PMI.
|
||||||
|
|
Loading…
Reference in New Issue