License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 22:07:57 +08:00
|
|
|
// SPDX-License-Identifier: GPL-2.0
|
2012-03-16 03:09:17 +08:00
|
|
|
#include <linux/list.h>
|
2015-06-10 15:25:07 +08:00
|
|
|
#include <linux/compiler.h>
|
2019-06-26 22:42:03 +08:00
|
|
|
#include <linux/string.h>
|
2019-07-04 22:32:27 +08:00
|
|
|
#include <linux/zalloc.h>
|
2021-07-01 14:42:53 +08:00
|
|
|
#include <linux/ctype.h>
|
2019-08-30 23:52:25 +08:00
|
|
|
#include <subcmd/pager.h>
|
2012-03-16 03:09:17 +08:00
|
|
|
#include <sys/types.h>
|
2017-04-18 21:46:11 +08:00
|
|
|
#include <errno.h>
|
2017-09-11 21:50:26 +08:00
|
|
|
#include <fcntl.h>
|
2017-04-20 07:57:47 +08:00
|
|
|
#include <sys/stat.h>
|
2012-03-16 03:09:17 +08:00
|
|
|
#include <unistd.h>
|
|
|
|
#include <stdio.h>
|
2014-07-31 14:00:49 +08:00
|
|
|
#include <stdbool.h>
|
2014-07-31 14:00:50 +08:00
|
|
|
#include <stdarg.h>
|
2012-03-16 03:09:17 +08:00
|
|
|
#include <dirent.h>
|
2013-12-10 00:14:24 +08:00
|
|
|
#include <api/fs/fs.h>
|
2013-11-13 00:58:49 +08:00
|
|
|
#include <locale.h>
|
2017-12-04 22:57:28 +08:00
|
|
|
#include <regex.h>
|
2019-07-21 19:24:30 +08:00
|
|
|
#include <perf/cpumap.h>
|
2021-07-01 14:42:53 +08:00
|
|
|
#include <fnmatch.h>
|
2019-08-22 21:48:31 +08:00
|
|
|
#include "debug.h"
|
2020-04-01 18:16:09 +08:00
|
|
|
#include "evsel.h"
|
2012-03-16 03:09:17 +08:00
|
|
|
#include "pmu.h"
|
|
|
|
#include "parse-events.h"
|
2016-09-16 06:24:40 +08:00
|
|
|
#include "header.h"
|
2017-04-18 03:51:59 +08:00
|
|
|
#include "string2.h"
|
2019-08-30 23:52:25 +08:00
|
|
|
#include "strbuf.h"
|
2019-11-21 08:15:11 +08:00
|
|
|
#include "fncache.h"
|
2021-04-27 15:01:18 +08:00
|
|
|
#include "pmu-hybrid.h"
|
2012-03-16 03:09:17 +08:00
|
|
|
|
2020-06-10 00:23:24 +08:00
|
|
|
struct perf_pmu perf_pmu__fake;
|
|
|
|
|
2013-01-19 04:05:09 +08:00
|
|
|
struct perf_pmu_format {
|
|
|
|
char *name;
|
|
|
|
int value;
|
|
|
|
DECLARE_BITMAP(bits, PERF_PMU_FORMAT_BITS);
|
|
|
|
struct list_head list;
|
|
|
|
};
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
int perf_pmu_parse(struct list_head *list, char *name);
|
|
|
|
extern FILE *perf_pmu_in;
|
|
|
|
|
|
|
|
static LIST_HEAD(pmus);
|
2021-04-27 15:01:19 +08:00
|
|
|
static bool hybrid_scanned;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Parse & process all the sysfs attributes located under
|
|
|
|
* the directory specified in 'dir' parameter.
|
|
|
|
*/
|
2012-11-10 08:46:50 +08:00
|
|
|
int perf_pmu__format_parse(char *dir, struct list_head *head)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
struct dirent *evt_ent;
|
|
|
|
DIR *format_dir;
|
|
|
|
int ret = 0;
|
|
|
|
|
|
|
|
format_dir = opendir(dir);
|
|
|
|
if (!format_dir)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
while (!ret && (evt_ent = readdir(format_dir))) {
|
|
|
|
char path[PATH_MAX];
|
|
|
|
char *name = evt_ent->d_name;
|
|
|
|
FILE *file;
|
|
|
|
|
|
|
|
if (!strcmp(name, ".") || !strcmp(name, ".."))
|
|
|
|
continue;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX, "%s/%s", dir, name);
|
|
|
|
|
|
|
|
ret = -EINVAL;
|
|
|
|
file = fopen(path, "r");
|
|
|
|
if (!file)
|
|
|
|
break;
|
|
|
|
|
|
|
|
perf_pmu_in = file;
|
|
|
|
ret = perf_pmu_parse(head, name);
|
|
|
|
fclose(file);
|
|
|
|
}
|
|
|
|
|
|
|
|
closedir(format_dir);
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Reading/parsing the default pmu format definition, which should be
|
|
|
|
* located at:
|
|
|
|
* /sys/bus/event_source/devices/<dev>/format as sysfs group attributes.
|
|
|
|
*/
|
2013-07-04 21:20:25 +08:00
|
|
|
static int pmu_format(const char *name, struct list_head *format)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
2013-11-06 01:48:50 +08:00
|
|
|
const char *sysfs = sysfs__mountpoint();
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX,
|
2012-08-17 03:10:24 +08:00
|
|
|
"%s" EVENT_SOURCE_DEVICE_PATH "%s/format", sysfs, name);
|
2012-03-16 03:09:17 +08:00
|
|
|
|
2019-11-21 08:15:11 +08:00
|
|
|
if (!file_available(path))
|
|
|
|
return 0;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
2012-11-10 08:46:50 +08:00
|
|
|
if (perf_pmu__format_parse(path, format))
|
2012-03-16 03:09:17 +08:00
|
|
|
return -1;
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2019-08-28 13:59:29 +08:00
|
|
|
int perf_pmu__convert_scale(const char *scale, char **end, double *sval)
|
2013-11-13 00:58:49 +08:00
|
|
|
{
|
2016-03-09 02:42:30 +08:00
|
|
|
char *lc;
|
2017-01-03 23:08:23 +08:00
|
|
|
int ret = 0;
|
2015-05-31 14:06:23 +08:00
|
|
|
|
2013-11-13 00:58:49 +08:00
|
|
|
/*
|
|
|
|
* save current locale
|
|
|
|
*/
|
|
|
|
lc = setlocale(LC_NUMERIC, NULL);
|
|
|
|
|
perf tools: Fix locale handling in pmu parsing
Ingo reported regression on display format of big numbers, which is
missing separators (in default perf stat output).
triton:~/tip> perf stat -a sleep 1
...
127008602 cycles # 0.011 GHz
279538533 stalled-cycles-frontend # 220.09% frontend cycles idle
119213269 instructions # 0.94 insn per cycle
This is caused by recent change:
perf stat: Check existence of frontend/backed stalled cycles
that added call to pmu_have_event, that subsequently calls
perf_pmu__parse_scale, which has a bug in locale handling.
The lc string returned from setlocale, that we use to store old locale
value, may be allocated in static storage. Getting a dynamic copy to
make it survive another setlocale call.
$ perf stat ls
...
2,360,602 cycles # 3.080 GHz
2,703,090 instructions # 1.15 insn per cycle
546,031 branches # 712.511 M/sec
Committer note:
Since the patch introducing the regression didn't made to perf/core,
move it to just before where the regression was introduced, so that we
don't break bisection for this feature.
Reported-by: Ingo Molnar <mingo@redhat.com>
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/20160303095348.GA24511@krava.redhat.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-03-03 17:53:48 +08:00
|
|
|
/*
|
|
|
|
* The lc string may be allocated in static storage,
|
|
|
|
* so get a dynamic copy to make it survive setlocale
|
|
|
|
* call below.
|
|
|
|
*/
|
|
|
|
lc = strdup(lc);
|
|
|
|
if (!lc) {
|
|
|
|
ret = -ENOMEM;
|
2017-01-03 23:08:23 +08:00
|
|
|
goto out;
|
perf tools: Fix locale handling in pmu parsing
Ingo reported regression on display format of big numbers, which is
missing separators (in default perf stat output).
triton:~/tip> perf stat -a sleep 1
...
127008602 cycles # 0.011 GHz
279538533 stalled-cycles-frontend # 220.09% frontend cycles idle
119213269 instructions # 0.94 insn per cycle
This is caused by recent change:
perf stat: Check existence of frontend/backed stalled cycles
that added call to pmu_have_event, that subsequently calls
perf_pmu__parse_scale, which has a bug in locale handling.
The lc string returned from setlocale, that we use to store old locale
value, may be allocated in static storage. Getting a dynamic copy to
make it survive another setlocale call.
$ perf stat ls
...
2,360,602 cycles # 3.080 GHz
2,703,090 instructions # 1.15 insn per cycle
546,031 branches # 712.511 M/sec
Committer note:
Since the patch introducing the regression didn't made to perf/core,
move it to just before where the regression was introduced, so that we
don't break bisection for this feature.
Reported-by: Ingo Molnar <mingo@redhat.com>
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/20160303095348.GA24511@krava.redhat.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-03-03 17:53:48 +08:00
|
|
|
}
|
|
|
|
|
2013-11-13 00:58:49 +08:00
|
|
|
/*
|
|
|
|
* force to C locale to ensure kernel
|
|
|
|
* scale string is converted correctly.
|
|
|
|
* kernel uses default C locale.
|
|
|
|
*/
|
|
|
|
setlocale(LC_NUMERIC, "C");
|
|
|
|
|
2017-01-03 23:08:23 +08:00
|
|
|
*sval = strtod(scale, end);
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2017-01-03 23:08:23 +08:00
|
|
|
out:
|
2013-11-13 00:58:49 +08:00
|
|
|
/* restore locale */
|
|
|
|
setlocale(LC_NUMERIC, lc);
|
2016-03-09 02:42:30 +08:00
|
|
|
free(lc);
|
2017-01-03 23:08:23 +08:00
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int perf_pmu__parse_scale(struct perf_pmu_alias *alias, char *dir, char *name)
|
|
|
|
{
|
|
|
|
struct stat st;
|
|
|
|
ssize_t sret;
|
|
|
|
char scale[128];
|
|
|
|
int fd, ret = -1;
|
|
|
|
char path[PATH_MAX];
|
|
|
|
|
2018-11-12 02:45:24 +08:00
|
|
|
scnprintf(path, PATH_MAX, "%s/%s.scale", dir, name);
|
2017-01-03 23:08:23 +08:00
|
|
|
|
|
|
|
fd = open(path, O_RDONLY);
|
|
|
|
if (fd == -1)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
if (fstat(fd, &st) < 0)
|
|
|
|
goto error;
|
|
|
|
|
|
|
|
sret = read(fd, scale, sizeof(scale)-1);
|
|
|
|
if (sret < 0)
|
|
|
|
goto error;
|
|
|
|
|
|
|
|
if (scale[sret - 1] == '\n')
|
|
|
|
scale[sret - 1] = '\0';
|
|
|
|
else
|
|
|
|
scale[sret] = '\0';
|
perf tools: Fix locale handling in pmu parsing
Ingo reported regression on display format of big numbers, which is
missing separators (in default perf stat output).
triton:~/tip> perf stat -a sleep 1
...
127008602 cycles # 0.011 GHz
279538533 stalled-cycles-frontend # 220.09% frontend cycles idle
119213269 instructions # 0.94 insn per cycle
This is caused by recent change:
perf stat: Check existence of frontend/backed stalled cycles
that added call to pmu_have_event, that subsequently calls
perf_pmu__parse_scale, which has a bug in locale handling.
The lc string returned from setlocale, that we use to store old locale
value, may be allocated in static storage. Getting a dynamic copy to
make it survive another setlocale call.
$ perf stat ls
...
2,360,602 cycles # 3.080 GHz
2,703,090 instructions # 1.15 insn per cycle
546,031 branches # 712.511 M/sec
Committer note:
Since the patch introducing the regression didn't made to perf/core,
move it to just before where the regression was introduced, so that we
don't break bisection for this feature.
Reported-by: Ingo Molnar <mingo@redhat.com>
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/20160303095348.GA24511@krava.redhat.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-03-03 17:53:48 +08:00
|
|
|
|
2019-08-28 13:59:29 +08:00
|
|
|
ret = perf_pmu__convert_scale(scale, NULL, &alias->scale);
|
2013-11-13 00:58:49 +08:00
|
|
|
error:
|
|
|
|
close(fd);
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int perf_pmu__parse_unit(struct perf_pmu_alias *alias, char *dir, char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
ssize_t sret;
|
|
|
|
int fd;
|
|
|
|
|
2018-11-12 02:45:24 +08:00
|
|
|
scnprintf(path, PATH_MAX, "%s/%s.unit", dir, name);
|
2013-11-13 00:58:49 +08:00
|
|
|
|
|
|
|
fd = open(path, O_RDONLY);
|
|
|
|
if (fd == -1)
|
|
|
|
return -1;
|
|
|
|
|
2015-12-14 23:44:40 +08:00
|
|
|
sret = read(fd, alias->unit, UNIT_MAX_LEN);
|
2013-11-13 00:58:49 +08:00
|
|
|
if (sret < 0)
|
|
|
|
goto error;
|
|
|
|
|
|
|
|
close(fd);
|
|
|
|
|
2015-05-31 14:06:23 +08:00
|
|
|
if (alias->unit[sret - 1] == '\n')
|
|
|
|
alias->unit[sret - 1] = '\0';
|
|
|
|
else
|
|
|
|
alias->unit[sret] = '\0';
|
2013-11-13 00:58:49 +08:00
|
|
|
|
|
|
|
return 0;
|
|
|
|
error:
|
|
|
|
close(fd);
|
|
|
|
alias->unit[0] = '\0';
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
2014-11-21 17:31:12 +08:00
|
|
|
static int
|
|
|
|
perf_pmu__parse_per_pkg(struct perf_pmu_alias *alias, char *dir, char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
int fd;
|
|
|
|
|
2018-11-12 02:45:24 +08:00
|
|
|
scnprintf(path, PATH_MAX, "%s/%s.per-pkg", dir, name);
|
2014-11-21 17:31:12 +08:00
|
|
|
|
|
|
|
fd = open(path, O_RDONLY);
|
|
|
|
if (fd == -1)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
close(fd);
|
|
|
|
|
|
|
|
alias->per_pkg = true;
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2014-11-21 17:31:13 +08:00
|
|
|
static int perf_pmu__parse_snapshot(struct perf_pmu_alias *alias,
|
|
|
|
char *dir, char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
int fd;
|
|
|
|
|
2018-11-12 02:45:24 +08:00
|
|
|
scnprintf(path, PATH_MAX, "%s/%s.snapshot", dir, name);
|
2014-11-21 17:31:13 +08:00
|
|
|
|
|
|
|
fd = open(path, O_RDONLY);
|
|
|
|
if (fd == -1)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
alias->snapshot = true;
|
|
|
|
close(fd);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2018-06-15 18:11:05 +08:00
|
|
|
static void perf_pmu_assign_str(char *name, const char *field, char **old_str,
|
|
|
|
char **new_str)
|
|
|
|
{
|
|
|
|
if (!*old_str)
|
|
|
|
goto set_new;
|
|
|
|
|
|
|
|
if (*new_str) { /* Have new string, check with old */
|
|
|
|
if (strcasecmp(*old_str, *new_str))
|
|
|
|
pr_debug("alias %s differs in field '%s'\n",
|
|
|
|
name, field);
|
|
|
|
zfree(old_str);
|
|
|
|
} else /* Nothing new --> keep old string */
|
|
|
|
return;
|
|
|
|
set_new:
|
|
|
|
*old_str = *new_str;
|
|
|
|
*new_str = NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void perf_pmu_update_alias(struct perf_pmu_alias *old,
|
|
|
|
struct perf_pmu_alias *newalias)
|
|
|
|
{
|
|
|
|
perf_pmu_assign_str(old->name, "desc", &old->desc, &newalias->desc);
|
|
|
|
perf_pmu_assign_str(old->name, "long_desc", &old->long_desc,
|
|
|
|
&newalias->long_desc);
|
|
|
|
perf_pmu_assign_str(old->name, "topic", &old->topic, &newalias->topic);
|
|
|
|
perf_pmu_assign_str(old->name, "metric_expr", &old->metric_expr,
|
|
|
|
&newalias->metric_expr);
|
|
|
|
perf_pmu_assign_str(old->name, "metric_name", &old->metric_name,
|
|
|
|
&newalias->metric_name);
|
|
|
|
perf_pmu_assign_str(old->name, "value", &old->str, &newalias->str);
|
|
|
|
old->scale = newalias->scale;
|
|
|
|
old->per_pkg = newalias->per_pkg;
|
|
|
|
old->snapshot = newalias->snapshot;
|
|
|
|
memcpy(old->unit, newalias->unit, sizeof(old->unit));
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Delete an alias entry. */
|
2020-09-15 11:18:18 +08:00
|
|
|
void perf_pmu_free_alias(struct perf_pmu_alias *newalias)
|
2018-06-15 18:11:05 +08:00
|
|
|
{
|
|
|
|
zfree(&newalias->name);
|
|
|
|
zfree(&newalias->desc);
|
|
|
|
zfree(&newalias->long_desc);
|
|
|
|
zfree(&newalias->topic);
|
|
|
|
zfree(&newalias->str);
|
|
|
|
zfree(&newalias->metric_expr);
|
|
|
|
zfree(&newalias->metric_name);
|
2021-04-27 15:01:17 +08:00
|
|
|
zfree(&newalias->pmu_name);
|
2018-06-15 18:11:05 +08:00
|
|
|
parse_events_terms__purge(&newalias->terms);
|
|
|
|
free(newalias);
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Merge an alias, search in alias list. If this name is already
|
|
|
|
* present merge both of them to combine all information.
|
|
|
|
*/
|
|
|
|
static bool perf_pmu_merge_alias(struct perf_pmu_alias *newalias,
|
|
|
|
struct list_head *alist)
|
|
|
|
{
|
|
|
|
struct perf_pmu_alias *a;
|
|
|
|
|
|
|
|
list_for_each_entry(a, alist, list) {
|
|
|
|
if (!strcasecmp(newalias->name, a->name)) {
|
2021-04-27 15:01:17 +08:00
|
|
|
if (newalias->pmu_name && a->pmu_name &&
|
|
|
|
!strcasecmp(newalias->pmu_name, a->pmu_name)) {
|
|
|
|
continue;
|
|
|
|
}
|
2018-06-15 18:11:05 +08:00
|
|
|
perf_pmu_update_alias(a, newalias);
|
|
|
|
perf_pmu_free_alias(newalias);
|
|
|
|
return true;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
return false;
|
|
|
|
}
|
|
|
|
|
2015-06-10 15:25:08 +08:00
|
|
|
static int __perf_pmu__new_alias(struct list_head *list, char *dir, char *name,
|
2021-04-27 15:01:16 +08:00
|
|
|
char *desc, char *val, struct pmu_event *pe)
|
2012-06-15 14:31:41 +08:00
|
|
|
{
|
2018-06-15 18:11:04 +08:00
|
|
|
struct parse_events_term *term;
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_alias *alias;
|
2012-06-15 14:31:41 +08:00
|
|
|
int ret;
|
2017-01-28 10:03:37 +08:00
|
|
|
int num;
|
2018-06-15 18:11:04 +08:00
|
|
|
char newval[256];
|
2021-04-27 15:01:16 +08:00
|
|
|
char *long_desc = NULL, *topic = NULL, *unit = NULL, *perpkg = NULL,
|
2021-04-27 15:01:17 +08:00
|
|
|
*metric_expr = NULL, *metric_name = NULL, *deprecated = NULL,
|
|
|
|
*pmu_name = NULL;
|
2021-04-27 15:01:16 +08:00
|
|
|
|
|
|
|
if (pe) {
|
|
|
|
long_desc = (char *)pe->long_desc;
|
|
|
|
topic = (char *)pe->topic;
|
|
|
|
unit = (char *)pe->unit;
|
|
|
|
perpkg = (char *)pe->perpkg;
|
|
|
|
metric_expr = (char *)pe->metric_expr;
|
|
|
|
metric_name = (char *)pe->metric_name;
|
|
|
|
deprecated = (char *)pe->deprecated;
|
2021-04-27 15:01:17 +08:00
|
|
|
pmu_name = (char *)pe->pmu;
|
2021-04-27 15:01:16 +08:00
|
|
|
}
|
2012-06-15 14:31:41 +08:00
|
|
|
|
|
|
|
alias = malloc(sizeof(*alias));
|
|
|
|
if (!alias)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
INIT_LIST_HEAD(&alias->terms);
|
2013-11-13 00:58:49 +08:00
|
|
|
alias->scale = 1.0;
|
|
|
|
alias->unit[0] = '\0';
|
2014-11-21 17:31:12 +08:00
|
|
|
alias->per_pkg = false;
|
2016-01-07 02:50:01 +08:00
|
|
|
alias->snapshot = false;
|
2019-10-15 10:53:57 +08:00
|
|
|
alias->deprecated = false;
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2015-06-10 15:25:08 +08:00
|
|
|
ret = parse_events_terms(&alias->terms, val);
|
2012-06-15 14:31:41 +08:00
|
|
|
if (ret) {
|
2015-06-10 15:25:08 +08:00
|
|
|
pr_err("Cannot parse alias %s: %d\n", val, ret);
|
2012-06-15 14:31:41 +08:00
|
|
|
free(alias);
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2018-06-15 18:11:04 +08:00
|
|
|
/* Scan event and remove leading zeroes, spaces, newlines, some
|
|
|
|
* platforms have terms specified as
|
|
|
|
* event=0x0091 (read from files ../<PMU>/events/<FILE>
|
|
|
|
* and terms specified as event=0x91 (read from JSON files).
|
|
|
|
*
|
|
|
|
* Rebuild string to make alias->str member comparable.
|
|
|
|
*/
|
|
|
|
memset(newval, 0, sizeof(newval));
|
|
|
|
ret = 0;
|
|
|
|
list_for_each_entry(term, &alias->terms, list) {
|
|
|
|
if (ret)
|
|
|
|
ret += scnprintf(newval + ret, sizeof(newval) - ret,
|
|
|
|
",");
|
|
|
|
if (term->type_val == PARSE_EVENTS__TERM_TYPE_NUM)
|
|
|
|
ret += scnprintf(newval + ret, sizeof(newval) - ret,
|
|
|
|
"%s=%#x", term->config, term->val.num);
|
|
|
|
else if (term->type_val == PARSE_EVENTS__TERM_TYPE_STR)
|
|
|
|
ret += scnprintf(newval + ret, sizeof(newval) - ret,
|
|
|
|
"%s=%s", term->config, term->val.str);
|
|
|
|
}
|
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
alias->name = strdup(name);
|
2015-06-10 15:25:08 +08:00
|
|
|
if (dir) {
|
|
|
|
/*
|
|
|
|
* load unit name and scale if available
|
|
|
|
*/
|
|
|
|
perf_pmu__parse_unit(alias, dir, name);
|
|
|
|
perf_pmu__parse_scale(alias, dir, name);
|
|
|
|
perf_pmu__parse_per_pkg(alias, dir, name);
|
|
|
|
perf_pmu__parse_snapshot(alias, dir, name);
|
|
|
|
}
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2017-03-21 04:17:07 +08:00
|
|
|
alias->metric_expr = metric_expr ? strdup(metric_expr) : NULL;
|
2017-03-21 04:17:10 +08:00
|
|
|
alias->metric_name = metric_name ? strdup(metric_name): NULL;
|
2016-09-16 06:24:43 +08:00
|
|
|
alias->desc = desc ? strdup(desc) : NULL;
|
2016-09-16 06:24:48 +08:00
|
|
|
alias->long_desc = long_desc ? strdup(long_desc) :
|
|
|
|
desc ? strdup(desc) : NULL;
|
2016-09-16 06:24:50 +08:00
|
|
|
alias->topic = topic ? strdup(topic) : NULL;
|
2017-01-28 10:03:37 +08:00
|
|
|
if (unit) {
|
2019-08-28 13:59:29 +08:00
|
|
|
if (perf_pmu__convert_scale(unit, &unit, &alias->scale) < 0)
|
2017-01-28 10:03:37 +08:00
|
|
|
return -1;
|
|
|
|
snprintf(alias->unit, sizeof(alias->unit), "%s", unit);
|
|
|
|
}
|
|
|
|
alias->per_pkg = perpkg && sscanf(perpkg, "%d", &num) == 1 && num == 1;
|
2018-06-15 18:11:04 +08:00
|
|
|
alias->str = strdup(newval);
|
2021-04-27 15:01:17 +08:00
|
|
|
alias->pmu_name = pmu_name ? strdup(pmu_name) : NULL;
|
2017-01-28 10:03:40 +08:00
|
|
|
|
2019-10-15 10:53:57 +08:00
|
|
|
if (deprecated)
|
|
|
|
alias->deprecated = true;
|
|
|
|
|
2018-06-15 18:11:05 +08:00
|
|
|
if (!perf_pmu_merge_alias(alias, list))
|
|
|
|
list_add_tail(&alias->list, list);
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2015-06-10 15:25:08 +08:00
|
|
|
static int perf_pmu__new_alias(struct list_head *list, char *dir, char *name, FILE *file)
|
|
|
|
{
|
|
|
|
char buf[256];
|
|
|
|
int ret;
|
|
|
|
|
|
|
|
ret = fread(buf, 1, sizeof(buf), file);
|
|
|
|
if (ret == 0)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
buf[ret] = 0;
|
|
|
|
|
2018-06-15 18:11:03 +08:00
|
|
|
/* Remove trailing newline from sysfs file */
|
2019-06-26 23:13:13 +08:00
|
|
|
strim(buf);
|
2018-06-15 18:11:03 +08:00
|
|
|
|
2021-04-27 15:01:16 +08:00
|
|
|
return __perf_pmu__new_alias(list, dir, name, NULL, buf, NULL);
|
2015-06-10 15:25:08 +08:00
|
|
|
}
|
|
|
|
|
2014-09-24 22:04:06 +08:00
|
|
|
static inline bool pmu_alias_info_file(char *name)
|
|
|
|
{
|
|
|
|
size_t len;
|
|
|
|
|
|
|
|
len = strlen(name);
|
|
|
|
if (len > 5 && !strcmp(name + len - 5, ".unit"))
|
|
|
|
return true;
|
|
|
|
if (len > 6 && !strcmp(name + len - 6, ".scale"))
|
|
|
|
return true;
|
2014-11-21 17:31:12 +08:00
|
|
|
if (len > 8 && !strcmp(name + len - 8, ".per-pkg"))
|
|
|
|
return true;
|
2014-11-21 17:31:13 +08:00
|
|
|
if (len > 9 && !strcmp(name + len - 9, ".snapshot"))
|
|
|
|
return true;
|
2014-09-24 22:04:06 +08:00
|
|
|
|
|
|
|
return false;
|
|
|
|
}
|
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
/*
|
|
|
|
* Process all the sysfs attributes located under the directory
|
|
|
|
* specified in 'dir' parameter.
|
|
|
|
*/
|
|
|
|
static int pmu_aliases_parse(char *dir, struct list_head *head)
|
|
|
|
{
|
|
|
|
struct dirent *evt_ent;
|
|
|
|
DIR *event_dir;
|
|
|
|
|
|
|
|
event_dir = opendir(dir);
|
|
|
|
if (!event_dir)
|
|
|
|
return -EINVAL;
|
|
|
|
|
2016-02-18 06:44:55 +08:00
|
|
|
while ((evt_ent = readdir(event_dir))) {
|
2012-06-15 14:31:41 +08:00
|
|
|
char path[PATH_MAX];
|
|
|
|
char *name = evt_ent->d_name;
|
|
|
|
FILE *file;
|
|
|
|
|
|
|
|
if (!strcmp(name, ".") || !strcmp(name, ".."))
|
|
|
|
continue;
|
|
|
|
|
2013-11-13 00:58:49 +08:00
|
|
|
/*
|
2014-09-24 22:04:06 +08:00
|
|
|
* skip info files parsed in perf_pmu__new_alias()
|
2013-11-13 00:58:49 +08:00
|
|
|
*/
|
2014-09-24 22:04:06 +08:00
|
|
|
if (pmu_alias_info_file(name))
|
2013-11-13 00:58:49 +08:00
|
|
|
continue;
|
|
|
|
|
2018-03-19 16:29:01 +08:00
|
|
|
scnprintf(path, PATH_MAX, "%s/%s", dir, name);
|
2012-06-15 14:31:41 +08:00
|
|
|
|
|
|
|
file = fopen(path, "r");
|
2016-02-18 06:44:55 +08:00
|
|
|
if (!file) {
|
|
|
|
pr_debug("Cannot open %s\n", path);
|
|
|
|
continue;
|
|
|
|
}
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2016-02-18 06:44:55 +08:00
|
|
|
if (perf_pmu__new_alias(head, dir, name, file) < 0)
|
|
|
|
pr_debug("Cannot set up %s\n", name);
|
2012-06-15 14:31:41 +08:00
|
|
|
fclose(file);
|
|
|
|
}
|
|
|
|
|
|
|
|
closedir(event_dir);
|
2016-02-18 06:44:55 +08:00
|
|
|
return 0;
|
2012-06-15 14:31:41 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Reading the pmu event aliases definition, which should be located at:
|
|
|
|
* /sys/bus/event_source/devices/<dev>/events as sysfs group attributes.
|
|
|
|
*/
|
2013-07-04 21:20:25 +08:00
|
|
|
static int pmu_aliases(const char *name, struct list_head *head)
|
2012-06-15 14:31:41 +08:00
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
2013-11-06 01:48:50 +08:00
|
|
|
const char *sysfs = sysfs__mountpoint();
|
2012-06-15 14:31:41 +08:00
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX,
|
|
|
|
"%s/bus/event_source/devices/%s/events", sysfs, name);
|
|
|
|
|
2019-11-21 08:15:11 +08:00
|
|
|
if (!file_available(path))
|
|
|
|
return 0;
|
2012-06-15 14:31:41 +08:00
|
|
|
|
|
|
|
if (pmu_aliases_parse(path, head))
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2013-01-19 03:54:00 +08:00
|
|
|
static int pmu_alias_terms(struct perf_pmu_alias *alias,
|
2012-06-15 14:31:41 +08:00
|
|
|
struct list_head *terms)
|
|
|
|
{
|
2014-04-17 02:49:02 +08:00
|
|
|
struct parse_events_term *term, *cloned;
|
2012-06-15 14:31:41 +08:00
|
|
|
LIST_HEAD(list);
|
|
|
|
int ret;
|
|
|
|
|
|
|
|
list_for_each_entry(term, &alias->terms, list) {
|
2014-04-17 02:49:02 +08:00
|
|
|
ret = parse_events_term__clone(&cloned, term);
|
2012-06-15 14:31:41 +08:00
|
|
|
if (ret) {
|
2016-02-13 03:48:00 +08:00
|
|
|
parse_events_terms__purge(&list);
|
2012-06-15 14:31:41 +08:00
|
|
|
return ret;
|
|
|
|
}
|
2017-10-21 04:27:55 +08:00
|
|
|
/*
|
|
|
|
* Weak terms don't override command line options,
|
|
|
|
* which we don't want for implicit terms in aliases.
|
|
|
|
*/
|
|
|
|
cloned->weak = true;
|
2014-04-17 02:49:02 +08:00
|
|
|
list_add_tail(&cloned->list, &list);
|
2012-06-15 14:31:41 +08:00
|
|
|
}
|
|
|
|
list_splice(&list, terms);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
/*
|
|
|
|
* Reading/parsing the default pmu type value, which should be
|
|
|
|
* located at:
|
|
|
|
* /sys/bus/event_source/devices/<dev>/type as sysfs attribute.
|
|
|
|
*/
|
2013-07-04 21:20:25 +08:00
|
|
|
static int pmu_type(const char *name, __u32 *type)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
FILE *file;
|
|
|
|
int ret = 0;
|
2013-11-06 01:48:50 +08:00
|
|
|
const char *sysfs = sysfs__mountpoint();
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX,
|
2012-08-17 03:10:24 +08:00
|
|
|
"%s" EVENT_SOURCE_DEVICE_PATH "%s/type", sysfs, name);
|
2012-03-16 03:09:17 +08:00
|
|
|
|
2019-11-21 08:15:11 +08:00
|
|
|
if (access(path, R_OK) < 0)
|
2012-03-16 03:09:17 +08:00
|
|
|
return -1;
|
|
|
|
|
|
|
|
file = fopen(path, "r");
|
|
|
|
if (!file)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
if (1 != fscanf(file, "%u", type))
|
|
|
|
ret = -1;
|
|
|
|
|
|
|
|
fclose(file);
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2012-08-17 03:10:24 +08:00
|
|
|
/* Add all pmus in sysfs to pmu list: */
|
|
|
|
static void pmu_read_sysfs(void)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
DIR *dir;
|
|
|
|
struct dirent *dent;
|
2013-11-06 01:48:50 +08:00
|
|
|
const char *sysfs = sysfs__mountpoint();
|
2012-08-17 03:10:24 +08:00
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX,
|
|
|
|
"%s" EVENT_SOURCE_DEVICE_PATH, sysfs);
|
|
|
|
|
|
|
|
dir = opendir(path);
|
|
|
|
if (!dir)
|
|
|
|
return;
|
|
|
|
|
|
|
|
while ((dent = readdir(dir))) {
|
|
|
|
if (!strcmp(dent->d_name, ".") || !strcmp(dent->d_name, ".."))
|
|
|
|
continue;
|
|
|
|
/* add to static LIST_HEAD(pmus): */
|
|
|
|
perf_pmu__find(dent->d_name);
|
|
|
|
}
|
|
|
|
|
|
|
|
closedir(dir);
|
|
|
|
}
|
|
|
|
|
2019-07-21 19:23:49 +08:00
|
|
|
static struct perf_cpu_map *__pmu_cpumask(const char *path)
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
{
|
|
|
|
FILE *file;
|
2019-07-21 19:23:49 +08:00
|
|
|
struct perf_cpu_map *cpus;
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
|
|
|
|
file = fopen(path, "r");
|
|
|
|
if (!file)
|
|
|
|
return NULL;
|
|
|
|
|
2019-07-21 19:24:30 +08:00
|
|
|
cpus = perf_cpu_map__read(file);
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
fclose(file);
|
|
|
|
return cpus;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Uncore PMUs have a "cpumask" file under sysfs. CPU PMUs (e.g. on arm/arm64)
|
|
|
|
* may have a "cpus" file.
|
|
|
|
*/
|
2020-12-04 19:10:09 +08:00
|
|
|
#define SYS_TEMPLATE_ID "./bus/event_source/devices/%s/identifier"
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
#define CPUS_TEMPLATE_UNCORE "%s/bus/event_source/devices/%s/cpumask"
|
|
|
|
|
2019-07-21 19:23:49 +08:00
|
|
|
static struct perf_cpu_map *pmu_cpumask(const char *name)
|
2012-09-10 15:53:50 +08:00
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
2019-07-21 19:23:49 +08:00
|
|
|
struct perf_cpu_map *cpus;
|
2013-11-06 01:48:50 +08:00
|
|
|
const char *sysfs = sysfs__mountpoint();
|
perf pmu: Support alternative sysfs cpumask
The perf tools can read a cpumask file for a PMU, describing a subset of
CPUs which that PMU covers. So far this has only been used to cater for
uncore PMUs, which in practice happen to only have a single CPU
described in the mask.
Until recently, the perf tools only correctly handled cpumask containing
a single CPU, and only when monitoring in system-wide mode. For example,
prior to commit 00e727bb389359c8 ("perf stat: Balance opening and
reading events"), a mask with more than a single CPU could cause perf
stat to hang. When a CPU PMU covers a subset of CPUs, but lacks a
cpumask, perf record will fail to open events (on the cores the PMU does
not support), and gives up.
For systems with heterogeneous CPUs such as ARM big.LITTLE systems, this
presents a problem. We have a PMU for each microarchitecture (e.g. a big
PMU and a little PMU), and would like to expose a cpumask for each (so
as to allow perf record and other tools to do the right thing). However,
doing so kernel-side will cause old perf binaries to not function (e.g.
hitting the issue solved by 00e727bb389359c8), and thus commits the
cardinal sin of breaking (existing) userspace.
To address this chicken-and-egg problem, this patch adds support got a
new file, cpus, which is largely identical to the existing cpumask file.
A kernel can expose this file, knowing that new perf binaries will
correctly support it, while old perf binaries will not look for it (and
thus will not be broken).
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Will Deacon <will.deacon@arm.com>
Link: http://lkml.kernel.org/r/1473330112-28528-8-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-08 18:21:52 +08:00
|
|
|
const char *templates[] = {
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
CPUS_TEMPLATE_UNCORE,
|
|
|
|
CPUS_TEMPLATE_CPU,
|
|
|
|
NULL
|
perf pmu: Support alternative sysfs cpumask
The perf tools can read a cpumask file for a PMU, describing a subset of
CPUs which that PMU covers. So far this has only been used to cater for
uncore PMUs, which in practice happen to only have a single CPU
described in the mask.
Until recently, the perf tools only correctly handled cpumask containing
a single CPU, and only when monitoring in system-wide mode. For example,
prior to commit 00e727bb389359c8 ("perf stat: Balance opening and
reading events"), a mask with more than a single CPU could cause perf
stat to hang. When a CPU PMU covers a subset of CPUs, but lacks a
cpumask, perf record will fail to open events (on the cores the PMU does
not support), and gives up.
For systems with heterogeneous CPUs such as ARM big.LITTLE systems, this
presents a problem. We have a PMU for each microarchitecture (e.g. a big
PMU and a little PMU), and would like to expose a cpumask for each (so
as to allow perf record and other tools to do the right thing). However,
doing so kernel-side will cause old perf binaries to not function (e.g.
hitting the issue solved by 00e727bb389359c8), and thus commits the
cardinal sin of breaking (existing) userspace.
To address this chicken-and-egg problem, this patch adds support got a
new file, cpus, which is largely identical to the existing cpumask file.
A kernel can expose this file, knowing that new perf binaries will
correctly support it, while old perf binaries will not look for it (and
thus will not be broken).
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Will Deacon <will.deacon@arm.com>
Link: http://lkml.kernel.org/r/1473330112-28528-8-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-08 18:21:52 +08:00
|
|
|
};
|
|
|
|
const char **template;
|
2012-09-10 15:53:50 +08:00
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return NULL;
|
|
|
|
|
perf pmu: Support alternative sysfs cpumask
The perf tools can read a cpumask file for a PMU, describing a subset of
CPUs which that PMU covers. So far this has only been used to cater for
uncore PMUs, which in practice happen to only have a single CPU
described in the mask.
Until recently, the perf tools only correctly handled cpumask containing
a single CPU, and only when monitoring in system-wide mode. For example,
prior to commit 00e727bb389359c8 ("perf stat: Balance opening and
reading events"), a mask with more than a single CPU could cause perf
stat to hang. When a CPU PMU covers a subset of CPUs, but lacks a
cpumask, perf record will fail to open events (on the cores the PMU does
not support), and gives up.
For systems with heterogeneous CPUs such as ARM big.LITTLE systems, this
presents a problem. We have a PMU for each microarchitecture (e.g. a big
PMU and a little PMU), and would like to expose a cpumask for each (so
as to allow perf record and other tools to do the right thing). However,
doing so kernel-side will cause old perf binaries to not function (e.g.
hitting the issue solved by 00e727bb389359c8), and thus commits the
cardinal sin of breaking (existing) userspace.
To address this chicken-and-egg problem, this patch adds support got a
new file, cpus, which is largely identical to the existing cpumask file.
A kernel can expose this file, knowing that new perf binaries will
correctly support it, while old perf binaries will not look for it (and
thus will not be broken).
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Will Deacon <will.deacon@arm.com>
Link: http://lkml.kernel.org/r/1473330112-28528-8-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-08 18:21:52 +08:00
|
|
|
for (template = templates; *template; template++) {
|
|
|
|
snprintf(path, PATH_MAX, *template, sysfs, name);
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
cpus = __pmu_cpumask(path);
|
|
|
|
if (cpus)
|
|
|
|
return cpus;
|
perf pmu: Support alternative sysfs cpumask
The perf tools can read a cpumask file for a PMU, describing a subset of
CPUs which that PMU covers. So far this has only been used to cater for
uncore PMUs, which in practice happen to only have a single CPU
described in the mask.
Until recently, the perf tools only correctly handled cpumask containing
a single CPU, and only when monitoring in system-wide mode. For example,
prior to commit 00e727bb389359c8 ("perf stat: Balance opening and
reading events"), a mask with more than a single CPU could cause perf
stat to hang. When a CPU PMU covers a subset of CPUs, but lacks a
cpumask, perf record will fail to open events (on the cores the PMU does
not support), and gives up.
For systems with heterogeneous CPUs such as ARM big.LITTLE systems, this
presents a problem. We have a PMU for each microarchitecture (e.g. a big
PMU and a little PMU), and would like to expose a cpumask for each (so
as to allow perf record and other tools to do the right thing). However,
doing so kernel-side will cause old perf binaries to not function (e.g.
hitting the issue solved by 00e727bb389359c8), and thus commits the
cardinal sin of breaking (existing) userspace.
To address this chicken-and-egg problem, this patch adds support got a
new file, cpus, which is largely identical to the existing cpumask file.
A kernel can expose this file, knowing that new perf binaries will
correctly support it, while old perf binaries will not look for it (and
thus will not be broken).
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Will Deacon <will.deacon@arm.com>
Link: http://lkml.kernel.org/r/1473330112-28528-8-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-08 18:21:52 +08:00
|
|
|
}
|
2012-09-10 15:53:50 +08:00
|
|
|
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
return NULL;
|
|
|
|
}
|
2012-09-10 15:53:50 +08:00
|
|
|
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
static bool pmu_is_uncore(const char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
2019-11-21 08:15:11 +08:00
|
|
|
const char *sysfs;
|
2012-09-10 15:53:50 +08:00
|
|
|
|
2021-04-27 15:01:18 +08:00
|
|
|
if (perf_pmu__hybrid_mounted(name))
|
|
|
|
return false;
|
|
|
|
|
2019-11-21 08:15:11 +08:00
|
|
|
sysfs = sysfs__mountpoint();
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
snprintf(path, PATH_MAX, CPUS_TEMPLATE_UNCORE, sysfs, name);
|
2019-11-21 08:15:11 +08:00
|
|
|
return file_available(path);
|
2012-09-10 15:53:50 +08:00
|
|
|
}
|
|
|
|
|
2020-12-04 19:10:09 +08:00
|
|
|
static char *pmu_id(const char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX], *str;
|
|
|
|
size_t len;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX, SYS_TEMPLATE_ID, name);
|
|
|
|
|
|
|
|
if (sysfs__read_str(path, &str, &len) < 0)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
str[len - 1] = 0; /* remove line feed */
|
|
|
|
|
|
|
|
return str;
|
|
|
|
}
|
|
|
|
|
2017-08-24 19:00:58 +08:00
|
|
|
/*
|
|
|
|
* PMU CORE devices have different name other than cpu in sysfs on some
|
2018-04-25 02:20:10 +08:00
|
|
|
* platforms.
|
|
|
|
* Looking for possible sysfs files to identify the arm core device.
|
2017-08-24 19:00:58 +08:00
|
|
|
*/
|
2018-04-25 02:20:10 +08:00
|
|
|
static int is_arm_pmu_core(const char *name)
|
2017-08-24 19:00:58 +08:00
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
const char *sysfs = sysfs__mountpoint();
|
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
/* Look for cpu sysfs (specific to arm) */
|
|
|
|
scnprintf(path, PATH_MAX, "%s/bus/event_source/devices/%s/cpus",
|
|
|
|
sysfs, name);
|
2019-11-21 08:15:11 +08:00
|
|
|
return file_available(path);
|
2017-08-24 19:00:58 +08:00
|
|
|
}
|
|
|
|
|
2017-10-17 02:32:18 +08:00
|
|
|
static char *perf_pmu__getcpuid(struct perf_pmu *pmu)
|
2016-09-16 06:24:40 +08:00
|
|
|
{
|
|
|
|
char *cpuid;
|
2016-10-14 05:15:24 +08:00
|
|
|
static bool printed;
|
2016-09-16 06:24:40 +08:00
|
|
|
|
2016-09-16 06:24:46 +08:00
|
|
|
cpuid = getenv("PERF_CPUID");
|
|
|
|
if (cpuid)
|
|
|
|
cpuid = strdup(cpuid);
|
|
|
|
if (!cpuid)
|
2017-10-17 02:32:18 +08:00
|
|
|
cpuid = get_cpuid_str(pmu);
|
2016-09-16 06:24:40 +08:00
|
|
|
if (!cpuid)
|
2017-09-01 03:40:30 +08:00
|
|
|
return NULL;
|
2016-09-16 06:24:40 +08:00
|
|
|
|
2016-10-14 05:15:24 +08:00
|
|
|
if (!printed) {
|
|
|
|
pr_debug("Using CPUID %s\n", cpuid);
|
|
|
|
printed = true;
|
|
|
|
}
|
2017-09-01 03:40:30 +08:00
|
|
|
return cpuid;
|
|
|
|
}
|
|
|
|
|
2021-10-16 01:21:13 +08:00
|
|
|
const struct pmu_events_map *perf_pmu__find_map(struct perf_pmu *pmu)
|
2017-09-01 03:40:30 +08:00
|
|
|
{
|
2021-10-16 01:21:13 +08:00
|
|
|
const struct pmu_events_map *map;
|
2017-10-17 02:32:18 +08:00
|
|
|
char *cpuid = perf_pmu__getcpuid(pmu);
|
2017-09-01 03:40:30 +08:00
|
|
|
int i;
|
2016-09-16 06:24:46 +08:00
|
|
|
|
2017-10-17 02:32:22 +08:00
|
|
|
/* on some platforms which uses cpus map, cpuid can be NULL for
|
|
|
|
* PMUs other than CORE PMUs.
|
|
|
|
*/
|
|
|
|
if (!cpuid)
|
|
|
|
return NULL;
|
|
|
|
|
2016-09-16 06:24:40 +08:00
|
|
|
i = 0;
|
2017-09-01 03:40:30 +08:00
|
|
|
for (;;) {
|
2016-09-16 06:24:40 +08:00
|
|
|
map = &pmu_events_map[i++];
|
2017-09-01 03:40:30 +08:00
|
|
|
if (!map->table) {
|
|
|
|
map = NULL;
|
|
|
|
break;
|
|
|
|
}
|
2016-09-16 06:24:40 +08:00
|
|
|
|
2018-02-13 23:14:18 +08:00
|
|
|
if (!strcmp_cpuid_str(map->cpuid, cpuid))
|
2016-09-16 06:24:40 +08:00
|
|
|
break;
|
|
|
|
}
|
2017-09-01 03:40:30 +08:00
|
|
|
free(cpuid);
|
|
|
|
return map;
|
|
|
|
}
|
|
|
|
|
2021-10-16 01:21:13 +08:00
|
|
|
const struct pmu_events_map *__weak pmu_events_map__find(void)
|
2021-04-07 18:32:47 +08:00
|
|
|
{
|
|
|
|
return perf_pmu__find_map(NULL);
|
|
|
|
}
|
|
|
|
|
2021-07-20 23:10:19 +08:00
|
|
|
/*
|
|
|
|
* Suffix must be in form tok_{digits}, or tok{digits}, or same as pmu_name
|
|
|
|
* to be valid.
|
|
|
|
*/
|
|
|
|
static bool perf_pmu__valid_suffix(const char *pmu_name, char *tok)
|
2021-07-01 14:42:53 +08:00
|
|
|
{
|
2021-07-20 23:10:19 +08:00
|
|
|
const char *p;
|
2021-07-01 14:42:53 +08:00
|
|
|
|
|
|
|
if (strncmp(pmu_name, tok, strlen(tok)))
|
|
|
|
return false;
|
|
|
|
|
|
|
|
p = pmu_name + strlen(tok);
|
|
|
|
if (*p == 0)
|
|
|
|
return true;
|
|
|
|
|
2021-07-20 23:10:19 +08:00
|
|
|
if (*p == '_')
|
|
|
|
++p;
|
2021-07-01 14:42:53 +08:00
|
|
|
|
2021-07-20 23:10:19 +08:00
|
|
|
/* Ensure we end in a number */
|
|
|
|
while (1) {
|
|
|
|
if (!isdigit(*p))
|
|
|
|
return false;
|
|
|
|
if (*(++p) == 0)
|
|
|
|
break;
|
|
|
|
}
|
2021-07-01 14:42:53 +08:00
|
|
|
|
|
|
|
return true;
|
|
|
|
}
|
|
|
|
|
2020-03-17 19:02:18 +08:00
|
|
|
bool pmu_uncore_alias_match(const char *pmu_name, const char *name)
|
2019-06-28 22:35:49 +08:00
|
|
|
{
|
|
|
|
char *tmp = NULL, *tok, *str;
|
|
|
|
bool res;
|
|
|
|
|
|
|
|
str = strdup(pmu_name);
|
|
|
|
if (!str)
|
|
|
|
return false;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* uncore alias may be from different PMU with common prefix
|
|
|
|
*/
|
|
|
|
tok = strtok_r(str, ",", &tmp);
|
|
|
|
if (strncmp(pmu_name, tok, strlen(tok))) {
|
|
|
|
res = false;
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Match more complex aliases where the alias name is a comma-delimited
|
|
|
|
* list of tokens, orderly contained in the matching PMU name.
|
|
|
|
*
|
|
|
|
* Example: For alias "socket,pmuname" and PMU "socketX_pmunameY", we
|
|
|
|
* match "socket" in "socketX_pmunameY" and then "pmuname" in
|
|
|
|
* "pmunameY".
|
|
|
|
*/
|
2021-07-20 23:10:19 +08:00
|
|
|
while (1) {
|
|
|
|
char *next_tok = strtok_r(NULL, ",", &tmp);
|
|
|
|
|
2019-06-28 22:35:49 +08:00
|
|
|
name = strstr(name, tok);
|
2021-07-20 23:10:19 +08:00
|
|
|
if (!name ||
|
|
|
|
(!next_tok && !perf_pmu__valid_suffix(name, tok))) {
|
2019-06-28 22:35:49 +08:00
|
|
|
res = false;
|
|
|
|
goto out;
|
|
|
|
}
|
2021-07-20 23:10:19 +08:00
|
|
|
if (!next_tok)
|
|
|
|
break;
|
|
|
|
tok = next_tok;
|
|
|
|
name += strlen(tok);
|
2019-06-28 22:35:49 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
res = true;
|
|
|
|
out:
|
|
|
|
free(str);
|
|
|
|
return res;
|
|
|
|
}
|
|
|
|
|
2017-09-01 03:40:30 +08:00
|
|
|
/*
|
|
|
|
* From the pmu_events_map, find the table of PMU events that corresponds
|
|
|
|
* to the current running CPU. Then, add all PMU events from that table
|
|
|
|
* as aliases.
|
|
|
|
*/
|
2020-03-17 19:02:15 +08:00
|
|
|
void pmu_add_cpu_aliases_map(struct list_head *head, struct perf_pmu *pmu,
|
2021-10-16 01:21:13 +08:00
|
|
|
const struct pmu_events_map *map)
|
2017-09-01 03:40:30 +08:00
|
|
|
{
|
|
|
|
int i;
|
2017-10-17 02:32:18 +08:00
|
|
|
const char *name = pmu->name;
|
2016-09-16 06:24:40 +08:00
|
|
|
/*
|
|
|
|
* Found a matching PMU events table. Create aliases
|
|
|
|
*/
|
|
|
|
i = 0;
|
|
|
|
while (1) {
|
2019-06-14 22:07:59 +08:00
|
|
|
const char *cpu_name = is_arm_pmu_core(name) ? name : "cpu";
|
|
|
|
struct pmu_event *pe = &map->table[i++];
|
|
|
|
const char *pname = pe->pmu ? pe->pmu : cpu_name;
|
2017-01-28 10:03:37 +08:00
|
|
|
|
2017-09-01 03:40:31 +08:00
|
|
|
if (!pe->name) {
|
|
|
|
if (pe->metric_group || pe->metric_name)
|
|
|
|
continue;
|
2016-09-16 06:24:40 +08:00
|
|
|
break;
|
2017-09-01 03:40:31 +08:00
|
|
|
}
|
2016-09-16 06:24:40 +08:00
|
|
|
|
2021-07-29 21:56:21 +08:00
|
|
|
if (pmu->is_uncore && pmu_uncore_alias_match(pname, name))
|
2019-06-14 22:07:59 +08:00
|
|
|
goto new_alias;
|
perf pmu: Fix parser error for uncore event alias
Perf fails to parse uncore event alias, for example:
# perf stat -e unc_m_clockticks -a --no-merge sleep 1
event syntax error: 'unc_m_clockticks'
\___ parser error
Current code assumes that the event alias is from one specific PMU.
To find the PMU, perf strcmps the PMU name of event alias with the real
PMU name on the system.
However, the uncore event alias may be from multiple PMUs with common
prefix. The PMU name of uncore event alias is the common prefix.
For example, UNC_M_CLOCKTICKS is clock event for iMC, which include 6
PMUs with the same prefix "uncore_imc" on a skylake server.
The real PMU names on the system for iMC are uncore_imc_0 ...
uncore_imc_5.
The strncmp is used to only check the common prefix for uncore event
alias.
With the patch:
# perf stat -e unc_m_clockticks -a --no-merge sleep 1
Performance counter stats for 'system wide':
723,594,722 unc_m_clockticks [uncore_imc_5]
724,001,954 unc_m_clockticks [uncore_imc_3]
724,042,655 unc_m_clockticks [uncore_imc_1]
724,161,001 unc_m_clockticks [uncore_imc_4]
724,293,713 unc_m_clockticks [uncore_imc_2]
724,340,901 unc_m_clockticks [uncore_imc_0]
1.002090060 seconds time elapsed
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Thomas Richter <tmricht@linux.ibm.com>
Cc: stable@vger.kernel.org
Fixes: ea1fa48c055f ("perf stat: Handle different PMU names with common prefix")
Link: http://lkml.kernel.org/r/1552672814-156173-1-git-send-email-kan.liang@linux.intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-03-16 02:00:14 +08:00
|
|
|
|
2019-06-14 22:07:59 +08:00
|
|
|
if (strcmp(pname, name))
|
|
|
|
continue;
|
2017-01-28 10:03:37 +08:00
|
|
|
|
perf pmu: Fix parser error for uncore event alias
Perf fails to parse uncore event alias, for example:
# perf stat -e unc_m_clockticks -a --no-merge sleep 1
event syntax error: 'unc_m_clockticks'
\___ parser error
Current code assumes that the event alias is from one specific PMU.
To find the PMU, perf strcmps the PMU name of event alias with the real
PMU name on the system.
However, the uncore event alias may be from multiple PMUs with common
prefix. The PMU name of uncore event alias is the common prefix.
For example, UNC_M_CLOCKTICKS is clock event for iMC, which include 6
PMUs with the same prefix "uncore_imc" on a skylake server.
The real PMU names on the system for iMC are uncore_imc_0 ...
uncore_imc_5.
The strncmp is used to only check the common prefix for uncore event
alias.
With the patch:
# perf stat -e unc_m_clockticks -a --no-merge sleep 1
Performance counter stats for 'system wide':
723,594,722 unc_m_clockticks [uncore_imc_5]
724,001,954 unc_m_clockticks [uncore_imc_3]
724,042,655 unc_m_clockticks [uncore_imc_1]
724,161,001 unc_m_clockticks [uncore_imc_4]
724,293,713 unc_m_clockticks [uncore_imc_2]
724,340,901 unc_m_clockticks [uncore_imc_0]
1.002090060 seconds time elapsed
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Thomas Richter <tmricht@linux.ibm.com>
Cc: stable@vger.kernel.org
Fixes: ea1fa48c055f ("perf stat: Handle different PMU names with common prefix")
Link: http://lkml.kernel.org/r/1552672814-156173-1-git-send-email-kan.liang@linux.intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-03-16 02:00:14 +08:00
|
|
|
new_alias:
|
2016-09-16 06:24:40 +08:00
|
|
|
/* need type casts to override 'const' */
|
|
|
|
__perf_pmu__new_alias(head, NULL, (char *)pe->name,
|
2016-09-16 06:24:48 +08:00
|
|
|
(char *)pe->desc, (char *)pe->event,
|
2021-04-27 15:01:16 +08:00
|
|
|
pe);
|
2016-09-16 06:24:40 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2020-03-17 19:02:15 +08:00
|
|
|
static void pmu_add_cpu_aliases(struct list_head *head, struct perf_pmu *pmu)
|
|
|
|
{
|
2021-10-16 01:21:13 +08:00
|
|
|
const struct pmu_events_map *map;
|
2020-03-17 19:02:15 +08:00
|
|
|
|
|
|
|
map = perf_pmu__find_map(pmu);
|
|
|
|
if (!map)
|
|
|
|
return;
|
|
|
|
|
|
|
|
pmu_add_cpu_aliases_map(head, pmu, map);
|
|
|
|
}
|
|
|
|
|
2020-12-04 19:10:10 +08:00
|
|
|
void pmu_for_each_sys_event(pmu_sys_event_iter_fn fn, void *data)
|
|
|
|
{
|
|
|
|
int i = 0;
|
|
|
|
|
|
|
|
while (1) {
|
2021-10-16 01:21:14 +08:00
|
|
|
const struct pmu_sys_events *event_table;
|
2020-12-04 19:10:10 +08:00
|
|
|
int j = 0;
|
|
|
|
|
|
|
|
event_table = &pmu_sys_event_tables[i++];
|
|
|
|
|
|
|
|
if (!event_table->table)
|
|
|
|
break;
|
|
|
|
|
|
|
|
while (1) {
|
|
|
|
struct pmu_event *pe = &event_table->table[j++];
|
|
|
|
int ret;
|
|
|
|
|
|
|
|
if (!pe->name && !pe->metric_group && !pe->metric_name)
|
|
|
|
break;
|
|
|
|
|
|
|
|
ret = fn(pe, data);
|
|
|
|
if (ret)
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
struct pmu_sys_event_iter_data {
|
|
|
|
struct list_head *head;
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
};
|
|
|
|
|
|
|
|
static int pmu_add_sys_aliases_iter_fn(struct pmu_event *pe, void *data)
|
|
|
|
{
|
|
|
|
struct pmu_sys_event_iter_data *idata = data;
|
|
|
|
struct perf_pmu *pmu = idata->pmu;
|
|
|
|
|
|
|
|
if (!pe->name) {
|
|
|
|
if (pe->metric_group || pe->metric_name)
|
|
|
|
return 0;
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (!pe->compat || !pe->pmu)
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
if (!strcmp(pmu->id, pe->compat) &&
|
|
|
|
pmu_uncore_alias_match(pe->pmu, pmu->name)) {
|
|
|
|
__perf_pmu__new_alias(idata->head, NULL,
|
|
|
|
(char *)pe->name,
|
|
|
|
(char *)pe->desc,
|
|
|
|
(char *)pe->event,
|
2021-04-27 15:01:16 +08:00
|
|
|
pe);
|
2020-12-04 19:10:10 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2021-07-29 21:56:24 +08:00
|
|
|
void pmu_add_sys_aliases(struct list_head *head, struct perf_pmu *pmu)
|
2020-12-04 19:10:10 +08:00
|
|
|
{
|
|
|
|
struct pmu_sys_event_iter_data idata = {
|
|
|
|
.head = head,
|
|
|
|
.pmu = pmu,
|
|
|
|
};
|
|
|
|
|
|
|
|
if (!pmu->id)
|
|
|
|
return;
|
|
|
|
|
|
|
|
pmu_for_each_sys_event(pmu_add_sys_aliases_iter_fn, &idata);
|
|
|
|
}
|
|
|
|
|
2015-06-10 15:25:07 +08:00
|
|
|
struct perf_event_attr * __weak
|
2014-07-31 14:00:49 +08:00
|
|
|
perf_pmu__get_default_config(struct perf_pmu *pmu __maybe_unused)
|
|
|
|
{
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
char * __weak
|
|
|
|
pmu_find_real_name(const char *name)
|
|
|
|
{
|
|
|
|
return (char *)name;
|
|
|
|
}
|
|
|
|
|
|
|
|
char * __weak
|
|
|
|
pmu_find_alias_name(const char *name __maybe_unused)
|
|
|
|
{
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2019-03-05 23:25:32 +08:00
|
|
|
static int pmu_max_precise(const char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
int max_precise = -1;
|
|
|
|
|
|
|
|
scnprintf(path, PATH_MAX,
|
|
|
|
"bus/event_source/devices/%s/caps/max_precise",
|
|
|
|
name);
|
|
|
|
|
|
|
|
sysfs__read_int(path, &max_precise);
|
|
|
|
return max_precise;
|
|
|
|
}
|
|
|
|
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
static struct perf_pmu *pmu_lookup(const char *lookup_name)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
LIST_HEAD(format);
|
2012-06-15 14:31:41 +08:00
|
|
|
LIST_HEAD(aliases);
|
2012-03-16 03:09:17 +08:00
|
|
|
__u32 type;
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
char *name = pmu_find_real_name(lookup_name);
|
2021-07-08 09:36:58 +08:00
|
|
|
bool is_hybrid = perf_pmu__hybrid_mounted(name);
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
char *alias_name;
|
2021-07-08 09:36:58 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Check pmu name for hybrid and the pmu may be invalid in sysfs
|
|
|
|
*/
|
|
|
|
if (!strncmp(name, "cpu_", 4) && !is_hybrid)
|
|
|
|
return NULL;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* The pmu data we store & need consists of the pmu
|
|
|
|
* type value and format definitions. Load both right
|
|
|
|
* now.
|
|
|
|
*/
|
|
|
|
if (pmu_format(name, &format))
|
|
|
|
return NULL;
|
|
|
|
|
2017-01-28 10:03:38 +08:00
|
|
|
/*
|
|
|
|
* Check the type first to avoid unnecessary work.
|
|
|
|
*/
|
|
|
|
if (pmu_type(name, &type))
|
2012-10-10 20:53:16 +08:00
|
|
|
return NULL;
|
|
|
|
|
2017-01-28 10:03:38 +08:00
|
|
|
if (pmu_aliases(name, &aliases))
|
2012-03-16 03:09:17 +08:00
|
|
|
return NULL;
|
|
|
|
|
|
|
|
pmu = zalloc(sizeof(*pmu));
|
|
|
|
if (!pmu)
|
|
|
|
return NULL;
|
|
|
|
|
2012-09-10 15:53:50 +08:00
|
|
|
pmu->cpus = pmu_cpumask(name);
|
2017-10-17 02:32:18 +08:00
|
|
|
pmu->name = strdup(name);
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
if (!pmu->name)
|
|
|
|
goto err;
|
|
|
|
|
|
|
|
alias_name = pmu_find_alias_name(name);
|
|
|
|
if (alias_name) {
|
|
|
|
pmu->alias_name = strdup(alias_name);
|
|
|
|
if (!pmu->alias_name)
|
|
|
|
goto err;
|
|
|
|
}
|
|
|
|
|
2017-10-17 02:32:18 +08:00
|
|
|
pmu->type = type;
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
pmu->is_uncore = pmu_is_uncore(name);
|
2020-12-04 19:10:09 +08:00
|
|
|
if (pmu->is_uncore)
|
|
|
|
pmu->id = pmu_id(name);
|
2021-07-08 09:36:58 +08:00
|
|
|
pmu->is_hybrid = is_hybrid;
|
2019-03-05 23:25:32 +08:00
|
|
|
pmu->max_precise = pmu_max_precise(name);
|
2017-10-17 02:32:18 +08:00
|
|
|
pmu_add_cpu_aliases(&aliases, pmu);
|
2020-12-04 19:10:10 +08:00
|
|
|
pmu_add_sys_aliases(&aliases, pmu);
|
perf pmu: Unbreak perf record for arm/arm64 with events with explicit PMU
Currently, perf record is broken on arm/arm64 systems when the PMU is
specified explicitly as part of the event, e.g.
$ ./perf record -e armv8_cortex_a53/cpu_cycles/u true
In such cases, perf record fails to open events unless
perf_event_paranoid is set to -1, even if the PMU in question supports
mode exclusion. Further, even when perf_event_paranoid is toggled, no
samples are recorded.
This is an unintended side effect of commit:
e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
... which assumes that if a PMU has an associated cpu_map, it is an
uncore PMU, and forces events for such PMUs to be system-wide.
This is not true for arm/arm64 systems, which can have heterogeneous
CPUs. To account for this, multiple CPU PMUs are exposed, each with a
"cpus" field under sysfs, which the perf tool parses into a cpu_map. ARM
PMUs do not have a "cpumask" file, and only have a "cpus" file. For the
gory details as to why, see commit:
7e3fcffe95544010 ("perf pmu: Support alternative sysfs cpumask")
Given all of this, we can instead identify uncore PMUs by explicitly
checking for a "cpumask" file, and restore arm/arm64 PMU support back to
a working state. This patch does so, adding a new perf_pmu::is_uncore
field, and splitting the existing cpumask parsing so that it can be
reused.
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Tested-by Will Deacon <will.deacon@arm.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: 4.12+ <stable@vger.kernel.org>
Fixes: e3ba76deef23064f ("perf tools: Force uncore events to system wide monitoring)
Link: http://lkml.kernel.org/r/1507315102-5942-1-git-send-email-mark.rutland@arm.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-10-07 02:38:22 +08:00
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
INIT_LIST_HEAD(&pmu->format);
|
2012-06-15 14:31:41 +08:00
|
|
|
INIT_LIST_HEAD(&pmu->aliases);
|
2020-03-20 04:25:01 +08:00
|
|
|
INIT_LIST_HEAD(&pmu->caps);
|
2012-03-16 03:09:17 +08:00
|
|
|
list_splice(&format, &pmu->format);
|
2012-06-15 14:31:41 +08:00
|
|
|
list_splice(&aliases, &pmu->aliases);
|
2012-06-15 04:38:37 +08:00
|
|
|
list_add_tail(&pmu->list, &pmus);
|
2014-07-31 14:00:49 +08:00
|
|
|
|
2021-04-27 15:01:18 +08:00
|
|
|
if (pmu->is_hybrid)
|
|
|
|
list_add_tail(&pmu->hybrid_list, &perf_pmu__hybrid_pmus);
|
|
|
|
|
2014-07-31 14:00:49 +08:00
|
|
|
pmu->default_config = perf_pmu__get_default_config(pmu);
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
return pmu;
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
err:
|
|
|
|
if (pmu->name)
|
|
|
|
free(pmu->name);
|
|
|
|
free(pmu);
|
|
|
|
return NULL;
|
2012-03-16 03:09:17 +08:00
|
|
|
}
|
|
|
|
|
2013-07-04 21:20:25 +08:00
|
|
|
static struct perf_pmu *pmu_find(const char *name)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
list_for_each_entry(pmu, &pmus, list) {
|
|
|
|
if (!strcmp(pmu->name, name) ||
|
|
|
|
(pmu->alias_name && !strcmp(pmu->alias_name, name)))
|
2012-03-16 03:09:17 +08:00
|
|
|
return pmu;
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
}
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2020-04-30 07:14:42 +08:00
|
|
|
struct perf_pmu *perf_pmu__find_by_type(unsigned int type)
|
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
|
|
|
|
list_for_each_entry(pmu, &pmus, list)
|
|
|
|
if (pmu->type == type)
|
|
|
|
return pmu;
|
|
|
|
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2012-08-17 03:10:24 +08:00
|
|
|
struct perf_pmu *perf_pmu__scan(struct perf_pmu *pmu)
|
|
|
|
{
|
|
|
|
/*
|
|
|
|
* pmu iterator: If pmu is NULL, we start at the begin,
|
|
|
|
* otherwise return the next pmu. Returns NULL on end.
|
|
|
|
*/
|
|
|
|
if (!pmu) {
|
|
|
|
pmu_read_sysfs();
|
|
|
|
pmu = list_prepare_entry(pmu, &pmus, list);
|
|
|
|
}
|
|
|
|
list_for_each_entry_continue(pmu, &pmus, list)
|
|
|
|
return pmu;
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2020-04-30 02:50:10 +08:00
|
|
|
struct perf_pmu *evsel__find_pmu(struct evsel *evsel)
|
2020-04-01 18:16:09 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu *pmu = NULL;
|
|
|
|
|
|
|
|
while ((pmu = perf_pmu__scan(pmu)) != NULL) {
|
|
|
|
if (pmu->type == evsel->core.attr.type)
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
|
|
|
|
return pmu;
|
|
|
|
}
|
|
|
|
|
2020-04-30 02:51:38 +08:00
|
|
|
bool evsel__is_aux_event(struct evsel *evsel)
|
2020-04-01 18:16:09 +08:00
|
|
|
{
|
2020-04-30 02:50:10 +08:00
|
|
|
struct perf_pmu *pmu = evsel__find_pmu(evsel);
|
2020-04-01 18:16:09 +08:00
|
|
|
|
|
|
|
return pmu && pmu->auxtrace;
|
|
|
|
}
|
|
|
|
|
2013-07-04 21:20:25 +08:00
|
|
|
struct perf_pmu *perf_pmu__find(const char *name)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Once PMU is loaded it stays in the list,
|
|
|
|
* so we keep us from multiple reading/parsing
|
|
|
|
* the pmu format definitions.
|
|
|
|
*/
|
|
|
|
pmu = pmu_find(name);
|
|
|
|
if (pmu)
|
|
|
|
return pmu;
|
|
|
|
|
|
|
|
return pmu_lookup(name);
|
|
|
|
}
|
|
|
|
|
2013-01-19 03:54:00 +08:00
|
|
|
static struct perf_pmu_format *
|
2015-07-18 00:33:49 +08:00
|
|
|
pmu_find_format(struct list_head *formats, const char *name)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_format *format;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
list_for_each_entry(format, formats, list)
|
|
|
|
if (!strcmp(format->name, name))
|
|
|
|
return format;
|
|
|
|
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2015-07-18 00:33:49 +08:00
|
|
|
__u64 perf_pmu__format_bits(struct list_head *formats, const char *name)
|
|
|
|
{
|
|
|
|
struct perf_pmu_format *format = pmu_find_format(formats, name);
|
|
|
|
__u64 bits = 0;
|
|
|
|
int fbit;
|
|
|
|
|
|
|
|
if (!format)
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
for_each_set_bit(fbit, format->bits, PERF_PMU_FORMAT_BITS)
|
|
|
|
bits |= 1ULL << fbit;
|
|
|
|
|
|
|
|
return bits;
|
|
|
|
}
|
|
|
|
|
2019-11-15 20:42:22 +08:00
|
|
|
int perf_pmu__format_type(struct list_head *formats, const char *name)
|
|
|
|
{
|
|
|
|
struct perf_pmu_format *format = pmu_find_format(formats, name);
|
|
|
|
|
|
|
|
if (!format)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
return format->value;
|
|
|
|
}
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
/*
|
2014-07-31 14:00:49 +08:00
|
|
|
* Sets value based on the format definition (format parameter)
|
2021-03-24 00:09:15 +08:00
|
|
|
* and unformatted value (value parameter).
|
2012-03-16 03:09:17 +08:00
|
|
|
*/
|
2014-07-31 14:00:49 +08:00
|
|
|
static void pmu_format_value(unsigned long *format, __u64 value, __u64 *v,
|
|
|
|
bool zero)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
|
|
|
unsigned long fbit, vbit;
|
|
|
|
|
|
|
|
for (fbit = 0, vbit = 0; fbit < PERF_PMU_FORMAT_BITS; fbit++) {
|
|
|
|
|
|
|
|
if (!test_bit(fbit, format))
|
|
|
|
continue;
|
|
|
|
|
2014-07-31 14:00:49 +08:00
|
|
|
if (value & (1llu << vbit++))
|
|
|
|
*v |= (1llu << fbit);
|
|
|
|
else if (zero)
|
|
|
|
*v &= ~(1llu << fbit);
|
2012-03-16 03:09:17 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-07-18 00:33:50 +08:00
|
|
|
static __u64 pmu_format_max_value(const unsigned long *format)
|
|
|
|
{
|
Revert "perf tools: Fix PMU term format max value calculation"
This reverts commit ac0e2cd555373ae6f8f3a3ad3fbbf5b6d1e7aaaa.
Michael reported an issue with oversized terms values assignment
and I noticed there was actually a misunderstanding of the max
value check in the past.
The above commit's changelog says:
If bit 21 is set, there is parsing issues as below.
$ perf stat -a -e uncore_qpi_0/event=0x200002,umask=0x8/
event syntax error: '..pi_0/event=0x200002,umask=0x8/'
\___ value too big for format, maximum is 511
But there's no issue there, because the event value is distributed
along the value defined by the format. Even if the format defines
separated bit, the value is treated as a continual number, which
should follow the format definition.
In above case it's 9-bit value with last bit separated:
$ cat uncore_qpi_0/format/event
config:0-7,21
Hence the value 0x200002 is correctly reported as format violation,
because it exceeds 9 bits. It should have been 0x102 instead, which
sets the 9th bit - the bit 21 of the format.
$ perf stat -vv -a -e uncore_qpi_0/event=0x102,umask=0x8/
Using CPUID GenuineIntel-6-2D
...
------------------------------------------------------------
perf_event_attr:
type 10
size 112
config 0x200802
sample_type IDENTIFIER
...
Reported-by: Michael Petlan <mpetlan@redhat.com>
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Fixes: ac0e2cd55537 ("perf tools: Fix PMU term format max value calculation")
Link: http://lkml.kernel.org/r/20181003072046.29276-1-jolsa@kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2018-10-03 15:20:46 +08:00
|
|
|
int w;
|
2016-03-31 03:16:15 +08:00
|
|
|
|
Revert "perf tools: Fix PMU term format max value calculation"
This reverts commit ac0e2cd555373ae6f8f3a3ad3fbbf5b6d1e7aaaa.
Michael reported an issue with oversized terms values assignment
and I noticed there was actually a misunderstanding of the max
value check in the past.
The above commit's changelog says:
If bit 21 is set, there is parsing issues as below.
$ perf stat -a -e uncore_qpi_0/event=0x200002,umask=0x8/
event syntax error: '..pi_0/event=0x200002,umask=0x8/'
\___ value too big for format, maximum is 511
But there's no issue there, because the event value is distributed
along the value defined by the format. Even if the format defines
separated bit, the value is treated as a continual number, which
should follow the format definition.
In above case it's 9-bit value with last bit separated:
$ cat uncore_qpi_0/format/event
config:0-7,21
Hence the value 0x200002 is correctly reported as format violation,
because it exceeds 9 bits. It should have been 0x102 instead, which
sets the 9th bit - the bit 21 of the format.
$ perf stat -vv -a -e uncore_qpi_0/event=0x102,umask=0x8/
Using CPUID GenuineIntel-6-2D
...
------------------------------------------------------------
perf_event_attr:
type 10
size 112
config 0x200802
sample_type IDENTIFIER
...
Reported-by: Michael Petlan <mpetlan@redhat.com>
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Fixes: ac0e2cd55537 ("perf tools: Fix PMU term format max value calculation")
Link: http://lkml.kernel.org/r/20181003072046.29276-1-jolsa@kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2018-10-03 15:20:46 +08:00
|
|
|
w = bitmap_weight(format, PERF_PMU_FORMAT_BITS);
|
|
|
|
if (!w)
|
|
|
|
return 0;
|
|
|
|
if (w < 64)
|
|
|
|
return (1ULL << w) - 1;
|
|
|
|
return -1;
|
2015-07-18 00:33:50 +08:00
|
|
|
}
|
|
|
|
|
2015-01-08 09:13:50 +08:00
|
|
|
/*
|
|
|
|
* Term is a string term, and might be a param-term. Try to look up it's value
|
|
|
|
* in the remaining terms.
|
|
|
|
* - We have a term like "base-or-format-term=param-term",
|
|
|
|
* - We need to find the value supplied for "param-term" (with param-term named
|
|
|
|
* in a config string) later on in the term list.
|
|
|
|
*/
|
|
|
|
static int pmu_resolve_param_term(struct parse_events_term *term,
|
|
|
|
struct list_head *head_terms,
|
|
|
|
__u64 *value)
|
|
|
|
{
|
|
|
|
struct parse_events_term *t;
|
|
|
|
|
|
|
|
list_for_each_entry(t, head_terms, list) {
|
2020-03-26 00:40:22 +08:00
|
|
|
if (t->type_val == PARSE_EVENTS__TERM_TYPE_NUM &&
|
|
|
|
t->config && !strcmp(t->config, term->config)) {
|
|
|
|
t->used = true;
|
|
|
|
*value = t->val.num;
|
|
|
|
return 0;
|
2015-01-08 09:13:50 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2017-02-17 16:17:38 +08:00
|
|
|
if (verbose > 0)
|
2015-01-08 09:13:50 +08:00
|
|
|
printf("Required parameter '%s' not specified\n", term->config);
|
|
|
|
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
perf tools: Show proper error message for wrong terms of hw/sw events
Show proper error message and show valid terms when wrong config terms
is specified for hw/sw type perf events.
This patch makes the original error format function formats_error_string()
more generic, which only outputs the static config terms for hw/sw perf
events, and prepends pmu formats for pmu events.
Before this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
invalid or unsupported event: 'cpu-clock/freqx=200/'
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
After this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
event syntax error: 'cpu-clock/freqx=200/'
\___ unknown term
valid terms: config,config1,config2,name,period,freq,branch_type,time,call-graph,stack-size
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: He Kuang <hekuang@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1443412336-120050-2-git-send-email-hekuang@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-09-28 11:52:14 +08:00
|
|
|
static char *pmu_formats_string(struct list_head *formats)
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu_format *format;
|
2016-05-10 13:47:44 +08:00
|
|
|
char *str = NULL;
|
|
|
|
struct strbuf buf = STRBUF_INIT;
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
unsigned i = 0;
|
|
|
|
|
perf tools: Show proper error message for wrong terms of hw/sw events
Show proper error message and show valid terms when wrong config terms
is specified for hw/sw type perf events.
This patch makes the original error format function formats_error_string()
more generic, which only outputs the static config terms for hw/sw perf
events, and prepends pmu formats for pmu events.
Before this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
invalid or unsupported event: 'cpu-clock/freqx=200/'
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
After this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
event syntax error: 'cpu-clock/freqx=200/'
\___ unknown term
valid terms: config,config1,config2,name,period,freq,branch_type,time,call-graph,stack-size
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: He Kuang <hekuang@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1443412336-120050-2-git-send-email-hekuang@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-09-28 11:52:14 +08:00
|
|
|
if (!formats)
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
return NULL;
|
|
|
|
|
|
|
|
/* sysfs exported terms */
|
perf tools: Show proper error message for wrong terms of hw/sw events
Show proper error message and show valid terms when wrong config terms
is specified for hw/sw type perf events.
This patch makes the original error format function formats_error_string()
more generic, which only outputs the static config terms for hw/sw perf
events, and prepends pmu formats for pmu events.
Before this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
invalid or unsupported event: 'cpu-clock/freqx=200/'
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
After this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
event syntax error: 'cpu-clock/freqx=200/'
\___ unknown term
valid terms: config,config1,config2,name,period,freq,branch_type,time,call-graph,stack-size
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: He Kuang <hekuang@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1443412336-120050-2-git-send-email-hekuang@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-09-28 11:52:14 +08:00
|
|
|
list_for_each_entry(format, formats, list)
|
2016-05-10 13:47:44 +08:00
|
|
|
if (strbuf_addf(&buf, i++ ? ",%s" : "%s", format->name) < 0)
|
|
|
|
goto error;
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
|
perf tools: Show proper error message for wrong terms of hw/sw events
Show proper error message and show valid terms when wrong config terms
is specified for hw/sw type perf events.
This patch makes the original error format function formats_error_string()
more generic, which only outputs the static config terms for hw/sw perf
events, and prepends pmu formats for pmu events.
Before this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
invalid or unsupported event: 'cpu-clock/freqx=200/'
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
After this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
event syntax error: 'cpu-clock/freqx=200/'
\___ unknown term
valid terms: config,config1,config2,name,period,freq,branch_type,time,call-graph,stack-size
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: He Kuang <hekuang@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1443412336-120050-2-git-send-email-hekuang@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-09-28 11:52:14 +08:00
|
|
|
str = strbuf_detach(&buf, NULL);
|
2016-05-10 13:47:44 +08:00
|
|
|
error:
|
perf tools: Show proper error message for wrong terms of hw/sw events
Show proper error message and show valid terms when wrong config terms
is specified for hw/sw type perf events.
This patch makes the original error format function formats_error_string()
more generic, which only outputs the static config terms for hw/sw perf
events, and prepends pmu formats for pmu events.
Before this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
invalid or unsupported event: 'cpu-clock/freqx=200/'
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
After this patch:
$ perf record -e 'cpu-clock/freqx=200/' -a sleep 1
event syntax error: 'cpu-clock/freqx=200/'
\___ unknown term
valid terms: config,config1,config2,name,period,freq,branch_type,time,call-graph,stack-size
Run 'perf list' for a list of valid events
usage: perf record [<options>] [<command>]
or: perf record [<options>] -- <command> [<options>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: He Kuang <hekuang@huawei.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: pi3orama@163.com
Link: http://lkml.kernel.org/r/1443412336-120050-2-git-send-email-hekuang@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-09-28 11:52:14 +08:00
|
|
|
strbuf_release(&buf);
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
|
|
|
|
return str;
|
|
|
|
}
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
/*
|
|
|
|
* Setup one of config[12] attr members based on the
|
2014-01-09 00:43:51 +08:00
|
|
|
* user input data - term parameter.
|
2012-03-16 03:09:17 +08:00
|
|
|
*/
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
static int pmu_config_term(const char *pmu_name,
|
|
|
|
struct list_head *formats,
|
2012-03-16 03:09:17 +08:00
|
|
|
struct perf_event_attr *attr,
|
2014-07-31 14:00:49 +08:00
|
|
|
struct parse_events_term *term,
|
2015-01-08 09:13:50 +08:00
|
|
|
struct list_head *head_terms,
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
bool zero, struct parse_events_error *err)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_format *format;
|
2012-03-16 03:09:17 +08:00
|
|
|
__u64 *vp;
|
2015-07-18 00:33:50 +08:00
|
|
|
__u64 val, max_val;
|
2015-01-08 09:13:50 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* If this is a parameter we've already used for parameterized-eval,
|
|
|
|
* skip it in normal eval.
|
|
|
|
*/
|
|
|
|
if (term->used)
|
|
|
|
return 0;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Hardcoded terms should be already in, so nothing
|
|
|
|
* to be done for them.
|
|
|
|
*/
|
|
|
|
if (parse_events__is_hardcoded_term(term))
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
format = pmu_find_format(formats, term->config);
|
2015-01-08 09:13:50 +08:00
|
|
|
if (!format) {
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
char *pmu_term = pmu_formats_string(formats);
|
|
|
|
char *unknown_term;
|
|
|
|
char *help_msg;
|
|
|
|
|
|
|
|
if (asprintf(&unknown_term,
|
|
|
|
"unknown term '%s' for pmu '%s'",
|
|
|
|
term->config, pmu_name) < 0)
|
|
|
|
unknown_term = NULL;
|
|
|
|
help_msg = parse_events_formats_error_string(pmu_term);
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
if (err) {
|
perf parse: Add parse events handle error
Parse event error handling may overwrite one error string with another
creating memory leaks. Introduce a helper routine that warns about
multiple error messages as well as avoiding the memory leak.
A reproduction of this problem can be seen with:
perf stat -e c/c/
After this change this produces:
WARNING: multiple event parsing errors
event syntax error: 'c/c/'
\___ unknown term
valid terms: event,filter_rem,filter_opc0,edge,filter_isoc,filter_tid,filter_loc,filter_nc,inv,umask,filter_opc1,tid_en,thresh,filter_all_op,filter_not_nm,filter_state,filter_nm,config,config1,config2,name,period,percore
Run 'perf list' for a list of valid events
Usage: perf stat [<options>] [<command>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: Ian Rogers <irogers@google.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexei Starovoitov <ast@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Martin KaFai Lau <kafai@fb.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Song Liu <songliubraving@fb.com>
Cc: Stephane Eranian <eranian@google.com>
Cc: Yonghong Song <yhs@fb.com>
Cc: bpf@vger.kernel.org
Cc: clang-built-linux@googlegroups.com
Cc: netdev@vger.kernel.org
Link: http://lore.kernel.org/lkml/20191030223448.12930-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-10-31 06:34:39 +08:00
|
|
|
parse_events__handle_error(err, term->err_term,
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
unknown_term,
|
|
|
|
help_msg);
|
|
|
|
} else {
|
|
|
|
pr_debug("%s (%s)\n", unknown_term, help_msg);
|
|
|
|
free(unknown_term);
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
}
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
free(pmu_term);
|
2012-03-16 03:09:17 +08:00
|
|
|
return -EINVAL;
|
2015-01-08 09:13:50 +08:00
|
|
|
}
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
switch (format->value) {
|
|
|
|
case PERF_PMU_FORMAT_VALUE_CONFIG:
|
|
|
|
vp = &attr->config;
|
|
|
|
break;
|
|
|
|
case PERF_PMU_FORMAT_VALUE_CONFIG1:
|
|
|
|
vp = &attr->config1;
|
|
|
|
break;
|
|
|
|
case PERF_PMU_FORMAT_VALUE_CONFIG2:
|
|
|
|
vp = &attr->config2;
|
|
|
|
break;
|
|
|
|
default:
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2012-04-26 00:24:57 +08:00
|
|
|
/*
|
2015-01-08 09:13:50 +08:00
|
|
|
* Either directly use a numeric term, or try to translate string terms
|
|
|
|
* using event parameters.
|
2012-04-26 00:24:57 +08:00
|
|
|
*/
|
2017-02-17 22:00:56 +08:00
|
|
|
if (term->type_val == PARSE_EVENTS__TERM_TYPE_NUM) {
|
|
|
|
if (term->no_value &&
|
|
|
|
bitmap_weight(format->bits, PERF_PMU_FORMAT_BITS) > 1) {
|
|
|
|
if (err) {
|
perf parse: Add parse events handle error
Parse event error handling may overwrite one error string with another
creating memory leaks. Introduce a helper routine that warns about
multiple error messages as well as avoiding the memory leak.
A reproduction of this problem can be seen with:
perf stat -e c/c/
After this change this produces:
WARNING: multiple event parsing errors
event syntax error: 'c/c/'
\___ unknown term
valid terms: event,filter_rem,filter_opc0,edge,filter_isoc,filter_tid,filter_loc,filter_nc,inv,umask,filter_opc1,tid_en,thresh,filter_all_op,filter_not_nm,filter_state,filter_nm,config,config1,config2,name,period,percore
Run 'perf list' for a list of valid events
Usage: perf stat [<options>] [<command>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: Ian Rogers <irogers@google.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexei Starovoitov <ast@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Martin KaFai Lau <kafai@fb.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Song Liu <songliubraving@fb.com>
Cc: Stephane Eranian <eranian@google.com>
Cc: Yonghong Song <yhs@fb.com>
Cc: bpf@vger.kernel.org
Cc: clang-built-linux@googlegroups.com
Cc: netdev@vger.kernel.org
Link: http://lore.kernel.org/lkml/20191030223448.12930-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-10-31 06:34:39 +08:00
|
|
|
parse_events__handle_error(err, term->err_val,
|
|
|
|
strdup("no value assigned for term"),
|
|
|
|
NULL);
|
2017-02-17 22:00:56 +08:00
|
|
|
}
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2015-01-08 09:13:50 +08:00
|
|
|
val = term->val.num;
|
2017-02-17 22:00:56 +08:00
|
|
|
} else if (term->type_val == PARSE_EVENTS__TERM_TYPE_STR) {
|
2015-01-08 09:13:50 +08:00
|
|
|
if (strcmp(term->val.str, "?")) {
|
2017-02-17 16:17:38 +08:00
|
|
|
if (verbose > 0) {
|
2015-01-08 09:13:50 +08:00
|
|
|
pr_info("Invalid sysfs entry %s=%s\n",
|
|
|
|
term->config, term->val.str);
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
}
|
|
|
|
if (err) {
|
perf parse: Add parse events handle error
Parse event error handling may overwrite one error string with another
creating memory leaks. Introduce a helper routine that warns about
multiple error messages as well as avoiding the memory leak.
A reproduction of this problem can be seen with:
perf stat -e c/c/
After this change this produces:
WARNING: multiple event parsing errors
event syntax error: 'c/c/'
\___ unknown term
valid terms: event,filter_rem,filter_opc0,edge,filter_isoc,filter_tid,filter_loc,filter_nc,inv,umask,filter_opc1,tid_en,thresh,filter_all_op,filter_not_nm,filter_state,filter_nm,config,config1,config2,name,period,percore
Run 'perf list' for a list of valid events
Usage: perf stat [<options>] [<command>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: Ian Rogers <irogers@google.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexei Starovoitov <ast@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Martin KaFai Lau <kafai@fb.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Song Liu <songliubraving@fb.com>
Cc: Stephane Eranian <eranian@google.com>
Cc: Yonghong Song <yhs@fb.com>
Cc: bpf@vger.kernel.org
Cc: clang-built-linux@googlegroups.com
Cc: netdev@vger.kernel.org
Link: http://lore.kernel.org/lkml/20191030223448.12930-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-10-31 06:34:39 +08:00
|
|
|
parse_events__handle_error(err, term->err_val,
|
|
|
|
strdup("expected numeric value"),
|
|
|
|
NULL);
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
}
|
2015-01-08 09:13:50 +08:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (pmu_resolve_param_term(term, head_terms, &val))
|
|
|
|
return -EINVAL;
|
|
|
|
} else
|
|
|
|
return -EINVAL;
|
|
|
|
|
2015-07-18 00:33:50 +08:00
|
|
|
max_val = pmu_format_max_value(format->bits);
|
|
|
|
if (val > max_val) {
|
|
|
|
if (err) {
|
perf parse: Add parse events handle error
Parse event error handling may overwrite one error string with another
creating memory leaks. Introduce a helper routine that warns about
multiple error messages as well as avoiding the memory leak.
A reproduction of this problem can be seen with:
perf stat -e c/c/
After this change this produces:
WARNING: multiple event parsing errors
event syntax error: 'c/c/'
\___ unknown term
valid terms: event,filter_rem,filter_opc0,edge,filter_isoc,filter_tid,filter_loc,filter_nc,inv,umask,filter_opc1,tid_en,thresh,filter_all_op,filter_not_nm,filter_state,filter_nm,config,config1,config2,name,period,percore
Run 'perf list' for a list of valid events
Usage: perf stat [<options>] [<command>]
-e, --event <event> event selector. use 'perf list' to list available events
Signed-off-by: Ian Rogers <irogers@google.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexei Starovoitov <ast@kernel.org>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Daniel Borkmann <daniel@iogearbox.net>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Martin KaFai Lau <kafai@fb.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Song Liu <songliubraving@fb.com>
Cc: Stephane Eranian <eranian@google.com>
Cc: Yonghong Song <yhs@fb.com>
Cc: bpf@vger.kernel.org
Cc: clang-built-linux@googlegroups.com
Cc: netdev@vger.kernel.org
Link: http://lore.kernel.org/lkml/20191030223448.12930-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-10-31 06:34:39 +08:00
|
|
|
char *err_str;
|
|
|
|
|
|
|
|
parse_events__handle_error(err, term->err_val,
|
|
|
|
asprintf(&err_str,
|
|
|
|
"value too big for format, maximum is %llu",
|
|
|
|
(unsigned long long)max_val) < 0
|
|
|
|
? strdup("value too big for format")
|
|
|
|
: err_str,
|
|
|
|
NULL);
|
2015-07-18 00:33:50 +08:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
/*
|
|
|
|
* Assume we don't care if !err, in which case the value will be
|
|
|
|
* silently truncated.
|
|
|
|
*/
|
|
|
|
}
|
|
|
|
|
2015-01-08 09:13:50 +08:00
|
|
|
pmu_format_value(format->bits, val, vp, zero);
|
2012-03-16 03:09:17 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
int perf_pmu__config_terms(const char *pmu_name, struct list_head *formats,
|
2012-11-10 08:46:50 +08:00
|
|
|
struct perf_event_attr *attr,
|
2014-07-31 14:00:49 +08:00
|
|
|
struct list_head *head_terms,
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
bool zero, struct parse_events_error *err)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
2013-01-19 03:29:49 +08:00
|
|
|
struct parse_events_term *term;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
2015-01-08 09:13:50 +08:00
|
|
|
list_for_each_entry(term, head_terms, list) {
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
if (pmu_config_term(pmu_name, formats, attr, term, head_terms,
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
zero, err))
|
2012-03-16 03:09:17 +08:00
|
|
|
return -EINVAL;
|
2015-01-08 09:13:50 +08:00
|
|
|
}
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Configures event's 'attr' parameter based on the:
|
|
|
|
* 1) users input - specified in terms parameter
|
|
|
|
* 2) pmu format definitions - specified by pmu parameter
|
|
|
|
*/
|
|
|
|
int perf_pmu__config(struct perf_pmu *pmu, struct perf_event_attr *attr,
|
perf tools: Add term support for parse_events_error
Allowing event's term processing to report back error, like:
$ perf record -e 'cpu/even=0x1/' ls
event syntax error: 'cpu/even=0x1/'
\___ unknown term
valid terms: pc,any,inv,edge,cmask,event,in_tx,ldlat,umask,in_tx_cp,offcore_rsp,config,config1,config2,name,period,branch_type
Signed-off-by: Jiri Olsa <jolsa@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1429729824-13932-7-git-send-email-jolsa@kernel.org
[ Renamed 'error' variables to 'err', not to clash with util.h error() ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-04-23 03:10:21 +08:00
|
|
|
struct list_head *head_terms,
|
|
|
|
struct parse_events_error *err)
|
2012-03-16 03:09:17 +08:00
|
|
|
{
|
2014-07-31 14:00:49 +08:00
|
|
|
bool zero = !!pmu->default_config;
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
attr->type = pmu->type;
|
perf parse-events: Make add PMU verbose output clearer
On a CPU like skylakex an uncore_iio_0 PMU may alias with
uncore_iio_free_running_0. The latter PMU doesn't support fc_mask as a
parameter and so pmu_config_term fails. Typically parse_events_add_pmu
is called in a loop where if one alias succeeds errors are ignored,
however, if multiple errors occur parse_events__handle_error will
currently give a WARN_ONCE.
This change removes the WARN_ONCE in parse_events__handle_error and
makes it a pr_debug. It adds verbose messages to parse_events_add_pmu
warning that non-fatal errors may occur, while giving details on the pmu
and config terms for useful context. pmu_config_term is altered so the
failing term and pmu are present in the case of the 'unknown term' error
which makes spotting the free_running case more straightforward.
Before:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
WARNING: multiple event parsing errors
...
Invalid event/parameter 'fc_mask'
...
After:
$ perf --debug verbose=3 stat -M llc_misses.pcie_read sleep 1
Using CPUID GenuineIntel-6-55-4
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
metric expr unc_iio_data_req_of_cpu.mem_read.part0 + unc_iio_data_req_of_cpu.mem_read.part1 + unc_iio_data_req_of_cpu.mem_read.part2 + unc_iio_data_req_of_cpu.mem_read.part3 for LLC_MISSES.PCIE_READ
found event unc_iio_data_req_of_cpu.mem_read.part0
found event unc_iio_data_req_of_cpu.mem_read.part1
found event unc_iio_data_req_of_cpu.mem_read.part2
found event unc_iio_data_req_of_cpu.mem_read.part3
adding {unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W,{unc_iio_data_req_of_cpu.mem_read.part0,unc_iio_data_req_of_cpu.mem_read.part1,unc_iio_data_req_of_cpu.mem_read.part2,unc_iio_data_req_of_cpu.mem_read.part3}:W
intel_pt default config: tsc,mtc,mtc_period=3,psb_period=3,pt,branch
Attempting to add event pmu 'uncore_iio_free_running_5' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_5' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_3' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_3' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Attempting to add event pmu 'uncore_iio_free_running_1' with 'unc_iio_data_req_of_cpu.mem_read.part0,' that may result in non-fatal errors
After aliases, add event pmu 'uncore_iio_free_running_1' with 'fc_mask,ch_mask,umask,event,' that may result in non-fatal errors
Multiple errors dropping message: unknown term 'fc_mask' for pmu 'uncore_iio_free_running_3' (valid terms: event,umask,config,config1,config2,name,period,percore)
...
So before you see a 'WARNING: multiple event parsing errors' and
'Invalid event/parameter'. After you see 'Attempting... that may result
in non-fatal errors' then 'Multiple errors...' with details that
'fc_mask' wasn't known to a free running counter. While not completely
clean, this makes it clearer that an error hasn't really occurred.
v2. addresses review feedback from Jiri Olsa <jolsa@redhat.com>.
Signed-off-by: Ian Rogers <irogers@google.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: John Garry <john.garry@huawei.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Mathieu Poirier <mathieu.poirier@linaro.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lore.kernel.org/lkml/20200513220635.54700-1-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-05-14 06:06:35 +08:00
|
|
|
return perf_pmu__config_terms(pmu->name, &pmu->format, attr,
|
|
|
|
head_terms, zero, err);
|
2012-03-16 03:09:17 +08:00
|
|
|
}
|
|
|
|
|
2013-01-19 03:54:00 +08:00
|
|
|
static struct perf_pmu_alias *pmu_find_alias(struct perf_pmu *pmu,
|
|
|
|
struct parse_events_term *term)
|
2012-06-15 14:31:41 +08:00
|
|
|
{
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_alias *alias;
|
2012-06-15 14:31:41 +08:00
|
|
|
char *name;
|
|
|
|
|
|
|
|
if (parse_events__is_hardcoded_term(term))
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
if (term->type_val == PARSE_EVENTS__TERM_TYPE_NUM) {
|
|
|
|
if (term->val.num != 1)
|
|
|
|
return NULL;
|
|
|
|
if (pmu_find_format(&pmu->format, term->config))
|
|
|
|
return NULL;
|
|
|
|
name = term->config;
|
|
|
|
} else if (term->type_val == PARSE_EVENTS__TERM_TYPE_STR) {
|
|
|
|
if (strcasecmp(term->config, "event"))
|
|
|
|
return NULL;
|
|
|
|
name = term->val.str;
|
|
|
|
} else {
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
list_for_each_entry(alias, &pmu->aliases, list) {
|
|
|
|
if (!strcasecmp(alias->name, name))
|
|
|
|
return alias;
|
|
|
|
}
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2014-11-21 17:31:13 +08:00
|
|
|
static int check_info_data(struct perf_pmu_alias *alias,
|
|
|
|
struct perf_pmu_info *info)
|
2013-11-13 00:58:49 +08:00
|
|
|
{
|
|
|
|
/*
|
|
|
|
* Only one term in event definition can
|
2014-11-21 17:31:13 +08:00
|
|
|
* define unit, scale and snapshot, fail
|
|
|
|
* if there's more than one.
|
2013-11-13 00:58:49 +08:00
|
|
|
*/
|
2017-02-15 21:06:20 +08:00
|
|
|
if ((info->unit && alias->unit[0]) ||
|
2014-11-21 17:31:13 +08:00
|
|
|
(info->scale && alias->scale) ||
|
|
|
|
(info->snapshot && alias->snapshot))
|
2013-11-13 00:58:49 +08:00
|
|
|
return -EINVAL;
|
|
|
|
|
2017-02-15 21:06:20 +08:00
|
|
|
if (alias->unit[0])
|
2014-11-21 17:31:13 +08:00
|
|
|
info->unit = alias->unit;
|
2013-11-13 00:58:49 +08:00
|
|
|
|
|
|
|
if (alias->scale)
|
2014-11-21 17:31:13 +08:00
|
|
|
info->scale = alias->scale;
|
|
|
|
|
|
|
|
if (alias->snapshot)
|
|
|
|
info->snapshot = alias->snapshot;
|
2013-11-13 00:58:49 +08:00
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
/*
|
|
|
|
* Find alias in the terms list and replace it with the terms
|
|
|
|
* defined for the alias
|
|
|
|
*/
|
2013-11-13 00:58:49 +08:00
|
|
|
int perf_pmu__check_alias(struct perf_pmu *pmu, struct list_head *head_terms,
|
2014-09-24 22:04:06 +08:00
|
|
|
struct perf_pmu_info *info)
|
2012-06-15 14:31:41 +08:00
|
|
|
{
|
2013-01-19 03:29:49 +08:00
|
|
|
struct parse_events_term *term, *h;
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_alias *alias;
|
2012-06-15 14:31:41 +08:00
|
|
|
int ret;
|
|
|
|
|
2014-11-21 17:31:12 +08:00
|
|
|
info->per_pkg = false;
|
|
|
|
|
2014-01-17 23:34:05 +08:00
|
|
|
/*
|
|
|
|
* Mark unit and scale as not set
|
|
|
|
* (different from default values, see below)
|
|
|
|
*/
|
2014-11-21 17:31:13 +08:00
|
|
|
info->unit = NULL;
|
|
|
|
info->scale = 0.0;
|
|
|
|
info->snapshot = false;
|
perf stat: Output JSON MetricExpr metric
Add generic infrastructure to perf stat to output ratios for
"MetricExpr" entries in the event lists. Many events are more useful as
ratios than in raw form, typically some count in relation to total
ticks.
Transfer the MetricExpr information from the alias to the evsel.
We mark the events that need to be collected for MetricExpr, and also
link the events using them with a pointer. The code is careful to always
prefer the right event in the same group to minimize multiplexing
errors. At the moment only a single relation is supported.
Then add a rblist to the stat shadow code that remembers stats based on
the cpu and context.
Then finally update and retrieve and print these values similarly to the
existing hardcoded perf metrics. We use the simple expression parser
added earlier to evaluate the expression.
Normally we just output the result without further commentary, but for
--metric-only this would lead to empty columns. So for this case use the
original event as description.
There is no attempt to automatically add the MetricExpr event, if it is
missing, however we suggest it to the user, because the user tool
doesn't have enough information to reliably construct a group that is
guaranteed to schedule. So we leave that to the user.
% perf stat -a -I 1000 -e '{unc_p_clockticks,unc_p_freq_max_os_cycles}'
1.000147889 800,085,181 unc_p_clockticks
1.000147889 93,126,241 unc_p_freq_max_os_cycles # 11.6
2.000448381 800,218,217 unc_p_clockticks
2.000448381 142,516,095 unc_p_freq_max_os_cycles # 17.8
3.000639852 800,243,057 unc_p_clockticks
3.000639852 162,292,689 unc_p_freq_max_os_cycles # 20.3
% perf stat -a -I 1000 -e '{unc_p_clockticks,unc_p_freq_max_os_cycles}' --metric-only
# time freq_max_os_cycles %
1.000127077 0.9
2.000301436 0.7
3.000456379 0.0
v2: Change from DivideBy to MetricExpr
v3: Use expr__ prefix. Support more than one other event.
v4: Update description
v5: Only print warning message once for multiple PMUs.
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Link: http://lkml.kernel.org/r/20170320201711.14142-11-andi@firstfloor.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-03-21 04:17:08 +08:00
|
|
|
info->metric_expr = NULL;
|
2017-03-21 04:17:10 +08:00
|
|
|
info->metric_name = NULL;
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
list_for_each_entry_safe(term, h, head_terms, list) {
|
|
|
|
alias = pmu_find_alias(pmu, term);
|
|
|
|
if (!alias)
|
|
|
|
continue;
|
|
|
|
ret = pmu_alias_terms(alias, &term->list);
|
|
|
|
if (ret)
|
|
|
|
return ret;
|
2013-11-13 00:58:49 +08:00
|
|
|
|
2014-11-21 17:31:13 +08:00
|
|
|
ret = check_info_data(alias, info);
|
2013-11-13 00:58:49 +08:00
|
|
|
if (ret)
|
|
|
|
return ret;
|
|
|
|
|
2014-11-21 17:31:12 +08:00
|
|
|
if (alias->per_pkg)
|
|
|
|
info->per_pkg = true;
|
perf stat: Output JSON MetricExpr metric
Add generic infrastructure to perf stat to output ratios for
"MetricExpr" entries in the event lists. Many events are more useful as
ratios than in raw form, typically some count in relation to total
ticks.
Transfer the MetricExpr information from the alias to the evsel.
We mark the events that need to be collected for MetricExpr, and also
link the events using them with a pointer. The code is careful to always
prefer the right event in the same group to minimize multiplexing
errors. At the moment only a single relation is supported.
Then add a rblist to the stat shadow code that remembers stats based on
the cpu and context.
Then finally update and retrieve and print these values similarly to the
existing hardcoded perf metrics. We use the simple expression parser
added earlier to evaluate the expression.
Normally we just output the result without further commentary, but for
--metric-only this would lead to empty columns. So for this case use the
original event as description.
There is no attempt to automatically add the MetricExpr event, if it is
missing, however we suggest it to the user, because the user tool
doesn't have enough information to reliably construct a group that is
guaranteed to schedule. So we leave that to the user.
% perf stat -a -I 1000 -e '{unc_p_clockticks,unc_p_freq_max_os_cycles}'
1.000147889 800,085,181 unc_p_clockticks
1.000147889 93,126,241 unc_p_freq_max_os_cycles # 11.6
2.000448381 800,218,217 unc_p_clockticks
2.000448381 142,516,095 unc_p_freq_max_os_cycles # 17.8
3.000639852 800,243,057 unc_p_clockticks
3.000639852 162,292,689 unc_p_freq_max_os_cycles # 20.3
% perf stat -a -I 1000 -e '{unc_p_clockticks,unc_p_freq_max_os_cycles}' --metric-only
# time freq_max_os_cycles %
1.000127077 0.9
2.000301436 0.7
3.000456379 0.0
v2: Change from DivideBy to MetricExpr
v3: Use expr__ prefix. Support more than one other event.
v4: Update description
v5: Only print warning message once for multiple PMUs.
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Link: http://lkml.kernel.org/r/20170320201711.14142-11-andi@firstfloor.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-03-21 04:17:08 +08:00
|
|
|
info->metric_expr = alias->metric_expr;
|
2017-03-21 04:17:10 +08:00
|
|
|
info->metric_name = alias->metric_name;
|
2014-11-21 17:31:12 +08:00
|
|
|
|
2019-07-04 23:13:46 +08:00
|
|
|
list_del_init(&term->list);
|
2019-10-31 06:34:47 +08:00
|
|
|
parse_events_term__delete(term);
|
2012-06-15 14:31:41 +08:00
|
|
|
}
|
2014-01-17 23:34:05 +08:00
|
|
|
|
|
|
|
/*
|
2021-03-24 00:09:15 +08:00
|
|
|
* if no unit or scale found in aliases, then
|
2014-01-17 23:34:05 +08:00
|
|
|
* set defaults as for evsel
|
|
|
|
* unit cannot left to NULL
|
|
|
|
*/
|
2014-09-24 22:04:06 +08:00
|
|
|
if (info->unit == NULL)
|
|
|
|
info->unit = "";
|
2014-01-17 23:34:05 +08:00
|
|
|
|
2014-09-24 22:04:06 +08:00
|
|
|
if (info->scale == 0.0)
|
|
|
|
info->scale = 1.0;
|
2014-01-17 23:34:05 +08:00
|
|
|
|
2012-06-15 14:31:41 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2012-03-16 03:09:17 +08:00
|
|
|
int perf_pmu__new_format(struct list_head *list, char *name,
|
|
|
|
int config, unsigned long *bits)
|
|
|
|
{
|
2013-01-19 03:54:00 +08:00
|
|
|
struct perf_pmu_format *format;
|
2012-03-16 03:09:17 +08:00
|
|
|
|
|
|
|
format = zalloc(sizeof(*format));
|
|
|
|
if (!format)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
format->name = strdup(name);
|
|
|
|
format->value = config;
|
|
|
|
memcpy(format->bits, bits, sizeof(format->bits));
|
|
|
|
|
|
|
|
list_add_tail(&format->list, list);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
void perf_pmu__set_format(unsigned long *bits, long from, long to)
|
|
|
|
{
|
|
|
|
long b;
|
|
|
|
|
|
|
|
if (!to)
|
|
|
|
to = from;
|
|
|
|
|
2013-01-18 01:11:30 +08:00
|
|
|
memset(bits, 0, BITS_TO_BYTES(PERF_PMU_FORMAT_BITS));
|
2012-03-16 03:09:17 +08:00
|
|
|
for (b = from; b <= to; b++)
|
|
|
|
set_bit(b, bits);
|
|
|
|
}
|
2013-04-21 02:02:29 +08:00
|
|
|
|
2020-09-15 11:18:19 +08:00
|
|
|
void perf_pmu__del_formats(struct list_head *formats)
|
|
|
|
{
|
|
|
|
struct perf_pmu_format *fmt, *tmp;
|
|
|
|
|
|
|
|
list_for_each_entry_safe(fmt, tmp, formats, list) {
|
|
|
|
list_del(&fmt->list);
|
|
|
|
free(fmt->name);
|
|
|
|
free(fmt);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-01-08 09:13:51 +08:00
|
|
|
static int sub_non_neg(int a, int b)
|
|
|
|
{
|
|
|
|
if (b > a)
|
|
|
|
return 0;
|
|
|
|
return a - b;
|
|
|
|
}
|
|
|
|
|
2013-04-21 02:02:29 +08:00
|
|
|
static char *format_alias(char *buf, int len, struct perf_pmu *pmu,
|
|
|
|
struct perf_pmu_alias *alias)
|
|
|
|
{
|
2015-01-08 09:13:51 +08:00
|
|
|
struct parse_events_term *term;
|
|
|
|
int used = snprintf(buf, len, "%s/%s", pmu->name, alias->name);
|
|
|
|
|
|
|
|
list_for_each_entry(term, &alias->terms, list) {
|
|
|
|
if (term->type_val == PARSE_EVENTS__TERM_TYPE_STR)
|
|
|
|
used += snprintf(buf + used, sub_non_neg(len, used),
|
|
|
|
",%s=%s", term->config,
|
|
|
|
term->val.str);
|
|
|
|
}
|
|
|
|
|
|
|
|
if (sub_non_neg(len, used) > 0) {
|
|
|
|
buf[used] = '/';
|
|
|
|
used++;
|
|
|
|
}
|
|
|
|
if (sub_non_neg(len, used) > 0) {
|
|
|
|
buf[used] = '\0';
|
|
|
|
used++;
|
|
|
|
} else
|
|
|
|
buf[len - 1] = '\0';
|
|
|
|
|
2013-04-21 02:02:29 +08:00
|
|
|
return buf;
|
|
|
|
}
|
|
|
|
|
|
|
|
static char *format_alias_or(char *buf, int len, struct perf_pmu *pmu,
|
|
|
|
struct perf_pmu_alias *alias)
|
|
|
|
{
|
|
|
|
snprintf(buf, len, "%s OR %s/%s/", alias->name, pmu->name, alias->name);
|
|
|
|
return buf;
|
|
|
|
}
|
|
|
|
|
2016-09-16 06:24:50 +08:00
|
|
|
struct sevent {
|
2016-09-16 06:24:43 +08:00
|
|
|
char *name;
|
|
|
|
char *desc;
|
2016-09-16 06:24:50 +08:00
|
|
|
char *topic;
|
2017-01-28 10:03:40 +08:00
|
|
|
char *str;
|
|
|
|
char *pmu;
|
2017-03-21 04:17:09 +08:00
|
|
|
char *metric_expr;
|
2017-03-21 04:17:10 +08:00
|
|
|
char *metric_name;
|
2020-06-17 17:01:54 +08:00
|
|
|
int is_cpu;
|
2016-09-16 06:24:43 +08:00
|
|
|
};
|
|
|
|
|
2016-09-16 06:24:50 +08:00
|
|
|
static int cmp_sevent(const void *a, const void *b)
|
2016-09-16 06:24:43 +08:00
|
|
|
{
|
2016-09-16 06:24:50 +08:00
|
|
|
const struct sevent *as = a;
|
|
|
|
const struct sevent *bs = b;
|
2016-09-16 06:24:43 +08:00
|
|
|
|
|
|
|
/* Put extra events last */
|
|
|
|
if (!!as->desc != !!bs->desc)
|
|
|
|
return !!as->desc - !!bs->desc;
|
2016-09-16 06:24:50 +08:00
|
|
|
if (as->topic && bs->topic) {
|
|
|
|
int n = strcmp(as->topic, bs->topic);
|
|
|
|
|
|
|
|
if (n)
|
|
|
|
return n;
|
|
|
|
}
|
2020-06-17 17:01:54 +08:00
|
|
|
|
|
|
|
/* Order CPU core events to be first */
|
|
|
|
if (as->is_cpu != bs->is_cpu)
|
|
|
|
return bs->is_cpu - as->is_cpu;
|
|
|
|
|
2016-09-16 06:24:43 +08:00
|
|
|
return strcmp(as->name, bs->name);
|
|
|
|
}
|
|
|
|
|
|
|
|
static void wordwrap(char *s, int start, int max, int corr)
|
2013-04-21 02:02:29 +08:00
|
|
|
{
|
2016-09-16 06:24:43 +08:00
|
|
|
int column = start;
|
|
|
|
int n;
|
|
|
|
|
|
|
|
while (*s) {
|
|
|
|
int wlen = strcspn(s, " \t");
|
|
|
|
|
|
|
|
if (column + wlen >= max && column > start) {
|
|
|
|
printf("\n%*s", start, "");
|
|
|
|
column = start + corr;
|
|
|
|
}
|
|
|
|
n = printf("%s%.*s", column > start ? " " : "", wlen, s);
|
|
|
|
if (n <= 0)
|
|
|
|
break;
|
|
|
|
s += wlen;
|
|
|
|
column += n;
|
2019-06-26 22:42:03 +08:00
|
|
|
s = skip_spaces(s);
|
2016-09-16 06:24:43 +08:00
|
|
|
}
|
2013-04-21 02:02:29 +08:00
|
|
|
}
|
|
|
|
|
2020-03-17 19:02:17 +08:00
|
|
|
bool is_pmu_core(const char *name)
|
|
|
|
{
|
|
|
|
return !strcmp(name, "cpu") || is_arm_pmu_core(name);
|
|
|
|
}
|
|
|
|
|
2016-09-16 06:24:48 +08:00
|
|
|
void print_pmu_events(const char *event_glob, bool name_only, bool quiet_flag,
|
2019-10-15 10:53:57 +08:00
|
|
|
bool long_desc, bool details_flag, bool deprecated)
|
2013-04-21 02:02:29 +08:00
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
struct perf_pmu_alias *alias;
|
|
|
|
char buf[1024];
|
|
|
|
int printed = 0;
|
|
|
|
int len, j;
|
2016-09-16 06:24:50 +08:00
|
|
|
struct sevent *aliases;
|
2016-09-16 06:24:43 +08:00
|
|
|
int numdesc = 0;
|
2016-09-16 06:24:44 +08:00
|
|
|
int columns = pager_get_columns();
|
2016-09-16 06:24:50 +08:00
|
|
|
char *topic = NULL;
|
2013-04-21 02:02:29 +08:00
|
|
|
|
|
|
|
pmu = NULL;
|
|
|
|
len = 0;
|
2014-10-23 18:45:10 +08:00
|
|
|
while ((pmu = perf_pmu__scan(pmu)) != NULL) {
|
2013-04-21 02:02:29 +08:00
|
|
|
list_for_each_entry(alias, &pmu->aliases, list)
|
|
|
|
len++;
|
2014-10-23 18:45:10 +08:00
|
|
|
if (pmu->selectable)
|
|
|
|
len++;
|
|
|
|
}
|
2016-09-16 06:24:50 +08:00
|
|
|
aliases = zalloc(sizeof(struct sevent) * len);
|
2013-04-21 02:02:29 +08:00
|
|
|
if (!aliases)
|
2014-10-24 21:25:09 +08:00
|
|
|
goto out_enomem;
|
2013-04-21 02:02:29 +08:00
|
|
|
pmu = NULL;
|
|
|
|
j = 0;
|
2014-10-23 18:45:10 +08:00
|
|
|
while ((pmu = perf_pmu__scan(pmu)) != NULL) {
|
2013-04-21 02:02:29 +08:00
|
|
|
list_for_each_entry(alias, &pmu->aliases, list) {
|
2016-09-16 06:24:43 +08:00
|
|
|
char *name = alias->desc ? alias->name :
|
|
|
|
format_alias(buf, sizeof(buf), pmu, alias);
|
2020-06-17 17:01:53 +08:00
|
|
|
bool is_cpu = is_pmu_core(pmu->name);
|
2013-04-21 02:02:29 +08:00
|
|
|
|
2019-10-15 10:53:57 +08:00
|
|
|
if (alias->deprecated && !deprecated)
|
|
|
|
continue;
|
|
|
|
|
2013-04-21 02:02:29 +08:00
|
|
|
if (event_glob != NULL &&
|
2016-10-20 01:50:01 +08:00
|
|
|
!(strglobmatch_nocase(name, event_glob) ||
|
|
|
|
(!is_cpu && strglobmatch_nocase(alias->name,
|
2016-10-20 02:45:23 +08:00
|
|
|
event_glob)) ||
|
|
|
|
(alias->topic &&
|
|
|
|
strglobmatch_nocase(alias->topic, event_glob))))
|
2013-04-21 02:02:29 +08:00
|
|
|
continue;
|
2014-10-24 21:25:09 +08:00
|
|
|
|
2016-09-16 06:24:43 +08:00
|
|
|
if (is_cpu && !name_only && !alias->desc)
|
2014-10-24 21:25:09 +08:00
|
|
|
name = format_alias_or(buf, sizeof(buf), pmu, alias);
|
|
|
|
|
2016-09-16 06:24:43 +08:00
|
|
|
aliases[j].name = name;
|
|
|
|
if (is_cpu && !name_only && !alias->desc)
|
|
|
|
aliases[j].name = format_alias_or(buf,
|
|
|
|
sizeof(buf),
|
|
|
|
pmu, alias);
|
|
|
|
aliases[j].name = strdup(aliases[j].name);
|
|
|
|
if (!aliases[j].name)
|
2014-10-24 21:25:09 +08:00
|
|
|
goto out_enomem;
|
2016-09-16 06:24:43 +08:00
|
|
|
|
2016-09-16 06:24:48 +08:00
|
|
|
aliases[j].desc = long_desc ? alias->long_desc :
|
|
|
|
alias->desc;
|
2016-09-16 06:24:50 +08:00
|
|
|
aliases[j].topic = alias->topic;
|
2017-01-28 10:03:40 +08:00
|
|
|
aliases[j].str = alias->str;
|
|
|
|
aliases[j].pmu = pmu->name;
|
2017-03-21 04:17:09 +08:00
|
|
|
aliases[j].metric_expr = alias->metric_expr;
|
2017-03-21 04:17:10 +08:00
|
|
|
aliases[j].metric_name = alias->metric_name;
|
2020-06-17 17:01:54 +08:00
|
|
|
aliases[j].is_cpu = is_cpu;
|
2013-04-21 02:02:29 +08:00
|
|
|
j++;
|
|
|
|
}
|
2015-10-03 02:28:16 +08:00
|
|
|
if (pmu->selectable &&
|
|
|
|
(event_glob == NULL || strglobmatch(pmu->name, event_glob))) {
|
2014-10-24 21:25:09 +08:00
|
|
|
char *s;
|
|
|
|
if (asprintf(&s, "%s//", pmu->name) < 0)
|
|
|
|
goto out_enomem;
|
2016-09-16 06:24:43 +08:00
|
|
|
aliases[j].name = s;
|
2014-10-23 18:45:10 +08:00
|
|
|
j++;
|
|
|
|
}
|
|
|
|
}
|
2013-04-21 02:02:29 +08:00
|
|
|
len = j;
|
2016-09-16 06:24:50 +08:00
|
|
|
qsort(aliases, len, sizeof(struct sevent), cmp_sevent);
|
2013-04-21 02:02:29 +08:00
|
|
|
for (j = 0; j < len; j++) {
|
2017-01-28 10:03:38 +08:00
|
|
|
/* Skip duplicates */
|
|
|
|
if (j > 0 && !strcmp(aliases[j].name, aliases[j - 1].name))
|
|
|
|
continue;
|
2013-04-21 02:02:29 +08:00
|
|
|
if (name_only) {
|
2016-09-16 06:24:43 +08:00
|
|
|
printf("%s ", aliases[j].name);
|
2013-04-21 02:02:29 +08:00
|
|
|
continue;
|
|
|
|
}
|
2016-09-16 06:24:45 +08:00
|
|
|
if (aliases[j].desc && !quiet_flag) {
|
2016-09-16 06:24:43 +08:00
|
|
|
if (numdesc++ == 0)
|
|
|
|
printf("\n");
|
2016-09-16 06:24:50 +08:00
|
|
|
if (aliases[j].topic && (!topic ||
|
|
|
|
strcmp(topic, aliases[j].topic))) {
|
|
|
|
printf("%s%s:\n", topic ? "\n" : "",
|
|
|
|
aliases[j].topic);
|
|
|
|
topic = aliases[j].topic;
|
|
|
|
}
|
2016-09-16 06:24:43 +08:00
|
|
|
printf(" %-50s\n", aliases[j].name);
|
|
|
|
printf("%*s", 8, "[");
|
|
|
|
wordwrap(aliases[j].desc, 8, columns, 0);
|
|
|
|
printf("]\n");
|
2017-03-21 04:17:11 +08:00
|
|
|
if (details_flag) {
|
2017-03-21 04:17:09 +08:00
|
|
|
printf("%*s%s/%s/ ", 8, "", aliases[j].pmu, aliases[j].str);
|
2017-03-21 04:17:10 +08:00
|
|
|
if (aliases[j].metric_name)
|
|
|
|
printf(" MetricName: %s", aliases[j].metric_name);
|
2017-03-21 04:17:09 +08:00
|
|
|
if (aliases[j].metric_expr)
|
|
|
|
printf(" MetricExpr: %s", aliases[j].metric_expr);
|
|
|
|
putchar('\n');
|
|
|
|
}
|
2016-09-16 06:24:43 +08:00
|
|
|
} else
|
|
|
|
printf(" %-50s [Kernel PMU event]\n", aliases[j].name);
|
2013-04-21 02:02:29 +08:00
|
|
|
printed++;
|
|
|
|
}
|
2015-10-01 04:13:26 +08:00
|
|
|
if (printed && pager_in_use())
|
2013-04-21 02:02:29 +08:00
|
|
|
printf("\n");
|
2014-10-24 21:25:09 +08:00
|
|
|
out_free:
|
|
|
|
for (j = 0; j < len; j++)
|
2016-09-16 06:24:43 +08:00
|
|
|
zfree(&aliases[j].name);
|
2014-10-24 21:25:09 +08:00
|
|
|
zfree(&aliases);
|
|
|
|
return;
|
|
|
|
|
|
|
|
out_enomem:
|
|
|
|
printf("FATAL: not enough memory to print PMU events\n");
|
|
|
|
if (aliases)
|
|
|
|
goto out_free;
|
2013-04-21 02:02:29 +08:00
|
|
|
}
|
2013-08-22 07:47:26 +08:00
|
|
|
|
|
|
|
bool pmu_have_event(const char *pname, const char *name)
|
|
|
|
{
|
|
|
|
struct perf_pmu *pmu;
|
|
|
|
struct perf_pmu_alias *alias;
|
|
|
|
|
|
|
|
pmu = NULL;
|
|
|
|
while ((pmu = perf_pmu__scan(pmu)) != NULL) {
|
|
|
|
if (strcmp(pname, pmu->name))
|
|
|
|
continue;
|
|
|
|
list_for_each_entry(alias, &pmu->aliases, list)
|
|
|
|
if (!strcmp(alias->name, name))
|
|
|
|
return true;
|
|
|
|
}
|
|
|
|
return false;
|
|
|
|
}
|
2014-07-31 14:00:50 +08:00
|
|
|
|
|
|
|
static FILE *perf_pmu__open_file(struct perf_pmu *pmu, const char *name)
|
|
|
|
{
|
|
|
|
char path[PATH_MAX];
|
|
|
|
const char *sysfs;
|
|
|
|
|
|
|
|
sysfs = sysfs__mountpoint();
|
|
|
|
if (!sysfs)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
snprintf(path, PATH_MAX,
|
|
|
|
"%s" EVENT_SOURCE_DEVICE_PATH "%s/%s", sysfs, pmu->name, name);
|
2019-11-21 08:15:11 +08:00
|
|
|
if (!file_available(path))
|
2014-07-31 14:00:50 +08:00
|
|
|
return NULL;
|
|
|
|
return fopen(path, "r");
|
|
|
|
}
|
|
|
|
|
|
|
|
int perf_pmu__scan_file(struct perf_pmu *pmu, const char *name, const char *fmt,
|
|
|
|
...)
|
|
|
|
{
|
|
|
|
va_list args;
|
|
|
|
FILE *file;
|
|
|
|
int ret = EOF;
|
|
|
|
|
|
|
|
va_start(args, fmt);
|
|
|
|
file = perf_pmu__open_file(pmu, name);
|
|
|
|
if (file) {
|
|
|
|
ret = vfscanf(file, fmt, args);
|
|
|
|
fclose(file);
|
|
|
|
}
|
|
|
|
va_end(args);
|
|
|
|
return ret;
|
|
|
|
}
|
2020-03-20 04:25:01 +08:00
|
|
|
|
|
|
|
static int perf_pmu__new_caps(struct list_head *list, char *name, char *value)
|
|
|
|
{
|
|
|
|
struct perf_pmu_caps *caps = zalloc(sizeof(*caps));
|
|
|
|
|
|
|
|
if (!caps)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
caps->name = strdup(name);
|
|
|
|
if (!caps->name)
|
|
|
|
goto free_caps;
|
|
|
|
caps->value = strndup(value, strlen(value) - 1);
|
|
|
|
if (!caps->value)
|
|
|
|
goto free_name;
|
|
|
|
list_add_tail(&caps->list, list);
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
free_name:
|
|
|
|
zfree(caps->name);
|
|
|
|
free_caps:
|
|
|
|
free(caps);
|
|
|
|
|
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Reading/parsing the given pmu capabilities, which should be located at:
|
|
|
|
* /sys/bus/event_source/devices/<dev>/caps as sysfs group attributes.
|
|
|
|
* Return the number of capabilities
|
|
|
|
*/
|
|
|
|
int perf_pmu__caps_parse(struct perf_pmu *pmu)
|
|
|
|
{
|
|
|
|
struct stat st;
|
|
|
|
char caps_path[PATH_MAX];
|
|
|
|
const char *sysfs = sysfs__mountpoint();
|
|
|
|
DIR *caps_dir;
|
|
|
|
struct dirent *evt_ent;
|
|
|
|
int nr_caps = 0;
|
|
|
|
|
|
|
|
if (!sysfs)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
snprintf(caps_path, PATH_MAX,
|
|
|
|
"%s" EVENT_SOURCE_DEVICE_PATH "%s/caps", sysfs, pmu->name);
|
|
|
|
|
|
|
|
if (stat(caps_path, &st) < 0)
|
|
|
|
return 0; /* no error if caps does not exist */
|
|
|
|
|
|
|
|
caps_dir = opendir(caps_path);
|
|
|
|
if (!caps_dir)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
while ((evt_ent = readdir(caps_dir)) != NULL) {
|
|
|
|
char path[PATH_MAX + NAME_MAX + 1];
|
|
|
|
char *name = evt_ent->d_name;
|
|
|
|
char value[128];
|
|
|
|
FILE *file;
|
|
|
|
|
|
|
|
if (!strcmp(name, ".") || !strcmp(name, ".."))
|
|
|
|
continue;
|
|
|
|
|
|
|
|
snprintf(path, sizeof(path), "%s/%s", caps_path, name);
|
|
|
|
|
|
|
|
file = fopen(path, "r");
|
|
|
|
if (!file)
|
|
|
|
continue;
|
|
|
|
|
|
|
|
if (!fgets(value, sizeof(value), file) ||
|
|
|
|
(perf_pmu__new_caps(&pmu->caps, name, value) < 0)) {
|
|
|
|
fclose(file);
|
|
|
|
continue;
|
|
|
|
}
|
|
|
|
|
|
|
|
nr_caps++;
|
|
|
|
fclose(file);
|
|
|
|
}
|
|
|
|
|
|
|
|
closedir(caps_dir);
|
|
|
|
|
|
|
|
return nr_caps;
|
|
|
|
}
|
2021-03-10 13:11:38 +08:00
|
|
|
|
|
|
|
void perf_pmu__warn_invalid_config(struct perf_pmu *pmu, __u64 config,
|
|
|
|
char *name)
|
|
|
|
{
|
|
|
|
struct perf_pmu_format *format;
|
|
|
|
__u64 masks = 0, bits;
|
|
|
|
char buf[100];
|
|
|
|
unsigned int i;
|
|
|
|
|
|
|
|
list_for_each_entry(format, &pmu->format, list) {
|
|
|
|
if (format->value != PERF_PMU_FORMAT_VALUE_CONFIG)
|
|
|
|
continue;
|
|
|
|
|
|
|
|
for_each_set_bit(i, format->bits, PERF_PMU_FORMAT_BITS)
|
|
|
|
masks |= 1ULL << i;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Kernel doesn't export any valid format bits.
|
|
|
|
*/
|
|
|
|
if (masks == 0)
|
|
|
|
return;
|
|
|
|
|
|
|
|
bits = config & ~masks;
|
|
|
|
if (bits == 0)
|
|
|
|
return;
|
|
|
|
|
|
|
|
bitmap_scnprintf((unsigned long *)&bits, sizeof(bits) * 8, buf, sizeof(buf));
|
|
|
|
|
|
|
|
pr_warning("WARNING: event '%s' not valid (bits %s of config "
|
|
|
|
"'%llx' not supported by kernel)!\n",
|
|
|
|
name ?: "N/A", buf, config);
|
|
|
|
}
|
2021-04-27 15:01:19 +08:00
|
|
|
|
|
|
|
bool perf_pmu__has_hybrid(void)
|
|
|
|
{
|
|
|
|
if (!hybrid_scanned) {
|
|
|
|
hybrid_scanned = true;
|
|
|
|
perf_pmu__scan(NULL);
|
|
|
|
}
|
|
|
|
|
|
|
|
return !list_empty(&perf_pmu__hybrid_pmus);
|
|
|
|
}
|
2021-07-01 14:42:53 +08:00
|
|
|
|
|
|
|
int perf_pmu__match(char *pattern, char *name, char *tok)
|
|
|
|
{
|
perf pmu: Add PMU alias support
A perf uncore PMU may have two PMU names, a real name and an alias. The
alias is exported at /sys/bus/event_source/devices/uncore_*/alias.
The perf tool should support the alias as well.
Add alias_name in the struct perf_pmu to store the alias. For the PMU
which doesn't have an alias. It's NULL.
Introduce two X86 specific functions to retrieve the real name and the
alias separately.
Only go through the sysfs to retrieve the mapping between the real name
and the alias once. The result is cached in a list, uncore_pmu_list.
Nothing changed for the other ARCHs.
With the patch, the perf tool can monitor the PMU with either the real
name or the alias.
Use the real name,
$ perf stat -e uncore_cha_2/event=1/ -x,
4044879584,,uncore_cha_2/event=1/,2528059205,100.00,,
Use the alias,
$ perf stat -e uncore_type_0_2/event=1/ -x,
3659675336,,uncore_type_0_2/event=1/,2287306455,100.00,,
Committer notes:
Rename 'struct perf_pmu_alias_name' to 'pmu_alias', the 'perf_' prefix
should be used for libperf, things inside just tools/perf/ are being
moved away from that prefix.
Also 'pmu_alias' is shorter and reflects the abstraction.
Also don't use 'pmu' as the name for variables for that type, we should
use that for the 'struct perf_pmu' variables, avoiding confusion. Use
'pmu_alias' for 'struct pmu_alias' variables.
Co-developed-by: Jin Yao <yao.jin@linux.intel.com>
Co-developed-by: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.garry@huawei.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Link: http://lore.kernel.org/lkml/20210902065955.1299-2-yao.jin@linux.intel.com
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-09-02 14:59:54 +08:00
|
|
|
if (!name)
|
|
|
|
return -1;
|
|
|
|
|
2021-07-01 14:42:53 +08:00
|
|
|
if (fnmatch(pattern, name, 0))
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
if (tok && !perf_pmu__valid_suffix(name, tok))
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
perf tools: Enable on a list of CPUs for hybrid
The 'perf record' and 'perf stat' commands have supported the option
'-C/--cpus' to count or collect only on the list of CPUs provided. This
option needs to be supported for hybrid as well.
For hybrid support, it needs to check that the cpu list are available
on hybrid PMU. One example for AlderLake, cpu0-7 is 'cpu_core', cpu8-11
is 'cpu_atom'.
Before:
# perf stat -e cpu_core/cycles/ -C11 -- sleep 1
Performance counter stats for 'CPU(s) 11':
<not supported> cpu_core/cycles/
1.006179431 seconds time elapsed
The 'perf stat' command silently returned "<not supported>" without any
helpful information. It should error out pointing out that that cpu11
was not 'cpu_core'.
After:
# perf stat -e cpu_core/cycles/ -C11 -- sleep 1
WARNING: 11 isn't a 'cpu_core', please use a CPU list in the 'cpu_core' range (0-7)
failed to use cpu list 11
We also need to support the events without pmu prefix specified.
# perf stat -e cycles -C11 -- sleep 1
WARNING: 11 isn't a 'cpu_core', please use a CPU list in the 'cpu_core' range (0-7)
Performance counter stats for 'CPU(s) 11':
1,067,373 cpu_atom/cycles/
1.005544738 seconds time elapsed
The perf tool creates two cycles events automatically, cpu_core/cycles/ and
cpu_atom/cycles/. It checks that cpu11 is not 'cpu_core', then shows a warning
for cpu_core/cycles/ and only count the cpu_atom/cycles/.
If part of cpus are 'cpu_core' and part of cpus are 'cpu_atom', for example,
# perf stat -e cycles -C0,11 -- sleep 1
WARNING: use 0 in 'cpu_core' for 'cycles', skip other cpus in list.
WARNING: use 11 in 'cpu_atom' for 'cycles', skip other cpus in list.
Performance counter stats for 'CPU(s) 0,11':
1,914,704 cpu_core/cycles/
2,036,983 cpu_atom/cycles/
1.005815641 seconds time elapsed
It now automatically selects cpu0 for cpu_core/cycles/, selects cpu11 for
cpu_atom/cycles/, and output with some warnings.
Some more complex examples,
# perf stat -e cycles,instructions -C0,11 -- sleep 1
WARNING: use 0 in 'cpu_core' for 'cycles', skip other cpus in list.
WARNING: use 11 in 'cpu_atom' for 'cycles', skip other cpus in list.
WARNING: use 0 in 'cpu_core' for 'instructions', skip other cpus in list.
WARNING: use 11 in 'cpu_atom' for 'instructions', skip other cpus in list.
Performance counter stats for 'CPU(s) 0,11':
2,780,387 cpu_core/cycles/
1,583,432 cpu_atom/cycles/
3,957,277 cpu_core/instructions/
1,167,089 cpu_atom/instructions/
1.006005124 seconds time elapsed
# perf stat -e cycles,cpu_atom/instructions/ -C0,11 -- sleep 1
WARNING: use 0 in 'cpu_core' for 'cycles', skip other cpus in list.
WARNING: use 11 in 'cpu_atom' for 'cycles', skip other cpus in list.
WARNING: use 11 in 'cpu_atom' for 'cpu_atom/instructions/', skip other cpus in list.
Performance counter stats for 'CPU(s) 0,11':
3,290,301 cpu_core/cycles/
1,953,073 cpu_atom/cycles/
1,407,869 cpu_atom/instructions/
1.006260912 seconds time elapsed
Signed-off-by: Jin Yao <yao.jin@linux.intel.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jin Yao <yao.jin@intel.com>
Cc: Kan Liang <kan.liang@intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https //lore.kernel.org/r/20210723063433.7318-4-yao.jin@linux.intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-07-23 14:34:33 +08:00
|
|
|
|
|
|
|
int perf_pmu__cpus_match(struct perf_pmu *pmu, struct perf_cpu_map *cpus,
|
|
|
|
struct perf_cpu_map **mcpus_ptr,
|
|
|
|
struct perf_cpu_map **ucpus_ptr)
|
|
|
|
{
|
|
|
|
struct perf_cpu_map *pmu_cpus = pmu->cpus;
|
|
|
|
struct perf_cpu_map *matched_cpus, *unmatched_cpus;
|
|
|
|
int matched_nr = 0, unmatched_nr = 0;
|
|
|
|
|
|
|
|
matched_cpus = perf_cpu_map__default_new();
|
|
|
|
if (!matched_cpus)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
unmatched_cpus = perf_cpu_map__default_new();
|
|
|
|
if (!unmatched_cpus) {
|
|
|
|
perf_cpu_map__put(matched_cpus);
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
for (int i = 0; i < cpus->nr; i++) {
|
|
|
|
int cpu;
|
|
|
|
|
|
|
|
cpu = perf_cpu_map__idx(pmu_cpus, cpus->map[i]);
|
|
|
|
if (cpu == -1)
|
|
|
|
unmatched_cpus->map[unmatched_nr++] = cpus->map[i];
|
|
|
|
else
|
|
|
|
matched_cpus->map[matched_nr++] = cpus->map[i];
|
|
|
|
}
|
|
|
|
|
|
|
|
unmatched_cpus->nr = unmatched_nr;
|
|
|
|
matched_cpus->nr = matched_nr;
|
|
|
|
*mcpus_ptr = matched_cpus;
|
|
|
|
*ucpus_ptr = unmatched_cpus;
|
|
|
|
return 0;
|
|
|
|
}
|