2005-11-06 00:25:53 +08:00
|
|
|
/*
|
2015-05-07 18:06:43 +08:00
|
|
|
* (c) 2005-2015 Advanced Micro Devices, Inc.
|
2005-11-06 00:25:53 +08:00
|
|
|
* Your use of this code is subject to the terms and conditions of the
|
|
|
|
* GNU general public license version 2. See "COPYING" or
|
|
|
|
* http://www.gnu.org/licenses/gpl.html
|
|
|
|
*
|
|
|
|
* Written by Jacob Shin - AMD, Inc.
|
2012-10-30 01:40:08 +08:00
|
|
|
* Maintained by: Borislav Petkov <bp@alien8.de>
|
2005-11-06 00:25:53 +08:00
|
|
|
*
|
2015-05-07 18:06:43 +08:00
|
|
|
* All MC4_MISCi registers are shared between cores on a node.
|
2005-11-06 00:25:53 +08:00
|
|
|
*/
|
|
|
|
#include <linux/interrupt.h>
|
|
|
|
#include <linux/notifier.h>
|
2009-04-08 18:31:18 +08:00
|
|
|
#include <linux/kobject.h>
|
x86, mce: trivial clean up for mce_amd_64.c
Fix for followings:
WARNING: Use #include <linux/percpu.h> instead of <asm/percpu.h>
+#include <asm/percpu.h>
ERROR: Macros with multiple statements should be enclosed in a do - while
loop
+#define THRESHOLD_ATTR(_name, _mode, _show, _store) \
+{ \
+ .attr = {.name = __stringify(_name), .mode = _mode }, \
+ .show = _show, \
+ .store = _store, \
+};
WARNING: usage of NR_CPUS is often wrong - consider using cpu_possible(),
num_possible_cpus(), for_each_possible_cpu(), etc
+ if (cpu >= NR_CPUS)
Signed-off-by: Hidetoshi Seto <seto.hidetoshi@jp.fujitsu.com>
Cc: Andi Kleen <andi@firstfloor.org>
Signed-off-by: H. Peter Anvin <hpa@zytor.com>
2009-04-08 18:31:18 +08:00
|
|
|
#include <linux/percpu.h>
|
2009-04-08 18:31:18 +08:00
|
|
|
#include <linux/errno.h>
|
|
|
|
#include <linux/sched.h>
|
2005-11-06 00:25:53 +08:00
|
|
|
#include <linux/sysfs.h>
|
include cleanup: Update gfp.h and slab.h includes to prepare for breaking implicit slab.h inclusion from percpu.h
percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.
percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.
http://userweb.kernel.org/~tj/misc/slabh-sweep.py
The script does the followings.
* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.
* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.
* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.
The conversion was done in the following steps.
1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.
2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.
3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.
4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.
5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.
6. percpu.h was updated not to include slab.h.
7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).
* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig
8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.
Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.
Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
2010-03-24 16:04:11 +08:00
|
|
|
#include <linux/slab.h>
|
2009-04-08 18:31:18 +08:00
|
|
|
#include <linux/init.h>
|
|
|
|
#include <linux/cpu.h>
|
|
|
|
#include <linux/smp.h>
|
|
|
|
|
2012-05-02 23:16:59 +08:00
|
|
|
#include <asm/amd_nb.h>
|
2005-11-06 00:25:53 +08:00
|
|
|
#include <asm/apic.h>
|
2009-04-08 18:31:18 +08:00
|
|
|
#include <asm/idle.h>
|
2005-11-06 00:25:53 +08:00
|
|
|
#include <asm/mce.h>
|
|
|
|
#include <asm/msr.h>
|
2015-05-06 19:58:56 +08:00
|
|
|
#include <asm/trace/irq_vectors.h>
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2006-06-26 19:58:56 +08:00
|
|
|
#define NR_BLOCKS 9
|
|
|
|
#define THRESHOLD_MAX 0xFFF
|
|
|
|
#define INT_TYPE_APIC 0x00020000
|
|
|
|
#define MASK_VALID_HI 0x80000000
|
2007-02-13 20:26:23 +08:00
|
|
|
#define MASK_CNTP_HI 0x40000000
|
|
|
|
#define MASK_LOCKED_HI 0x20000000
|
2006-06-26 19:58:56 +08:00
|
|
|
#define MASK_LVTOFF_HI 0x00F00000
|
|
|
|
#define MASK_COUNT_EN_HI 0x00080000
|
|
|
|
#define MASK_INT_TYPE_HI 0x00060000
|
|
|
|
#define MASK_OVERFLOW_HI 0x00010000
|
2005-11-06 00:25:53 +08:00
|
|
|
#define MASK_ERR_COUNT_HI 0x00000FFF
|
2006-06-26 19:58:53 +08:00
|
|
|
#define MASK_BLKPTR_LO 0xFF000000
|
|
|
|
#define MCG_XBLK_ADDR 0xC0000400
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2015-05-06 19:58:56 +08:00
|
|
|
/* Deferred error settings */
|
|
|
|
#define MSR_CU_DEF_ERR 0xC0000410
|
|
|
|
#define MASK_DEF_LVTOFF 0x000000F0
|
|
|
|
#define MASK_DEF_INT_TYPE 0x00000006
|
|
|
|
#define DEF_LVT_OFF 0x2
|
|
|
|
#define DEF_INT_TYPE_APIC 0x2
|
|
|
|
|
2012-05-04 23:05:27 +08:00
|
|
|
static const char * const th_names[] = {
|
|
|
|
"load_store",
|
|
|
|
"insn_fetch",
|
|
|
|
"combined_unit",
|
|
|
|
"",
|
|
|
|
"northbridge",
|
|
|
|
"execution_unit",
|
|
|
|
};
|
|
|
|
|
2013-03-15 05:10:41 +08:00
|
|
|
static DEFINE_PER_CPU(struct threshold_bank **, threshold_banks);
|
2005-11-06 00:25:53 +08:00
|
|
|
static DEFINE_PER_CPU(unsigned char, bank_map); /* see which banks are on */
|
|
|
|
|
2009-02-12 20:49:31 +08:00
|
|
|
static void amd_threshold_interrupt(void);
|
2015-05-06 19:58:56 +08:00
|
|
|
static void amd_deferred_error_interrupt(void);
|
|
|
|
|
|
|
|
static void default_deferred_error_interrupt(void)
|
|
|
|
{
|
|
|
|
pr_err("Unexpected deferred interrupt at vector %x\n", DEFERRED_ERROR_VECTOR);
|
|
|
|
}
|
|
|
|
void (*deferred_error_int_vector)(void) = default_deferred_error_interrupt;
|
2009-02-12 20:49:31 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
/*
|
|
|
|
* CPU Initialization
|
|
|
|
*/
|
|
|
|
|
2008-12-17 09:34:04 +08:00
|
|
|
struct thresh_restart {
|
2009-04-08 18:31:18 +08:00
|
|
|
struct threshold_block *b;
|
|
|
|
int reset;
|
2010-10-25 22:03:35 +08:00
|
|
|
int set_lvt_off;
|
|
|
|
int lvt_off;
|
2009-04-08 18:31:18 +08:00
|
|
|
u16 old_limit;
|
2008-12-17 09:34:04 +08:00
|
|
|
};
|
|
|
|
|
2013-03-15 05:10:40 +08:00
|
|
|
static inline bool is_shared_bank(int bank)
|
|
|
|
{
|
|
|
|
/* Bank 4 is for northbridge reporting and is thus shared */
|
|
|
|
return (bank == 4);
|
|
|
|
}
|
|
|
|
|
2015-01-23 16:32:01 +08:00
|
|
|
static const char *bank4_names(const struct threshold_block *b)
|
2012-05-04 23:05:27 +08:00
|
|
|
{
|
|
|
|
switch (b->address) {
|
|
|
|
/* MSR4_MISC0 */
|
|
|
|
case 0x00000413:
|
|
|
|
return "dram";
|
|
|
|
|
|
|
|
case 0xc0000408:
|
|
|
|
return "ht_links";
|
|
|
|
|
|
|
|
case 0xc0000409:
|
|
|
|
return "l3_cache";
|
|
|
|
|
|
|
|
default:
|
|
|
|
WARN(1, "Funny MSR: 0x%08x\n", b->address);
|
|
|
|
return "";
|
|
|
|
}
|
|
|
|
};
|
|
|
|
|
|
|
|
|
2012-04-17 00:01:53 +08:00
|
|
|
static bool lvt_interrupt_supported(unsigned int bank, u32 msr_high_bits)
|
|
|
|
{
|
|
|
|
/*
|
|
|
|
* bank 4 supports APIC LVT interrupts implicitly since forever.
|
|
|
|
*/
|
|
|
|
if (bank == 4)
|
|
|
|
return true;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* IntP: interrupt present; if this bit is set, the thresholding
|
|
|
|
* bank can generate APIC LVT interrupts
|
|
|
|
*/
|
|
|
|
return msr_high_bits & BIT(28);
|
|
|
|
}
|
|
|
|
|
2010-10-25 22:03:37 +08:00
|
|
|
static int lvt_off_valid(struct threshold_block *b, int apic, u32 lo, u32 hi)
|
|
|
|
{
|
|
|
|
int msr = (hi & MASK_LVTOFF_HI) >> 20;
|
|
|
|
|
|
|
|
if (apic < 0) {
|
|
|
|
pr_err(FW_BUG "cpu %d, failed to setup threshold interrupt "
|
|
|
|
"for bank %d, block %d (MSR%08X=0x%x%08x)\n", b->cpu,
|
|
|
|
b->bank, b->block, b->address, hi, lo);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (apic != msr) {
|
|
|
|
pr_err(FW_BUG "cpu %d, invalid threshold interrupt offset %d "
|
|
|
|
"for bank %d, block %d (MSR%08X=0x%x%08x)\n",
|
|
|
|
b->cpu, apic, b->bank, b->block, b->address, hi, lo);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 1;
|
|
|
|
};
|
|
|
|
|
2012-04-17 00:01:53 +08:00
|
|
|
/*
|
|
|
|
* Called via smp_call_function_single(), must be called with correct
|
|
|
|
* cpu affinity.
|
|
|
|
*/
|
2009-03-18 08:10:25 +08:00
|
|
|
static void threshold_restart_bank(void *_tr)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2008-12-17 09:34:04 +08:00
|
|
|
struct thresh_restart *tr = _tr;
|
2010-10-25 22:03:36 +08:00
|
|
|
u32 hi, lo;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2010-10-25 22:03:36 +08:00
|
|
|
rdmsr(tr->b->address, lo, hi);
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2010-10-25 22:03:36 +08:00
|
|
|
if (tr->b->threshold_limit < (hi & THRESHOLD_MAX))
|
2008-12-17 09:34:04 +08:00
|
|
|
tr->reset = 1; /* limit cannot be lower than err count */
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2008-12-17 09:34:04 +08:00
|
|
|
if (tr->reset) { /* reset err count and overflow bit */
|
2010-10-25 22:03:36 +08:00
|
|
|
hi =
|
|
|
|
(hi & ~(MASK_ERR_COUNT_HI | MASK_OVERFLOW_HI)) |
|
2008-12-17 09:34:04 +08:00
|
|
|
(THRESHOLD_MAX - tr->b->threshold_limit);
|
|
|
|
} else if (tr->old_limit) { /* change limit w/o reset */
|
2010-10-25 22:03:36 +08:00
|
|
|
int new_count = (hi & THRESHOLD_MAX) +
|
2008-12-17 09:34:04 +08:00
|
|
|
(tr->old_limit - tr->b->threshold_limit);
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2010-10-25 22:03:36 +08:00
|
|
|
hi = (hi & ~MASK_ERR_COUNT_HI) |
|
2005-11-06 00:25:53 +08:00
|
|
|
(new_count & THRESHOLD_MAX);
|
|
|
|
}
|
|
|
|
|
2012-04-17 00:01:53 +08:00
|
|
|
/* clear IntType */
|
|
|
|
hi &= ~MASK_INT_TYPE_HI;
|
|
|
|
|
|
|
|
if (!tr->b->interrupt_capable)
|
|
|
|
goto done;
|
|
|
|
|
2010-10-25 22:03:35 +08:00
|
|
|
if (tr->set_lvt_off) {
|
2010-10-25 22:03:37 +08:00
|
|
|
if (lvt_off_valid(tr->b, tr->lvt_off, lo, hi)) {
|
|
|
|
/* set new lvt offset */
|
|
|
|
hi &= ~MASK_LVTOFF_HI;
|
|
|
|
hi |= tr->lvt_off << 20;
|
|
|
|
}
|
2010-10-25 22:03:35 +08:00
|
|
|
}
|
|
|
|
|
2012-04-17 00:01:53 +08:00
|
|
|
if (tr->b->interrupt_enable)
|
|
|
|
hi |= INT_TYPE_APIC;
|
|
|
|
|
|
|
|
done:
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2010-10-25 22:03:36 +08:00
|
|
|
hi |= MASK_COUNT_EN_HI;
|
|
|
|
wrmsr(tr->b->address, lo, hi);
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2010-10-25 22:03:35 +08:00
|
|
|
static void mce_threshold_block_init(struct threshold_block *b, int offset)
|
|
|
|
{
|
|
|
|
struct thresh_restart tr = {
|
|
|
|
.b = b,
|
|
|
|
.set_lvt_off = 1,
|
|
|
|
.lvt_off = offset,
|
|
|
|
};
|
|
|
|
|
|
|
|
b->threshold_limit = THRESHOLD_MAX;
|
|
|
|
threshold_restart_bank(&tr);
|
|
|
|
};
|
|
|
|
|
2015-05-06 19:58:58 +08:00
|
|
|
static int setup_APIC_mce_threshold(int reserved, int new)
|
2010-10-25 22:03:37 +08:00
|
|
|
{
|
|
|
|
if (reserved < 0 && !setup_APIC_eilvt(new, THRESHOLD_APIC_VECTOR,
|
|
|
|
APIC_EILVT_MSG_FIX, 0))
|
|
|
|
return new;
|
|
|
|
|
|
|
|
return reserved;
|
|
|
|
}
|
|
|
|
|
2015-05-06 19:58:56 +08:00
|
|
|
static int setup_APIC_deferred_error(int reserved, int new)
|
|
|
|
{
|
|
|
|
if (reserved < 0 && !setup_APIC_eilvt(new, DEFERRED_ERROR_VECTOR,
|
|
|
|
APIC_EILVT_MSG_FIX, 0))
|
|
|
|
return new;
|
|
|
|
|
|
|
|
return reserved;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void deferred_error_interrupt_enable(struct cpuinfo_x86 *c)
|
|
|
|
{
|
|
|
|
u32 low = 0, high = 0;
|
|
|
|
int def_offset = -1, def_new;
|
|
|
|
|
|
|
|
if (rdmsr_safe(MSR_CU_DEF_ERR, &low, &high))
|
|
|
|
return;
|
|
|
|
|
|
|
|
def_new = (low & MASK_DEF_LVTOFF) >> 4;
|
|
|
|
if (!(low & MASK_DEF_LVTOFF)) {
|
|
|
|
pr_err(FW_BUG "Your BIOS is not setting up LVT offset 0x2 for deferred error IRQs correctly.\n");
|
|
|
|
def_new = DEF_LVT_OFF;
|
|
|
|
low = (low & ~MASK_DEF_LVTOFF) | (DEF_LVT_OFF << 4);
|
|
|
|
}
|
|
|
|
|
|
|
|
def_offset = setup_APIC_deferred_error(def_offset, def_new);
|
|
|
|
if ((def_offset == def_new) &&
|
|
|
|
(deferred_error_int_vector != amd_deferred_error_interrupt))
|
|
|
|
deferred_error_int_vector = amd_deferred_error_interrupt;
|
|
|
|
|
|
|
|
low = (low & ~MASK_DEF_INT_TYPE) | DEF_INT_TYPE_APIC;
|
|
|
|
wrmsr(MSR_CU_DEF_ERR, low, high);
|
|
|
|
}
|
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
/* cpu init entry point, called from mce.c with preempt off */
|
2009-02-21 15:35:51 +08:00
|
|
|
void mce_amd_feature_init(struct cpuinfo_x86 *c)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2010-10-25 22:03:35 +08:00
|
|
|
struct threshold_block b;
|
2005-11-06 00:25:53 +08:00
|
|
|
unsigned int cpu = smp_processor_id();
|
2006-06-26 19:58:53 +08:00
|
|
|
u32 low = 0, high = 0, address = 0;
|
2009-04-08 18:31:18 +08:00
|
|
|
unsigned int bank, block;
|
2014-11-01 18:23:32 +08:00
|
|
|
int offset = -1, new;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2013-03-15 05:10:41 +08:00
|
|
|
for (bank = 0; bank < mca_cfg.banks; ++bank) {
|
2006-06-26 19:58:53 +08:00
|
|
|
for (block = 0; block < NR_BLOCKS; ++block) {
|
|
|
|
if (block == 0)
|
2014-09-23 10:16:01 +08:00
|
|
|
address = MSR_IA32_MCx_MISC(bank);
|
2007-02-13 20:26:23 +08:00
|
|
|
else if (block == 1) {
|
|
|
|
address = (low & MASK_BLKPTR_LO) >> 21;
|
|
|
|
if (!address)
|
|
|
|
break;
|
2010-10-08 18:08:34 +08:00
|
|
|
|
2007-02-13 20:26:23 +08:00
|
|
|
address += MCG_XBLK_ADDR;
|
2009-04-08 18:31:18 +08:00
|
|
|
} else
|
2006-06-26 19:58:53 +08:00
|
|
|
++address;
|
|
|
|
|
|
|
|
if (rdmsr_safe(address, &low, &high))
|
2007-02-13 20:26:23 +08:00
|
|
|
break;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2010-10-08 18:08:34 +08:00
|
|
|
if (!(high & MASK_VALID_HI))
|
|
|
|
continue;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2007-02-13 20:26:23 +08:00
|
|
|
if (!(high & MASK_CNTP_HI) ||
|
|
|
|
(high & MASK_LOCKED_HI))
|
2006-06-26 19:58:53 +08:00
|
|
|
continue;
|
|
|
|
|
|
|
|
if (!block)
|
|
|
|
per_cpu(bank_map, cpu) |= (1 << bank);
|
2011-12-21 08:52:22 +08:00
|
|
|
|
2010-10-25 22:03:35 +08:00
|
|
|
memset(&b, 0, sizeof(b));
|
2012-04-17 00:01:53 +08:00
|
|
|
b.cpu = cpu;
|
|
|
|
b.bank = bank;
|
|
|
|
b.block = block;
|
|
|
|
b.address = address;
|
|
|
|
b.interrupt_capable = lvt_interrupt_supported(bank, high);
|
|
|
|
|
2014-11-01 18:23:32 +08:00
|
|
|
if (!b.interrupt_capable)
|
|
|
|
goto init;
|
2009-02-12 20:49:31 +08:00
|
|
|
|
2015-02-03 01:02:41 +08:00
|
|
|
b.interrupt_enable = 1;
|
2014-11-01 18:23:32 +08:00
|
|
|
new = (high & MASK_LVTOFF_HI) >> 20;
|
2015-05-06 19:58:58 +08:00
|
|
|
offset = setup_APIC_mce_threshold(offset, new);
|
2014-10-02 23:20:12 +08:00
|
|
|
|
2014-11-01 18:23:32 +08:00
|
|
|
if ((offset == new) &&
|
|
|
|
(mce_threshold_vector != amd_threshold_interrupt))
|
2014-10-02 23:20:12 +08:00
|
|
|
mce_threshold_vector = amd_threshold_interrupt;
|
2014-11-01 18:23:32 +08:00
|
|
|
|
|
|
|
init:
|
|
|
|
mce_threshold_block_init(&b, offset);
|
2006-06-26 19:58:53 +08:00
|
|
|
}
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
2015-05-06 19:58:56 +08:00
|
|
|
|
|
|
|
if (mce_flags.succor)
|
|
|
|
deferred_error_interrupt_enable(c);
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2015-05-06 19:58:53 +08:00
|
|
|
static void __log_error(unsigned int bank, bool threshold_err, u64 misc)
|
|
|
|
{
|
|
|
|
struct mce m;
|
|
|
|
u64 status;
|
|
|
|
|
|
|
|
rdmsrl(MSR_IA32_MCx_STATUS(bank), status);
|
|
|
|
if (!(status & MCI_STATUS_VAL))
|
|
|
|
return;
|
|
|
|
|
|
|
|
mce_setup(&m);
|
|
|
|
|
|
|
|
m.status = status;
|
|
|
|
m.bank = bank;
|
2015-05-06 19:58:54 +08:00
|
|
|
|
2015-05-06 19:58:53 +08:00
|
|
|
if (threshold_err)
|
|
|
|
m.misc = misc;
|
|
|
|
|
2015-05-06 19:58:54 +08:00
|
|
|
if (m.status & MCI_STATUS_ADDRV)
|
|
|
|
rdmsrl(MSR_IA32_MCx_ADDR(bank), m.addr);
|
2015-05-06 19:58:53 +08:00
|
|
|
|
2015-05-06 19:58:54 +08:00
|
|
|
mce_log(&m);
|
2015-05-06 19:58:53 +08:00
|
|
|
wrmsrl(MSR_IA32_MCx_STATUS(bank), 0);
|
|
|
|
}
|
|
|
|
|
2015-05-06 19:58:56 +08:00
|
|
|
static inline void __smp_deferred_error_interrupt(void)
|
|
|
|
{
|
|
|
|
inc_irq_stat(irq_deferred_error_count);
|
|
|
|
deferred_error_int_vector();
|
|
|
|
}
|
|
|
|
|
|
|
|
asmlinkage __visible void smp_deferred_error_interrupt(void)
|
|
|
|
{
|
|
|
|
entering_irq();
|
|
|
|
__smp_deferred_error_interrupt();
|
|
|
|
exiting_ack_irq();
|
|
|
|
}
|
|
|
|
|
|
|
|
asmlinkage __visible void smp_trace_deferred_error_interrupt(void)
|
|
|
|
{
|
|
|
|
entering_irq();
|
|
|
|
trace_deferred_error_apic_entry(DEFERRED_ERROR_VECTOR);
|
|
|
|
__smp_deferred_error_interrupt();
|
|
|
|
trace_deferred_error_apic_exit(DEFERRED_ERROR_VECTOR);
|
|
|
|
exiting_ack_irq();
|
|
|
|
}
|
|
|
|
|
|
|
|
/* APIC interrupt handler for deferred errors */
|
|
|
|
static void amd_deferred_error_interrupt(void)
|
|
|
|
{
|
|
|
|
u64 status;
|
|
|
|
unsigned int bank;
|
|
|
|
|
|
|
|
for (bank = 0; bank < mca_cfg.banks; ++bank) {
|
|
|
|
rdmsrl(MSR_IA32_MCx_STATUS(bank), status);
|
|
|
|
|
|
|
|
if (!(status & MCI_STATUS_VAL) ||
|
|
|
|
!(status & MCI_STATUS_DEFERRED))
|
|
|
|
continue;
|
|
|
|
|
|
|
|
__log_error(bank, false, 0);
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
/*
|
|
|
|
* APIC Interrupt Handler
|
|
|
|
*/
|
|
|
|
|
|
|
|
/*
|
|
|
|
* threshold interrupt handler will service THRESHOLD_APIC_VECTOR.
|
|
|
|
* the interrupt goes off when error_count reaches threshold_limit.
|
|
|
|
* the handler will simply log mcelog w/ software defined bank number.
|
|
|
|
*/
|
2015-05-06 19:58:53 +08:00
|
|
|
|
2009-02-12 20:49:31 +08:00
|
|
|
static void amd_threshold_interrupt(void)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2009-04-08 18:31:18 +08:00
|
|
|
u32 low = 0, high = 0, address = 0;
|
2014-10-02 20:48:19 +08:00
|
|
|
int cpu = smp_processor_id();
|
2006-06-26 19:58:53 +08:00
|
|
|
unsigned int bank, block;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
|
|
|
/* assume first bank caused it */
|
2013-03-15 05:10:41 +08:00
|
|
|
for (bank = 0; bank < mca_cfg.banks; ++bank) {
|
2014-10-02 20:48:19 +08:00
|
|
|
if (!(per_cpu(bank_map, cpu) & (1 << bank)))
|
2007-02-13 20:26:23 +08:00
|
|
|
continue;
|
2006-06-26 19:58:53 +08:00
|
|
|
for (block = 0; block < NR_BLOCKS; ++block) {
|
2009-04-08 18:31:18 +08:00
|
|
|
if (block == 0) {
|
2014-09-23 10:16:01 +08:00
|
|
|
address = MSR_IA32_MCx_MISC(bank);
|
2009-04-08 18:31:18 +08:00
|
|
|
} else if (block == 1) {
|
2007-02-13 20:26:23 +08:00
|
|
|
address = (low & MASK_BLKPTR_LO) >> 21;
|
|
|
|
if (!address)
|
|
|
|
break;
|
|
|
|
address += MCG_XBLK_ADDR;
|
2009-04-08 18:31:18 +08:00
|
|
|
} else {
|
2006-06-26 19:58:53 +08:00
|
|
|
++address;
|
2009-04-08 18:31:18 +08:00
|
|
|
}
|
2006-06-26 19:58:53 +08:00
|
|
|
|
|
|
|
if (rdmsr_safe(address, &low, &high))
|
2007-02-13 20:26:23 +08:00
|
|
|
break;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
|
|
|
if (!(high & MASK_VALID_HI)) {
|
|
|
|
if (block)
|
|
|
|
continue;
|
|
|
|
else
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
|
2007-02-13 20:26:23 +08:00
|
|
|
if (!(high & MASK_CNTP_HI) ||
|
|
|
|
(high & MASK_LOCKED_HI))
|
2006-06-26 19:58:53 +08:00
|
|
|
continue;
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
/*
|
|
|
|
* Log the machine check that caused the threshold
|
|
|
|
* event.
|
|
|
|
*/
|
2014-10-02 20:48:19 +08:00
|
|
|
if (high & MASK_OVERFLOW_HI)
|
|
|
|
goto log;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
}
|
2014-10-02 20:48:19 +08:00
|
|
|
return;
|
|
|
|
|
|
|
|
log:
|
2015-05-06 19:58:53 +08:00
|
|
|
__log_error(bank, true, ((u64)high << 32) | low);
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Sysfs Interface
|
|
|
|
*/
|
|
|
|
|
|
|
|
struct threshold_attr {
|
2006-06-26 19:58:56 +08:00
|
|
|
struct attribute attr;
|
2009-04-08 18:31:18 +08:00
|
|
|
ssize_t (*show) (struct threshold_block *, char *);
|
|
|
|
ssize_t (*store) (struct threshold_block *, const char *, size_t count);
|
2005-11-06 00:25:53 +08:00
|
|
|
};
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
#define SHOW_FIELDS(name) \
|
|
|
|
static ssize_t show_ ## name(struct threshold_block *b, char *buf) \
|
|
|
|
{ \
|
2012-04-27 18:31:34 +08:00
|
|
|
return sprintf(buf, "%lu\n", (unsigned long) b->name); \
|
2006-06-26 19:58:56 +08:00
|
|
|
}
|
2005-11-06 00:25:53 +08:00
|
|
|
SHOW_FIELDS(interrupt_enable)
|
|
|
|
SHOW_FIELDS(threshold_limit)
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
static ssize_t
|
2009-04-14 16:26:30 +08:00
|
|
|
store_interrupt_enable(struct threshold_block *b, const char *buf, size_t size)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2008-12-17 09:34:04 +08:00
|
|
|
struct thresh_restart tr;
|
2009-04-08 18:31:18 +08:00
|
|
|
unsigned long new;
|
|
|
|
|
2012-04-17 00:01:53 +08:00
|
|
|
if (!b->interrupt_capable)
|
|
|
|
return -EINVAL;
|
|
|
|
|
2014-08-09 05:24:03 +08:00
|
|
|
if (kstrtoul(buf, 0, &new) < 0)
|
2005-11-06 00:25:53 +08:00
|
|
|
return -EINVAL;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
b->interrupt_enable = !!new;
|
|
|
|
|
2010-10-25 22:03:35 +08:00
|
|
|
memset(&tr, 0, sizeof(tr));
|
2009-04-08 18:31:18 +08:00
|
|
|
tr.b = b;
|
|
|
|
|
2009-03-18 08:10:25 +08:00
|
|
|
smp_call_function_single(b->cpu, threshold_restart_bank, &tr, 1);
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2009-04-14 16:26:30 +08:00
|
|
|
return size;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
static ssize_t
|
2009-04-14 16:26:30 +08:00
|
|
|
store_threshold_limit(struct threshold_block *b, const char *buf, size_t size)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2008-12-17 09:34:04 +08:00
|
|
|
struct thresh_restart tr;
|
2009-04-08 18:31:18 +08:00
|
|
|
unsigned long new;
|
|
|
|
|
2014-08-09 05:24:03 +08:00
|
|
|
if (kstrtoul(buf, 0, &new) < 0)
|
2005-11-06 00:25:53 +08:00
|
|
|
return -EINVAL;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
if (new > THRESHOLD_MAX)
|
|
|
|
new = THRESHOLD_MAX;
|
|
|
|
if (new < 1)
|
|
|
|
new = 1;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2010-10-25 22:03:35 +08:00
|
|
|
memset(&tr, 0, sizeof(tr));
|
2008-12-17 09:34:04 +08:00
|
|
|
tr.old_limit = b->threshold_limit;
|
2005-11-06 00:25:53 +08:00
|
|
|
b->threshold_limit = new;
|
2008-12-17 09:34:04 +08:00
|
|
|
tr.b = b;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2009-03-18 08:10:25 +08:00
|
|
|
smp_call_function_single(b->cpu, threshold_restart_bank, &tr, 1);
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2009-04-14 16:26:30 +08:00
|
|
|
return size;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2008-12-17 09:34:04 +08:00
|
|
|
static ssize_t show_error_count(struct threshold_block *b, char *buf)
|
|
|
|
{
|
2012-04-27 18:53:59 +08:00
|
|
|
u32 lo, hi;
|
|
|
|
|
|
|
|
rdmsr_on_cpu(b->cpu, b->address, &lo, &hi);
|
2009-03-18 08:10:25 +08:00
|
|
|
|
2012-04-27 18:53:59 +08:00
|
|
|
return sprintf(buf, "%u\n", ((hi & THRESHOLD_MAX) -
|
|
|
|
(THRESHOLD_MAX - b->threshold_limit)));
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2012-04-27 21:37:25 +08:00
|
|
|
static struct threshold_attr error_count = {
|
|
|
|
.attr = {.name = __stringify(error_count), .mode = 0444 },
|
|
|
|
.show = show_error_count,
|
|
|
|
};
|
2005-11-06 00:25:53 +08:00
|
|
|
|
x86, mce: trivial clean up for mce_amd_64.c
Fix for followings:
WARNING: Use #include <linux/percpu.h> instead of <asm/percpu.h>
+#include <asm/percpu.h>
ERROR: Macros with multiple statements should be enclosed in a do - while
loop
+#define THRESHOLD_ATTR(_name, _mode, _show, _store) \
+{ \
+ .attr = {.name = __stringify(_name), .mode = _mode }, \
+ .show = _show, \
+ .store = _store, \
+};
WARNING: usage of NR_CPUS is often wrong - consider using cpu_possible(),
num_possible_cpus(), for_each_possible_cpu(), etc
+ if (cpu >= NR_CPUS)
Signed-off-by: Hidetoshi Seto <seto.hidetoshi@jp.fujitsu.com>
Cc: Andi Kleen <andi@firstfloor.org>
Signed-off-by: H. Peter Anvin <hpa@zytor.com>
2009-04-08 18:31:18 +08:00
|
|
|
#define RW_ATTR(val) \
|
|
|
|
static struct threshold_attr val = { \
|
|
|
|
.attr = {.name = __stringify(val), .mode = 0644 }, \
|
|
|
|
.show = show_## val, \
|
|
|
|
.store = store_## val, \
|
2005-11-06 00:25:53 +08:00
|
|
|
};
|
|
|
|
|
2006-06-26 19:58:56 +08:00
|
|
|
RW_ATTR(interrupt_enable);
|
|
|
|
RW_ATTR(threshold_limit);
|
2005-11-06 00:25:53 +08:00
|
|
|
|
|
|
|
static struct attribute *default_attrs[] = {
|
|
|
|
&threshold_limit.attr,
|
|
|
|
&error_count.attr,
|
2012-04-17 00:20:36 +08:00
|
|
|
NULL, /* possibly interrupt_enable if supported, see below */
|
|
|
|
NULL,
|
2005-11-06 00:25:53 +08:00
|
|
|
};
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
#define to_block(k) container_of(k, struct threshold_block, kobj)
|
|
|
|
#define to_attr(a) container_of(a, struct threshold_attr, attr)
|
2005-11-06 00:25:53 +08:00
|
|
|
|
|
|
|
static ssize_t show(struct kobject *kobj, struct attribute *attr, char *buf)
|
|
|
|
{
|
2006-06-26 19:58:53 +08:00
|
|
|
struct threshold_block *b = to_block(kobj);
|
2005-11-06 00:25:53 +08:00
|
|
|
struct threshold_attr *a = to_attr(attr);
|
|
|
|
ssize_t ret;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
ret = a->show ? a->show(b, buf) : -EIO;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
|
|
|
static ssize_t store(struct kobject *kobj, struct attribute *attr,
|
|
|
|
const char *buf, size_t count)
|
|
|
|
{
|
2006-06-26 19:58:53 +08:00
|
|
|
struct threshold_block *b = to_block(kobj);
|
2005-11-06 00:25:53 +08:00
|
|
|
struct threshold_attr *a = to_attr(attr);
|
|
|
|
ssize_t ret;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
ret = a->store ? a->store(b, buf, count) : -EIO;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2010-01-19 09:58:23 +08:00
|
|
|
static const struct sysfs_ops threshold_ops = {
|
2009-04-08 18:31:18 +08:00
|
|
|
.show = show,
|
|
|
|
.store = store,
|
2005-11-06 00:25:53 +08:00
|
|
|
};
|
|
|
|
|
|
|
|
static struct kobj_type threshold_ktype = {
|
2009-04-08 18:31:18 +08:00
|
|
|
.sysfs_ops = &threshold_ops,
|
|
|
|
.default_attrs = default_attrs,
|
2005-11-06 00:25:53 +08:00
|
|
|
};
|
|
|
|
|
x86: delete __cpuinit usage from all x86 files
The __cpuinit type of throwaway sections might have made sense
some time ago when RAM was more constrained, but now the savings
do not offset the cost and complications. For example, the fix in
commit 5e427ec2d0 ("x86: Fix bit corruption at CPU resume time")
is a good example of the nasty type of bugs that can be created
with improper use of the various __init prefixes.
After a discussion on LKML[1] it was decided that cpuinit should go
the way of devinit and be phased out. Once all the users are gone,
we can then finally remove the macros themselves from linux/init.h.
Note that some harmless section mismatch warnings may result, since
notify_cpu_starting() and cpu_up() are arch independent (kernel/cpu.c)
are flagged as __cpuinit -- so if we remove the __cpuinit from
arch specific callers, we will also get section mismatch warnings.
As an intermediate step, we intend to turn the linux/init.h cpuinit
content into no-ops as early as possible, since that will get rid
of these warnings. In any case, they are temporary and harmless.
This removes all the arch/x86 uses of the __cpuinit macros from
all C files. x86 only had the one __CPUINIT used in assembly files,
and it wasn't paired off with a .previous or a __FINIT, so we can
delete it directly w/o any corresponding additional change there.
[1] https://lkml.org/lkml/2013/5/20/589
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: x86@kernel.org
Acked-by: Ingo Molnar <mingo@kernel.org>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: H. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2013-06-19 06:23:59 +08:00
|
|
|
static int allocate_threshold_blocks(unsigned int cpu, unsigned int bank,
|
|
|
|
unsigned int block, u32 address)
|
2006-06-26 19:58:53 +08:00
|
|
|
{
|
|
|
|
struct threshold_block *b = NULL;
|
2009-04-08 18:31:18 +08:00
|
|
|
u32 low, high;
|
|
|
|
int err;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2013-03-15 05:10:41 +08:00
|
|
|
if ((bank >= mca_cfg.banks) || (block >= NR_BLOCKS))
|
2006-06-26 19:58:53 +08:00
|
|
|
return 0;
|
|
|
|
|
2009-03-18 08:10:25 +08:00
|
|
|
if (rdmsr_safe_on_cpu(cpu, address, &low, &high))
|
2007-02-13 20:26:23 +08:00
|
|
|
return 0;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
|
|
|
if (!(high & MASK_VALID_HI)) {
|
|
|
|
if (block)
|
|
|
|
goto recurse;
|
|
|
|
else
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2007-02-13 20:26:23 +08:00
|
|
|
if (!(high & MASK_CNTP_HI) ||
|
|
|
|
(high & MASK_LOCKED_HI))
|
2006-06-26 19:58:53 +08:00
|
|
|
goto recurse;
|
|
|
|
|
|
|
|
b = kzalloc(sizeof(struct threshold_block), GFP_KERNEL);
|
|
|
|
if (!b)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
b->block = block;
|
|
|
|
b->bank = bank;
|
|
|
|
b->cpu = cpu;
|
|
|
|
b->address = address;
|
|
|
|
b->interrupt_enable = 0;
|
2012-04-17 00:01:53 +08:00
|
|
|
b->interrupt_capable = lvt_interrupt_supported(bank, high);
|
2009-04-08 18:31:18 +08:00
|
|
|
b->threshold_limit = THRESHOLD_MAX;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2015-02-03 01:02:41 +08:00
|
|
|
if (b->interrupt_capable) {
|
2012-04-17 00:20:36 +08:00
|
|
|
threshold_ktype.default_attrs[2] = &interrupt_enable.attr;
|
2015-02-03 01:02:41 +08:00
|
|
|
b->interrupt_enable = 1;
|
|
|
|
} else {
|
2012-04-17 00:20:36 +08:00
|
|
|
threshold_ktype.default_attrs[2] = NULL;
|
2015-02-03 01:02:41 +08:00
|
|
|
}
|
2012-04-17 00:20:36 +08:00
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
INIT_LIST_HEAD(&b->miscj);
|
|
|
|
|
2009-04-08 18:31:18 +08:00
|
|
|
if (per_cpu(threshold_banks, cpu)[bank]->blocks) {
|
2006-06-26 19:58:53 +08:00
|
|
|
list_add(&b->miscj,
|
|
|
|
&per_cpu(threshold_banks, cpu)[bank]->blocks->miscj);
|
2009-04-08 18:31:18 +08:00
|
|
|
} else {
|
2006-06-26 19:58:53 +08:00
|
|
|
per_cpu(threshold_banks, cpu)[bank]->blocks = b;
|
2009-04-08 18:31:18 +08:00
|
|
|
}
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2007-12-20 01:23:20 +08:00
|
|
|
err = kobject_init_and_add(&b->kobj, &threshold_ktype,
|
|
|
|
per_cpu(threshold_banks, cpu)[bank]->kobj,
|
2012-05-04 23:05:27 +08:00
|
|
|
(bank == 4 ? bank4_names(b) : th_names[bank]));
|
2006-06-26 19:58:53 +08:00
|
|
|
if (err)
|
|
|
|
goto out_free;
|
|
|
|
recurse:
|
|
|
|
if (!block) {
|
|
|
|
address = (low & MASK_BLKPTR_LO) >> 21;
|
|
|
|
if (!address)
|
|
|
|
return 0;
|
|
|
|
address += MCG_XBLK_ADDR;
|
2009-04-08 18:31:18 +08:00
|
|
|
} else {
|
2006-06-26 19:58:53 +08:00
|
|
|
++address;
|
2009-04-08 18:31:18 +08:00
|
|
|
}
|
2006-06-26 19:58:53 +08:00
|
|
|
|
|
|
|
err = allocate_threshold_blocks(cpu, bank, ++block, address);
|
|
|
|
if (err)
|
|
|
|
goto out_free;
|
|
|
|
|
2008-01-30 20:29:58 +08:00
|
|
|
if (b)
|
|
|
|
kobject_uevent(&b->kobj, KOBJ_ADD);
|
2007-12-20 01:23:20 +08:00
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
return err;
|
|
|
|
|
|
|
|
out_free:
|
|
|
|
if (b) {
|
2007-12-21 00:13:05 +08:00
|
|
|
kobject_put(&b->kobj);
|
2011-05-13 21:52:09 +08:00
|
|
|
list_del(&b->miscj);
|
2006-06-26 19:58:53 +08:00
|
|
|
kfree(b);
|
|
|
|
}
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
x86: delete __cpuinit usage from all x86 files
The __cpuinit type of throwaway sections might have made sense
some time ago when RAM was more constrained, but now the savings
do not offset the cost and complications. For example, the fix in
commit 5e427ec2d0 ("x86: Fix bit corruption at CPU resume time")
is a good example of the nasty type of bugs that can be created
with improper use of the various __init prefixes.
After a discussion on LKML[1] it was decided that cpuinit should go
the way of devinit and be phased out. Once all the users are gone,
we can then finally remove the macros themselves from linux/init.h.
Note that some harmless section mismatch warnings may result, since
notify_cpu_starting() and cpu_up() are arch independent (kernel/cpu.c)
are flagged as __cpuinit -- so if we remove the __cpuinit from
arch specific callers, we will also get section mismatch warnings.
As an intermediate step, we intend to turn the linux/init.h cpuinit
content into no-ops as early as possible, since that will get rid
of these warnings. In any case, they are temporary and harmless.
This removes all the arch/x86 uses of the __cpuinit macros from
all C files. x86 only had the one __CPUINIT used in assembly files,
and it wasn't paired off with a .previous or a __FINIT, so we can
delete it directly w/o any corresponding additional change there.
[1] https://lkml.org/lkml/2013/5/20/589
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: x86@kernel.org
Acked-by: Ingo Molnar <mingo@kernel.org>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: H. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2013-06-19 06:23:59 +08:00
|
|
|
static int __threshold_add_blocks(struct threshold_bank *b)
|
2012-05-02 23:16:59 +08:00
|
|
|
{
|
|
|
|
struct list_head *head = &b->blocks->miscj;
|
|
|
|
struct threshold_block *pos = NULL;
|
|
|
|
struct threshold_block *tmp = NULL;
|
|
|
|
int err = 0;
|
|
|
|
|
|
|
|
err = kobject_add(&b->blocks->kobj, b->kobj, b->blocks->kobj.name);
|
|
|
|
if (err)
|
|
|
|
return err;
|
|
|
|
|
|
|
|
list_for_each_entry_safe(pos, tmp, head, miscj) {
|
|
|
|
|
|
|
|
err = kobject_add(&pos->kobj, b->kobj, pos->kobj.name);
|
|
|
|
if (err) {
|
|
|
|
list_for_each_entry_safe_reverse(pos, tmp, head, miscj)
|
|
|
|
kobject_del(&pos->kobj);
|
|
|
|
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
x86: delete __cpuinit usage from all x86 files
The __cpuinit type of throwaway sections might have made sense
some time ago when RAM was more constrained, but now the savings
do not offset the cost and complications. For example, the fix in
commit 5e427ec2d0 ("x86: Fix bit corruption at CPU resume time")
is a good example of the nasty type of bugs that can be created
with improper use of the various __init prefixes.
After a discussion on LKML[1] it was decided that cpuinit should go
the way of devinit and be phased out. Once all the users are gone,
we can then finally remove the macros themselves from linux/init.h.
Note that some harmless section mismatch warnings may result, since
notify_cpu_starting() and cpu_up() are arch independent (kernel/cpu.c)
are flagged as __cpuinit -- so if we remove the __cpuinit from
arch specific callers, we will also get section mismatch warnings.
As an intermediate step, we intend to turn the linux/init.h cpuinit
content into no-ops as early as possible, since that will get rid
of these warnings. In any case, they are temporary and harmless.
This removes all the arch/x86 uses of the __cpuinit macros from
all C files. x86 only had the one __CPUINIT used in assembly files,
and it wasn't paired off with a .previous or a __FINIT, so we can
delete it directly w/o any corresponding additional change there.
[1] https://lkml.org/lkml/2013/5/20/589
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: x86@kernel.org
Acked-by: Ingo Molnar <mingo@kernel.org>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: H. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2013-06-19 06:23:59 +08:00
|
|
|
static int threshold_create_bank(unsigned int cpu, unsigned int bank)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2012-01-27 07:49:14 +08:00
|
|
|
struct device *dev = per_cpu(mce_device, cpu);
|
2012-05-02 23:16:59 +08:00
|
|
|
struct amd_northbridge *nb = NULL;
|
2012-05-02 22:20:49 +08:00
|
|
|
struct threshold_bank *b = NULL;
|
2012-05-04 23:05:27 +08:00
|
|
|
const char *name = th_names[bank];
|
2012-05-02 22:20:49 +08:00
|
|
|
int err = 0;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2013-03-15 05:10:40 +08:00
|
|
|
if (is_shared_bank(bank)) {
|
2012-05-02 23:16:59 +08:00
|
|
|
nb = node_to_amd_nb(amd_get_nb_id(cpu));
|
|
|
|
|
|
|
|
/* threshold descriptor already initialized on this node? */
|
2012-10-01 14:42:05 +08:00
|
|
|
if (nb && nb->bank4) {
|
2012-05-02 23:16:59 +08:00
|
|
|
/* yes, use it */
|
|
|
|
b = nb->bank4;
|
|
|
|
err = kobject_add(b->kobj, &dev->kobj, name);
|
|
|
|
if (err)
|
|
|
|
goto out;
|
|
|
|
|
|
|
|
per_cpu(threshold_banks, cpu)[bank] = b;
|
|
|
|
atomic_inc(&b->cpus);
|
|
|
|
|
|
|
|
err = __threshold_add_blocks(b);
|
|
|
|
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
b = kzalloc(sizeof(struct threshold_bank), GFP_KERNEL);
|
2005-11-06 00:25:53 +08:00
|
|
|
if (!b) {
|
|
|
|
err = -ENOMEM;
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
2012-01-17 06:40:28 +08:00
|
|
|
b->kobj = kobject_create_and_add(name, &dev->kobj);
|
2012-05-02 22:20:49 +08:00
|
|
|
if (!b->kobj) {
|
|
|
|
err = -EINVAL;
|
2007-12-20 01:23:20 +08:00
|
|
|
goto out_free;
|
2012-05-02 22:20:49 +08:00
|
|
|
}
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
per_cpu(threshold_banks, cpu)[bank] = b;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2013-03-15 05:10:40 +08:00
|
|
|
if (is_shared_bank(bank)) {
|
2012-05-02 23:16:59 +08:00
|
|
|
atomic_set(&b->cpus, 1);
|
|
|
|
|
|
|
|
/* nb is already initialized, see above */
|
2012-10-01 14:42:05 +08:00
|
|
|
if (nb) {
|
|
|
|
WARN_ON(nb->bank4);
|
|
|
|
nb->bank4 = b;
|
|
|
|
}
|
2012-05-02 23:16:59 +08:00
|
|
|
}
|
|
|
|
|
2014-09-23 10:16:01 +08:00
|
|
|
err = allocate_threshold_blocks(cpu, bank, 0, MSR_IA32_MCx_MISC(bank));
|
2012-05-02 22:20:49 +08:00
|
|
|
if (!err)
|
|
|
|
goto out;
|
2006-06-26 19:58:53 +08:00
|
|
|
|
2012-05-02 23:16:59 +08:00
|
|
|
out_free:
|
2006-06-26 19:58:53 +08:00
|
|
|
kfree(b);
|
2012-05-02 23:16:59 +08:00
|
|
|
|
|
|
|
out:
|
2005-11-06 00:25:53 +08:00
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* create dir/files for all valid threshold banks */
|
x86: delete __cpuinit usage from all x86 files
The __cpuinit type of throwaway sections might have made sense
some time ago when RAM was more constrained, but now the savings
do not offset the cost and complications. For example, the fix in
commit 5e427ec2d0 ("x86: Fix bit corruption at CPU resume time")
is a good example of the nasty type of bugs that can be created
with improper use of the various __init prefixes.
After a discussion on LKML[1] it was decided that cpuinit should go
the way of devinit and be phased out. Once all the users are gone,
we can then finally remove the macros themselves from linux/init.h.
Note that some harmless section mismatch warnings may result, since
notify_cpu_starting() and cpu_up() are arch independent (kernel/cpu.c)
are flagged as __cpuinit -- so if we remove the __cpuinit from
arch specific callers, we will also get section mismatch warnings.
As an intermediate step, we intend to turn the linux/init.h cpuinit
content into no-ops as early as possible, since that will get rid
of these warnings. In any case, they are temporary and harmless.
This removes all the arch/x86 uses of the __cpuinit macros from
all C files. x86 only had the one __CPUINIT used in assembly files,
and it wasn't paired off with a .previous or a __FINIT, so we can
delete it directly w/o any corresponding additional change there.
[1] https://lkml.org/lkml/2013/5/20/589
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: x86@kernel.org
Acked-by: Ingo Molnar <mingo@kernel.org>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: H. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2013-06-19 06:23:59 +08:00
|
|
|
static int threshold_create_device(unsigned int cpu)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2006-06-26 19:58:56 +08:00
|
|
|
unsigned int bank;
|
2013-03-15 05:10:41 +08:00
|
|
|
struct threshold_bank **bp;
|
2005-11-06 00:25:53 +08:00
|
|
|
int err = 0;
|
|
|
|
|
2013-03-15 05:10:41 +08:00
|
|
|
bp = kzalloc(sizeof(struct threshold_bank *) * mca_cfg.banks,
|
|
|
|
GFP_KERNEL);
|
|
|
|
if (!bp)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
per_cpu(threshold_banks, cpu) = bp;
|
|
|
|
|
|
|
|
for (bank = 0; bank < mca_cfg.banks; ++bank) {
|
2008-01-30 20:33:40 +08:00
|
|
|
if (!(per_cpu(bank_map, cpu) & (1 << bank)))
|
2005-11-06 00:25:53 +08:00
|
|
|
continue;
|
|
|
|
err = threshold_create_bank(cpu, bank);
|
|
|
|
if (err)
|
2010-10-25 22:03:38 +08:00
|
|
|
return err;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
2010-10-25 22:03:38 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
2006-07-30 18:03:37 +08:00
|
|
|
static void deallocate_threshold_block(unsigned int cpu,
|
2006-06-26 19:58:53 +08:00
|
|
|
unsigned int bank)
|
|
|
|
{
|
|
|
|
struct threshold_block *pos = NULL;
|
|
|
|
struct threshold_block *tmp = NULL;
|
|
|
|
struct threshold_bank *head = per_cpu(threshold_banks, cpu)[bank];
|
|
|
|
|
|
|
|
if (!head)
|
|
|
|
return;
|
|
|
|
|
|
|
|
list_for_each_entry_safe(pos, tmp, &head->blocks->miscj, miscj) {
|
2007-12-21 00:13:05 +08:00
|
|
|
kobject_put(&pos->kobj);
|
2006-06-26 19:58:53 +08:00
|
|
|
list_del(&pos->miscj);
|
|
|
|
kfree(pos);
|
|
|
|
}
|
|
|
|
|
|
|
|
kfree(per_cpu(threshold_banks, cpu)[bank]->blocks);
|
|
|
|
per_cpu(threshold_banks, cpu)[bank]->blocks = NULL;
|
|
|
|
}
|
|
|
|
|
2012-05-02 23:16:59 +08:00
|
|
|
static void __threshold_remove_blocks(struct threshold_bank *b)
|
|
|
|
{
|
|
|
|
struct threshold_block *pos = NULL;
|
|
|
|
struct threshold_block *tmp = NULL;
|
|
|
|
|
|
|
|
kobject_del(b->kobj);
|
|
|
|
|
|
|
|
list_for_each_entry_safe(pos, tmp, &b->blocks->miscj, miscj)
|
|
|
|
kobject_del(&pos->kobj);
|
|
|
|
}
|
|
|
|
|
2006-07-30 18:03:37 +08:00
|
|
|
static void threshold_remove_bank(unsigned int cpu, int bank)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2012-05-02 23:16:59 +08:00
|
|
|
struct amd_northbridge *nb;
|
2005-11-06 00:25:53 +08:00
|
|
|
struct threshold_bank *b;
|
|
|
|
|
|
|
|
b = per_cpu(threshold_banks, cpu)[bank];
|
|
|
|
if (!b)
|
|
|
|
return;
|
2012-05-02 23:16:59 +08:00
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
if (!b->blocks)
|
|
|
|
goto free_out;
|
|
|
|
|
2013-03-15 05:10:40 +08:00
|
|
|
if (is_shared_bank(bank)) {
|
2012-05-02 23:16:59 +08:00
|
|
|
if (!atomic_dec_and_test(&b->cpus)) {
|
|
|
|
__threshold_remove_blocks(b);
|
|
|
|
per_cpu(threshold_banks, cpu)[bank] = NULL;
|
|
|
|
return;
|
|
|
|
} else {
|
|
|
|
/*
|
|
|
|
* the last CPU on this node using the shared bank is
|
|
|
|
* going away, remove that bank now.
|
|
|
|
*/
|
|
|
|
nb = node_to_amd_nb(amd_get_nb_id(cpu));
|
|
|
|
nb->bank4 = NULL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2006-06-26 19:58:53 +08:00
|
|
|
deallocate_threshold_block(cpu, bank);
|
|
|
|
|
|
|
|
free_out:
|
x86 MCE: Fix CPU hotplug problem with multiple multicore AMD CPUs
During CPU hot-remove the sysfs directory created by
threshold_create_bank(), defined in
arch/x86/kernel/cpu/mcheck/mce_amd_64.c, has to be removed before
its parent directory, created by mce_create_device(), defined in
arch/x86/kernel/cpu/mcheck/mce_64.c . Moreover, when the CPU in
question is hotplugged again, obviously the latter has to be created
before the former. At present, the right ordering is not enforced,
because all of these operations are carried out by CPU hotplug
notifiers which are not appropriately ordered with respect to each
other. This leads to serious problems on systems with two or more
multicore AMD CPUs, among other things during suspend and hibernation.
Fix the problem by placing threshold bank CPU hotplug callbacks in
mce_cpu_callback(), so that they are invoked at the right places,
if defined. Additionally, use kobject_del() to remove the sysfs
directory associated with the kobject created by
kobject_create_and_add() in threshold_create_bank(), to prevent the
kernel from crashing during CPU hotplug operations on systems with
two or more multicore AMD CPUs.
This patch fixes bug #11337.
Signed-off-by: Rafael J. Wysocki <rjw@sisk.pl>
Acked-by: Andi Kleen <andi@firstfloor.org>
Tested-by: Mark Langsdorf <mark.langsdorf@amd.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-08-23 04:23:09 +08:00
|
|
|
kobject_del(b->kobj);
|
2007-12-21 00:13:05 +08:00
|
|
|
kobject_put(b->kobj);
|
2006-06-26 19:58:53 +08:00
|
|
|
kfree(b);
|
|
|
|
per_cpu(threshold_banks, cpu)[bank] = NULL;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
2006-07-30 18:03:37 +08:00
|
|
|
static void threshold_remove_device(unsigned int cpu)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
2006-06-26 19:58:56 +08:00
|
|
|
unsigned int bank;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
2013-03-15 05:10:41 +08:00
|
|
|
for (bank = 0; bank < mca_cfg.banks; ++bank) {
|
2008-01-30 20:33:40 +08:00
|
|
|
if (!(per_cpu(bank_map, cpu) & (1 << bank)))
|
2005-11-06 00:25:53 +08:00
|
|
|
continue;
|
|
|
|
threshold_remove_bank(cpu, bank);
|
|
|
|
}
|
2013-03-15 05:10:41 +08:00
|
|
|
kfree(per_cpu(threshold_banks, cpu));
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/* get notified when a cpu comes on/off */
|
x86: delete __cpuinit usage from all x86 files
The __cpuinit type of throwaway sections might have made sense
some time ago when RAM was more constrained, but now the savings
do not offset the cost and complications. For example, the fix in
commit 5e427ec2d0 ("x86: Fix bit corruption at CPU resume time")
is a good example of the nasty type of bugs that can be created
with improper use of the various __init prefixes.
After a discussion on LKML[1] it was decided that cpuinit should go
the way of devinit and be phased out. Once all the users are gone,
we can then finally remove the macros themselves from linux/init.h.
Note that some harmless section mismatch warnings may result, since
notify_cpu_starting() and cpu_up() are arch independent (kernel/cpu.c)
are flagged as __cpuinit -- so if we remove the __cpuinit from
arch specific callers, we will also get section mismatch warnings.
As an intermediate step, we intend to turn the linux/init.h cpuinit
content into no-ops as early as possible, since that will get rid
of these warnings. In any case, they are temporary and harmless.
This removes all the arch/x86 uses of the __cpuinit macros from
all C files. x86 only had the one __CPUINIT used in assembly files,
and it wasn't paired off with a .previous or a __FINIT, so we can
delete it directly w/o any corresponding additional change there.
[1] https://lkml.org/lkml/2013/5/20/589
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: x86@kernel.org
Acked-by: Ingo Molnar <mingo@kernel.org>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: H. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2013-06-19 06:23:59 +08:00
|
|
|
static void
|
2009-04-08 18:31:18 +08:00
|
|
|
amd_64_threshold_cpu_callback(unsigned long action, unsigned int cpu)
|
2005-11-06 00:25:53 +08:00
|
|
|
{
|
|
|
|
switch (action) {
|
|
|
|
case CPU_ONLINE:
|
2007-05-09 17:35:10 +08:00
|
|
|
case CPU_ONLINE_FROZEN:
|
2005-11-06 00:25:53 +08:00
|
|
|
threshold_create_device(cpu);
|
|
|
|
break;
|
|
|
|
case CPU_DEAD:
|
2007-05-09 17:35:10 +08:00
|
|
|
case CPU_DEAD_FROZEN:
|
2005-11-06 00:25:53 +08:00
|
|
|
threshold_remove_device(cpu);
|
|
|
|
break;
|
|
|
|
default:
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static __init int threshold_init_device(void)
|
|
|
|
{
|
2006-06-26 19:58:56 +08:00
|
|
|
unsigned lcpu = 0;
|
2005-11-06 00:25:53 +08:00
|
|
|
|
|
|
|
/* to hit CPUs online before the notifier is up */
|
|
|
|
for_each_online_cpu(lcpu) {
|
2006-06-26 19:58:50 +08:00
|
|
|
int err = threshold_create_device(lcpu);
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2005-11-06 00:25:53 +08:00
|
|
|
if (err)
|
2006-06-26 19:58:50 +08:00
|
|
|
return err;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
x86 MCE: Fix CPU hotplug problem with multiple multicore AMD CPUs
During CPU hot-remove the sysfs directory created by
threshold_create_bank(), defined in
arch/x86/kernel/cpu/mcheck/mce_amd_64.c, has to be removed before
its parent directory, created by mce_create_device(), defined in
arch/x86/kernel/cpu/mcheck/mce_64.c . Moreover, when the CPU in
question is hotplugged again, obviously the latter has to be created
before the former. At present, the right ordering is not enforced,
because all of these operations are carried out by CPU hotplug
notifiers which are not appropriately ordered with respect to each
other. This leads to serious problems on systems with two or more
multicore AMD CPUs, among other things during suspend and hibernation.
Fix the problem by placing threshold bank CPU hotplug callbacks in
mce_cpu_callback(), so that they are invoked at the right places,
if defined. Additionally, use kobject_del() to remove the sysfs
directory associated with the kobject created by
kobject_create_and_add() in threshold_create_bank(), to prevent the
kernel from crashing during CPU hotplug operations on systems with
two or more multicore AMD CPUs.
This patch fixes bug #11337.
Signed-off-by: Rafael J. Wysocki <rjw@sisk.pl>
Acked-by: Andi Kleen <andi@firstfloor.org>
Tested-by: Mark Langsdorf <mark.langsdorf@amd.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-08-23 04:23:09 +08:00
|
|
|
threshold_cpu_callback = amd_64_threshold_cpu_callback;
|
2009-04-08 18:31:18 +08:00
|
|
|
|
2006-06-26 19:58:50 +08:00
|
|
|
return 0;
|
2005-11-06 00:25:53 +08:00
|
|
|
}
|
2012-06-07 19:58:50 +08:00
|
|
|
/*
|
|
|
|
* there are 3 funcs which need to be _initcalled in a logic sequence:
|
|
|
|
* 1. xen_late_init_mcelog
|
|
|
|
* 2. mcheck_init_device
|
|
|
|
* 3. threshold_init_device
|
|
|
|
*
|
|
|
|
* xen_late_init_mcelog must register xen_mce_chrdev_device before
|
|
|
|
* native mce_chrdev_device registration if running under xen platform;
|
|
|
|
*
|
|
|
|
* mcheck_init_device should be inited before threshold_init_device to
|
|
|
|
* initialize mce_device, otherwise a NULL ptr dereference will cause panic.
|
|
|
|
*
|
|
|
|
* so we use following _initcalls
|
|
|
|
* 1. device_initcall(xen_late_init_mcelog);
|
|
|
|
* 2. device_initcall_sync(mcheck_init_device);
|
|
|
|
* 3. late_initcall(threshold_init_device);
|
|
|
|
*
|
|
|
|
* when running under xen, the initcall order is 1,2,3;
|
|
|
|
* on baremetal, we skip 1 and we do only 2 and 3.
|
|
|
|
*/
|
|
|
|
late_initcall(threshold_init_device);
|