2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* Intel IO-APIC support for multi-Pentium hosts.
|
|
|
|
*
|
2009-01-31 09:03:42 +08:00
|
|
|
* Copyright (C) 1997, 1998, 1999, 2000, 2009 Ingo Molnar, Hajnalka Szabo
|
2005-04-17 06:20:36 +08:00
|
|
|
*
|
|
|
|
* Many thanks to Stig Venaas for trying out countless experimental
|
|
|
|
* patches and reporting/debugging problems patiently!
|
|
|
|
*
|
|
|
|
* (c) 1999, Multiple IO-APIC support, developed by
|
|
|
|
* Ken-ichi Yaku <yaku@css1.kbnes.nec.co.jp> and
|
|
|
|
* Hidemi Kishimoto <kisimoto@css1.kbnes.nec.co.jp>,
|
|
|
|
* further tested and cleaned up by Zach Brown <zab@redhat.com>
|
|
|
|
* and Ingo Molnar <mingo@redhat.com>
|
|
|
|
*
|
|
|
|
* Fixes
|
|
|
|
* Maciej W. Rozycki : Bits for genuine 82489DX APICs;
|
|
|
|
* thanks to Eric Gilmore
|
|
|
|
* and Rolf G. Tews
|
|
|
|
* for testing these extensively
|
|
|
|
* Paul Diefenbaugh : Added full ACPI support
|
2015-04-14 10:29:58 +08:00
|
|
|
*
|
|
|
|
* Historical information which is worth to be preserved:
|
|
|
|
*
|
|
|
|
* - SiS APIC rmw bug:
|
|
|
|
*
|
|
|
|
* We used to have a workaround for a bug in SiS chips which
|
|
|
|
* required to rewrite the index register for a read-modify-write
|
|
|
|
* operation as the chip lost the index information which was
|
|
|
|
* setup for the read already. We cache the data now, so that
|
|
|
|
* workaround has been removed.
|
2005-04-17 06:20:36 +08:00
|
|
|
*/
|
|
|
|
|
|
|
|
#include <linux/mm.h>
|
|
|
|
#include <linux/interrupt.h>
|
|
|
|
#include <linux/init.h>
|
|
|
|
#include <linux/delay.h>
|
|
|
|
#include <linux/sched.h>
|
2008-08-20 11:50:38 +08:00
|
|
|
#include <linux/pci.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
#include <linux/mc146818rtc.h>
|
|
|
|
#include <linux/compiler.h>
|
|
|
|
#include <linux/acpi.h>
|
2016-07-14 08:18:56 +08:00
|
|
|
#include <linux/export.h>
|
2011-03-24 05:15:54 +08:00
|
|
|
#include <linux/syscore_ops.h>
|
2006-12-07 12:34:23 +08:00
|
|
|
#include <linux/freezer.h>
|
2007-05-03 01:27:19 +08:00
|
|
|
#include <linux/kthread.h>
|
2008-08-20 15:07:45 +08:00
|
|
|
#include <linux/jiffies.h> /* time_after() */
|
include cleanup: Update gfp.h and slab.h includes to prepare for breaking implicit slab.h inclusion from percpu.h
percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.
percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.
http://userweb.kernel.org/~tj/misc/slabh-sweep.py
The script does the followings.
* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.
* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.
* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.
The conversion was done in the following steps.
1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.
2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.
3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.
4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.
5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.
6. percpu.h was updated not to include slab.h.
7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).
* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig
8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.
Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.
Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
2010-03-24 16:04:11 +08:00
|
|
|
#include <linux/slab.h>
|
2008-08-20 11:50:38 +08:00
|
|
|
#include <linux/bootmem.h>
|
[PATCH] x86/x86_64: deferred handling of writes to /proc/irqxx/smp_affinity
When handling writes to /proc/irq, current code is re-programming rte
entries directly. This is not recommended and could potentially cause
chipset's to lockup, or cause missing interrupts.
CONFIG_IRQ_BALANCE does this correctly, where it re-programs only when the
interrupt is pending. The same needs to be done for /proc/irq handling as well.
Otherwise user space irq balancers are really not doing the right thing.
- Changed pending_irq_balance_cpumask to pending_irq_migrate_cpumask for
lack of a generic name.
- added move_irq out of IRQ_BALANCE, and added this same to X86_64
- Added new proc handler for write, so we can do deferred write at irq
handling time.
- Display of /proc/irq/XX/smp_affinity used to display CPU_MASKALL, instead
it now shows only active cpu masks, or exactly what was set.
- Provided a common move_irq implementation, instead of duplicating
when using generic irq framework.
Tested on i386/x86_64 and ia64 with CONFIG_PCI_MSI turned on and off.
Tested UP builds as well.
MSI testing: tbd: I have cards, need to look for a x-over cable, although I
did test an earlier version of this patch. Will test in a couple days.
Signed-off-by: Ashok Raj <ashok.raj@intel.com>
Acked-by: Zwane Mwaikambo <zwane@holomorphy.com>
Grudgingly-acked-by: Andi Kleen <ak@muc.de>
Signed-off-by: Coywolf Qi Hunt <coywolf@lovecn.org>
Signed-off-by: Ashok Raj <ashok.raj@intel.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2005-09-07 06:16:15 +08:00
|
|
|
|
2015-04-14 10:30:08 +08:00
|
|
|
#include <asm/irqdomain.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
#include <asm/io.h>
|
|
|
|
#include <asm/smp.h>
|
2009-01-08 00:08:59 +08:00
|
|
|
#include <asm/cpu.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
#include <asm/desc.h>
|
2008-08-20 11:50:38 +08:00
|
|
|
#include <asm/proto.h>
|
|
|
|
#include <asm/acpi.h>
|
|
|
|
#include <asm/dma.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
#include <asm/timer.h>
|
2005-06-30 17:58:55 +08:00
|
|
|
#include <asm/i8259.h>
|
2008-07-25 17:14:28 +08:00
|
|
|
#include <asm/setup.h>
|
2012-03-31 02:47:08 +08:00
|
|
|
#include <asm/irq_remapping.h>
|
2009-04-11 02:33:10 +08:00
|
|
|
#include <asm/hw_irq.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2009-02-17 20:58:15 +08:00
|
|
|
#include <asm/apic.h>
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
#define for_each_ioapic(idx) \
|
|
|
|
for ((idx) = 0; (idx) < nr_ioapics; (idx)++)
|
|
|
|
#define for_each_ioapic_reverse(idx) \
|
|
|
|
for ((idx) = nr_ioapics - 1; (idx) >= 0; (idx)--)
|
|
|
|
#define for_each_pin(idx, pin) \
|
|
|
|
for ((pin) = 0; (pin) < ioapics[(idx)].nr_registers; (pin)++)
|
|
|
|
#define for_each_ioapic_pin(idx, pin) \
|
|
|
|
for_each_ioapic((idx)) \
|
|
|
|
for_each_pin((idx), (pin))
|
2009-08-01 15:47:59 +08:00
|
|
|
#define for_each_irq_pin(entry, head) \
|
2014-10-27 16:11:55 +08:00
|
|
|
list_for_each_entry(entry, &head, list)
|
2008-07-21 07:52:49 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
static DEFINE_RAW_SPINLOCK(ioapic_lock);
|
2014-06-09 16:19:52 +08:00
|
|
|
static DEFINE_MUTEX(ioapic_mutex);
|
2014-06-09 16:19:53 +08:00
|
|
|
static unsigned int ioapic_dynirq_base;
|
2014-06-09 16:20:11 +08:00
|
|
|
static int ioapic_initialized;
|
2008-08-20 11:50:36 +08:00
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
struct irq_pin_list {
|
|
|
|
struct list_head list;
|
|
|
|
int apic, pin;
|
|
|
|
};
|
|
|
|
|
2015-04-13 14:11:55 +08:00
|
|
|
struct mp_chip_data {
|
2015-04-14 10:29:53 +08:00
|
|
|
struct list_head irq_2_pin;
|
2015-04-13 14:11:55 +08:00
|
|
|
struct IO_APIC_route_entry entry;
|
|
|
|
int trigger;
|
|
|
|
int polarity;
|
2015-04-13 14:11:58 +08:00
|
|
|
u32 count;
|
2015-04-13 14:11:55 +08:00
|
|
|
bool isa_irq;
|
|
|
|
};
|
|
|
|
|
2015-04-14 10:29:56 +08:00
|
|
|
struct mp_ioapic_gsi {
|
|
|
|
u32 gsi_base;
|
|
|
|
u32 gsi_end;
|
|
|
|
};
|
|
|
|
|
2011-05-19 07:31:35 +08:00
|
|
|
static struct ioapic {
|
|
|
|
/*
|
|
|
|
* # of IRQ routing registers
|
|
|
|
*/
|
|
|
|
int nr_registers;
|
2011-05-19 07:31:36 +08:00
|
|
|
/*
|
|
|
|
* Saved state during suspend/resume, or while enabling intr-remap.
|
|
|
|
*/
|
|
|
|
struct IO_APIC_route_entry *saved_registers;
|
2011-05-19 07:31:37 +08:00
|
|
|
/* I/O APIC config */
|
|
|
|
struct mpc_ioapic mp_config;
|
2011-05-19 07:31:38 +08:00
|
|
|
/* IO APIC gsi routing info */
|
|
|
|
struct mp_ioapic_gsi gsi_config;
|
2014-06-09 16:19:52 +08:00
|
|
|
struct ioapic_domain_cfg irqdomain_cfg;
|
|
|
|
struct irq_domain *irqdomain;
|
2014-10-27 13:21:46 +08:00
|
|
|
struct resource *iomem_res;
|
2011-05-19 07:31:35 +08:00
|
|
|
} ioapics[MAX_IO_APICS];
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
#define mpc_ioapic_ver(ioapic_idx) ioapics[ioapic_idx].mp_config.apicver
|
2011-05-19 07:31:37 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
int mpc_ioapic_id(int ioapic_idx)
|
2011-05-19 07:31:37 +08:00
|
|
|
{
|
2011-10-12 15:33:48 +08:00
|
|
|
return ioapics[ioapic_idx].mp_config.apicid;
|
2011-05-19 07:31:37 +08:00
|
|
|
}
|
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
unsigned int mpc_ioapic_addr(int ioapic_idx)
|
2011-05-19 07:31:37 +08:00
|
|
|
{
|
2011-10-12 15:33:48 +08:00
|
|
|
return ioapics[ioapic_idx].mp_config.apicaddr;
|
2011-05-19 07:31:37 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:56 +08:00
|
|
|
static inline struct mp_ioapic_gsi *mp_ioapic_gsi_routing(int ioapic_idx)
|
2011-05-19 07:31:38 +08:00
|
|
|
{
|
2011-10-12 15:33:48 +08:00
|
|
|
return &ioapics[ioapic_idx].gsi_config;
|
2011-05-19 07:31:38 +08:00
|
|
|
}
|
2008-04-05 03:41:13 +08:00
|
|
|
|
2014-06-09 16:19:45 +08:00
|
|
|
static inline int mp_ioapic_pin_count(int ioapic)
|
|
|
|
{
|
|
|
|
struct mp_ioapic_gsi *gsi_cfg = mp_ioapic_gsi_routing(ioapic);
|
|
|
|
|
|
|
|
return gsi_cfg->gsi_end - gsi_cfg->gsi_base + 1;
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:56 +08:00
|
|
|
static inline u32 mp_pin_to_gsi(int ioapic, int pin)
|
2014-06-09 16:19:45 +08:00
|
|
|
{
|
|
|
|
return mp_ioapic_gsi_routing(ioapic)->gsi_base + pin;
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
static inline bool mp_is_legacy_irq(int irq)
|
|
|
|
{
|
|
|
|
return irq >= 0 && irq < nr_legacy_irqs();
|
|
|
|
}
|
|
|
|
|
2014-06-09 16:19:48 +08:00
|
|
|
/*
|
|
|
|
* Initialize all legacy IRQs and all pins on the first IOAPIC
|
|
|
|
* if we have legacy interrupt controller. Kernel boot option "pirq="
|
|
|
|
* may rely on non-legacy pins on the first IOAPIC.
|
|
|
|
*/
|
2014-06-09 16:19:45 +08:00
|
|
|
static inline int mp_init_irq_at_boot(int ioapic, int irq)
|
|
|
|
{
|
2014-06-09 16:19:48 +08:00
|
|
|
if (!nr_legacy_irqs())
|
|
|
|
return 0;
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
return ioapic == 0 || mp_is_legacy_irq(irq);
|
2014-06-09 16:19:45 +08:00
|
|
|
}
|
|
|
|
|
2014-06-09 16:19:52 +08:00
|
|
|
static inline struct irq_domain *mp_ioapic_irqdomain(int ioapic)
|
|
|
|
{
|
|
|
|
return ioapics[ioapic].irqdomain;
|
|
|
|
}
|
|
|
|
|
2011-05-19 07:31:38 +08:00
|
|
|
int nr_ioapics;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2010-06-09 02:44:32 +08:00
|
|
|
/* The one past the highest gsi number used */
|
|
|
|
u32 gsi_top;
|
2010-03-30 16:07:10 +08:00
|
|
|
|
2008-04-05 03:41:32 +08:00
|
|
|
/* MP IRQ source entries */
|
2009-01-12 20:17:22 +08:00
|
|
|
struct mpc_intsrc mp_irqs[MAX_IRQ_SOURCES];
|
2008-04-05 03:41:32 +08:00
|
|
|
|
|
|
|
/* # of MP IRQ source entries */
|
|
|
|
int mp_irq_entries;
|
|
|
|
|
2012-05-18 07:06:13 +08:00
|
|
|
#ifdef CONFIG_EISA
|
2008-05-19 23:47:16 +08:00
|
|
|
int mp_bus_id_to_type[MAX_MP_BUSSES];
|
|
|
|
#endif
|
|
|
|
|
|
|
|
DECLARE_BITMAP(mp_bus_not_pci, MAX_MP_BUSSES);
|
|
|
|
|
2008-08-20 11:50:36 +08:00
|
|
|
int skip_ioapic_setup;
|
|
|
|
|
2011-02-22 22:38:05 +08:00
|
|
|
/**
|
|
|
|
* disable_ioapic_support() - disables ioapic support at runtime
|
|
|
|
*/
|
|
|
|
void disable_ioapic_support(void)
|
2009-01-31 10:36:17 +08:00
|
|
|
{
|
|
|
|
#ifdef CONFIG_PCI
|
|
|
|
noioapicquirk = 1;
|
|
|
|
noioapicreroute = -1;
|
|
|
|
#endif
|
|
|
|
skip_ioapic_setup = 1;
|
|
|
|
}
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
static int __init parse_noapic(char *str)
|
2008-08-20 11:50:36 +08:00
|
|
|
{
|
|
|
|
/* disable IO-APIC */
|
2011-02-22 22:38:05 +08:00
|
|
|
disable_ioapic_support();
|
2008-08-20 11:50:36 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
early_param("noapic", parse_noapic);
|
2005-09-13 00:49:25 +08:00
|
|
|
|
2010-11-19 11:33:35 +08:00
|
|
|
/* Will be called in mpparse/acpi/sfi codes for saving IRQ info */
|
|
|
|
void mp_save_irq(struct mpc_intsrc *m)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
apic_printk(APIC_VERBOSE, "Int: type %d, pol %d, trig %d, bus %02x,"
|
|
|
|
" IRQ %02x, APIC ID %x, APIC INT %02x\n",
|
|
|
|
m->irqtype, m->irqflag & 3, (m->irqflag >> 2) & 3, m->srcbus,
|
|
|
|
m->srcbusirq, m->dstapic, m->dstirq);
|
|
|
|
|
|
|
|
for (i = 0; i < mp_irq_entries; i++) {
|
2010-12-08 15:18:57 +08:00
|
|
|
if (!memcmp(&mp_irqs[i], m, sizeof(*m)))
|
2010-11-19 11:33:35 +08:00
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
2010-12-08 15:18:57 +08:00
|
|
|
memcpy(&mp_irqs[mp_irq_entries], m, sizeof(*m));
|
2010-11-19 11:33:35 +08:00
|
|
|
if (++mp_irq_entries == MAX_IRQ_SOURCES)
|
|
|
|
panic("Max # of irq sources exceeded!!\n");
|
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:39 +08:00
|
|
|
static void alloc_ioapic_saved_registers(int idx)
|
|
|
|
{
|
|
|
|
size_t size;
|
|
|
|
|
|
|
|
if (ioapics[idx].saved_registers)
|
|
|
|
return;
|
|
|
|
|
|
|
|
size = sizeof(struct IO_APIC_route_entry) * ioapics[idx].nr_registers;
|
|
|
|
ioapics[idx].saved_registers = kzalloc(size, GFP_KERNEL);
|
|
|
|
if (!ioapics[idx].saved_registers)
|
|
|
|
pr_err("IOAPIC %d: suspend/resume impossible!\n", idx);
|
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:46 +08:00
|
|
|
static void free_ioapic_saved_registers(int idx)
|
|
|
|
{
|
|
|
|
kfree(ioapics[idx].saved_registers);
|
|
|
|
ioapics[idx].saved_registers = NULL;
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:12:05 +08:00
|
|
|
int __init arch_early_ioapic_init(void)
|
2008-08-20 11:50:51 +08:00
|
|
|
{
|
2015-04-13 14:11:56 +08:00
|
|
|
int i;
|
2008-10-15 21:27:23 +08:00
|
|
|
|
2014-06-09 16:19:48 +08:00
|
|
|
if (!nr_legacy_irqs())
|
2010-02-05 20:06:56 +08:00
|
|
|
io_apic_irqs = ~0UL;
|
|
|
|
|
2014-10-27 13:21:39 +08:00
|
|
|
for_each_ioapic(i)
|
|
|
|
alloc_ioapic_saved_registers(i);
|
2011-05-19 07:31:32 +08:00
|
|
|
|
2008-12-26 18:05:47 +08:00
|
|
|
return 0;
|
2008-12-06 10:58:31 +08:00
|
|
|
}
|
2008-08-20 11:50:51 +08:00
|
|
|
|
2006-11-02 01:11:00 +08:00
|
|
|
struct io_apic {
|
|
|
|
unsigned int index;
|
|
|
|
unsigned int unused[3];
|
|
|
|
unsigned int data;
|
x86, x2apic: cleanup the IO-APIC level migration with interrupt-remapping
Impact: simplification
In the current code, for level triggered migration, we need to modify the
io-apic RTE with the update vector information, along with modifying interrupt
remapping table entry(IRTE) with vector and destination. This is to ensure that
remote IRR bit inthe IOAPIC RTE gets cleared when the cpu does EOI.
With this patch, for level triggered, we eliminate the io-apic RTE modification
(with the updated vector information), by using a virtual vector (io-apic pin
number). Real vector that is used for interrupting cpu will be coming from
the interrupt-remapping table entry. Trigger mode in the IRTE will always be
edge, and the actual level or edge trigger will be setup in the IO-APIC RTE.
So a level triggered interrupt will appear as an edge to the local apic
cpu but still as level to the IO-APIC.
With this change, level irq migration can be done by simply modifying
the interrupt-remapping table entry with out changing the io-apic RTE.
And as the interrupt appears as edge at the cpu, in addition to do the
local apic EOI, we need to do IO-APIC directed EOI to clear the remote
IRR bit in the IO-APIC RTE.
This simplies the irq migration in the presence of interrupt-remapping.
Idea-by: Rajesh Sankaran <rajesh.sankaran@intel.com>
Signed-off-by: Suresh Siddha <suresh.b.siddha@intel.com>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>
2009-03-17 08:05:01 +08:00
|
|
|
unsigned int unused2[11];
|
|
|
|
unsigned int eoi;
|
2006-11-02 01:11:00 +08:00
|
|
|
};
|
|
|
|
|
|
|
|
static __attribute_const__ struct io_apic __iomem *io_apic_base(int idx)
|
|
|
|
{
|
|
|
|
return (void __iomem *) __fix_to_virt(FIX_IO_APIC_BASE_0 + idx)
|
2011-05-19 07:31:37 +08:00
|
|
|
+ (mpc_ioapic_addr(idx) & ~PAGE_MASK);
|
2006-11-02 01:11:00 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:43 +08:00
|
|
|
static inline void io_apic_eoi(unsigned int apic, unsigned int vector)
|
x86, x2apic: cleanup the IO-APIC level migration with interrupt-remapping
Impact: simplification
In the current code, for level triggered migration, we need to modify the
io-apic RTE with the update vector information, along with modifying interrupt
remapping table entry(IRTE) with vector and destination. This is to ensure that
remote IRR bit inthe IOAPIC RTE gets cleared when the cpu does EOI.
With this patch, for level triggered, we eliminate the io-apic RTE modification
(with the updated vector information), by using a virtual vector (io-apic pin
number). Real vector that is used for interrupting cpu will be coming from
the interrupt-remapping table entry. Trigger mode in the IRTE will always be
edge, and the actual level or edge trigger will be setup in the IO-APIC RTE.
So a level triggered interrupt will appear as an edge to the local apic
cpu but still as level to the IO-APIC.
With this change, level irq migration can be done by simply modifying
the interrupt-remapping table entry with out changing the io-apic RTE.
And as the interrupt appears as edge at the cpu, in addition to do the
local apic EOI, we need to do IO-APIC directed EOI to clear the remote
IRR bit in the IO-APIC RTE.
This simplies the irq migration in the presence of interrupt-remapping.
Idea-by: Rajesh Sankaran <rajesh.sankaran@intel.com>
Signed-off-by: Suresh Siddha <suresh.b.siddha@intel.com>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>
2009-03-17 08:05:01 +08:00
|
|
|
{
|
|
|
|
struct io_apic __iomem *io_apic = io_apic_base(apic);
|
|
|
|
writel(vector, &io_apic->eoi);
|
|
|
|
}
|
|
|
|
|
2012-03-29 00:37:36 +08:00
|
|
|
unsigned int native_io_apic_read(unsigned int apic, unsigned int reg)
|
2006-11-02 01:11:00 +08:00
|
|
|
{
|
|
|
|
struct io_apic __iomem *io_apic = io_apic_base(apic);
|
|
|
|
writel(reg, &io_apic->index);
|
|
|
|
return readl(&io_apic->data);
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:55 +08:00
|
|
|
static void io_apic_write(unsigned int apic, unsigned int reg,
|
|
|
|
unsigned int value)
|
2006-11-02 01:11:00 +08:00
|
|
|
{
|
|
|
|
struct io_apic __iomem *io_apic = io_apic_base(apic);
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2006-11-02 01:11:00 +08:00
|
|
|
writel(reg, &io_apic->index);
|
|
|
|
writel(value, &io_apic->data);
|
|
|
|
}
|
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
union entry_union {
|
|
|
|
struct { u32 w1, w2; };
|
|
|
|
struct IO_APIC_route_entry entry;
|
|
|
|
};
|
|
|
|
|
2011-08-26 03:01:12 +08:00
|
|
|
static struct IO_APIC_route_entry __ioapic_read_entry(int apic, int pin)
|
|
|
|
{
|
|
|
|
union entry_union eu;
|
|
|
|
|
|
|
|
eu.w1 = io_apic_read(apic, 0x10 + 2 * pin);
|
|
|
|
eu.w2 = io_apic_read(apic, 0x11 + 2 * pin);
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2011-08-26 03:01:12 +08:00
|
|
|
return eu.entry;
|
|
|
|
}
|
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
static struct IO_APIC_route_entry ioapic_read_entry(int apic, int pin)
|
|
|
|
{
|
|
|
|
union entry_union eu;
|
|
|
|
unsigned long flags;
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-08-26 03:01:12 +08:00
|
|
|
eu.entry = __ioapic_read_entry(apic, pin);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
return eu.entry;
|
|
|
|
}
|
|
|
|
|
2006-11-02 02:05:35 +08:00
|
|
|
/*
|
|
|
|
* When we write a new IO APIC routing entry, we need to write the high
|
|
|
|
* word first! If the mask bit in the low word is clear, we will enable
|
|
|
|
* the interrupt, and we need to make sure the entry is fully populated
|
|
|
|
* before that happens.
|
|
|
|
*/
|
2012-03-22 10:58:08 +08:00
|
|
|
static void __ioapic_write_entry(int apic, int pin, struct IO_APIC_route_entry e)
|
2006-09-26 16:52:30 +08:00
|
|
|
{
|
2009-06-17 22:25:20 +08:00
|
|
|
union entry_union eu = {{0, 0}};
|
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
eu.entry = e;
|
2006-11-02 02:05:35 +08:00
|
|
|
io_apic_write(apic, 0x11 + 2*pin, eu.w2);
|
|
|
|
io_apic_write(apic, 0x10 + 2*pin, eu.w1);
|
2006-12-07 09:14:07 +08:00
|
|
|
}
|
|
|
|
|
2010-10-05 03:08:56 +08:00
|
|
|
static void ioapic_write_entry(int apic, int pin, struct IO_APIC_route_entry e)
|
2006-12-07 09:14:07 +08:00
|
|
|
{
|
|
|
|
unsigned long flags;
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2006-12-07 09:14:07 +08:00
|
|
|
__ioapic_write_entry(apic, pin, e);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2006-11-02 02:05:35 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* When we mask an IO APIC routing entry, we need to write the low
|
|
|
|
* word first, in order to set the mask bit before we change the
|
|
|
|
* high bits!
|
|
|
|
*/
|
|
|
|
static void ioapic_mask_entry(int apic, int pin)
|
|
|
|
{
|
|
|
|
unsigned long flags;
|
2015-04-14 10:30:06 +08:00
|
|
|
union entry_union eu = { .entry.mask = IOAPIC_MASKED };
|
2006-11-02 02:05:35 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2006-09-26 16:52:30 +08:00
|
|
|
io_apic_write(apic, 0x10 + 2*pin, eu.w1);
|
|
|
|
io_apic_write(apic, 0x11 + 2*pin, eu.w2);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2006-09-26 16:52:30 +08:00
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* The common case is 1:1 IRQ<->pin mappings. Sometimes there are
|
|
|
|
* shared ISA-space IRQs, so we have to support them. We are super
|
|
|
|
* fast in the common case, and fast for shared ISA-space IRQs.
|
|
|
|
*/
|
2015-04-14 10:29:53 +08:00
|
|
|
static int __add_pin_to_irq_node(struct mp_chip_data *data,
|
|
|
|
int node, int apic, int pin)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2014-10-27 16:11:55 +08:00
|
|
|
struct irq_pin_list *entry;
|
2008-08-20 11:50:26 +08:00
|
|
|
|
2009-08-01 15:47:59 +08:00
|
|
|
/* don't allow duplicates */
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin)
|
2008-08-20 11:50:26 +08:00
|
|
|
if (entry->apic == apic && entry->pin == pin)
|
2009-08-06 04:09:31 +08:00
|
|
|
return 0;
|
2008-08-20 11:50:26 +08:00
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
entry = kzalloc_node(sizeof(struct irq_pin_list), GFP_ATOMIC, node);
|
2009-08-01 15:48:00 +08:00
|
|
|
if (!entry) {
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_err("can not alloc irq_pin_list (%d,%d,%d)\n",
|
|
|
|
node, apic, pin);
|
2009-08-06 04:09:31 +08:00
|
|
|
return -ENOMEM;
|
2009-08-01 15:48:00 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
entry->apic = apic;
|
|
|
|
entry->pin = pin;
|
2015-04-14 10:29:53 +08:00
|
|
|
list_add_tail(&entry->list, &data->irq_2_pin);
|
2009-06-08 18:24:11 +08:00
|
|
|
|
2009-08-06 04:09:31 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void __remove_pin_from_irq(struct mp_chip_data *data, int apic, int pin)
|
2014-06-09 16:20:06 +08:00
|
|
|
{
|
2014-10-27 16:11:55 +08:00
|
|
|
struct irq_pin_list *tmp, *entry;
|
2014-06-09 16:20:06 +08:00
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
list_for_each_entry_safe(entry, tmp, &data->irq_2_pin, list)
|
2014-06-09 16:20:06 +08:00
|
|
|
if (entry->apic == apic && entry->pin == pin) {
|
2014-10-27 16:11:55 +08:00
|
|
|
list_del(&entry->list);
|
2014-06-09 16:20:06 +08:00
|
|
|
kfree(entry);
|
|
|
|
return;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void add_pin_to_irq_node(struct mp_chip_data *data,
|
|
|
|
int node, int apic, int pin)
|
2009-08-06 04:09:31 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
if (__add_pin_to_irq_node(data, node, apic, pin))
|
2009-08-06 04:09:31 +08:00
|
|
|
panic("IO-APIC: failed to add irq-pin. Can not proceed\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Reroute an IRQ to a different pin.
|
|
|
|
*/
|
2015-04-14 10:29:53 +08:00
|
|
|
static void __init replace_pin_at_irq_node(struct mp_chip_data *data, int node,
|
2009-06-08 18:32:15 +08:00
|
|
|
int oldapic, int oldpin,
|
|
|
|
int newapic, int newpin)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2009-06-08 18:29:26 +08:00
|
|
|
struct irq_pin_list *entry;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin) {
|
2005-04-17 06:20:36 +08:00
|
|
|
if (entry->apic == oldapic && entry->pin == oldpin) {
|
|
|
|
entry->apic = newapic;
|
|
|
|
entry->pin = newpin;
|
2008-08-20 11:50:26 +08:00
|
|
|
/* every one is different, right? */
|
2009-06-08 18:32:15 +08:00
|
|
|
return;
|
2008-08-20 11:50:26 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
2008-08-20 11:50:26 +08:00
|
|
|
|
2009-06-08 18:32:15 +08:00
|
|
|
/* old apic/pin didn't exist, so just add new ones */
|
2015-04-14 10:29:53 +08:00
|
|
|
add_pin_to_irq_node(data, node, newapic, newpin);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void io_apic_modify_irq(struct mp_chip_data *data,
|
2009-06-08 17:55:22 +08:00
|
|
|
int mask_and, int mask_or,
|
|
|
|
void (*final)(struct irq_pin_list *entry))
|
2008-09-11 02:19:50 +08:00
|
|
|
{
|
2015-04-14 10:29:57 +08:00
|
|
|
union entry_union eu;
|
2008-09-11 02:19:50 +08:00
|
|
|
struct irq_pin_list *entry;
|
2008-08-20 11:50:41 +08:00
|
|
|
|
2015-04-14 10:29:57 +08:00
|
|
|
eu.entry = data->entry;
|
|
|
|
eu.w1 &= mask_and;
|
|
|
|
eu.w1 |= mask_or;
|
|
|
|
data->entry = eu.entry;
|
|
|
|
|
|
|
|
for_each_irq_pin(entry, data->irq_2_pin) {
|
|
|
|
io_apic_write(entry->apic, 0x10 + 2 * entry->pin, eu.w1);
|
|
|
|
if (final)
|
|
|
|
final(entry);
|
|
|
|
}
|
2009-12-02 07:31:16 +08:00
|
|
|
}
|
|
|
|
|
2008-12-29 23:04:35 +08:00
|
|
|
static void io_apic_sync(struct irq_pin_list *entry)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2008-09-11 02:19:50 +08:00
|
|
|
/*
|
|
|
|
* Synchronize the IO-APIC and the CPU by doing
|
|
|
|
* a dummy read from the IO-APIC
|
|
|
|
*/
|
|
|
|
struct io_apic __iomem *io_apic;
|
2012-03-22 10:58:08 +08:00
|
|
|
|
2008-09-11 02:19:50 +08:00
|
|
|
io_apic = io_apic_base(entry->apic);
|
2008-08-20 11:50:47 +08:00
|
|
|
readl(&io_apic->data);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void mask_ioapic_irq(struct irq_data *irq_data)
|
2008-09-11 02:19:50 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
2010-09-28 21:18:35 +08:00
|
|
|
unsigned long flags;
|
|
|
|
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:53 +08:00
|
|
|
io_apic_modify_irq(data, ~0, IO_APIC_REDIR_MASKED, &io_apic_sync);
|
2010-09-28 21:18:35 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2008-09-11 02:19:50 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void __unmask_ioapic(struct mp_chip_data *data)
|
2010-09-28 21:18:35 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
io_apic_modify_irq(data, ~IO_APIC_REDIR_MASKED, 0, NULL);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void unmask_ioapic_irq(struct irq_data *irq_data)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
2005-04-17 06:20:36 +08:00
|
|
|
unsigned long flags;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:53 +08:00
|
|
|
__unmask_ioapic(data);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2011-08-26 03:01:13 +08:00
|
|
|
/*
|
|
|
|
* IO-APIC versions below 0x20 don't support EOI register.
|
|
|
|
* For the record, here is the information about various versions:
|
|
|
|
* 0Xh 82489DX
|
|
|
|
* 1Xh I/OAPIC or I/O(x)APIC which are not PCI 2.2 Compliant
|
|
|
|
* 2Xh I/O(x)APIC which is PCI 2.2 Compliant
|
|
|
|
* 30h-FFh Reserved
|
|
|
|
*
|
|
|
|
* Some of the Intel ICH Specs (ICH2 to ICH5) documents the io-apic
|
|
|
|
* version as 0x2. This is an error with documentation and these ICH chips
|
|
|
|
* use io-apic's of version 0x20.
|
|
|
|
*
|
|
|
|
* For IO-APIC's with EOI register, we use that to do an explicit EOI.
|
|
|
|
* Otherwise, we simulate the EOI message manually by changing the trigger
|
|
|
|
* mode to edge and then back to level, with RTE being masked during this.
|
|
|
|
*/
|
2015-04-14 10:29:43 +08:00
|
|
|
static void __eoi_ioapic_pin(int apic, int pin, int vector)
|
2011-08-26 03:01:13 +08:00
|
|
|
{
|
|
|
|
if (mpc_ioapic_ver(apic) >= 0x20) {
|
2012-09-26 18:44:50 +08:00
|
|
|
io_apic_eoi(apic, vector);
|
2011-08-26 03:01:13 +08:00
|
|
|
} else {
|
|
|
|
struct IO_APIC_route_entry entry, entry1;
|
|
|
|
|
|
|
|
entry = entry1 = __ioapic_read_entry(apic, pin);
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Mask the entry and change the trigger mode to edge.
|
|
|
|
*/
|
2015-04-14 10:30:06 +08:00
|
|
|
entry1.mask = IOAPIC_MASKED;
|
2011-08-26 03:01:13 +08:00
|
|
|
entry1.trigger = IOAPIC_EDGE;
|
|
|
|
|
|
|
|
__ioapic_write_entry(apic, pin, entry1);
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Restore the previous level triggered entry.
|
|
|
|
*/
|
|
|
|
__ioapic_write_entry(apic, pin, entry);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-10-09 22:24:45 +08:00
|
|
|
static void eoi_ioapic_pin(int vector, struct mp_chip_data *data)
|
2015-04-13 14:11:59 +08:00
|
|
|
{
|
|
|
|
unsigned long flags;
|
|
|
|
struct irq_pin_list *entry;
|
|
|
|
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin)
|
2015-04-14 10:29:43 +08:00
|
|
|
__eoi_ioapic_pin(entry->apic, entry->pin, vector);
|
2011-08-26 03:01:13 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
static void clear_IO_APIC_pin(unsigned int apic, unsigned int pin)
|
|
|
|
{
|
|
|
|
struct IO_APIC_route_entry entry;
|
2008-06-08 19:07:18 +08:00
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/* Check delivery_mode to be sure we're not clearing an SMI pin */
|
2006-09-26 16:52:30 +08:00
|
|
|
entry = ioapic_read_entry(apic, pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
if (entry.delivery_mode == dest_SMI)
|
|
|
|
return;
|
2011-08-26 03:01:11 +08:00
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
2011-08-26 03:01:11 +08:00
|
|
|
* Make sure the entry is masked and re-read the contents to check
|
|
|
|
* if it is a level triggered pin and if the remote-IRR is set.
|
|
|
|
*/
|
2015-04-14 10:30:06 +08:00
|
|
|
if (entry.mask == IOAPIC_UNMASKED) {
|
|
|
|
entry.mask = IOAPIC_MASKED;
|
2011-08-26 03:01:11 +08:00
|
|
|
ioapic_write_entry(apic, pin, entry);
|
|
|
|
entry = ioapic_read_entry(apic, pin);
|
|
|
|
}
|
|
|
|
|
|
|
|
if (entry.irr) {
|
2011-08-26 03:01:13 +08:00
|
|
|
unsigned long flags;
|
|
|
|
|
2011-08-26 03:01:11 +08:00
|
|
|
/*
|
|
|
|
* Make sure the trigger mode is set to level. Explicit EOI
|
|
|
|
* doesn't clear the remote-IRR if the trigger mode is not
|
|
|
|
* set to level.
|
|
|
|
*/
|
2015-04-14 10:30:06 +08:00
|
|
|
if (entry.trigger == IOAPIC_EDGE) {
|
2011-08-26 03:01:11 +08:00
|
|
|
entry.trigger = IOAPIC_LEVEL;
|
|
|
|
ioapic_write_entry(apic, pin, entry);
|
|
|
|
}
|
2011-08-26 03:01:13 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:43 +08:00
|
|
|
__eoi_ioapic_pin(apic, pin, entry.vector);
|
2011-08-26 03:01:13 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2011-08-26 03:01:11 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Clear the rest of the bits in the IO-APIC RTE except for the mask
|
|
|
|
* bit.
|
2005-04-17 06:20:36 +08:00
|
|
|
*/
|
2006-11-02 02:05:35 +08:00
|
|
|
ioapic_mask_entry(apic, pin);
|
2011-08-26 03:01:11 +08:00
|
|
|
entry = ioapic_read_entry(apic, pin);
|
|
|
|
if (entry.irr)
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_err("Unable to reset IRR for apic: %d, pin :%d\n",
|
2011-08-26 03:01:11 +08:00
|
|
|
mpc_ioapic_id(apic), pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
static void clear_IO_APIC (void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
int apic, pin;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic_pin(apic, pin)
|
|
|
|
clear_IO_APIC_pin(apic, pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* support for broken MP BIOSs, enables hand-redirection of PIRQ0-7 to
|
|
|
|
* specific CPU-side IRQs.
|
|
|
|
*/
|
|
|
|
|
|
|
|
#define MAX_PIRQS 8
|
2009-02-15 18:54:03 +08:00
|
|
|
static int pirq_entries[MAX_PIRQS] = {
|
|
|
|
[0 ... MAX_PIRQS - 1] = -1
|
|
|
|
};
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
static int __init ioapic_pirq_setup(char *str)
|
|
|
|
{
|
|
|
|
int i, max;
|
|
|
|
int ints[MAX_PIRQS+1];
|
|
|
|
|
|
|
|
get_options(str, ARRAY_SIZE(ints), ints);
|
|
|
|
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_INFO
|
|
|
|
"PIRQ redirection, working around broken MP-BIOS.\n");
|
|
|
|
max = MAX_PIRQS;
|
|
|
|
if (ints[0] < MAX_PIRQS)
|
|
|
|
max = ints[0];
|
|
|
|
|
|
|
|
for (i = 0; i < max; i++) {
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_DEBUG
|
|
|
|
"... PIRQ%d -> IRQ %d\n", i, ints[i+1]);
|
|
|
|
/*
|
|
|
|
* PIRQs are mapped upside down, usually.
|
|
|
|
*/
|
|
|
|
pirq_entries[MAX_PIRQS-i-1] = ints[i+1];
|
|
|
|
}
|
|
|
|
return 1;
|
|
|
|
}
|
|
|
|
|
|
|
|
__setup("pirq=", ioapic_pirq_setup);
|
2008-08-20 15:07:45 +08:00
|
|
|
#endif /* CONFIG_X86_32 */
|
|
|
|
|
|
|
|
/*
|
2009-03-17 08:05:03 +08:00
|
|
|
* Saves all the IO-APIC RTE's
|
2008-08-20 15:07:45 +08:00
|
|
|
*/
|
2011-05-19 07:31:33 +08:00
|
|
|
int save_ioapic_entries(void)
|
2008-08-20 15:07:45 +08:00
|
|
|
{
|
|
|
|
int apic, pin;
|
2011-05-19 07:31:33 +08:00
|
|
|
int err = 0;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(apic) {
|
2011-05-19 07:31:36 +08:00
|
|
|
if (!ioapics[apic].saved_registers) {
|
2011-05-19 07:31:33 +08:00
|
|
|
err = -ENOMEM;
|
|
|
|
continue;
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_pin(apic, pin)
|
2011-05-19 07:31:36 +08:00
|
|
|
ioapics[apic].saved_registers[pin] =
|
2008-08-20 15:07:45 +08:00
|
|
|
ioapic_read_entry(apic, pin);
|
2009-03-28 05:22:44 +08:00
|
|
|
}
|
2008-09-19 03:37:57 +08:00
|
|
|
|
2011-05-19 07:31:33 +08:00
|
|
|
return err;
|
2008-08-20 15:07:45 +08:00
|
|
|
}
|
|
|
|
|
2009-03-28 05:22:44 +08:00
|
|
|
/*
|
|
|
|
* Mask all IO APIC entries.
|
|
|
|
*/
|
2011-05-19 07:31:33 +08:00
|
|
|
void mask_ioapic_entries(void)
|
2009-03-17 08:05:03 +08:00
|
|
|
{
|
|
|
|
int apic, pin;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(apic) {
|
2011-05-25 01:45:31 +08:00
|
|
|
if (!ioapics[apic].saved_registers)
|
2011-05-19 07:31:33 +08:00
|
|
|
continue;
|
2009-03-28 05:22:44 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_pin(apic, pin) {
|
2009-03-17 08:05:03 +08:00
|
|
|
struct IO_APIC_route_entry entry;
|
|
|
|
|
2011-05-19 07:31:36 +08:00
|
|
|
entry = ioapics[apic].saved_registers[pin];
|
2015-04-14 10:30:06 +08:00
|
|
|
if (entry.mask == IOAPIC_UNMASKED) {
|
|
|
|
entry.mask = IOAPIC_MASKED;
|
2009-03-17 08:05:03 +08:00
|
|
|
ioapic_write_entry(apic, pin, entry);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2009-03-28 05:22:44 +08:00
|
|
|
/*
|
2011-05-19 07:31:36 +08:00
|
|
|
* Restore IO APIC entries which was saved in the ioapic structure.
|
2009-03-28 05:22:44 +08:00
|
|
|
*/
|
2011-05-19 07:31:33 +08:00
|
|
|
int restore_ioapic_entries(void)
|
2008-08-20 15:07:45 +08:00
|
|
|
{
|
|
|
|
int apic, pin;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(apic) {
|
2011-05-25 01:45:31 +08:00
|
|
|
if (!ioapics[apic].saved_registers)
|
2011-05-19 07:31:33 +08:00
|
|
|
continue;
|
2009-03-28 05:22:44 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_pin(apic, pin)
|
2008-08-20 15:07:45 +08:00
|
|
|
ioapic_write_entry(apic, pin,
|
2011-05-19 07:31:36 +08:00
|
|
|
ioapics[apic].saved_registers[pin]);
|
2008-09-19 03:37:57 +08:00
|
|
|
}
|
2009-03-28 05:22:44 +08:00
|
|
|
return 0;
|
2008-08-20 15:07:45 +08:00
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* Find the IRQ entry number of a certain pin.
|
|
|
|
*/
|
2011-10-12 15:33:48 +08:00
|
|
|
static int find_irq_entry(int ioapic_idx, int pin, int type)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
for (i = 0; i < mp_irq_entries; i++)
|
2009-01-12 20:17:22 +08:00
|
|
|
if (mp_irqs[i].irqtype == type &&
|
2011-10-12 15:33:48 +08:00
|
|
|
(mp_irqs[i].dstapic == mpc_ioapic_id(ioapic_idx) ||
|
2009-01-12 20:17:22 +08:00
|
|
|
mp_irqs[i].dstapic == MP_APIC_ALL) &&
|
|
|
|
mp_irqs[i].dstirq == pin)
|
2005-04-17 06:20:36 +08:00
|
|
|
return i;
|
|
|
|
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Find the pin to which IRQ[irq] (ISA) is connected
|
|
|
|
*/
|
2005-10-31 06:59:39 +08:00
|
|
|
static int __init find_isa_irq_pin(int irq, int type)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
for (i = 0; i < mp_irq_entries; i++) {
|
2009-01-12 20:17:22 +08:00
|
|
|
int lbus = mp_irqs[i].srcbus;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-03-20 19:54:18 +08:00
|
|
|
if (test_bit(lbus, mp_bus_not_pci) &&
|
2009-01-12 20:17:22 +08:00
|
|
|
(mp_irqs[i].irqtype == type) &&
|
|
|
|
(mp_irqs[i].srcbusirq == irq))
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2009-01-12 20:17:22 +08:00
|
|
|
return mp_irqs[i].dstirq;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
2005-10-31 06:59:39 +08:00
|
|
|
static int __init find_isa_irq_apic(int irq, int type)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
for (i = 0; i < mp_irq_entries; i++) {
|
2009-01-12 20:17:22 +08:00
|
|
|
int lbus = mp_irqs[i].srcbus;
|
2005-10-31 06:59:39 +08:00
|
|
|
|
2008-03-20 19:54:24 +08:00
|
|
|
if (test_bit(lbus, mp_bus_not_pci) &&
|
2009-01-12 20:17:22 +08:00
|
|
|
(mp_irqs[i].irqtype == type) &&
|
|
|
|
(mp_irqs[i].srcbusirq == irq))
|
2005-10-31 06:59:39 +08:00
|
|
|
break;
|
|
|
|
}
|
2011-10-12 15:33:48 +08:00
|
|
|
|
2005-10-31 06:59:39 +08:00
|
|
|
if (i < mp_irq_entries) {
|
2011-10-12 15:33:48 +08:00
|
|
|
int ioapic_idx;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(ioapic_idx)
|
2011-10-12 15:33:48 +08:00
|
|
|
if (mpc_ioapic_id(ioapic_idx) == mp_irqs[i].dstapic)
|
|
|
|
return ioapic_idx;
|
2005-10-31 06:59:39 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
2012-05-18 07:06:13 +08:00
|
|
|
#ifdef CONFIG_EISA
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* EISA Edge/Level control register, ELCR
|
|
|
|
*/
|
|
|
|
static int EISA_ELCR(unsigned int irq)
|
|
|
|
{
|
2014-06-09 16:19:48 +08:00
|
|
|
if (irq < nr_legacy_irqs()) {
|
2005-04-17 06:20:36 +08:00
|
|
|
unsigned int port = 0x4d0 + (irq >> 3);
|
|
|
|
return (inb(port) >> (irq & 7)) & 1;
|
|
|
|
}
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_INFO
|
|
|
|
"Broken MPtable reports ISA irq %d\n", irq);
|
|
|
|
return 0;
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2008-03-20 19:55:02 +08:00
|
|
|
#endif
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2015-04-14 10:30:06 +08:00
|
|
|
/* ISA interrupts are always active high edge triggered,
|
2008-03-20 19:54:36 +08:00
|
|
|
* when listed as conforming in the MP table. */
|
|
|
|
|
2015-04-14 10:30:06 +08:00
|
|
|
#define default_ISA_trigger(idx) (IOAPIC_EDGE)
|
|
|
|
#define default_ISA_polarity(idx) (IOAPIC_POL_HIGH)
|
2008-03-20 19:54:36 +08:00
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/* EISA interrupts are always polarity zero and can be edge or level
|
|
|
|
* trigger depending on the ELCR value. If an interrupt is listed as
|
|
|
|
* EISA conforming in the MP table, that means its trigger type must
|
|
|
|
* be read in from the ELCR */
|
|
|
|
|
2009-01-12 20:17:22 +08:00
|
|
|
#define default_EISA_trigger(idx) (EISA_ELCR(mp_irqs[idx].srcbusirq))
|
2008-03-20 19:54:36 +08:00
|
|
|
#define default_EISA_polarity(idx) default_ISA_polarity(idx)
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2015-04-14 10:30:06 +08:00
|
|
|
/* PCI interrupts are always active low level triggered,
|
2005-04-17 06:20:36 +08:00
|
|
|
* when listed as conforming in the MP table. */
|
|
|
|
|
2015-04-14 10:30:06 +08:00
|
|
|
#define default_PCI_trigger(idx) (IOAPIC_LEVEL)
|
|
|
|
#define default_PCI_polarity(idx) (IOAPIC_POL_LOW)
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-02-24 00:33:53 +08:00
|
|
|
static int irq_polarity(int idx)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2009-01-12 20:17:22 +08:00
|
|
|
int bus = mp_irqs[idx].srcbus;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Determine IRQ line polarity (high active or low active):
|
|
|
|
*/
|
2015-04-14 10:30:07 +08:00
|
|
|
switch (mp_irqs[idx].irqflag & 0x03) {
|
|
|
|
case 0:
|
|
|
|
/* conforms to spec, ie. bus-type dependent polarity */
|
|
|
|
if (test_bit(bus, mp_bus_not_pci))
|
|
|
|
return default_ISA_polarity(idx);
|
|
|
|
else
|
|
|
|
return default_PCI_polarity(idx);
|
|
|
|
case 1:
|
|
|
|
return IOAPIC_POL_HIGH;
|
|
|
|
case 2:
|
|
|
|
pr_warn("IOAPIC: Invalid polarity: 2, defaulting to low\n");
|
|
|
|
case 3:
|
|
|
|
default: /* Pointless default required due to do gcc stupidity */
|
|
|
|
return IOAPIC_POL_LOW;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:30:07 +08:00
|
|
|
#ifdef CONFIG_EISA
|
|
|
|
static int eisa_irq_trigger(int idx, int bus, int trigger)
|
|
|
|
{
|
|
|
|
switch (mp_bus_id_to_type[bus]) {
|
|
|
|
case MP_BUS_PCI:
|
|
|
|
case MP_BUS_ISA:
|
|
|
|
return trigger;
|
|
|
|
case MP_BUS_EISA:
|
|
|
|
return default_EISA_trigger(idx);
|
|
|
|
}
|
|
|
|
pr_warn("IOAPIC: Invalid srcbus: %d defaulting to level\n", bus);
|
|
|
|
return IOAPIC_LEVEL;
|
|
|
|
}
|
|
|
|
#else
|
|
|
|
static inline int eisa_irq_trigger(int idx, int bus, int trigger)
|
|
|
|
{
|
|
|
|
return trigger;
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2011-02-24 00:33:53 +08:00
|
|
|
static int irq_trigger(int idx)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2009-01-12 20:17:22 +08:00
|
|
|
int bus = mp_irqs[idx].srcbus;
|
2005-04-17 06:20:36 +08:00
|
|
|
int trigger;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Determine IRQ trigger mode (edge or level sensitive):
|
|
|
|
*/
|
2015-04-14 10:30:07 +08:00
|
|
|
switch ((mp_irqs[idx].irqflag >> 2) & 0x03) {
|
|
|
|
case 0:
|
|
|
|
/* conforms to spec, ie. bus-type dependent trigger mode */
|
|
|
|
if (test_bit(bus, mp_bus_not_pci))
|
|
|
|
trigger = default_ISA_trigger(idx);
|
|
|
|
else
|
|
|
|
trigger = default_PCI_trigger(idx);
|
|
|
|
/* Take EISA into account */
|
|
|
|
return eisa_irq_trigger(idx, bus, trigger);
|
|
|
|
case 1:
|
|
|
|
return IOAPIC_EDGE;
|
|
|
|
case 2:
|
|
|
|
pr_warn("IOAPIC: Invalid trigger mode 2 defaulting to level\n");
|
|
|
|
case 3:
|
|
|
|
default: /* Pointless default required due to do gcc stupidity */
|
|
|
|
return IOAPIC_LEVEL;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:54 +08:00
|
|
|
void ioapic_set_alloc_attr(struct irq_alloc_info *info, int node,
|
|
|
|
int trigger, int polarity)
|
|
|
|
{
|
|
|
|
init_irq_alloc_info(info, NULL);
|
|
|
|
info->type = X86_IRQ_ALLOC_TYPE_IOAPIC;
|
|
|
|
info->ioapic_node = node;
|
|
|
|
info->ioapic_trigger = trigger;
|
|
|
|
info->ioapic_polarity = polarity;
|
|
|
|
info->ioapic_valid = 1;
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:58 +08:00
|
|
|
#ifndef CONFIG_ACPI
|
|
|
|
int acpi_get_override_irq(u32 gsi, int *trigger, int *polarity);
|
|
|
|
#endif
|
|
|
|
|
|
|
|
static void ioapic_copy_alloc_attr(struct irq_alloc_info *dst,
|
|
|
|
struct irq_alloc_info *src,
|
|
|
|
u32 gsi, int ioapic_idx, int pin)
|
|
|
|
{
|
|
|
|
int trigger, polarity;
|
|
|
|
|
|
|
|
copy_irq_alloc_info(dst, src);
|
|
|
|
dst->type = X86_IRQ_ALLOC_TYPE_IOAPIC;
|
|
|
|
dst->ioapic_id = mpc_ioapic_id(ioapic_idx);
|
|
|
|
dst->ioapic_pin = pin;
|
|
|
|
dst->ioapic_valid = 1;
|
|
|
|
if (src && src->ioapic_valid) {
|
|
|
|
dst->ioapic_node = src->ioapic_node;
|
|
|
|
dst->ioapic_trigger = src->ioapic_trigger;
|
|
|
|
dst->ioapic_polarity = src->ioapic_polarity;
|
|
|
|
} else {
|
|
|
|
dst->ioapic_node = NUMA_NO_NODE;
|
|
|
|
if (acpi_get_override_irq(gsi, &trigger, &polarity) >= 0) {
|
|
|
|
dst->ioapic_trigger = trigger;
|
|
|
|
dst->ioapic_polarity = polarity;
|
|
|
|
} else {
|
|
|
|
/*
|
2015-04-14 10:30:06 +08:00
|
|
|
* PCI interrupts are always active low level
|
2015-04-13 14:11:58 +08:00
|
|
|
* triggered.
|
|
|
|
*/
|
2015-04-14 10:30:06 +08:00
|
|
|
dst->ioapic_trigger = IOAPIC_LEVEL;
|
|
|
|
dst->ioapic_polarity = IOAPIC_POL_LOW;
|
2015-04-13 14:11:58 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static int ioapic_alloc_attr_node(struct irq_alloc_info *info)
|
|
|
|
{
|
|
|
|
return (info && info->ioapic_valid) ? info->ioapic_node : NUMA_NO_NODE;
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:55 +08:00
|
|
|
static void mp_register_handler(unsigned int irq, unsigned long trigger)
|
|
|
|
{
|
|
|
|
irq_flow_handler_t hdl;
|
|
|
|
bool fasteoi;
|
|
|
|
|
|
|
|
if (trigger) {
|
|
|
|
irq_set_status_flags(irq, IRQ_LEVEL);
|
|
|
|
fasteoi = true;
|
|
|
|
} else {
|
|
|
|
irq_clear_status_flags(irq, IRQ_LEVEL);
|
|
|
|
fasteoi = false;
|
|
|
|
}
|
|
|
|
|
|
|
|
hdl = fasteoi ? handle_fasteoi_irq : handle_edge_irq;
|
|
|
|
__irq_set_handler(irq, hdl, 0, fasteoi ? "fasteoi" : "edge");
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:58 +08:00
|
|
|
static bool mp_check_pin_attr(int irq, struct irq_alloc_info *info)
|
|
|
|
{
|
|
|
|
struct mp_chip_data *data = irq_get_chip_data(irq);
|
|
|
|
|
|
|
|
/*
|
|
|
|
* setup_IO_APIC_irqs() programs all legacy IRQs with default trigger
|
|
|
|
* and polarity attirbutes. So allow the first user to reprogram the
|
|
|
|
* pin with real trigger and polarity attributes.
|
|
|
|
*/
|
|
|
|
if (irq < nr_legacy_irqs() && data->count == 1) {
|
|
|
|
if (info->ioapic_trigger != data->trigger)
|
2015-07-30 15:51:32 +08:00
|
|
|
mp_register_handler(irq, info->ioapic_trigger);
|
2015-04-13 14:11:58 +08:00
|
|
|
data->entry.trigger = data->trigger = info->ioapic_trigger;
|
|
|
|
data->entry.polarity = data->polarity = info->ioapic_polarity;
|
|
|
|
}
|
|
|
|
|
|
|
|
return data->trigger == info->ioapic_trigger &&
|
|
|
|
data->polarity == info->ioapic_polarity;
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
static int alloc_irq_from_domain(struct irq_domain *domain, int ioapic, u32 gsi,
|
2015-04-13 14:11:54 +08:00
|
|
|
struct irq_alloc_info *info)
|
2014-06-10 14:13:25 +08:00
|
|
|
{
|
2015-04-13 14:11:59 +08:00
|
|
|
bool legacy = false;
|
2014-06-09 16:19:52 +08:00
|
|
|
int irq = -1;
|
|
|
|
int type = ioapics[ioapic].irqdomain_cfg.type;
|
|
|
|
|
|
|
|
switch (type) {
|
|
|
|
case IOAPIC_DOMAIN_LEGACY:
|
|
|
|
/*
|
2015-04-13 14:11:59 +08:00
|
|
|
* Dynamically allocate IRQ number for non-ISA IRQs in the first
|
|
|
|
* 16 GSIs on some weird platforms.
|
2014-06-09 16:19:52 +08:00
|
|
|
*/
|
2015-04-13 14:11:59 +08:00
|
|
|
if (!ioapic_initialized || gsi >= nr_legacy_irqs())
|
2014-06-09 16:19:52 +08:00
|
|
|
irq = gsi;
|
2015-04-13 14:11:59 +08:00
|
|
|
legacy = mp_is_legacy_irq(irq);
|
2014-06-09 16:19:52 +08:00
|
|
|
break;
|
|
|
|
case IOAPIC_DOMAIN_STRICT:
|
2015-04-13 14:11:59 +08:00
|
|
|
irq = gsi;
|
2014-06-09 16:19:52 +08:00
|
|
|
break;
|
|
|
|
case IOAPIC_DOMAIN_DYNAMIC:
|
|
|
|
break;
|
|
|
|
default:
|
|
|
|
WARN(1, "ioapic: unknown irqdomain type %d\n", type);
|
2015-04-13 14:11:59 +08:00
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
return __irq_domain_alloc_irqs(domain, irq, 1,
|
|
|
|
ioapic_alloc_attr_node(info),
|
2016-07-04 16:39:24 +08:00
|
|
|
info, legacy, NULL);
|
2015-04-13 14:11:59 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Need special handling for ISA IRQs because there may be multiple IOAPIC pins
|
|
|
|
* sharing the same ISA IRQ number and irqdomain only supports 1:1 mapping
|
|
|
|
* between IOAPIC pin and IRQ number. A typical IOAPIC has 24 pins, pin 0-15 are
|
|
|
|
* used for legacy IRQs and pin 16-23 are used for PCI IRQs (PIRQ A-H).
|
|
|
|
* When ACPI is disabled, only legacy IRQ numbers (IRQ0-15) are available, and
|
|
|
|
* some BIOSes may use MP Interrupt Source records to override IRQ numbers for
|
|
|
|
* PIRQs instead of reprogramming the interrupt routing logic. Thus there may be
|
|
|
|
* multiple pins sharing the same legacy IRQ number when ACPI is disabled.
|
|
|
|
*/
|
|
|
|
static int alloc_isa_irq_from_domain(struct irq_domain *domain,
|
|
|
|
int irq, int ioapic, int pin,
|
|
|
|
struct irq_alloc_info *info)
|
|
|
|
{
|
|
|
|
struct mp_chip_data *data;
|
|
|
|
struct irq_data *irq_data = irq_get_irq_data(irq);
|
|
|
|
int node = ioapic_alloc_attr_node(info);
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Legacy ISA IRQ has already been allocated, just add pin to
|
|
|
|
* the pin list assoicated with this IRQ and program the IOAPIC
|
|
|
|
* entry. The IOAPIC entry
|
|
|
|
*/
|
|
|
|
if (irq_data && irq_data->parent_data) {
|
|
|
|
if (!mp_check_pin_attr(irq, info))
|
|
|
|
return -EBUSY;
|
2015-04-14 10:29:53 +08:00
|
|
|
if (__add_pin_to_irq_node(irq_data->chip_data, node, ioapic,
|
|
|
|
info->ioapic_pin))
|
2015-04-13 14:11:59 +08:00
|
|
|
return -ENOMEM;
|
|
|
|
} else {
|
2016-07-04 16:39:24 +08:00
|
|
|
irq = __irq_domain_alloc_irqs(domain, irq, 1, node, info, true,
|
|
|
|
NULL);
|
2015-04-13 14:11:59 +08:00
|
|
|
if (irq >= 0) {
|
|
|
|
irq_data = irq_domain_get_irq_data(domain, irq);
|
|
|
|
data = irq_data->chip_data;
|
|
|
|
data->isa_irq = true;
|
|
|
|
}
|
2014-06-09 16:19:52 +08:00
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
return irq;
|
2014-06-09 16:19:52 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
static int mp_map_pin_to_irq(u32 gsi, int idx, int ioapic, int pin,
|
2015-04-13 14:11:54 +08:00
|
|
|
unsigned int flags, struct irq_alloc_info *info)
|
2014-06-09 16:19:52 +08:00
|
|
|
{
|
|
|
|
int irq;
|
2015-04-13 14:11:59 +08:00
|
|
|
bool legacy = false;
|
|
|
|
struct irq_alloc_info tmp;
|
|
|
|
struct mp_chip_data *data;
|
2014-06-09 16:19:52 +08:00
|
|
|
struct irq_domain *domain = mp_ioapic_irqdomain(ioapic);
|
|
|
|
|
2014-06-09 16:20:11 +08:00
|
|
|
if (!domain)
|
2015-04-13 14:11:59 +08:00
|
|
|
return -ENOSYS;
|
2014-06-09 16:20:04 +08:00
|
|
|
|
|
|
|
if (idx >= 0 && test_bit(mp_irqs[idx].srcbus, mp_bus_not_pci)) {
|
|
|
|
irq = mp_irqs[idx].srcbusirq;
|
2015-04-13 14:11:59 +08:00
|
|
|
legacy = mp_is_legacy_irq(irq);
|
|
|
|
}
|
2014-06-09 16:20:04 +08:00
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
mutex_lock(&ioapic_mutex);
|
|
|
|
if (!(flags & IOAPIC_MAP_ALLOC)) {
|
|
|
|
if (!legacy) {
|
|
|
|
irq = irq_find_mapping(domain, pin);
|
2014-06-09 16:20:04 +08:00
|
|
|
if (irq == 0)
|
2015-04-13 14:11:59 +08:00
|
|
|
irq = -ENOENT;
|
2014-06-09 16:20:04 +08:00
|
|
|
}
|
|
|
|
} else {
|
2015-04-13 14:11:59 +08:00
|
|
|
ioapic_copy_alloc_attr(&tmp, info, gsi, ioapic, pin);
|
|
|
|
if (legacy)
|
|
|
|
irq = alloc_isa_irq_from_domain(domain, irq,
|
|
|
|
ioapic, pin, &tmp);
|
|
|
|
else if ((irq = irq_find_mapping(domain, pin)) == 0)
|
|
|
|
irq = alloc_irq_from_domain(domain, ioapic, gsi, &tmp);
|
|
|
|
else if (!mp_check_pin_attr(irq, &tmp))
|
|
|
|
irq = -EBUSY;
|
|
|
|
if (irq >= 0) {
|
|
|
|
data = irq_get_chip_data(irq);
|
|
|
|
data->count++;
|
|
|
|
}
|
2014-06-09 16:19:58 +08:00
|
|
|
}
|
2014-06-09 16:19:52 +08:00
|
|
|
mutex_unlock(&ioapic_mutex);
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
return irq;
|
2014-06-10 14:13:25 +08:00
|
|
|
}
|
|
|
|
|
2014-06-09 16:19:52 +08:00
|
|
|
static int pin_2_irq(int idx, int ioapic, int pin, unsigned int flags)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2014-06-09 16:19:52 +08:00
|
|
|
u32 gsi = mp_pin_to_gsi(ioapic, pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Debugging check, we are in big trouble if this message pops up!
|
|
|
|
*/
|
2009-01-12 20:17:22 +08:00
|
|
|
if (mp_irqs[idx].dstirq != pin)
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_err("broken BIOS or MPTABLE parser, ayiee!!\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* PCI IRQ command line redirection. Yes, limits are hardcoded.
|
|
|
|
*/
|
|
|
|
if ((pin >= 16) && (pin <= 23)) {
|
|
|
|
if (pirq_entries[pin-16] != -1) {
|
|
|
|
if (!pirq_entries[pin-16]) {
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_DEBUG
|
|
|
|
"disabling PIRQ%d\n", pin-16);
|
|
|
|
} else {
|
2014-06-09 16:19:52 +08:00
|
|
|
int irq = pirq_entries[pin-16];
|
2005-04-17 06:20:36 +08:00
|
|
|
apic_printk(APIC_VERBOSE, KERN_DEBUG
|
|
|
|
"using PIRQ%d -> IRQ %d\n",
|
|
|
|
pin-16, irq);
|
2014-06-10 14:13:25 +08:00
|
|
|
return irq;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
#endif
|
|
|
|
|
2015-04-13 14:11:54 +08:00
|
|
|
return mp_map_pin_to_irq(gsi, idx, ioapic, pin, flags, NULL);
|
2014-06-09 16:19:52 +08:00
|
|
|
}
|
2014-06-10 14:13:25 +08:00
|
|
|
|
2015-04-14 10:29:56 +08:00
|
|
|
int mp_map_gsi_to_irq(u32 gsi, unsigned int flags, struct irq_alloc_info *info)
|
2014-06-09 16:19:52 +08:00
|
|
|
{
|
|
|
|
int ioapic, pin, idx;
|
|
|
|
|
|
|
|
ioapic = mp_find_ioapic(gsi);
|
|
|
|
if (ioapic < 0)
|
2017-01-20 03:24:22 +08:00
|
|
|
return -ENODEV;
|
2014-06-09 16:19:52 +08:00
|
|
|
|
|
|
|
pin = mp_find_ioapic_pin(ioapic, gsi);
|
|
|
|
idx = find_irq_entry(ioapic, pin, mp_INT);
|
|
|
|
if ((flags & IOAPIC_MAP_CHECK) && idx < 0)
|
2017-01-20 03:24:22 +08:00
|
|
|
return -ENODEV;
|
2014-06-09 16:19:52 +08:00
|
|
|
|
2015-04-13 14:11:54 +08:00
|
|
|
return mp_map_pin_to_irq(gsi, idx, ioapic, pin, flags, info);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2014-06-09 16:20:06 +08:00
|
|
|
void mp_unmap_irq(int irq)
|
|
|
|
{
|
2015-04-13 14:11:59 +08:00
|
|
|
struct irq_data *irq_data = irq_get_irq_data(irq);
|
|
|
|
struct mp_chip_data *data;
|
2014-06-09 16:20:06 +08:00
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
if (!irq_data || !irq_data->domain)
|
2014-06-09 16:20:06 +08:00
|
|
|
return;
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
data = irq_data->chip_data;
|
|
|
|
if (!data || data->isa_irq)
|
|
|
|
return;
|
2014-06-09 16:20:06 +08:00
|
|
|
|
|
|
|
mutex_lock(&ioapic_mutex);
|
2015-04-13 14:11:59 +08:00
|
|
|
if (--data->count == 0)
|
|
|
|
irq_domain_free_irqs(irq, 1);
|
2014-06-09 16:20:06 +08:00
|
|
|
mutex_unlock(&ioapic_mutex);
|
|
|
|
}
|
|
|
|
|
2009-05-07 01:08:22 +08:00
|
|
|
/*
|
|
|
|
* Find a specific PCI IRQ entry.
|
|
|
|
* Not an __init, possibly needed by modules
|
|
|
|
*/
|
2014-10-27 16:11:54 +08:00
|
|
|
int IO_APIC_get_PCI_irq_vector(int bus, int slot, int pin)
|
2009-05-07 01:08:22 +08:00
|
|
|
{
|
2014-06-09 16:19:52 +08:00
|
|
|
int irq, i, best_ioapic = -1, best_idx = -1;
|
2009-05-07 01:08:22 +08:00
|
|
|
|
|
|
|
apic_printk(APIC_DEBUG,
|
|
|
|
"querying PCI -> IRQ mapping bus:%d, slot:%d, pin:%d.\n",
|
|
|
|
bus, slot, pin);
|
|
|
|
if (test_bit(bus, mp_bus_not_pci)) {
|
|
|
|
apic_printk(APIC_VERBOSE,
|
|
|
|
"PCI BIOS passed nonexistent PCI bus %d!\n", bus);
|
|
|
|
return -1;
|
|
|
|
}
|
2014-06-09 16:19:44 +08:00
|
|
|
|
2009-05-07 01:08:22 +08:00
|
|
|
for (i = 0; i < mp_irq_entries; i++) {
|
|
|
|
int lbus = mp_irqs[i].srcbus;
|
2014-06-09 16:19:44 +08:00
|
|
|
int ioapic_idx, found = 0;
|
|
|
|
|
|
|
|
if (bus != lbus || mp_irqs[i].irqtype != mp_INT ||
|
|
|
|
slot != ((mp_irqs[i].srcbusirq >> 2) & 0x1f))
|
|
|
|
continue;
|
2009-05-07 01:08:22 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(ioapic_idx)
|
2011-10-12 15:33:48 +08:00
|
|
|
if (mpc_ioapic_id(ioapic_idx) == mp_irqs[i].dstapic ||
|
2014-06-09 16:19:44 +08:00
|
|
|
mp_irqs[i].dstapic == MP_APIC_ALL) {
|
|
|
|
found = 1;
|
2009-05-07 01:08:22 +08:00
|
|
|
break;
|
|
|
|
}
|
2014-06-09 16:19:44 +08:00
|
|
|
if (!found)
|
|
|
|
continue;
|
|
|
|
|
|
|
|
/* Skip ISA IRQs */
|
2014-06-09 16:19:52 +08:00
|
|
|
irq = pin_2_irq(i, ioapic_idx, mp_irqs[i].dstirq, 0);
|
|
|
|
if (irq > 0 && !IO_APIC_IRQ(irq))
|
2014-06-09 16:19:44 +08:00
|
|
|
continue;
|
|
|
|
|
|
|
|
if (pin == (mp_irqs[i].srcbusirq & 3)) {
|
2014-06-09 16:19:52 +08:00
|
|
|
best_idx = i;
|
|
|
|
best_ioapic = ioapic_idx;
|
|
|
|
goto out;
|
2014-06-09 16:19:44 +08:00
|
|
|
}
|
2014-06-09 16:19:52 +08:00
|
|
|
|
2014-06-09 16:19:44 +08:00
|
|
|
/*
|
|
|
|
* Use the first all-but-pin matching entry as a
|
|
|
|
* best-guess fuzzy result for broken mptables.
|
|
|
|
*/
|
2014-06-09 16:19:52 +08:00
|
|
|
if (best_idx < 0) {
|
|
|
|
best_idx = i;
|
|
|
|
best_ioapic = ioapic_idx;
|
2009-05-07 01:08:22 +08:00
|
|
|
}
|
|
|
|
}
|
2014-06-09 16:19:52 +08:00
|
|
|
if (best_idx < 0)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
out:
|
2014-10-27 16:11:54 +08:00
|
|
|
return pin_2_irq(best_idx, best_ioapic, mp_irqs[best_idx].dstirq,
|
|
|
|
IOAPIC_MAP_ALLOC);
|
2009-05-07 01:08:22 +08:00
|
|
|
}
|
|
|
|
EXPORT_SYMBOL(IO_APIC_get_PCI_irq_vector);
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
static struct irq_chip ioapic_chip, ioapic_ir_chip;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-02-23 21:31:36 +08:00
|
|
|
static void __init setup_IO_APIC_irqs(void)
|
|
|
|
{
|
2014-06-09 16:20:04 +08:00
|
|
|
unsigned int ioapic, pin;
|
|
|
|
int idx;
|
2011-02-23 21:31:36 +08:00
|
|
|
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_DEBUG "init IO_APIC IRQs\n");
|
|
|
|
|
2014-06-09 16:20:04 +08:00
|
|
|
for_each_ioapic_pin(ioapic, pin) {
|
|
|
|
idx = find_irq_entry(ioapic, pin, mp_INT);
|
|
|
|
if (idx < 0)
|
|
|
|
apic_printk(APIC_VERBOSE,
|
|
|
|
KERN_DEBUG " apic %d pin %d not connected\n",
|
|
|
|
mpc_ioapic_id(ioapic), pin);
|
|
|
|
else
|
|
|
|
pin_2_irq(idx, ioapic, pin,
|
|
|
|
ioapic ? 0 : IOAPIC_MAP_ALLOC);
|
|
|
|
}
|
2011-02-23 21:31:36 +08:00
|
|
|
}
|
|
|
|
|
2013-08-20 15:01:07 +08:00
|
|
|
void ioapic_zap_locks(void)
|
|
|
|
{
|
|
|
|
raw_spin_lock_init(&ioapic_lock);
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:57 +08:00
|
|
|
static void io_apic_print_entries(unsigned int apic, unsigned int nr_entries)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
char buf[256];
|
|
|
|
struct IO_APIC_route_entry entry;
|
|
|
|
struct IR_IO_APIC_route_entry *ir_entry = (void *)&entry;
|
|
|
|
|
|
|
|
printk(KERN_DEBUG "IOAPIC %d:\n", apic);
|
|
|
|
for (i = 0; i <= nr_entries; i++) {
|
|
|
|
entry = ioapic_read_entry(apic, i);
|
|
|
|
snprintf(buf, sizeof(buf),
|
|
|
|
" pin%02x, %s, %s, %s, V(%02X), IRR(%1d), S(%1d)",
|
2015-04-14 10:30:06 +08:00
|
|
|
i,
|
|
|
|
entry.mask == IOAPIC_MASKED ? "disabled" : "enabled ",
|
|
|
|
entry.trigger == IOAPIC_LEVEL ? "level" : "edge ",
|
|
|
|
entry.polarity == IOAPIC_POL_LOW ? "low " : "high",
|
2015-04-13 14:11:57 +08:00
|
|
|
entry.vector, entry.irr, entry.delivery_status);
|
|
|
|
if (ir_entry->format)
|
|
|
|
printk(KERN_DEBUG "%s, remapped, I(%04X), Z(%X)\n",
|
2017-08-22 23:44:47 +08:00
|
|
|
buf, (ir_entry->index2 << 15) | ir_entry->index,
|
2015-04-13 14:11:57 +08:00
|
|
|
ir_entry->zero);
|
|
|
|
else
|
|
|
|
printk(KERN_DEBUG "%s, %s, D(%02X), M(%1d)\n",
|
2015-04-14 10:30:06 +08:00
|
|
|
buf,
|
|
|
|
entry.dest_mode == IOAPIC_DEST_MODE_LOGICAL ?
|
|
|
|
"logical " : "physical",
|
2015-04-13 14:11:57 +08:00
|
|
|
entry.dest, entry.delivery_mode);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:12:00 +08:00
|
|
|
static void __init print_IO_APIC(int ioapic_idx)
|
2012-09-26 18:44:36 +08:00
|
|
|
{
|
2005-04-17 06:20:36 +08:00
|
|
|
union IO_APIC_reg_00 reg_00;
|
|
|
|
union IO_APIC_reg_01 reg_01;
|
|
|
|
union IO_APIC_reg_02 reg_02;
|
|
|
|
union IO_APIC_reg_03 reg_03;
|
|
|
|
unsigned long flags;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_00.raw = io_apic_read(ioapic_idx, 0);
|
|
|
|
reg_01.raw = io_apic_read(ioapic_idx, 1);
|
2005-04-17 06:20:36 +08:00
|
|
|
if (reg_01.bits.version >= 0x10)
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_02.raw = io_apic_read(ioapic_idx, 2);
|
2008-10-15 21:27:23 +08:00
|
|
|
if (reg_01.bits.version >= 0x20)
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_03.raw = io_apic_read(ioapic_idx, 3);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
printk(KERN_DEBUG "IO APIC #%d......\n", mpc_ioapic_id(ioapic_idx));
|
2005-04-17 06:20:36 +08:00
|
|
|
printk(KERN_DEBUG ".... register #00: %08X\n", reg_00.raw);
|
|
|
|
printk(KERN_DEBUG "....... : physical APIC id: %02X\n", reg_00.bits.ID);
|
|
|
|
printk(KERN_DEBUG "....... : Delivery Type: %X\n", reg_00.bits.delivery_type);
|
|
|
|
printk(KERN_DEBUG "....... : LTS : %X\n", reg_00.bits.LTS);
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
printk(KERN_DEBUG ".... register #01: %08X\n", *(int *)®_01);
|
2011-07-09 02:46:36 +08:00
|
|
|
printk(KERN_DEBUG "....... : max redirection entries: %02X\n",
|
|
|
|
reg_01.bits.entries);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
printk(KERN_DEBUG "....... : PRQ implemented: %X\n", reg_01.bits.PRQ);
|
2011-07-09 02:46:36 +08:00
|
|
|
printk(KERN_DEBUG "....... : IO APIC version: %02X\n",
|
|
|
|
reg_01.bits.version);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Some Intel chipsets with IO APIC VERSION of 0x1? don't have reg_02,
|
|
|
|
* but the value of reg_02 is read as the previous read register
|
|
|
|
* value, so ignore it if reg_02 == reg_01.
|
|
|
|
*/
|
|
|
|
if (reg_01.bits.version >= 0x10 && reg_02.raw != reg_01.raw) {
|
|
|
|
printk(KERN_DEBUG ".... register #02: %08X\n", reg_02.raw);
|
|
|
|
printk(KERN_DEBUG "....... : arbitration: %02X\n", reg_02.bits.arbitration);
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Some Intel chipsets with IO APIC VERSION of 0x2? don't have reg_02
|
|
|
|
* or reg_03, but the value of reg_0[23] is read as the previous read
|
|
|
|
* register value, so ignore it if reg_03 == reg_0[12].
|
|
|
|
*/
|
|
|
|
if (reg_01.bits.version >= 0x20 && reg_03.raw != reg_02.raw &&
|
|
|
|
reg_03.raw != reg_01.raw) {
|
|
|
|
printk(KERN_DEBUG ".... register #03: %08X\n", reg_03.raw);
|
|
|
|
printk(KERN_DEBUG "....... : Boot DT : %X\n", reg_03.bits.boot_DT);
|
|
|
|
}
|
|
|
|
|
|
|
|
printk(KERN_DEBUG ".... IRQ redirection table:\n");
|
2015-04-13 14:11:57 +08:00
|
|
|
io_apic_print_entries(ioapic_idx, reg_01.bits.entries);
|
2011-10-12 15:33:39 +08:00
|
|
|
}
|
|
|
|
|
2014-10-27 16:12:00 +08:00
|
|
|
void __init print_IO_APICs(void)
|
2011-10-12 15:33:39 +08:00
|
|
|
{
|
2011-10-12 15:33:48 +08:00
|
|
|
int ioapic_idx;
|
2011-10-12 15:33:39 +08:00
|
|
|
unsigned int irq;
|
|
|
|
|
|
|
|
printk(KERN_DEBUG "number of MP IRQ sources: %d.\n", mp_irq_entries);
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(ioapic_idx)
|
2011-10-12 15:33:39 +08:00
|
|
|
printk(KERN_DEBUG "number of IO-APIC #%d registers: %d.\n",
|
2011-10-12 15:33:48 +08:00
|
|
|
mpc_ioapic_id(ioapic_idx),
|
|
|
|
ioapics[ioapic_idx].nr_registers);
|
2011-10-12 15:33:39 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* We are a bit conservative about what we expect. We have to
|
|
|
|
* know about every hardware change ASAP.
|
|
|
|
*/
|
|
|
|
printk(KERN_INFO "testing the IO APIC.......................\n");
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(ioapic_idx)
|
2011-10-12 15:33:48 +08:00
|
|
|
print_IO_APIC(ioapic_idx);
|
2011-07-13 05:17:35 +08:00
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
printk(KERN_DEBUG "IRQ to pin mappings:\n");
|
2010-09-30 17:26:43 +08:00
|
|
|
for_each_active_irq(irq) {
|
2008-12-06 10:58:31 +08:00
|
|
|
struct irq_pin_list *entry;
|
2015-04-14 10:29:53 +08:00
|
|
|
struct irq_chip *chip;
|
|
|
|
struct mp_chip_data *data;
|
2008-12-06 10:58:31 +08:00
|
|
|
|
2011-11-10 21:45:24 +08:00
|
|
|
chip = irq_get_chip(irq);
|
2015-04-13 14:11:59 +08:00
|
|
|
if (chip != &ioapic_chip && chip != &ioapic_ir_chip)
|
2011-11-10 21:45:24 +08:00
|
|
|
continue;
|
2015-04-14 10:29:53 +08:00
|
|
|
data = irq_get_chip_data(irq);
|
|
|
|
if (!data)
|
2010-08-20 06:46:16 +08:00
|
|
|
continue;
|
2015-04-14 10:29:53 +08:00
|
|
|
if (list_empty(&data->irq_2_pin))
|
2005-04-17 06:20:36 +08:00
|
|
|
continue;
|
2015-04-14 10:29:53 +08:00
|
|
|
|
2008-08-20 11:50:51 +08:00
|
|
|
printk(KERN_DEBUG "IRQ%d ", irq);
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin)
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_cont("-> %d:%d", entry->apic, entry->pin);
|
|
|
|
pr_cont("\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
printk(KERN_INFO ".................................... done.\n");
|
|
|
|
}
|
|
|
|
|
2008-08-20 11:50:36 +08:00
|
|
|
/* Where if anywhere is the i8259 connect in external int mode */
|
|
|
|
static struct { int pin, apic; } ioapic_i8259 = { -1, -1 };
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
void __init enable_IO_APIC(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2005-10-31 06:59:39 +08:00
|
|
|
int i8259_apic, i8259_pin;
|
2014-06-09 16:19:42 +08:00
|
|
|
int apic, pin;
|
2009-08-30 00:09:57 +08:00
|
|
|
|
2015-01-16 05:22:32 +08:00
|
|
|
if (skip_ioapic_setup)
|
|
|
|
nr_ioapics = 0;
|
|
|
|
|
|
|
|
if (!nr_legacy_irqs() || !nr_ioapics)
|
2009-08-30 00:09:57 +08:00
|
|
|
return;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic_pin(apic, pin) {
|
2005-10-31 06:59:39 +08:00
|
|
|
/* See if any of the pins is in ExtINT mode */
|
2014-06-09 16:19:42 +08:00
|
|
|
struct IO_APIC_route_entry entry = ioapic_read_entry(apic, pin);
|
2005-10-31 06:59:39 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
/* If the interrupt line is enabled and in ExtInt mode
|
|
|
|
* I have found the pin where the i8259 is connected.
|
|
|
|
*/
|
|
|
|
if ((entry.mask == 0) && (entry.delivery_mode == dest_ExtINT)) {
|
|
|
|
ioapic_i8259.apic = apic;
|
|
|
|
ioapic_i8259.pin = pin;
|
|
|
|
goto found_i8259;
|
2005-10-31 06:59:39 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
found_i8259:
|
|
|
|
/* Look to see what if the MP table has reported the ExtINT */
|
|
|
|
/* If we could not find the appropriate pin by looking at the ioapic
|
|
|
|
* the i8259 probably is not connected the ioapic but give the
|
|
|
|
* mptable a chance anyway.
|
|
|
|
*/
|
|
|
|
i8259_pin = find_isa_irq_pin(0, mp_ExtINT);
|
|
|
|
i8259_apic = find_isa_irq_apic(0, mp_ExtINT);
|
|
|
|
/* Trust the MP table if nothing is setup in the hardware */
|
|
|
|
if ((ioapic_i8259.pin == -1) && (i8259_pin >= 0)) {
|
|
|
|
printk(KERN_WARNING "ExtINT not setup in hardware but reported by MP table\n");
|
|
|
|
ioapic_i8259.pin = i8259_pin;
|
|
|
|
ioapic_i8259.apic = i8259_apic;
|
|
|
|
}
|
|
|
|
/* Complain if the MP table and the hardware disagree */
|
|
|
|
if (((ioapic_i8259.apic != i8259_apic) || (ioapic_i8259.pin != i8259_pin)) &&
|
|
|
|
(i8259_pin >= 0) && (ioapic_i8259.pin >= 0))
|
|
|
|
{
|
|
|
|
printk(KERN_WARNING "ExtINT in hardware and MP table differ\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Do not trust the IO-APIC being empty at bootup
|
|
|
|
*/
|
|
|
|
clear_IO_APIC();
|
|
|
|
}
|
|
|
|
|
2012-09-26 18:44:35 +08:00
|
|
|
void native_disable_io_apic(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2005-06-26 05:57:44 +08:00
|
|
|
/*
|
2005-09-09 18:59:04 +08:00
|
|
|
* If the i8259 is routed through an IOAPIC
|
2005-06-26 05:57:44 +08:00
|
|
|
* Put that IOAPIC in virtual wire mode
|
2005-09-09 18:59:04 +08:00
|
|
|
* so legacy interrupts can be delivered.
|
2005-06-26 05:57:44 +08:00
|
|
|
*/
|
2012-09-26 18:44:35 +08:00
|
|
|
if (ioapic_i8259.pin != -1) {
|
2005-06-26 05:57:44 +08:00
|
|
|
struct IO_APIC_route_entry entry;
|
|
|
|
|
|
|
|
memset(&entry, 0, sizeof(entry));
|
2015-04-14 10:30:06 +08:00
|
|
|
entry.mask = IOAPIC_UNMASKED;
|
|
|
|
entry.trigger = IOAPIC_EDGE;
|
|
|
|
entry.polarity = IOAPIC_POL_HIGH;
|
|
|
|
entry.dest_mode = IOAPIC_DEST_MODE_PHYSICAL;
|
|
|
|
entry.delivery_mode = dest_ExtINT;
|
|
|
|
entry.dest = read_apic_id();
|
2005-06-26 05:57:44 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Add it to the IO-APIC irq-routing table:
|
|
|
|
*/
|
2006-09-26 16:52:30 +08:00
|
|
|
ioapic_write_entry(ioapic_i8259.apic, ioapic_i8259.pin, entry);
|
2005-06-26 05:57:44 +08:00
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2016-04-05 04:25:00 +08:00
|
|
|
if (boot_cpu_has(X86_FEATURE_APIC) || apic_from_smp_config())
|
2012-09-26 18:44:35 +08:00
|
|
|
disconnect_bsp_APIC(ioapic_i8259.pin != -1);
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Not an __init, needed by the reboot code
|
|
|
|
*/
|
|
|
|
void disable_IO_APIC(void)
|
|
|
|
{
|
2009-03-17 08:04:59 +08:00
|
|
|
/*
|
2012-09-26 18:44:35 +08:00
|
|
|
* Clear the IO-APIC before rebooting:
|
2009-03-17 08:04:59 +08:00
|
|
|
*/
|
2012-09-26 18:44:35 +08:00
|
|
|
clear_IO_APIC();
|
|
|
|
|
2014-06-09 16:19:48 +08:00
|
|
|
if (!nr_legacy_irqs())
|
2012-09-26 18:44:35 +08:00
|
|
|
return;
|
|
|
|
|
|
|
|
x86_io_apic_ops.disable();
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* function to set the IO-APIC physical IDs based on the
|
|
|
|
* values stored in the MPC table.
|
|
|
|
*
|
|
|
|
* by Matt Domsch <Matt_Domsch@dell.com> Tue Dec 21 12:25:05 CST 1999
|
|
|
|
*/
|
2010-11-27 00:50:20 +08:00
|
|
|
void __init setup_ioapic_ids_from_mpc_nocheck(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
union IO_APIC_reg_00 reg_00;
|
|
|
|
physid_mask_t phys_id_present_map;
|
2011-10-12 15:33:48 +08:00
|
|
|
int ioapic_idx;
|
2005-04-17 06:20:36 +08:00
|
|
|
int i;
|
|
|
|
unsigned char old_id;
|
|
|
|
unsigned long flags;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* This is broken; anything with a real cpu count has to
|
|
|
|
* circumvent this idiocy regardless.
|
|
|
|
*/
|
2009-11-10 06:06:59 +08:00
|
|
|
apic->ioapic_phys_id_map(&phys_cpu_present_map, &phys_id_present_map);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Set the IOAPIC ID to the value stored in the MPC table.
|
|
|
|
*/
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(ioapic_idx) {
|
2005-04-17 06:20:36 +08:00
|
|
|
/* Read the register 0 value */
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_00.raw = io_apic_read(ioapic_idx, 0);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2008-06-08 19:07:18 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
old_id = mpc_ioapic_id(ioapic_idx);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
if (mpc_ioapic_id(ioapic_idx) >= get_physical_broadcast()) {
|
2005-04-17 06:20:36 +08:00
|
|
|
printk(KERN_ERR "BIOS bug, IO-APIC#%d ID is %d in the MPC table!...\n",
|
2011-10-12 15:33:48 +08:00
|
|
|
ioapic_idx, mpc_ioapic_id(ioapic_idx));
|
2005-04-17 06:20:36 +08:00
|
|
|
printk(KERN_ERR "... fixing up to %d. (tell your hw vendor)\n",
|
|
|
|
reg_00.bits.ID);
|
2011-10-12 15:33:48 +08:00
|
|
|
ioapics[ioapic_idx].mp_config.apicid = reg_00.bits.ID;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Sanity check, is the ID really free? Every APIC in a
|
|
|
|
* system must have a unique ID or we get lots of nice
|
|
|
|
* 'stuck on smp_invalidate_needed IPI wait' messages.
|
|
|
|
*/
|
2009-11-10 06:06:59 +08:00
|
|
|
if (apic->check_apicid_used(&phys_id_present_map,
|
2011-10-12 15:33:48 +08:00
|
|
|
mpc_ioapic_id(ioapic_idx))) {
|
2005-04-17 06:20:36 +08:00
|
|
|
printk(KERN_ERR "BIOS bug, IO-APIC#%d ID %d is already used!...\n",
|
2011-10-12 15:33:48 +08:00
|
|
|
ioapic_idx, mpc_ioapic_id(ioapic_idx));
|
2005-04-17 06:20:36 +08:00
|
|
|
for (i = 0; i < get_physical_broadcast(); i++)
|
|
|
|
if (!physid_isset(i, phys_id_present_map))
|
|
|
|
break;
|
|
|
|
if (i >= get_physical_broadcast())
|
|
|
|
panic("Max APIC ID exceeded!\n");
|
|
|
|
printk(KERN_ERR "... fixing up to %d. (tell your hw vendor)\n",
|
|
|
|
i);
|
|
|
|
physid_set(i, phys_id_present_map);
|
2011-10-12 15:33:48 +08:00
|
|
|
ioapics[ioapic_idx].mp_config.apicid = i;
|
2005-04-17 06:20:36 +08:00
|
|
|
} else {
|
|
|
|
physid_mask_t tmp;
|
2011-10-12 15:33:48 +08:00
|
|
|
apic->apicid_to_cpu_present(mpc_ioapic_id(ioapic_idx),
|
2011-05-19 07:31:37 +08:00
|
|
|
&tmp);
|
2005-04-17 06:20:36 +08:00
|
|
|
apic_printk(APIC_VERBOSE, "Setting %d in the "
|
|
|
|
"phys_id_present_map\n",
|
2011-10-12 15:33:48 +08:00
|
|
|
mpc_ioapic_id(ioapic_idx));
|
2005-04-17 06:20:36 +08:00
|
|
|
physids_or(phys_id_present_map, phys_id_present_map, tmp);
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* We need to adjust the IRQ routing table
|
|
|
|
* if the ID changed.
|
|
|
|
*/
|
2011-10-12 15:33:48 +08:00
|
|
|
if (old_id != mpc_ioapic_id(ioapic_idx))
|
2005-04-17 06:20:36 +08:00
|
|
|
for (i = 0; i < mp_irq_entries; i++)
|
2009-01-12 20:17:22 +08:00
|
|
|
if (mp_irqs[i].dstapic == old_id)
|
|
|
|
mp_irqs[i].dstapic
|
2011-10-12 15:33:48 +08:00
|
|
|
= mpc_ioapic_id(ioapic_idx);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
2010-12-07 16:59:49 +08:00
|
|
|
* Update the ID register according to the right value
|
|
|
|
* from the MPC table if they are different.
|
2008-06-08 19:07:18 +08:00
|
|
|
*/
|
2011-10-12 15:33:48 +08:00
|
|
|
if (mpc_ioapic_id(ioapic_idx) == reg_00.bits.ID)
|
2010-12-07 16:59:49 +08:00
|
|
|
continue;
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
apic_printk(APIC_VERBOSE, KERN_INFO
|
|
|
|
"...changing IO-APIC physical APIC ID to %d ...",
|
2011-10-12 15:33:48 +08:00
|
|
|
mpc_ioapic_id(ioapic_idx));
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_00.bits.ID = mpc_ioapic_id(ioapic_idx);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
io_apic_write(ioapic_idx, 0, reg_00.raw);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Sanity check
|
|
|
|
*/
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_00.raw = io_apic_read(ioapic_idx, 0);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
if (reg_00.bits.ID != mpc_ioapic_id(ioapic_idx))
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_cont("could not set ID!\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
else
|
|
|
|
apic_printk(APIC_VERBOSE, " ok.\n");
|
|
|
|
}
|
|
|
|
}
|
2010-11-27 00:50:20 +08:00
|
|
|
|
|
|
|
void __init setup_ioapic_ids_from_mpc(void)
|
|
|
|
{
|
|
|
|
|
|
|
|
if (acpi_ioapic)
|
|
|
|
return;
|
|
|
|
/*
|
|
|
|
* Don't check I/O APIC IDs for xAPIC systems. They have
|
|
|
|
* no meaning without the serial APIC bus.
|
|
|
|
*/
|
|
|
|
if (!(boot_cpu_data.x86_vendor == X86_VENDOR_INTEL)
|
2016-09-14 02:12:32 +08:00
|
|
|
|| APIC_XAPIC(boot_cpu_apic_version))
|
2010-11-27 00:50:20 +08:00
|
|
|
return;
|
|
|
|
setup_ioapic_ids_from_mpc_nocheck();
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
#endif
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2007-02-13 20:26:21 +08:00
|
|
|
int no_timer_check __initdata;
|
2006-12-07 09:14:09 +08:00
|
|
|
|
|
|
|
static int __init notimercheck(char *s)
|
|
|
|
{
|
|
|
|
no_timer_check = 1;
|
|
|
|
return 1;
|
|
|
|
}
|
|
|
|
__setup("no_timer_check", notimercheck);
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* There is a nasty bug in some older SMP boards, their mptable lies
|
|
|
|
* about the timer IRQ. We do the following to work around the situation:
|
|
|
|
*
|
|
|
|
* - timer IRQ defaults to IO-APIC IRQ
|
|
|
|
* - if this function detects that timer IRQs are defunct, then we fall
|
|
|
|
* back to ISA timer IRQs
|
|
|
|
*/
|
2007-07-21 23:10:29 +08:00
|
|
|
static int __init timer_irq_works(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
unsigned long t1 = jiffies;
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
unsigned long flags;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2006-12-07 09:14:09 +08:00
|
|
|
if (no_timer_check)
|
|
|
|
return 1;
|
|
|
|
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
local_save_flags(flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
local_irq_enable();
|
|
|
|
/* Let ten ticks pass... */
|
|
|
|
mdelay((10 * 1000) / HZ);
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
local_irq_restore(flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Expect a few ticks at least, to be sure some possible
|
|
|
|
* glue logic does not lock up after one or two first
|
|
|
|
* ticks in a non-ExtINT mode. Also the local APIC
|
|
|
|
* might have cached one ExtINT interrupt. Finally, at
|
|
|
|
* least one tick may be lost due to delays.
|
|
|
|
*/
|
2008-08-20 15:07:45 +08:00
|
|
|
|
|
|
|
/* jiffies wrap? */
|
2008-01-30 20:32:19 +08:00
|
|
|
if (time_after(jiffies, t1 + 4))
|
2005-04-17 06:20:36 +08:00
|
|
|
return 1;
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* In the SMP+IOAPIC case it might happen that there are an unspecified
|
|
|
|
* number of pending IRQ events unhandled. These cases are very rare,
|
|
|
|
* so we 'resend' these IRQs via IPIs, to the same CPU. It's much
|
|
|
|
* better to do it this way as thus we do not have to be aware of
|
|
|
|
* 'pending' interrupts in the IRQ path, except at this point.
|
|
|
|
*/
|
|
|
|
/*
|
|
|
|
* Edge triggered needs to resend any interrupt
|
|
|
|
* that was delayed but this is now handled in the device
|
|
|
|
* independent code.
|
|
|
|
*/
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Starting up a edge-triggered IO-APIC interrupt is
|
|
|
|
* nasty - we need to make sure that we get the edge.
|
|
|
|
* If it is already asserted for some reason, we need
|
|
|
|
* return 1 to indicate that is was pending.
|
|
|
|
*
|
|
|
|
* This is not complete - we should be able to fake
|
|
|
|
* an edge even if it isn't on the 8259A...
|
|
|
|
*/
|
2010-09-28 22:00:34 +08:00
|
|
|
static unsigned int startup_ioapic_irq(struct irq_data *data)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2010-09-28 22:00:34 +08:00
|
|
|
int was_pending = 0, irq = data->irq;
|
2005-04-17 06:20:36 +08:00
|
|
|
unsigned long flags;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2014-06-09 16:19:48 +08:00
|
|
|
if (irq < nr_legacy_irqs()) {
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->mask(irq);
|
2009-11-10 03:27:04 +08:00
|
|
|
if (legacy_pic->irq_pending(irq))
|
2005-04-17 06:20:36 +08:00
|
|
|
was_pending = 1;
|
|
|
|
}
|
2015-04-14 10:29:53 +08:00
|
|
|
__unmask_ioapic(data->chip_data);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
return was_pending;
|
|
|
|
}
|
|
|
|
|
2008-08-20 11:50:48 +08:00
|
|
|
atomic_t irq_mis_count;
|
|
|
|
|
2008-08-20 11:50:41 +08:00
|
|
|
#ifdef CONFIG_GENERIC_PENDING_IRQ
|
2015-04-14 10:29:53 +08:00
|
|
|
static bool io_apic_level_ack_pending(struct mp_chip_data *data)
|
2012-05-08 15:24:20 +08:00
|
|
|
{
|
|
|
|
struct irq_pin_list *entry;
|
|
|
|
unsigned long flags;
|
|
|
|
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin) {
|
2012-05-08 15:24:20 +08:00
|
|
|
unsigned int reg;
|
|
|
|
int pin;
|
|
|
|
|
|
|
|
pin = entry->pin;
|
|
|
|
reg = io_apic_read(entry->apic, 0x10 + pin*2);
|
|
|
|
/* Is the remote IRR bit set? */
|
|
|
|
if (reg & IO_APIC_REDIR_REMOTE_IRR) {
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
return true;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
|
|
|
|
return false;
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static inline bool ioapic_irqd_mask(struct irq_data *data)
|
2012-03-20 22:19:36 +08:00
|
|
|
{
|
2008-08-20 15:07:45 +08:00
|
|
|
/* If we are moving the irq we need to mask it */
|
2011-02-05 22:35:51 +08:00
|
|
|
if (unlikely(irqd_is_setaffinity_pending(data))) {
|
2015-04-14 10:29:53 +08:00
|
|
|
mask_ioapic_irq(data);
|
2012-03-20 22:19:36 +08:00
|
|
|
return true;
|
2008-08-20 15:07:45 +08:00
|
|
|
}
|
2012-03-20 22:19:36 +08:00
|
|
|
return false;
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static inline void ioapic_irqd_unmask(struct irq_data *data, bool masked)
|
2012-03-20 22:19:36 +08:00
|
|
|
{
|
|
|
|
if (unlikely(masked)) {
|
|
|
|
/* Only migrate the irq if the ack has been received.
|
|
|
|
*
|
|
|
|
* On rare occasions the broadcast level triggered ack gets
|
|
|
|
* delayed going to ioapics, and if we reprogram the
|
|
|
|
* vector while Remote IRR is still set the irq will never
|
|
|
|
* fire again.
|
|
|
|
*
|
|
|
|
* To prevent this scenario we read the Remote IRR bit
|
|
|
|
* of the ioapic. This has two effects.
|
|
|
|
* - On any sane system the read of the ioapic will
|
|
|
|
* flush writes (and acks) going to the ioapic from
|
|
|
|
* this cpu.
|
|
|
|
* - We get to see if the ACK has actually been delivered.
|
|
|
|
*
|
|
|
|
* Based on failed experiments of reprogramming the
|
|
|
|
* ioapic entry from outside of irq context starting
|
|
|
|
* with masking the ioapic entry and then polling until
|
|
|
|
* Remote IRR was clear before reprogramming the
|
|
|
|
* ioapic I don't trust the Remote IRR bit to be
|
|
|
|
* completey accurate.
|
|
|
|
*
|
|
|
|
* However there appears to be no other way to plug
|
|
|
|
* this race, so if the Remote IRR bit is not
|
|
|
|
* accurate and is causing problems then it is a hardware bug
|
|
|
|
* and you can go talk to the chipset vendor about it.
|
|
|
|
*/
|
2015-04-14 10:29:53 +08:00
|
|
|
if (!io_apic_level_ack_pending(data->chip_data))
|
2012-03-20 22:19:36 +08:00
|
|
|
irq_move_masked_irq(data);
|
2015-04-14 10:29:53 +08:00
|
|
|
unmask_ioapic_irq(data);
|
2012-03-20 22:19:36 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
#else
|
2015-04-14 10:29:53 +08:00
|
|
|
static inline bool ioapic_irqd_mask(struct irq_data *data)
|
2012-03-20 22:19:36 +08:00
|
|
|
{
|
|
|
|
return false;
|
|
|
|
}
|
2015-04-14 10:29:53 +08:00
|
|
|
static inline void ioapic_irqd_unmask(struct irq_data *data, bool masked)
|
2012-03-20 22:19:36 +08:00
|
|
|
{
|
|
|
|
}
|
2008-08-20 11:50:41 +08:00
|
|
|
#endif
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
static void ioapic_ack_level(struct irq_data *irq_data)
|
2012-03-20 22:19:36 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
struct irq_cfg *cfg = irqd_cfg(irq_data);
|
2012-03-20 22:19:36 +08:00
|
|
|
unsigned long v;
|
|
|
|
bool masked;
|
2015-04-13 14:11:59 +08:00
|
|
|
int i;
|
2012-03-20 22:19:36 +08:00
|
|
|
|
|
|
|
irq_complete_move(cfg);
|
2015-04-14 10:29:53 +08:00
|
|
|
masked = ioapic_irqd_mask(irq_data);
|
2012-03-20 22:19:36 +08:00
|
|
|
|
2008-08-20 11:50:48 +08:00
|
|
|
/*
|
2009-06-08 18:00:22 +08:00
|
|
|
* It appears there is an erratum which affects at least version 0x11
|
|
|
|
* of I/O APIC (that's the 82093AA and cores integrated into various
|
|
|
|
* chipsets). Under certain conditions a level-triggered interrupt is
|
|
|
|
* erroneously delivered as edge-triggered one but the respective IRR
|
|
|
|
* bit gets set nevertheless. As a result the I/O unit expects an EOI
|
|
|
|
* message but it will never arrive and further interrupts are blocked
|
|
|
|
* from the source. The exact reason is so far unknown, but the
|
|
|
|
* phenomenon was observed when two consecutive interrupt requests
|
|
|
|
* from a given source get delivered to the same CPU and the source is
|
|
|
|
* temporarily disabled in between.
|
|
|
|
*
|
|
|
|
* A workaround is to simulate an EOI message manually. We achieve it
|
|
|
|
* by setting the trigger mode to edge and then to level when the edge
|
|
|
|
* trigger mode gets detected in the TMR of a local APIC for a
|
|
|
|
* level-triggered interrupt. We mask the source for the time of the
|
|
|
|
* operation to prevent an edge-triggered interrupt escaping meanwhile.
|
|
|
|
* The idea is from Manfred Spraul. --macro
|
2009-12-02 07:31:17 +08:00
|
|
|
*
|
|
|
|
* Also in the case when cpu goes offline, fixup_irqs() will forward
|
|
|
|
* any unhandled interrupt on the offlined cpu to the new cpu
|
|
|
|
* destination that is handling the corresponding interrupt. This
|
|
|
|
* interrupt forwarding is done via IPI's. Hence, in this case also
|
|
|
|
* level-triggered io-apic interrupt will be seen as an edge
|
|
|
|
* interrupt in the IRR. And we can't rely on the cpu's EOI
|
|
|
|
* to be broadcasted to the IO-APIC's which will clear the remoteIRR
|
|
|
|
* corresponding to the level-triggered interrupt. Hence on IO-APIC's
|
|
|
|
* supporting EOI register, we do an explicit EOI to clear the
|
|
|
|
* remote IRR and on IO-APIC's which don't have an EOI register,
|
|
|
|
* we use the above logic (mask+edge followed by unmask+level) from
|
|
|
|
* Manfred Spraul to clear the remote IRR.
|
2009-06-08 18:00:22 +08:00
|
|
|
*/
|
2008-12-06 10:58:34 +08:00
|
|
|
i = cfg->vector;
|
2008-08-20 11:50:48 +08:00
|
|
|
v = apic_read(APIC_TMR + ((i & ~0x1f) >> 1));
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
/*
|
|
|
|
* We must acknowledge the irq before we move it or the acknowledge will
|
|
|
|
* not propagate properly.
|
|
|
|
*/
|
|
|
|
ack_APIC_irq();
|
|
|
|
|
2009-12-02 07:31:17 +08:00
|
|
|
/*
|
|
|
|
* Tail end of clearing remote IRR bit (either by delivering the EOI
|
|
|
|
* message via io-apic EOI register write or simulating it using
|
|
|
|
* mask+edge followed by unnask+level logic) manually when the
|
|
|
|
* level triggered interrupt is seen as the edge triggered interrupt
|
|
|
|
* at the cpu.
|
|
|
|
*/
|
2009-12-02 07:31:15 +08:00
|
|
|
if (!(v & (1 << (i & 0x1f)))) {
|
|
|
|
atomic_inc(&irq_mis_count);
|
2015-04-14 10:29:53 +08:00
|
|
|
eoi_ioapic_pin(cfg->vector, irq_data->chip_data);
|
2009-12-02 07:31:15 +08:00
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
ioapic_irqd_unmask(irq_data, masked);
|
2008-08-20 11:50:48 +08:00
|
|
|
}
|
2008-08-20 11:50:34 +08:00
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
static void ioapic_ir_ack_level(struct irq_data *irq_data)
|
|
|
|
{
|
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Intr-remapping uses pin number as the virtual vector
|
|
|
|
* in the RTE. Actual vector is programmed in
|
|
|
|
* intr-remapping table entry. Hence for the io-apic
|
|
|
|
* EOI we use the pin number.
|
|
|
|
*/
|
|
|
|
ack_APIC_irq();
|
2015-04-14 10:29:53 +08:00
|
|
|
eoi_ioapic_pin(data->entry.vector, data);
|
2015-04-13 14:11:59 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
static int ioapic_set_affinity(struct irq_data *irq_data,
|
|
|
|
const struct cpumask *mask, bool force)
|
|
|
|
{
|
|
|
|
struct irq_data *parent = irq_data->parent_data;
|
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
2015-04-14 10:29:57 +08:00
|
|
|
struct irq_pin_list *entry;
|
2015-04-13 14:11:59 +08:00
|
|
|
struct irq_cfg *cfg;
|
|
|
|
unsigned long flags;
|
|
|
|
int ret;
|
|
|
|
|
|
|
|
ret = parent->chip->irq_set_affinity(parent, mask, force);
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
|
|
|
if (ret >= 0 && ret != IRQ_SET_MASK_OK_DONE) {
|
|
|
|
cfg = irqd_cfg(irq_data);
|
|
|
|
data->entry.dest = cfg->dest_apicid;
|
|
|
|
data->entry.vector = cfg->vector;
|
2015-04-14 10:29:57 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin)
|
|
|
|
__ioapic_write_entry(entry->apic, entry->pin,
|
|
|
|
data->entry);
|
2015-04-13 14:11:59 +08:00
|
|
|
}
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2006-10-04 17:16:26 +08:00
|
|
|
static struct irq_chip ioapic_chip __read_mostly = {
|
2010-10-09 03:40:23 +08:00
|
|
|
.name = "IO-APIC",
|
|
|
|
.irq_startup = startup_ioapic_irq,
|
|
|
|
.irq_mask = mask_ioapic_irq,
|
|
|
|
.irq_unmask = unmask_ioapic_irq,
|
2015-04-13 14:11:59 +08:00
|
|
|
.irq_ack = irq_chip_ack_parent,
|
|
|
|
.irq_eoi = ioapic_ack_level,
|
|
|
|
.irq_set_affinity = ioapic_set_affinity,
|
2017-01-17 22:13:52 +08:00
|
|
|
.irq_retrigger = irq_chip_retrigger_hierarchy,
|
2015-04-13 14:11:59 +08:00
|
|
|
.flags = IRQCHIP_SKIP_SET_WAKE,
|
|
|
|
};
|
|
|
|
|
|
|
|
static struct irq_chip ioapic_ir_chip __read_mostly = {
|
|
|
|
.name = "IR-IO-APIC",
|
|
|
|
.irq_startup = startup_ioapic_irq,
|
|
|
|
.irq_mask = mask_ioapic_irq,
|
|
|
|
.irq_unmask = unmask_ioapic_irq,
|
|
|
|
.irq_ack = irq_chip_ack_parent,
|
|
|
|
.irq_eoi = ioapic_ir_ack_level,
|
|
|
|
.irq_set_affinity = ioapic_set_affinity,
|
2017-01-17 22:13:52 +08:00
|
|
|
.irq_retrigger = irq_chip_retrigger_hierarchy,
|
2014-09-01 19:49:07 +08:00
|
|
|
.flags = IRQCHIP_SKIP_SET_WAKE,
|
2005-04-17 06:20:36 +08:00
|
|
|
};
|
|
|
|
|
|
|
|
static inline void init_IO_APIC_traps(void)
|
|
|
|
{
|
2008-08-20 11:50:25 +08:00
|
|
|
struct irq_cfg *cfg;
|
2010-09-30 17:26:43 +08:00
|
|
|
unsigned int irq;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2010-09-30 17:26:43 +08:00
|
|
|
for_each_active_irq(irq) {
|
2014-06-09 16:19:43 +08:00
|
|
|
cfg = irq_cfg(irq);
|
2008-12-06 10:58:31 +08:00
|
|
|
if (IO_APIC_IRQ(irq) && cfg && !cfg->vector) {
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* Hmm.. We don't have an entry for this,
|
|
|
|
* so default to an old-fashioned 8259
|
|
|
|
* interrupt if we can..
|
|
|
|
*/
|
2014-06-09 16:19:48 +08:00
|
|
|
if (irq < nr_legacy_irqs())
|
2009-11-10 03:27:04 +08:00
|
|
|
legacy_pic->make_irq(irq);
|
2008-12-06 10:58:31 +08:00
|
|
|
else
|
2005-04-17 06:20:36 +08:00
|
|
|
/* Strange. Oh, well.. */
|
2011-03-12 19:20:43 +08:00
|
|
|
irq_set_chip(irq, &no_irq_chip);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2006-10-04 17:16:26 +08:00
|
|
|
/*
|
|
|
|
* The local APIC irq-chip implementation:
|
|
|
|
*/
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2010-09-28 22:03:54 +08:00
|
|
|
static void mask_lapic_irq(struct irq_data *data)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
unsigned long v;
|
|
|
|
|
|
|
|
v = apic_read(APIC_LVT0);
|
x86: APIC: remove apic_write_around(); use alternatives
Use alternatives to select the workaround for the 11AP Pentium erratum
for the affected steppings on the fly rather than build time. Remove the
X86_GOOD_APIC configuration option and replace all the calls to
apic_write_around() with plain apic_write(), protecting accesses to the
ESR as appropriate due to the 3AP Pentium erratum. Remove
apic_read_around() and all its invocations altogether as not needed.
Remove apic_write_atomic() and all its implementing backends. The use of
ASM_OUTPUT2() is not strictly needed for input constraints, but I have
used it for readability's sake.
I had the feeling no one else was brave enough to do it, so I went ahead
and here it is. Verified by checking the generated assembly and tested
with both a 32-bit and a 64-bit configuration, also with the 11AP
"feature" forced on and verified with gdb on /proc/kcore to work as
expected (as an 11AP machines are quite hard to get hands on these days).
Some script complained about the use of "volatile", but apic_write() needs
it for the same reason and is effectively a replacement for writel(), so I
have disregarded it.
I am not sure what the policy wrt defconfig files is, they are generated
and there is risk of a conflict resulting from an unrelated change, so I
have left changes to them out. The option will get removed from them at
the next run.
Some testing with machines other than mine will be needed to avoid some
stupid mistake, but despite its volume, the change is not really that
intrusive, so I am fairly confident that because it works for me, it will
everywhere.
Signed-off-by: Maciej W. Rozycki <macro@linux-mips.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-07-17 02:15:30 +08:00
|
|
|
apic_write(APIC_LVT0, v | APIC_LVT_MASKED);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2010-09-28 22:03:54 +08:00
|
|
|
static void unmask_lapic_irq(struct irq_data *data)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2006-10-04 17:16:26 +08:00
|
|
|
unsigned long v;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2006-10-04 17:16:26 +08:00
|
|
|
v = apic_read(APIC_LVT0);
|
x86: APIC: remove apic_write_around(); use alternatives
Use alternatives to select the workaround for the 11AP Pentium erratum
for the affected steppings on the fly rather than build time. Remove the
X86_GOOD_APIC configuration option and replace all the calls to
apic_write_around() with plain apic_write(), protecting accesses to the
ESR as appropriate due to the 3AP Pentium erratum. Remove
apic_read_around() and all its invocations altogether as not needed.
Remove apic_write_atomic() and all its implementing backends. The use of
ASM_OUTPUT2() is not strictly needed for input constraints, but I have
used it for readability's sake.
I had the feeling no one else was brave enough to do it, so I went ahead
and here it is. Verified by checking the generated assembly and tested
with both a 32-bit and a 64-bit configuration, also with the 11AP
"feature" forced on and verified with gdb on /proc/kcore to work as
expected (as an 11AP machines are quite hard to get hands on these days).
Some script complained about the use of "volatile", but apic_write() needs
it for the same reason and is effectively a replacement for writel(), so I
have disregarded it.
I am not sure what the policy wrt defconfig files is, they are generated
and there is risk of a conflict resulting from an unrelated change, so I
have left changes to them out. The option will get removed from them at
the next run.
Some testing with machines other than mine will be needed to avoid some
stupid mistake, but despite its volume, the change is not really that
intrusive, so I am fairly confident that because it works for me, it will
everywhere.
Signed-off-by: Maciej W. Rozycki <macro@linux-mips.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-07-17 02:15:30 +08:00
|
|
|
apic_write(APIC_LVT0, v & ~APIC_LVT_MASKED);
|
2006-10-04 17:16:26 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2010-09-28 22:03:54 +08:00
|
|
|
static void ack_lapic_irq(struct irq_data *data)
|
2008-08-20 11:50:34 +08:00
|
|
|
{
|
|
|
|
ack_APIC_irq();
|
|
|
|
}
|
|
|
|
|
2006-10-04 17:16:26 +08:00
|
|
|
static struct irq_chip lapic_chip __read_mostly = {
|
2008-05-28 04:19:09 +08:00
|
|
|
.name = "local-APIC",
|
2010-09-28 22:03:54 +08:00
|
|
|
.irq_mask = mask_lapic_irq,
|
|
|
|
.irq_unmask = unmask_lapic_irq,
|
|
|
|
.irq_ack = ack_lapic_irq,
|
2005-04-17 06:20:36 +08:00
|
|
|
};
|
|
|
|
|
2010-09-28 23:28:38 +08:00
|
|
|
static void lapic_register_intr(int irq)
|
2008-07-12 02:35:17 +08:00
|
|
|
{
|
2010-09-28 23:28:38 +08:00
|
|
|
irq_clear_status_flags(irq, IRQ_LEVEL);
|
2011-03-12 19:20:43 +08:00
|
|
|
irq_set_chip_and_handler_name(irq, &lapic_chip, handle_edge_irq,
|
2008-07-12 02:35:17 +08:00
|
|
|
"edge");
|
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* This looks a bit hackish but it's about the only one way of sending
|
|
|
|
* a few INTA cycles to 8259As and any associated glue logic. ICR does
|
|
|
|
* not support the ExtINT mode, unfortunately. We need to send these
|
|
|
|
* cycles as some i82489DX-based boards have glue logic that keeps the
|
|
|
|
* 8259A interrupt line asserted until INTA. --macro
|
|
|
|
*/
|
2008-04-12 23:41:12 +08:00
|
|
|
static inline void __init unlock_ExtINT_logic(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2005-10-31 06:59:39 +08:00
|
|
|
int apic, pin, i;
|
2005-04-17 06:20:36 +08:00
|
|
|
struct IO_APIC_route_entry entry0, entry1;
|
|
|
|
unsigned char save_control, save_freq_select;
|
|
|
|
|
2005-10-31 06:59:39 +08:00
|
|
|
pin = find_isa_irq_pin(8, mp_INT);
|
2006-12-07 09:14:11 +08:00
|
|
|
if (pin == -1) {
|
|
|
|
WARN_ON_ONCE(1);
|
|
|
|
return;
|
|
|
|
}
|
2005-10-31 06:59:39 +08:00
|
|
|
apic = find_isa_irq_apic(8, mp_INT);
|
2006-12-07 09:14:11 +08:00
|
|
|
if (apic == -1) {
|
|
|
|
WARN_ON_ONCE(1);
|
2005-04-17 06:20:36 +08:00
|
|
|
return;
|
2006-12-07 09:14:11 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
entry0 = ioapic_read_entry(apic, pin);
|
2005-10-31 06:59:39 +08:00
|
|
|
clear_IO_APIC_pin(apic, pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
memset(&entry1, 0, sizeof(entry1));
|
|
|
|
|
2015-04-14 10:30:06 +08:00
|
|
|
entry1.dest_mode = IOAPIC_DEST_MODE_PHYSICAL;
|
|
|
|
entry1.mask = IOAPIC_UNMASKED;
|
2008-08-20 11:50:33 +08:00
|
|
|
entry1.dest = hard_smp_processor_id();
|
2005-04-17 06:20:36 +08:00
|
|
|
entry1.delivery_mode = dest_ExtINT;
|
|
|
|
entry1.polarity = entry0.polarity;
|
2015-04-14 10:30:06 +08:00
|
|
|
entry1.trigger = IOAPIC_EDGE;
|
2005-04-17 06:20:36 +08:00
|
|
|
entry1.vector = 0;
|
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
ioapic_write_entry(apic, pin, entry1);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
save_control = CMOS_READ(RTC_CONTROL);
|
|
|
|
save_freq_select = CMOS_READ(RTC_FREQ_SELECT);
|
|
|
|
CMOS_WRITE((save_freq_select & ~RTC_RATE_SELECT) | 0x6,
|
|
|
|
RTC_FREQ_SELECT);
|
|
|
|
CMOS_WRITE(save_control | RTC_PIE, RTC_CONTROL);
|
|
|
|
|
|
|
|
i = 100;
|
|
|
|
while (i-- > 0) {
|
|
|
|
mdelay(10);
|
|
|
|
if ((CMOS_READ(RTC_INTR_FLAGS) & RTC_PF) == RTC_PF)
|
|
|
|
i -= 10;
|
|
|
|
}
|
|
|
|
|
|
|
|
CMOS_WRITE(save_control, RTC_CONTROL);
|
|
|
|
CMOS_WRITE(save_freq_select, RTC_FREQ_SELECT);
|
2005-10-31 06:59:39 +08:00
|
|
|
clear_IO_APIC_pin(apic, pin);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2006-09-26 16:52:30 +08:00
|
|
|
ioapic_write_entry(apic, pin, entry0);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2008-08-20 11:50:36 +08:00
|
|
|
static int disable_timer_pin_1 __initdata;
|
2008-08-20 11:50:41 +08:00
|
|
|
/* Actually the next is obsolete, but keep it for paranoid reasons -AK */
|
2008-08-20 15:07:45 +08:00
|
|
|
static int __init disable_timer_pin_setup(char *arg)
|
2008-08-20 11:50:36 +08:00
|
|
|
{
|
|
|
|
disable_timer_pin_1 = 1;
|
|
|
|
return 0;
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
early_param("disable_timer_pin_1", disable_timer_pin_setup);
|
2008-08-20 11:50:36 +08:00
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
static int mp_alloc_timer_irq(int ioapic, int pin)
|
|
|
|
{
|
|
|
|
int irq = -1;
|
|
|
|
struct irq_domain *domain = mp_ioapic_irqdomain(ioapic);
|
|
|
|
|
|
|
|
if (domain) {
|
2015-04-14 10:29:53 +08:00
|
|
|
struct irq_alloc_info info;
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
ioapic_set_alloc_attr(&info, NUMA_NO_NODE, 0, 0);
|
|
|
|
info.ioapic_id = mpc_ioapic_id(ioapic);
|
|
|
|
info.ioapic_pin = pin;
|
|
|
|
mutex_lock(&ioapic_mutex);
|
|
|
|
irq = alloc_isa_irq_from_domain(domain, 0, ioapic, pin, &info);
|
|
|
|
mutex_unlock(&ioapic_mutex);
|
|
|
|
}
|
|
|
|
|
|
|
|
return irq;
|
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* This code may look a bit paranoid, but it's supposed to cooperate with
|
|
|
|
* a wide range of boards and BIOS bugs. Fortunately only the timer IRQ
|
|
|
|
* is so screwy. Thanks to Brian Perkins for testing/hacking this beast
|
|
|
|
* fanatically on his truly buggy board.
|
2008-08-20 15:07:45 +08:00
|
|
|
*
|
|
|
|
* FIXME: really need to revamp this for all platforms.
|
2005-04-17 06:20:36 +08:00
|
|
|
*/
|
2006-12-07 09:14:09 +08:00
|
|
|
static inline void __init check_timer(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2015-04-14 10:29:53 +08:00
|
|
|
struct irq_data *irq_data = irq_get_irq_data(0);
|
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
|
|
|
struct irq_cfg *cfg = irqd_cfg(irq_data);
|
2010-07-22 01:03:58 +08:00
|
|
|
int node = cpu_to_node(0);
|
2005-10-31 06:59:39 +08:00
|
|
|
int apic1, pin1, apic2, pin2;
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
unsigned long flags;
|
2008-08-20 11:50:41 +08:00
|
|
|
int no_pin1 = 0;
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
|
|
|
|
local_irq_save(flags);
|
2007-11-27 03:42:19 +08:00
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* get/set the timer IRQ vector:
|
|
|
|
*/
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->mask(0);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/*
|
2008-05-22 05:09:11 +08:00
|
|
|
* As IRQ0 is to be enabled in the 8259A, the virtual
|
|
|
|
* wire has to be disabled in the local APIC. Also
|
|
|
|
* timer interrupts need to be acknowledged manually in
|
|
|
|
* the 8259A for the i82489DX when using the NMI
|
|
|
|
* watchdog as that APIC treats NMIs as level-triggered.
|
|
|
|
* The AEOI mode will finish them in the 8259A
|
|
|
|
* automatically.
|
2005-04-17 06:20:36 +08:00
|
|
|
*/
|
x86: APIC: remove apic_write_around(); use alternatives
Use alternatives to select the workaround for the 11AP Pentium erratum
for the affected steppings on the fly rather than build time. Remove the
X86_GOOD_APIC configuration option and replace all the calls to
apic_write_around() with plain apic_write(), protecting accesses to the
ESR as appropriate due to the 3AP Pentium erratum. Remove
apic_read_around() and all its invocations altogether as not needed.
Remove apic_write_atomic() and all its implementing backends. The use of
ASM_OUTPUT2() is not strictly needed for input constraints, but I have
used it for readability's sake.
I had the feeling no one else was brave enough to do it, so I went ahead
and here it is. Verified by checking the generated assembly and tested
with both a 32-bit and a 64-bit configuration, also with the 11AP
"feature" forced on and verified with gdb on /proc/kcore to work as
expected (as an 11AP machines are quite hard to get hands on these days).
Some script complained about the use of "volatile", but apic_write() needs
it for the same reason and is effectively a replacement for writel(), so I
have disregarded it.
I am not sure what the policy wrt defconfig files is, they are generated
and there is risk of a conflict resulting from an unrelated change, so I
have left changes to them out. The option will get removed from them at
the next run.
Some testing with machines other than mine will be needed to avoid some
stupid mistake, but despite its volume, the change is not really that
intrusive, so I am fairly confident that because it works for me, it will
everywhere.
Signed-off-by: Maciej W. Rozycki <macro@linux-mips.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-07-17 02:15:30 +08:00
|
|
|
apic_write(APIC_LVT0, APIC_LVT_MASKED | APIC_DM_EXTINT);
|
2009-11-10 03:27:04 +08:00
|
|
|
legacy_pic->init(1);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2005-10-31 06:59:39 +08:00
|
|
|
pin1 = find_isa_irq_pin(0, mp_INT);
|
|
|
|
apic1 = find_isa_irq_apic(0, mp_INT);
|
|
|
|
pin2 = ioapic_i8259.pin;
|
|
|
|
apic2 = ioapic_i8259.apic;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "..TIMER: vector=0x%02X "
|
|
|
|
"apic1=%d pin1=%d apic2=%d pin2=%d\n",
|
2008-08-20 11:50:28 +08:00
|
|
|
cfg->vector, apic1, pin1, apic2, pin2);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-05-28 04:19:51 +08:00
|
|
|
/*
|
|
|
|
* Some BIOS writers are clueless and report the ExtINTA
|
|
|
|
* I/O APIC input from the cascaded 8259A as the timer
|
|
|
|
* interrupt input. So just in case, if only one pin
|
|
|
|
* was found above, try it both directly and through the
|
|
|
|
* 8259A.
|
|
|
|
*/
|
|
|
|
if (pin1 == -1) {
|
2012-09-26 18:44:41 +08:00
|
|
|
panic_if_irq_remap("BIOS bug: timer not connected to IO-APIC");
|
2008-05-28 04:19:51 +08:00
|
|
|
pin1 = pin2;
|
|
|
|
apic1 = apic2;
|
|
|
|
no_pin1 = 1;
|
|
|
|
} else if (pin2 == -1) {
|
|
|
|
pin2 = pin1;
|
|
|
|
apic2 = apic1;
|
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
if (pin1 != -1) {
|
2015-04-13 14:11:59 +08:00
|
|
|
/* Ok, does IRQ0 through the IOAPIC work? */
|
2008-05-28 04:19:51 +08:00
|
|
|
if (no_pin1) {
|
2015-04-13 14:11:59 +08:00
|
|
|
mp_alloc_timer_irq(apic1, pin1);
|
2009-02-09 08:18:03 +08:00
|
|
|
} else {
|
2015-04-13 14:11:59 +08:00
|
|
|
/*
|
|
|
|
* for edge trigger, it's already unmasked,
|
2009-02-09 08:18:03 +08:00
|
|
|
* so only need to unmask if it is level-trigger
|
|
|
|
* do we really have level trigger timer?
|
|
|
|
*/
|
|
|
|
int idx;
|
|
|
|
idx = find_irq_entry(apic1, pin1, mp_INT);
|
|
|
|
if (idx != -1 && irq_trigger(idx))
|
2017-07-18 17:20:44 +08:00
|
|
|
unmask_ioapic_irq(irq_get_irq_data(0));
|
2008-05-28 04:19:51 +08:00
|
|
|
}
|
2017-02-01 02:03:21 +08:00
|
|
|
irq_domain_deactivate_irq(irq_data);
|
2015-04-14 10:29:53 +08:00
|
|
|
irq_domain_activate_irq(irq_data);
|
2005-04-17 06:20:36 +08:00
|
|
|
if (timer_irq_works()) {
|
2005-09-13 00:49:25 +08:00
|
|
|
if (disable_timer_pin_1 > 0)
|
|
|
|
clear_IO_APIC_pin(0, pin1);
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
goto out;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
2012-09-26 18:44:41 +08:00
|
|
|
panic_if_irq_remap("timer doesn't work through Interrupt-remapped IO-APIC");
|
2009-02-09 08:18:03 +08:00
|
|
|
local_irq_disable();
|
2005-10-31 06:59:39 +08:00
|
|
|
clear_IO_APIC_pin(apic1, pin1);
|
2008-05-28 04:19:51 +08:00
|
|
|
if (!no_pin1)
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_ERR "..MP-BIOS bug: "
|
|
|
|
"8254 timer not connected to IO-APIC\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "...trying to set up timer "
|
|
|
|
"(IRQ0) through the 8259A ...\n");
|
|
|
|
apic_printk(APIC_QUIET, KERN_INFO
|
|
|
|
"..... (found apic %d pin %d) ...\n", apic2, pin2);
|
2005-04-17 06:20:36 +08:00
|
|
|
/*
|
|
|
|
* legacy devices should be connected to IO APIC #0
|
|
|
|
*/
|
2015-04-14 10:29:53 +08:00
|
|
|
replace_pin_at_irq_node(data, node, apic1, pin1, apic2, pin2);
|
2017-02-01 02:03:21 +08:00
|
|
|
irq_domain_deactivate_irq(irq_data);
|
2015-04-14 10:29:53 +08:00
|
|
|
irq_domain_activate_irq(irq_data);
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->unmask(0);
|
2005-04-17 06:20:36 +08:00
|
|
|
if (timer_irq_works()) {
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "....... works.\n");
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
goto out;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
/*
|
|
|
|
* Cleanup, just in case ...
|
|
|
|
*/
|
2009-02-09 08:18:03 +08:00
|
|
|
local_irq_disable();
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->mask(0);
|
2005-10-31 06:59:39 +08:00
|
|
|
clear_IO_APIC_pin(apic2, pin2);
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "....... failed.\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO
|
|
|
|
"...trying to set up timer as Virtual Wire IRQ...\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2010-09-28 23:28:38 +08:00
|
|
|
lapic_register_intr(0);
|
2008-08-20 11:50:28 +08:00
|
|
|
apic_write(APIC_LVT0, APIC_DM_FIXED | cfg->vector); /* Fixed mode */
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->unmask(0);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
if (timer_irq_works()) {
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "..... works.\n");
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
goto out;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
2009-02-09 08:18:03 +08:00
|
|
|
local_irq_disable();
|
2010-09-28 21:01:33 +08:00
|
|
|
legacy_pic->mask(0);
|
2008-08-20 11:50:28 +08:00
|
|
|
apic_write(APIC_LVT0, APIC_LVT_MASKED | APIC_DM_FIXED | cfg->vector);
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "..... failed.\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO
|
|
|
|
"...trying to set up timer as ExtINT IRQ...\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2009-11-10 03:27:04 +08:00
|
|
|
legacy_pic->init(0);
|
|
|
|
legacy_pic->make_irq(0);
|
x86: APIC: remove apic_write_around(); use alternatives
Use alternatives to select the workaround for the 11AP Pentium erratum
for the affected steppings on the fly rather than build time. Remove the
X86_GOOD_APIC configuration option and replace all the calls to
apic_write_around() with plain apic_write(), protecting accesses to the
ESR as appropriate due to the 3AP Pentium erratum. Remove
apic_read_around() and all its invocations altogether as not needed.
Remove apic_write_atomic() and all its implementing backends. The use of
ASM_OUTPUT2() is not strictly needed for input constraints, but I have
used it for readability's sake.
I had the feeling no one else was brave enough to do it, so I went ahead
and here it is. Verified by checking the generated assembly and tested
with both a 32-bit and a 64-bit configuration, also with the 11AP
"feature" forced on and verified with gdb on /proc/kcore to work as
expected (as an 11AP machines are quite hard to get hands on these days).
Some script complained about the use of "volatile", but apic_write() needs
it for the same reason and is effectively a replacement for writel(), so I
have disregarded it.
I am not sure what the policy wrt defconfig files is, they are generated
and there is risk of a conflict resulting from an unrelated change, so I
have left changes to them out. The option will get removed from them at
the next run.
Some testing with machines other than mine will be needed to avoid some
stupid mistake, but despite its volume, the change is not really that
intrusive, so I am fairly confident that because it works for me, it will
everywhere.
Signed-off-by: Maciej W. Rozycki <macro@linux-mips.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-07-17 02:15:30 +08:00
|
|
|
apic_write(APIC_LVT0, APIC_DM_EXTINT);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
unlock_ExtINT_logic();
|
|
|
|
|
|
|
|
if (timer_irq_works()) {
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "..... works.\n");
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
goto out;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
2009-02-09 08:18:03 +08:00
|
|
|
local_irq_disable();
|
2008-07-15 02:08:13 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO "..... failed :(.\n");
|
2015-01-16 05:22:14 +08:00
|
|
|
if (apic_is_x2apic_enabled())
|
2011-12-22 09:45:17 +08:00
|
|
|
apic_printk(APIC_QUIET, KERN_INFO
|
|
|
|
"Perhaps problem with the pre-enabled x2apic mode\n"
|
|
|
|
"Try booting with x2apic and interrupt-remapping disabled in the bios.\n");
|
2005-04-17 06:20:36 +08:00
|
|
|
panic("IO-APIC + timer doesn't work! Boot with apic=debug and send a "
|
2008-07-15 02:08:13 +08:00
|
|
|
"report. Then try booting with the 'noapic' option.\n");
|
x86: fix "Kernel panic - not syncing: IO-APIC + timer doesn't work!"
this is the tale of a full day spent debugging an ancient but elusive bug.
after booting up thousands of random .config kernels, i finally happened
to generate a .config that produced the following rare bootup failure
on 32-bit x86:
| ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
| ..MP-BIOS bug: 8254 timer not connected to IO-APIC
| ...trying to set up timer (IRQ0) through the 8259A ... failed.
| ...trying to set up timer as Virtual Wire IRQ... failed.
| ...trying to set up timer as ExtINT IRQ... failed :(.
| Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug
| and send a report. Then try booting with the 'noapic' option
this bug has been reported many times during the years, but it was never
reproduced nor fixed.
the bug that i hit was extremely sensitive to .config details.
First i did a .config-bisection - suspecting some .config detail.
That led to CONFIG_X86_MCE: enabling X86_MCE magically made the bug disappear
and the system would boot up just fine.
Debugging my way through the MCE code ended up identifying two unlikely
candidates: the thing that made a real difference to the hang was that
X86_MCE did two printks:
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
Adding the same printks to a !CONFIG_X86_MCE kernel made the bug go away!
this left timing as the main suspect: i experimented with adding various
udelay()s to the arch/x86/kernel/io_apic_32.c:check_timer() function, and
the race window turned out to be narrower than 30 microseconds (!).
That made debugging especially funny, debugging without having printk
ability before the bug hits is ... interesting ;-)
eventually i started suspecting IRQ activities - those are pretty much the
only thing that happen this early during bootup and have the timescale of
a few dozen microseconds. Also, check_timer() changes the IRQ hardware
in various creative ways, so the main candidate became IRQ0 interaction.
i've added a counter to track timer irqs (on which core they arrived, at
what exact time, etc.) and found that no timer IRQ would arrive after the
bug condition hits - even if we re-enable IRQ0 and re-initialize the i8259A,
but that we'd get a small number of timer irqs right around the time when we
call the check_timer() function.
Eventually i got the following backtrace triggered from debug code in the
timer interrupt:
...trying to set up timer as Virtual Wire IRQ... failed.
...trying to set up timer as ExtINT IRQ...
Pid: 1, comm: swapper Not tainted (2.6.24-rc5 #57)
EIP: 0060:[<c044d57e>] EFLAGS: 00000246 CPU: 0
EIP is at _spin_unlock_irqrestore+0x5/0x1c
EAX: c0634178 EBX: 00000000 ECX: c4947d63 EDX: 00000246
ESI: 00000002 EDI: 00010031 EBP: c04e0f2e ESP: f7c41df4
DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: ffe04000 CR3: 00630000 CR4: 000006d0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
[<c05f5784>] setup_IO_APIC+0x9c3/0xc5c
the spin_unlock() was called from init_8259A(). Wait ... we have an IRQ0
entry while we are in the middle of setting up the local APIC, the i8259A
and the PIT??
That is certainly not how it's supposed to work! check_timer() was supposed
to be called with irqs turned off - but this eroded away sometime in the
past. This code would still work most of the time because this code runs
very quickly, but just the right timing conditions are present and IRQ0
hits in this small, ~30 usecs window, timer irqs stop and the system does
not boot up. Also, given how early this is during bootup, the hang is
very deterministic - but it would only occur on certain machines (and
certain configs).
The fix was quite simple: disable/restore interrupts properly in this
function. With that in place the test-system now boots up just fine.
(64-bit x86 io_apic_64.c had the same bug.)
Phew! One down, only 1500 other kernel bugs are left ;-)
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
2007-12-19 01:05:58 +08:00
|
|
|
out:
|
|
|
|
local_irq_restore(flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
x86: I/O APIC: Never configure IRQ2
There is no such entity as ISA IRQ2. The ACPI spec does not make it
explicitly clear, but does not preclude it either -- all it says is ISA
legacy interrupts are identity mapped by default (subject to overrides),
but it does not state whether IRQ2 exists or not. As a result if there is
no IRQ0 override, then IRQ2 is normally initialised as an ISA interrupt,
which implies an edge-triggered line, which is unmasked by default as this
is what we do for edge-triggered I/O APIC interrupts so as not to miss an
edge.
To the best of my knowledge it is useless, as IRQ2 has not been in use
since the PC/AT as back then it was taken by the 8259A cascade interrupt
to the slave, with the line position in the slot rerouted to newly-created
IRQ9. No device could thus make use of this line with the pair of 8259A
chips. Now in theory INTIN2 of the I/O APIC may be usable, but the
interrupt of the device wired to it would not be available in the PIC mode
at all, so I seriously doubt if anybody decided to reuse it for a regular
device.
However there are two common uses of INTIN2. One is for IRQ0, with an
ACPI interrupt override (or its equivalent in the MP table). But in this
case IRQ2 is gone entirely with INTIN0 left vacant. The other one is for
an 8959A ExtINTA cascade. In this case IRQ0 goes to INTIN0 and if ACPI is
used INTIN2 is assumed to be IRQ2 (there is no override and ACPI has no
way to report ExtINTA interrupts). This is where a problem happens.
The problem is INTIN2 is configured as a native APIC interrupt, with a
vector assigned and the mask cleared. And the line may indeed get active
and inject interrupts if the master 8959A has its timer interrupt enabled
(it might happen for other interrupts too, but they are normally masked in
the process of rerouting them to the I/O APIC). There are two cases where
it will happen:
* When the I/O APIC NMI watchdog is enabled. This is actually a misnomer
as the watchdog pulses are delivered through the 8259A to the LINT0
inputs of all the local APICs in the system. The implication is the
output of the master 8259A goes high and low repeatedly, signalling
interrupts to INTIN2 which is enabled too!
[The origin of the name is I think for a brief period during the
development we had a capability in our code to configure the watchdog to
use an I/O APIC input; that would be INTIN2 in this scenario.]
* When the native route of IRQ0 via INTIN0 fails for whatever reason -- as
it happens with the system considered here. In this scenario the timer
pulse is delivered through the 8259A to LINT0 input of the local APIC of
the bootstrap processor, quite similarly to how is done for the watchdog
described above. The result is, again, INTIN2 receives these pulses
too. Rafael's system used to escape this scenario, because an incorrect
IRQ0 override would occupy INTIN2 and prevent it from being unmasked.
My conclusion is IRQ2 should be excluded from configuration in all the
cases and the current exception for ACPI systems should be lifted. The
reason being the exception not only being useless, but harmful as well.
Signed-off-by: Maciej W. Rozycki <macro@linux-mips.org>
Cc: "Rafael J. Wysocki" <rjw@sisk.pl>
Cc: Matthew Garrett <mjg59@srcf.ucam.org>
Cc: Andreas Herrmann <andreas.herrmann3@amd.com>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2008-07-12 02:35:23 +08:00
|
|
|
* Traditionally ISA IRQ2 is the cascade IRQ, and is not available
|
|
|
|
* to devices. However there may be an I/O APIC pin available for
|
|
|
|
* this interrupt regardless. The pin may be left unconnected, but
|
|
|
|
* typically it will be reused as an ExtINT cascade interrupt for
|
|
|
|
* the master 8259A. In the MPS case such a pin will normally be
|
|
|
|
* reported as an ExtINT interrupt in the MP table. With ACPI
|
|
|
|
* there is no provision for ExtINT interrupts, and in the absence
|
|
|
|
* of an override it would be treated as an ordinary ISA I/O APIC
|
|
|
|
* interrupt, that is edge-triggered and unmasked by default. We
|
|
|
|
* used to do this, but it caused problems on some systems because
|
|
|
|
* of the NMI watchdog and sometimes IRQ0 of the 8254 timer using
|
|
|
|
* the same ExtINT cascade interrupt to drive the local APIC of the
|
|
|
|
* bootstrap processor. Therefore we refrain from routing IRQ2 to
|
|
|
|
* the I/O APIC in all cases now. No actual device should request
|
|
|
|
* it anyway. --macro
|
2005-04-17 06:20:36 +08:00
|
|
|
*/
|
2009-08-30 00:09:57 +08:00
|
|
|
#define PIC_IRQS (1UL << PIC_CASCADE_IR)
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2014-06-09 16:19:53 +08:00
|
|
|
static int mp_irqdomain_create(int ioapic)
|
|
|
|
{
|
2015-04-13 14:11:59 +08:00
|
|
|
struct irq_alloc_info info;
|
|
|
|
struct irq_domain *parent;
|
2014-06-09 16:19:53 +08:00
|
|
|
int hwirqs = mp_ioapic_pin_count(ioapic);
|
|
|
|
struct ioapic *ip = &ioapics[ioapic];
|
|
|
|
struct ioapic_domain_cfg *cfg = &ip->irqdomain_cfg;
|
|
|
|
struct mp_ioapic_gsi *gsi_cfg = mp_ioapic_gsi_routing(ioapic);
|
2017-06-20 07:37:07 +08:00
|
|
|
struct fwnode_handle *fn;
|
|
|
|
char *name = "IO-APIC";
|
2014-06-09 16:19:53 +08:00
|
|
|
|
|
|
|
if (cfg->type == IOAPIC_DOMAIN_INVALID)
|
|
|
|
return 0;
|
|
|
|
|
2015-04-13 14:11:59 +08:00
|
|
|
init_irq_alloc_info(&info, NULL);
|
|
|
|
info.type = X86_IRQ_ALLOC_TYPE_IOAPIC;
|
|
|
|
info.ioapic_id = mpc_ioapic_id(ioapic);
|
|
|
|
parent = irq_remapping_get_ir_irq_domain(&info);
|
|
|
|
if (!parent)
|
|
|
|
parent = x86_vector_domain;
|
2017-06-20 07:37:07 +08:00
|
|
|
else
|
|
|
|
name = "IO-APIC-IR";
|
|
|
|
|
|
|
|
/* Handle device tree enumerated APICs proper */
|
|
|
|
if (cfg->dev) {
|
|
|
|
fn = of_node_to_fwnode(cfg->dev);
|
|
|
|
} else {
|
|
|
|
fn = irq_domain_alloc_named_id_fwnode(name, ioapic);
|
|
|
|
if (!fn)
|
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
|
|
|
|
ip->irqdomain = irq_domain_create_linear(fn, hwirqs, cfg->ops,
|
|
|
|
(void *)(long)ioapic);
|
|
|
|
|
|
|
|
/* Release fw handle if it was allocated above */
|
|
|
|
if (!cfg->dev)
|
|
|
|
irq_domain_free_fwnode(fn);
|
2015-04-13 14:11:59 +08:00
|
|
|
|
2015-04-14 10:29:39 +08:00
|
|
|
if (!ip->irqdomain)
|
2014-06-09 16:19:53 +08:00
|
|
|
return -ENOMEM;
|
2015-04-14 10:29:39 +08:00
|
|
|
|
|
|
|
ip->irqdomain->parent = parent;
|
2014-06-09 16:19:53 +08:00
|
|
|
|
|
|
|
if (cfg->type == IOAPIC_DOMAIN_LEGACY ||
|
|
|
|
cfg->type == IOAPIC_DOMAIN_STRICT)
|
|
|
|
ioapic_dynirq_base = max(ioapic_dynirq_base,
|
|
|
|
gsi_cfg->gsi_end + 1);
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:46 +08:00
|
|
|
static void ioapic_destroy_irqdomain(int idx)
|
|
|
|
{
|
|
|
|
if (ioapics[idx].irqdomain) {
|
|
|
|
irq_domain_remove(ioapics[idx].irqdomain);
|
|
|
|
ioapics[idx].irqdomain = NULL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
void __init setup_IO_APIC(void)
|
|
|
|
{
|
2014-06-09 16:19:53 +08:00
|
|
|
int ioapic;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2015-01-16 05:22:32 +08:00
|
|
|
if (skip_ioapic_setup || !nr_ioapics)
|
|
|
|
return;
|
|
|
|
|
2014-06-09 16:19:48 +08:00
|
|
|
io_apic_irqs = nr_legacy_irqs() ? ~PIC_IRQS : ~0UL;
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
apic_printk(APIC_VERBOSE, "ENABLING IO-APIC IRQs\n");
|
2014-06-09 16:19:53 +08:00
|
|
|
for_each_ioapic(ioapic)
|
|
|
|
BUG_ON(mp_irqdomain_create(ioapic));
|
|
|
|
|
2008-10-15 21:27:23 +08:00
|
|
|
/*
|
2008-08-20 15:07:45 +08:00
|
|
|
* Set up IO-APIC IRQ routing.
|
|
|
|
*/
|
2009-08-20 15:27:29 +08:00
|
|
|
x86_init.mpparse.setup_ioapic_ids();
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
sync_Arb_IDs();
|
|
|
|
setup_IO_APIC_irqs();
|
|
|
|
init_IO_APIC_traps();
|
2014-06-09 16:19:48 +08:00
|
|
|
if (nr_legacy_irqs())
|
2009-08-30 00:09:57 +08:00
|
|
|
check_timer();
|
2014-06-09 16:20:11 +08:00
|
|
|
|
|
|
|
ioapic_initialized = 1;
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2011-10-12 15:33:48 +08:00
|
|
|
static void resume_ioapic_id(int ioapic_idx)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
unsigned long flags;
|
|
|
|
union IO_APIC_reg_00 reg_00;
|
2008-06-08 19:07:18 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2011-10-12 15:33:48 +08:00
|
|
|
reg_00.raw = io_apic_read(ioapic_idx, 0);
|
|
|
|
if (reg_00.bits.ID != mpc_ioapic_id(ioapic_idx)) {
|
|
|
|
reg_00.bits.ID = mpc_ioapic_id(ioapic_idx);
|
|
|
|
io_apic_write(ioapic_idx, 0, reg_00.raw);
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2011-03-24 05:15:54 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2011-03-24 05:15:54 +08:00
|
|
|
static void ioapic_resume(void)
|
|
|
|
{
|
2011-10-12 15:33:48 +08:00
|
|
|
int ioapic_idx;
|
2011-03-24 05:15:54 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic_reverse(ioapic_idx)
|
2011-10-12 15:33:48 +08:00
|
|
|
resume_ioapic_id(ioapic_idx);
|
2011-05-19 07:31:34 +08:00
|
|
|
|
|
|
|
restore_ioapic_entries();
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
2011-03-24 05:15:54 +08:00
|
|
|
static struct syscore_ops ioapic_syscore_ops = {
|
2011-05-19 07:31:34 +08:00
|
|
|
.suspend = save_ioapic_entries,
|
2005-04-17 06:20:36 +08:00
|
|
|
.resume = ioapic_resume,
|
|
|
|
};
|
|
|
|
|
2011-03-24 05:15:54 +08:00
|
|
|
static int __init ioapic_init_ops(void)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
2011-03-24 05:15:54 +08:00
|
|
|
register_syscore_ops(&ioapic_syscore_ops);
|
|
|
|
|
2005-04-17 06:20:36 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2011-03-24 05:15:54 +08:00
|
|
|
device_initcall(ioapic_init_ops);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2014-10-27 13:21:41 +08:00
|
|
|
static int io_apic_get_redir_entries(int ioapic)
|
2008-08-20 11:50:52 +08:00
|
|
|
{
|
|
|
|
union IO_APIC_reg_01 reg_01;
|
|
|
|
unsigned long flags;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2008-08-20 11:50:52 +08:00
|
|
|
reg_01.raw = io_apic_read(ioapic, 1);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2008-08-20 11:50:52 +08:00
|
|
|
|
2010-03-30 16:07:08 +08:00
|
|
|
/* The register returns the maximum index redir index
|
|
|
|
* supported, which is one less than the total number of redir
|
|
|
|
* entries.
|
|
|
|
*/
|
|
|
|
return reg_01.bits.entries + 1;
|
2008-08-20 11:50:52 +08:00
|
|
|
}
|
|
|
|
|
2014-04-24 15:50:53 +08:00
|
|
|
unsigned int arch_dynirq_lower_bound(unsigned int from)
|
|
|
|
{
|
2014-06-09 16:20:11 +08:00
|
|
|
/*
|
|
|
|
* dmar_alloc_hwirq() may be called before setup_IO_APIC(), so use
|
|
|
|
* gsi_top if ioapic_dynirq_base hasn't been initialized yet.
|
|
|
|
*/
|
|
|
|
return ioapic_initialized ? ioapic_dynirq_base : gsi_top;
|
2014-04-24 15:50:53 +08:00
|
|
|
}
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2014-10-27 13:21:41 +08:00
|
|
|
static int io_apic_get_unique_id(int ioapic, int apic_id)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
union IO_APIC_reg_00 reg_00;
|
|
|
|
static physid_mask_t apic_id_map = PHYSID_MASK_NONE;
|
|
|
|
physid_mask_t tmp;
|
|
|
|
unsigned long flags;
|
|
|
|
int i = 0;
|
|
|
|
|
|
|
|
/*
|
2008-06-08 19:07:18 +08:00
|
|
|
* The P4 platform supports up to 256 APIC IDs on two separate APIC
|
|
|
|
* buses (one for LAPICs, one for IOAPICs), where predecessors only
|
2005-04-17 06:20:36 +08:00
|
|
|
* supports up to 16 on one shared APIC bus.
|
2008-06-08 19:07:18 +08:00
|
|
|
*
|
2005-04-17 06:20:36 +08:00
|
|
|
* TBD: Expand LAPIC/IOAPIC support on P4-class systems to take full
|
|
|
|
* advantage of new APIC bus architecture.
|
|
|
|
*/
|
|
|
|
|
|
|
|
if (physids_empty(apic_id_map))
|
2009-11-10 06:06:59 +08:00
|
|
|
apic->ioapic_phys_id_map(&phys_cpu_present_map, &apic_id_map);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
reg_00.raw = io_apic_read(ioapic, 0);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
if (apic_id >= get_physical_broadcast()) {
|
|
|
|
printk(KERN_WARNING "IOAPIC[%d]: Invalid apic_id %d, trying "
|
|
|
|
"%d\n", ioapic, apic_id, reg_00.bits.ID);
|
|
|
|
apic_id = reg_00.bits.ID;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
2008-06-08 19:07:18 +08:00
|
|
|
* Every APIC in a system must have a unique ID or we get lots of nice
|
2005-04-17 06:20:36 +08:00
|
|
|
* 'stuck on smp_invalidate_needed IPI wait' messages.
|
|
|
|
*/
|
2009-11-10 06:06:59 +08:00
|
|
|
if (apic->check_apicid_used(&apic_id_map, apic_id)) {
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
for (i = 0; i < get_physical_broadcast(); i++) {
|
2009-11-10 06:06:59 +08:00
|
|
|
if (!apic->check_apicid_used(&apic_id_map, i))
|
2005-04-17 06:20:36 +08:00
|
|
|
break;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (i == get_physical_broadcast())
|
|
|
|
panic("Max apic_id exceeded!\n");
|
|
|
|
|
|
|
|
printk(KERN_WARNING "IOAPIC[%d]: apic_id %d already used, "
|
|
|
|
"trying %d\n", ioapic, apic_id, i);
|
|
|
|
|
|
|
|
apic_id = i;
|
2008-06-08 19:07:18 +08:00
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2009-11-10 06:06:59 +08:00
|
|
|
apic->apicid_to_cpu_present(apic_id, &tmp);
|
2005-04-17 06:20:36 +08:00
|
|
|
physids_or(apic_id_map, apic_id_map, tmp);
|
|
|
|
|
|
|
|
if (reg_00.bits.ID != apic_id) {
|
|
|
|
reg_00.bits.ID = apic_id;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
io_apic_write(ioapic, 0, reg_00.raw);
|
|
|
|
reg_00.raw = io_apic_read(ioapic, 0);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
/* Sanity check */
|
2006-02-26 11:18:34 +08:00
|
|
|
if (reg_00.bits.ID != apic_id) {
|
2012-05-22 10:50:07 +08:00
|
|
|
pr_err("IOAPIC[%d]: Unable to change apic_id!\n",
|
|
|
|
ioapic);
|
2006-02-26 11:18:34 +08:00
|
|
|
return -1;
|
|
|
|
}
|
2005-04-17 06:20:36 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_INFO
|
|
|
|
"IOAPIC[%d]: Assigned apic_id %d\n", ioapic, apic_id);
|
|
|
|
|
|
|
|
return apic_id;
|
|
|
|
}
|
2011-02-23 23:08:03 +08:00
|
|
|
|
2014-10-27 13:21:41 +08:00
|
|
|
static u8 io_apic_unique_id(int idx, u8 id)
|
2011-02-23 23:08:03 +08:00
|
|
|
{
|
|
|
|
if ((boot_cpu_data.x86_vendor == X86_VENDOR_INTEL) &&
|
2016-09-14 02:12:32 +08:00
|
|
|
!APIC_XAPIC(boot_cpu_apic_version))
|
2014-10-27 13:21:40 +08:00
|
|
|
return io_apic_get_unique_id(idx, id);
|
2011-02-23 23:08:03 +08:00
|
|
|
else
|
|
|
|
return id;
|
|
|
|
}
|
|
|
|
#else
|
2014-10-27 13:21:41 +08:00
|
|
|
static u8 io_apic_unique_id(int idx, u8 id)
|
2011-02-23 23:08:03 +08:00
|
|
|
{
|
2014-10-27 13:21:40 +08:00
|
|
|
union IO_APIC_reg_00 reg_00;
|
2011-02-23 23:08:03 +08:00
|
|
|
DECLARE_BITMAP(used, 256);
|
2014-10-27 13:21:40 +08:00
|
|
|
unsigned long flags;
|
|
|
|
u8 new_id;
|
|
|
|
int i;
|
2011-02-23 23:08:03 +08:00
|
|
|
|
|
|
|
bitmap_zero(used, 256);
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(i)
|
2011-05-19 07:31:37 +08:00
|
|
|
__set_bit(mpc_ioapic_id(i), used);
|
2014-10-27 13:21:40 +08:00
|
|
|
|
|
|
|
/* Hand out the requested id if available */
|
2011-02-23 23:08:03 +08:00
|
|
|
if (!test_bit(id, used))
|
|
|
|
return id;
|
2014-10-27 13:21:40 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Read the current id from the ioapic and keep it if
|
|
|
|
* available.
|
|
|
|
*/
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
|
|
|
reg_00.raw = io_apic_read(idx, 0);
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
new_id = reg_00.bits.ID;
|
|
|
|
if (!test_bit(new_id, used)) {
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_INFO
|
|
|
|
"IOAPIC[%d]: Using reg apic_id %d instead of %d\n",
|
|
|
|
idx, new_id, id);
|
|
|
|
return new_id;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Get the next free id and write it to the ioapic.
|
|
|
|
*/
|
|
|
|
new_id = find_first_zero_bit(used, 256);
|
|
|
|
reg_00.bits.ID = new_id;
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
|
|
|
io_apic_write(idx, 0, reg_00.raw);
|
|
|
|
reg_00.raw = io_apic_read(idx, 0);
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
/* Sanity check */
|
|
|
|
BUG_ON(reg_00.bits.ID != new_id);
|
|
|
|
|
|
|
|
return new_id;
|
2011-02-23 23:08:03 +08:00
|
|
|
}
|
x86: Print real IOAPIC version for x86-64
Fix the fact that the IOAPIC version number in the x86_64 code path always
gets assigned to 0, instead of the correct value.
Before the patch: (from "dmesg" output):
ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
IOAPIC[0]: apic_id 8, version 0, address 0xfec00000, GSI 0-23 <---
After the patch:
ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23 <---
History:
io_apic_get_version() was compiled out of the x86_64 code path in the commit
f2c2cca3acef8b253a36381d9b469ad4fb08563a:
Author: Andi Kleen <ak@suse.de>
Date: Tue Sep 26 10:52:37 2006 +0200
[PATCH] Remove APIC version/cpu capability mpparse checking/printing
ACPI went to great trouble to get the APIC version and CPU capabilities
of different CPUs before passing them to the mpparser. But all
that data was used was to print it out. Actually it even faked some data
based on the boot cpu, not on the actual CPU being booted.
Remove all this code because it's not needed.
Cc: len.brown@intel.com
At the time, the IOAPIC version number was deliberately not printed
in the x86_64 code path. However, after the x86 and x86_64 files were
merged, the net result is that the IOAPIC version is printed incorrectly
in the x86_64 code path.
The patch below provides a fix. I have tested it with acpi, and with
acpi=off, and did not see any problems.
Signed-off-by: Naga Chumbalkar <nagananda.chumbalkar@hp.com>
Acked-by: Yinghai Lu <yhlu.kernel@gmail.com>
LKML-Reference: <20090416014230.4885.94926.sendpatchset@localhost.localdomain>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
*************************
2009-05-27 05:48:07 +08:00
|
|
|
#endif
|
2005-04-17 06:20:36 +08:00
|
|
|
|
2014-10-27 13:21:41 +08:00
|
|
|
static int io_apic_get_version(int ioapic)
|
2005-04-17 06:20:36 +08:00
|
|
|
{
|
|
|
|
union IO_APIC_reg_01 reg_01;
|
|
|
|
unsigned long flags;
|
|
|
|
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
reg_01.raw = io_apic_read(ioapic, 1);
|
2009-07-26 00:39:36 +08:00
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
2005-04-17 06:20:36 +08:00
|
|
|
|
|
|
|
return reg_01.bits.version;
|
|
|
|
}
|
|
|
|
|
2010-03-30 16:07:03 +08:00
|
|
|
int acpi_get_override_irq(u32 gsi, int *trigger, int *polarity)
|
2007-11-17 14:05:28 +08:00
|
|
|
{
|
2010-03-30 16:07:03 +08:00
|
|
|
int ioapic, pin, idx;
|
2007-11-17 14:05:28 +08:00
|
|
|
|
|
|
|
if (skip_ioapic_setup)
|
|
|
|
return -1;
|
|
|
|
|
2010-03-30 16:07:03 +08:00
|
|
|
ioapic = mp_find_ioapic(gsi);
|
|
|
|
if (ioapic < 0)
|
2007-11-17 14:05:28 +08:00
|
|
|
return -1;
|
|
|
|
|
2010-03-30 16:07:03 +08:00
|
|
|
pin = mp_find_ioapic_pin(ioapic, gsi);
|
|
|
|
if (pin < 0)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
idx = find_irq_entry(ioapic, pin, mp_INT);
|
|
|
|
if (idx < 0)
|
2007-11-17 14:05:28 +08:00
|
|
|
return -1;
|
|
|
|
|
2010-03-30 16:07:03 +08:00
|
|
|
*trigger = irq_trigger(idx);
|
|
|
|
*polarity = irq_polarity(idx);
|
2007-11-17 14:05:28 +08:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2008-08-20 11:50:28 +08:00
|
|
|
/*
|
|
|
|
* This function currently is only a helper for the i386 smp boot process where
|
|
|
|
* we need to reprogram the ioredtbls to cater for the cpus which have come online
|
2009-01-28 11:32:51 +08:00
|
|
|
* so mask in all cases should simply be apic->target_cpus()
|
2008-08-20 11:50:28 +08:00
|
|
|
*/
|
|
|
|
#ifdef CONFIG_SMP
|
|
|
|
void __init setup_ioapic_dest(void)
|
|
|
|
{
|
x86: Fix out of order of gsi
Iranna D Ankad reported that IBM x3950 systems have boot
problems after this commit:
|
| commit b9c61b70075c87a8612624736faf4a2de5b1ed30
|
| x86/pci: update pirq_enable_irq() to setup io apic routing
|
The problem is that with the patch, the machine freezes when
console=ttyS0,... kernel serial parameter is passed.
It seem to freeze at DVD initialization and the whole problem
seem to be DVD/pata related, but somehow exposed through the
serial parameter.
Such apic problems can expose really weird behavior:
ACPI: IOAPIC (id[0x10] address[0xfecff000] gsi_base[0])
IOAPIC[0]: apic_id 16, version 0, address 0xfecff000, GSI 0-2
ACPI: IOAPIC (id[0x0f] address[0xfec00000] gsi_base[3])
IOAPIC[1]: apic_id 15, version 0, address 0xfec00000, GSI 3-38
ACPI: IOAPIC (id[0x0e] address[0xfec01000] gsi_base[39])
IOAPIC[2]: apic_id 14, version 0, address 0xfec01000, GSI 39-74
ACPI: INT_SRC_OVR (bus 0 bus_irq 1 global_irq 4 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 5 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 3 global_irq 6 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 4 global_irq 7 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 6 global_irq 9 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 7 global_irq 10 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 8 global_irq 11 low edge)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 12 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 12 global_irq 15 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 13 global_irq 16 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 14 global_irq 17 low edge)
ACPI: INT_SRC_OVR (bus 0 bus_irq 15 global_irq 18 dfl dfl)
It turns out that the system has three io apic controllers, but
boot ioapic routing is in the second one, and that gsi_base is
not 0 - it is using a bunch of INT_SRC_OVR...
So these recent changes:
1. one set routing for first io apic controller
2. assume irq = gsi
... will break that system.
So try to remap those gsis, need to seperate boot_ioapic_idx
detection out of enable_IO_APIC() and call them early.
So introduce boot_ioapic_idx, and remap_ioapic_gsi()...
-v2: shift gsi with delta instead of gsi_base of boot_ioapic_idx
-v3: double check with find_isa_irq_apic(0, mp_INT) to get right
boot_ioapic_idx
-v4: nr_legacy_irqs
-v5: add print out for boot_ioapic_idx, and also make it could be
applied for current kernel and previous kernel
-v6: add bus_irq, in acpi_sci_ioapic_setup, so can get overwride
for sci right mapping...
-v7: looks like pnpacpi get irq instead of gsi, so need to revert
them back...
-v8: split into two patches
-v9: according to Eric, use fixed 16 for shifting instead of remap
-v10: still need to touch rsparser.c
-v11: just revert back to way Eric suggest...
anyway the ioapic in first ioapic is blocked by second...
-v12: two patches, this one will add more loop but check apic_id and irq > 16
Reported-by: Iranna D Ankad <iranna.ankad@in.ibm.com>
Bisected-by: Iranna D Ankad <iranna.ankad@in.ibm.com>
Tested-by: Gary Hade <garyhade@us.ibm.com>
Signed-off-by: Yinghai Lu <yinghai@kernel.org>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Cc: Thomas Renninger <trenn@suse.de>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Cc: Suresh Siddha <suresh.b.siddha@intel.com>
Cc: len.brown@intel.com
LKML-Reference: <4B8A321A.1000008@kernel.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-28 17:06:34 +08:00
|
|
|
int pin, ioapic, irq, irq_entry;
|
2008-12-17 09:33:56 +08:00
|
|
|
const struct cpumask *mask;
|
2016-01-14 15:43:38 +08:00
|
|
|
struct irq_desc *desc;
|
2011-02-05 22:35:51 +08:00
|
|
|
struct irq_data *idata;
|
2015-09-14 18:00:55 +08:00
|
|
|
struct irq_chip *chip;
|
2008-08-20 11:50:28 +08:00
|
|
|
|
|
|
|
if (skip_ioapic_setup == 1)
|
|
|
|
return;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic_pin(ioapic, pin) {
|
2009-05-07 01:10:06 +08:00
|
|
|
irq_entry = find_irq_entry(ioapic, pin, mp_INT);
|
|
|
|
if (irq_entry == -1)
|
|
|
|
continue;
|
2008-11-07 19:33:49 +08:00
|
|
|
|
2014-06-09 16:19:52 +08:00
|
|
|
irq = pin_2_irq(irq_entry, ioapic, pin, 0);
|
|
|
|
if (irq < 0 || !mp_init_irq_at_boot(ioapic, irq))
|
x86: Fix out of order of gsi
Iranna D Ankad reported that IBM x3950 systems have boot
problems after this commit:
|
| commit b9c61b70075c87a8612624736faf4a2de5b1ed30
|
| x86/pci: update pirq_enable_irq() to setup io apic routing
|
The problem is that with the patch, the machine freezes when
console=ttyS0,... kernel serial parameter is passed.
It seem to freeze at DVD initialization and the whole problem
seem to be DVD/pata related, but somehow exposed through the
serial parameter.
Such apic problems can expose really weird behavior:
ACPI: IOAPIC (id[0x10] address[0xfecff000] gsi_base[0])
IOAPIC[0]: apic_id 16, version 0, address 0xfecff000, GSI 0-2
ACPI: IOAPIC (id[0x0f] address[0xfec00000] gsi_base[3])
IOAPIC[1]: apic_id 15, version 0, address 0xfec00000, GSI 3-38
ACPI: IOAPIC (id[0x0e] address[0xfec01000] gsi_base[39])
IOAPIC[2]: apic_id 14, version 0, address 0xfec01000, GSI 39-74
ACPI: INT_SRC_OVR (bus 0 bus_irq 1 global_irq 4 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 5 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 3 global_irq 6 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 4 global_irq 7 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 6 global_irq 9 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 7 global_irq 10 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 8 global_irq 11 low edge)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 12 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 12 global_irq 15 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 13 global_irq 16 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 14 global_irq 17 low edge)
ACPI: INT_SRC_OVR (bus 0 bus_irq 15 global_irq 18 dfl dfl)
It turns out that the system has three io apic controllers, but
boot ioapic routing is in the second one, and that gsi_base is
not 0 - it is using a bunch of INT_SRC_OVR...
So these recent changes:
1. one set routing for first io apic controller
2. assume irq = gsi
... will break that system.
So try to remap those gsis, need to seperate boot_ioapic_idx
detection out of enable_IO_APIC() and call them early.
So introduce boot_ioapic_idx, and remap_ioapic_gsi()...
-v2: shift gsi with delta instead of gsi_base of boot_ioapic_idx
-v3: double check with find_isa_irq_apic(0, mp_INT) to get right
boot_ioapic_idx
-v4: nr_legacy_irqs
-v5: add print out for boot_ioapic_idx, and also make it could be
applied for current kernel and previous kernel
-v6: add bus_irq, in acpi_sci_ioapic_setup, so can get overwride
for sci right mapping...
-v7: looks like pnpacpi get irq instead of gsi, so need to revert
them back...
-v8: split into two patches
-v9: according to Eric, use fixed 16 for shifting instead of remap
-v10: still need to touch rsparser.c
-v11: just revert back to way Eric suggest...
anyway the ioapic in first ioapic is blocked by second...
-v12: two patches, this one will add more loop but check apic_id and irq > 16
Reported-by: Iranna D Ankad <iranna.ankad@in.ibm.com>
Bisected-by: Iranna D Ankad <iranna.ankad@in.ibm.com>
Tested-by: Gary Hade <garyhade@us.ibm.com>
Signed-off-by: Yinghai Lu <yinghai@kernel.org>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Cc: Thomas Renninger <trenn@suse.de>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Cc: Suresh Siddha <suresh.b.siddha@intel.com>
Cc: len.brown@intel.com
LKML-Reference: <4B8A321A.1000008@kernel.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-28 17:06:34 +08:00
|
|
|
continue;
|
|
|
|
|
2016-01-14 15:43:38 +08:00
|
|
|
desc = irq_to_desc(irq);
|
|
|
|
raw_spin_lock_irq(&desc->lock);
|
|
|
|
idata = irq_desc_get_irq_data(desc);
|
2008-11-07 19:33:49 +08:00
|
|
|
|
2009-05-07 01:10:06 +08:00
|
|
|
/*
|
|
|
|
* Honour affinities which have been set in early boot
|
|
|
|
*/
|
2011-02-05 22:35:51 +08:00
|
|
|
if (!irqd_can_balance(idata) || irqd_affinity_was_set(idata))
|
2015-06-03 11:46:22 +08:00
|
|
|
mask = irq_data_get_affinity_mask(idata);
|
2009-05-07 01:10:06 +08:00
|
|
|
else
|
|
|
|
mask = apic->target_cpus();
|
2008-08-20 11:50:28 +08:00
|
|
|
|
2015-09-14 18:00:55 +08:00
|
|
|
chip = irq_data_get_irq_chip(idata);
|
2015-10-27 08:08:04 +08:00
|
|
|
/* Might be lapic_chip for irq 0 */
|
|
|
|
if (chip->irq_set_affinity)
|
|
|
|
chip->irq_set_affinity(idata, mask, false);
|
2016-01-14 15:43:38 +08:00
|
|
|
raw_spin_unlock_irq(&desc->lock);
|
2008-08-20 11:50:28 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2008-08-20 15:07:45 +08:00
|
|
|
#define IOAPIC_RESOURCE_NAME_SIZE 11
|
|
|
|
|
|
|
|
static struct resource *ioapic_resources;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
static struct resource * __init ioapic_setup_resources(void)
|
2008-08-20 15:07:45 +08:00
|
|
|
{
|
|
|
|
unsigned long n;
|
|
|
|
struct resource *res;
|
|
|
|
char *mem;
|
2016-06-08 14:59:53 +08:00
|
|
|
int i;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2016-06-08 14:59:53 +08:00
|
|
|
if (nr_ioapics == 0)
|
2008-08-20 15:07:45 +08:00
|
|
|
return NULL;
|
|
|
|
|
|
|
|
n = IOAPIC_RESOURCE_NAME_SIZE + sizeof(struct resource);
|
2016-06-08 14:59:53 +08:00
|
|
|
n *= nr_ioapics;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
|
|
|
mem = alloc_bootmem(n);
|
|
|
|
res = (void *)mem;
|
|
|
|
|
2016-06-08 14:59:53 +08:00
|
|
|
mem += sizeof(struct resource) * nr_ioapics;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(i) {
|
2016-06-08 14:59:53 +08:00
|
|
|
res[i].name = mem;
|
|
|
|
res[i].flags = IORESOURCE_MEM | IORESOURCE_BUSY;
|
2009-11-08 23:54:31 +08:00
|
|
|
snprintf(mem, IOAPIC_RESOURCE_NAME_SIZE, "IOAPIC %u", i);
|
2009-08-25 01:53:39 +08:00
|
|
|
mem += IOAPIC_RESOURCE_NAME_SIZE;
|
2016-06-08 14:59:53 +08:00
|
|
|
ioapics[i].iomem_res = &res[i];
|
2008-08-20 15:07:45 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
ioapic_resources = res;
|
|
|
|
|
|
|
|
return res;
|
|
|
|
}
|
|
|
|
|
2015-04-24 19:57:48 +08:00
|
|
|
void __init io_apic_init_mappings(void)
|
2008-06-27 16:41:56 +08:00
|
|
|
{
|
|
|
|
unsigned long ioapic_phys, idx = FIX_IO_APIC_BASE_0;
|
2008-08-20 15:07:45 +08:00
|
|
|
struct resource *ioapic_res;
|
2008-10-15 21:27:23 +08:00
|
|
|
int i;
|
2008-06-27 16:41:56 +08:00
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
ioapic_res = ioapic_setup_resources();
|
|
|
|
for_each_ioapic(i) {
|
2008-06-27 16:41:56 +08:00
|
|
|
if (smp_found_config) {
|
2011-05-19 07:31:37 +08:00
|
|
|
ioapic_phys = mpc_ioapic_addr(i);
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2008-10-15 21:27:23 +08:00
|
|
|
if (!ioapic_phys) {
|
|
|
|
printk(KERN_ERR
|
|
|
|
"WARNING: bogus zero IO-APIC "
|
|
|
|
"address found in MPTABLE, "
|
|
|
|
"disabling IO/APIC support!\n");
|
|
|
|
smp_found_config = 0;
|
|
|
|
skip_ioapic_setup = 1;
|
|
|
|
goto fake_ioapic_page;
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
#endif
|
2008-06-27 16:41:56 +08:00
|
|
|
} else {
|
2008-08-20 15:07:45 +08:00
|
|
|
#ifdef CONFIG_X86_32
|
2008-06-27 16:41:56 +08:00
|
|
|
fake_ioapic_page:
|
2008-08-20 15:07:45 +08:00
|
|
|
#endif
|
x86: io-apic: IO-APIC MMIO should not fail on resource insertion
If IO-APIC base address is 1K aligned we should not fail
on resourse insertion procedure. For this sake we define
IO_APIC_SLOT_SIZE constant which should cover all IO-APIC
direct accessible registers.
An example of a such configuration is there
http://marc.info/?l=linux-kernel&m=118114792006520
|
| Quoting the message
|
| IOAPIC[0]: apic_id 2, version 32, address 0xfec00000, GSI 0-23
| IOAPIC[1]: apic_id 3, version 32, address 0xfec80000, GSI 24-47
| IOAPIC[2]: apic_id 4, version 32, address 0xfec80400, GSI 48-71
| IOAPIC[3]: apic_id 5, version 32, address 0xfec84000, GSI 72-95
| IOAPIC[4]: apic_id 8, version 32, address 0xfec84400, GSI 96-119
|
Reported-by: "Maciej W. Rozycki" <macro@linux-mips.org>
Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>
Acked-by: Yinghai Lu <yinghai@kernel.org>
LKML-Reference: <20091116151426.GC5653@lenovo>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-11-16 23:14:26 +08:00
|
|
|
ioapic_phys = (unsigned long)alloc_bootmem_pages(PAGE_SIZE);
|
2008-06-27 16:41:56 +08:00
|
|
|
ioapic_phys = __pa(ioapic_phys);
|
|
|
|
}
|
|
|
|
set_fixmap_nocache(idx, ioapic_phys);
|
x86: io-apic: IO-APIC MMIO should not fail on resource insertion
If IO-APIC base address is 1K aligned we should not fail
on resourse insertion procedure. For this sake we define
IO_APIC_SLOT_SIZE constant which should cover all IO-APIC
direct accessible registers.
An example of a such configuration is there
http://marc.info/?l=linux-kernel&m=118114792006520
|
| Quoting the message
|
| IOAPIC[0]: apic_id 2, version 32, address 0xfec00000, GSI 0-23
| IOAPIC[1]: apic_id 3, version 32, address 0xfec80000, GSI 24-47
| IOAPIC[2]: apic_id 4, version 32, address 0xfec80400, GSI 48-71
| IOAPIC[3]: apic_id 5, version 32, address 0xfec84000, GSI 72-95
| IOAPIC[4]: apic_id 8, version 32, address 0xfec84400, GSI 96-119
|
Reported-by: "Maciej W. Rozycki" <macro@linux-mips.org>
Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>
Acked-by: Yinghai Lu <yinghai@kernel.org>
LKML-Reference: <20091116151426.GC5653@lenovo>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-11-16 23:14:26 +08:00
|
|
|
apic_printk(APIC_VERBOSE, "mapped IOAPIC to %08lx (%08lx)\n",
|
|
|
|
__fix_to_virt(idx) + (ioapic_phys & ~PAGE_MASK),
|
|
|
|
ioapic_phys);
|
2008-06-27 16:41:56 +08:00
|
|
|
idx++;
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2009-08-25 01:53:39 +08:00
|
|
|
ioapic_res->start = ioapic_phys;
|
x86: io-apic: IO-APIC MMIO should not fail on resource insertion
If IO-APIC base address is 1K aligned we should not fail
on resourse insertion procedure. For this sake we define
IO_APIC_SLOT_SIZE constant which should cover all IO-APIC
direct accessible registers.
An example of a such configuration is there
http://marc.info/?l=linux-kernel&m=118114792006520
|
| Quoting the message
|
| IOAPIC[0]: apic_id 2, version 32, address 0xfec00000, GSI 0-23
| IOAPIC[1]: apic_id 3, version 32, address 0xfec80000, GSI 24-47
| IOAPIC[2]: apic_id 4, version 32, address 0xfec80400, GSI 48-71
| IOAPIC[3]: apic_id 5, version 32, address 0xfec84000, GSI 72-95
| IOAPIC[4]: apic_id 8, version 32, address 0xfec84400, GSI 96-119
|
Reported-by: "Maciej W. Rozycki" <macro@linux-mips.org>
Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>
Acked-by: Yinghai Lu <yinghai@kernel.org>
LKML-Reference: <20091116151426.GC5653@lenovo>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-11-16 23:14:26 +08:00
|
|
|
ioapic_res->end = ioapic_phys + IO_APIC_SLOT_SIZE - 1;
|
2009-08-25 01:53:39 +08:00
|
|
|
ioapic_res++;
|
2008-06-27 16:41:56 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2009-07-11 00:36:20 +08:00
|
|
|
void __init ioapic_insert_resources(void)
|
2008-08-20 15:07:45 +08:00
|
|
|
{
|
|
|
|
int i;
|
|
|
|
struct resource *r = ioapic_resources;
|
|
|
|
|
|
|
|
if (!r) {
|
2009-07-11 00:36:20 +08:00
|
|
|
if (nr_ioapics > 0)
|
2009-03-21 04:02:55 +08:00
|
|
|
printk(KERN_ERR
|
|
|
|
"IO APIC resources couldn't be allocated.\n");
|
2009-07-11 00:36:20 +08:00
|
|
|
return;
|
2008-08-20 15:07:45 +08:00
|
|
|
}
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(i) {
|
2008-08-20 15:07:45 +08:00
|
|
|
insert_resource(&iomem_resource, r);
|
|
|
|
r++;
|
|
|
|
}
|
|
|
|
}
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2010-03-30 16:07:09 +08:00
|
|
|
int mp_find_ioapic(u32 gsi)
|
2009-07-08 11:01:15 +08:00
|
|
|
{
|
2014-06-09 16:19:42 +08:00
|
|
|
int i;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2011-02-15 05:52:38 +08:00
|
|
|
if (nr_ioapics == 0)
|
|
|
|
return -1;
|
|
|
|
|
2009-07-08 11:01:15 +08:00
|
|
|
/* Find the IOAPIC that manages this GSI. */
|
2014-06-09 16:19:42 +08:00
|
|
|
for_each_ioapic(i) {
|
2011-05-19 07:31:38 +08:00
|
|
|
struct mp_ioapic_gsi *gsi_cfg = mp_ioapic_gsi_routing(i);
|
2014-06-09 16:19:42 +08:00
|
|
|
if (gsi >= gsi_cfg->gsi_base && gsi <= gsi_cfg->gsi_end)
|
2009-07-08 11:01:15 +08:00
|
|
|
return i;
|
|
|
|
}
|
2008-08-20 15:07:45 +08:00
|
|
|
|
2009-07-08 11:01:15 +08:00
|
|
|
printk(KERN_ERR "ERROR: Unable to locate IOAPIC for GSI %d\n", gsi);
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
2010-03-30 16:07:09 +08:00
|
|
|
int mp_find_ioapic_pin(int ioapic, u32 gsi)
|
2009-07-08 11:01:15 +08:00
|
|
|
{
|
2011-05-19 07:31:38 +08:00
|
|
|
struct mp_ioapic_gsi *gsi_cfg;
|
|
|
|
|
2014-06-09 16:19:42 +08:00
|
|
|
if (WARN_ON(ioapic < 0))
|
2009-07-08 11:01:15 +08:00
|
|
|
return -1;
|
2011-05-19 07:31:38 +08:00
|
|
|
|
|
|
|
gsi_cfg = mp_ioapic_gsi_routing(ioapic);
|
|
|
|
if (WARN_ON(gsi > gsi_cfg->gsi_end))
|
2009-07-08 11:01:15 +08:00
|
|
|
return -1;
|
|
|
|
|
2011-05-19 07:31:38 +08:00
|
|
|
return gsi - gsi_cfg->gsi_base;
|
2009-07-08 11:01:15 +08:00
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:41 +08:00
|
|
|
static int bad_ioapic_register(int idx)
|
2012-03-13 02:36:33 +08:00
|
|
|
{
|
|
|
|
union IO_APIC_reg_00 reg_00;
|
|
|
|
union IO_APIC_reg_01 reg_01;
|
|
|
|
union IO_APIC_reg_02 reg_02;
|
|
|
|
|
|
|
|
reg_00.raw = io_apic_read(idx, 0);
|
|
|
|
reg_01.raw = io_apic_read(idx, 1);
|
|
|
|
reg_02.raw = io_apic_read(idx, 2);
|
|
|
|
|
|
|
|
if (reg_00.raw == -1 && reg_01.raw == -1 && reg_02.raw == -1) {
|
|
|
|
pr_warn("I/O APIC 0x%x registers return all ones, skipping!\n",
|
|
|
|
mpc_ioapic_addr(idx));
|
|
|
|
return 1;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:43 +08:00
|
|
|
static int find_free_ioapic_entry(void)
|
|
|
|
{
|
2014-10-27 13:21:45 +08:00
|
|
|
int idx;
|
|
|
|
|
|
|
|
for (idx = 0; idx < MAX_IO_APICS; idx++)
|
|
|
|
if (ioapics[idx].nr_registers == 0)
|
|
|
|
return idx;
|
|
|
|
|
|
|
|
return MAX_IO_APICS;
|
2014-10-27 13:21:43 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
/**
|
|
|
|
* mp_register_ioapic - Register an IOAPIC device
|
|
|
|
* @id: hardware IOAPIC ID
|
|
|
|
* @address: physical address of IOAPIC register area
|
|
|
|
* @gsi_base: base of GSI associated with the IOAPIC
|
|
|
|
* @cfg: configuration information for the IOAPIC
|
|
|
|
*/
|
|
|
|
int mp_register_ioapic(int id, u32 address, u32 gsi_base,
|
|
|
|
struct ioapic_domain_cfg *cfg)
|
2009-07-08 11:01:15 +08:00
|
|
|
{
|
2014-10-27 13:21:45 +08:00
|
|
|
bool hotplug = !!ioapic_initialized;
|
2011-05-19 07:31:38 +08:00
|
|
|
struct mp_ioapic_gsi *gsi_cfg;
|
2014-10-27 13:21:43 +08:00
|
|
|
int idx, ioapic, entries;
|
|
|
|
u32 gsi_end;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2014-10-27 13:21:43 +08:00
|
|
|
if (!address) {
|
|
|
|
pr_warn("Bogus (zero) I/O APIC address found, skipping!\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
for_each_ioapic(ioapic)
|
|
|
|
if (ioapics[ioapic].mp_config.apicaddr == address) {
|
|
|
|
pr_warn("address 0x%x conflicts with IOAPIC%d\n",
|
|
|
|
address, ioapic);
|
|
|
|
return -EEXIST;
|
|
|
|
}
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2014-10-27 13:21:43 +08:00
|
|
|
idx = find_free_ioapic_entry();
|
|
|
|
if (idx >= MAX_IO_APICS) {
|
|
|
|
pr_warn("Max # of I/O APICs (%d) exceeded (found %d), skipping\n",
|
|
|
|
MAX_IO_APICS, idx);
|
|
|
|
return -ENOSPC;
|
|
|
|
}
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2011-05-19 07:31:37 +08:00
|
|
|
ioapics[idx].mp_config.type = MP_IOAPIC;
|
|
|
|
ioapics[idx].mp_config.flags = MPC_APIC_USABLE;
|
|
|
|
ioapics[idx].mp_config.apicaddr = address;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
|
|
|
set_fixmap_nocache(FIX_IO_APIC_BASE_0 + idx, address);
|
2012-03-13 02:36:33 +08:00
|
|
|
if (bad_ioapic_register(idx)) {
|
|
|
|
clear_fixmap(FIX_IO_APIC_BASE_0 + idx);
|
2014-10-27 13:21:43 +08:00
|
|
|
return -ENODEV;
|
2012-03-13 02:36:33 +08:00
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:40 +08:00
|
|
|
ioapics[idx].mp_config.apicid = io_apic_unique_id(idx, id);
|
2011-05-19 07:31:37 +08:00
|
|
|
ioapics[idx].mp_config.apicver = io_apic_get_version(idx);
|
2009-07-08 11:01:15 +08:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Build basic GSI lookup table to facilitate gsi->io_apic lookups
|
|
|
|
* and to prevent reprogramming of IOAPIC pins (PCI GSIs).
|
|
|
|
*/
|
2010-03-30 16:07:12 +08:00
|
|
|
entries = io_apic_get_redir_entries(idx);
|
2014-10-27 13:21:43 +08:00
|
|
|
gsi_end = gsi_base + entries - 1;
|
|
|
|
for_each_ioapic(ioapic) {
|
|
|
|
gsi_cfg = mp_ioapic_gsi_routing(ioapic);
|
|
|
|
if ((gsi_base >= gsi_cfg->gsi_base &&
|
|
|
|
gsi_base <= gsi_cfg->gsi_end) ||
|
|
|
|
(gsi_end >= gsi_cfg->gsi_base &&
|
|
|
|
gsi_end <= gsi_cfg->gsi_end)) {
|
|
|
|
pr_warn("GSI range [%u-%u] for new IOAPIC conflicts with GSI[%u-%u]\n",
|
|
|
|
gsi_base, gsi_end,
|
|
|
|
gsi_cfg->gsi_base, gsi_cfg->gsi_end);
|
|
|
|
clear_fixmap(FIX_IO_APIC_BASE_0 + idx);
|
|
|
|
return -ENOSPC;
|
|
|
|
}
|
|
|
|
}
|
2011-05-19 07:31:38 +08:00
|
|
|
gsi_cfg = mp_ioapic_gsi_routing(idx);
|
|
|
|
gsi_cfg->gsi_base = gsi_base;
|
2014-10-27 13:21:43 +08:00
|
|
|
gsi_cfg->gsi_end = gsi_end;
|
2010-03-30 16:07:12 +08:00
|
|
|
|
2014-10-27 13:21:43 +08:00
|
|
|
ioapics[idx].irqdomain = NULL;
|
|
|
|
ioapics[idx].irqdomain_cfg = *cfg;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2014-10-27 13:21:45 +08:00
|
|
|
/*
|
|
|
|
* If mp_register_ioapic() is called during early boot stage when
|
|
|
|
* walking ACPI/SFI/DT tables, it's too early to create irqdomain,
|
|
|
|
* we are still using bootmem allocator. So delay it to setup_IO_APIC().
|
|
|
|
*/
|
|
|
|
if (hotplug) {
|
|
|
|
if (mp_irqdomain_create(idx)) {
|
|
|
|
clear_fixmap(FIX_IO_APIC_BASE_0 + idx);
|
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
alloc_ioapic_saved_registers(idx);
|
|
|
|
}
|
|
|
|
|
2011-05-19 07:31:38 +08:00
|
|
|
if (gsi_cfg->gsi_end >= gsi_top)
|
|
|
|
gsi_top = gsi_cfg->gsi_end + 1;
|
2014-10-27 13:21:43 +08:00
|
|
|
if (nr_ioapics <= idx)
|
|
|
|
nr_ioapics = idx + 1;
|
|
|
|
|
|
|
|
/* Set nr_registers to mark entry present */
|
|
|
|
ioapics[idx].nr_registers = entries;
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2012-03-13 02:36:33 +08:00
|
|
|
pr_info("IOAPIC[%d]: apic_id %d, version %d, address 0x%x, GSI %d-%d\n",
|
|
|
|
idx, mpc_ioapic_id(idx),
|
|
|
|
mpc_ioapic_ver(idx), mpc_ioapic_addr(idx),
|
|
|
|
gsi_cfg->gsi_base, gsi_cfg->gsi_end);
|
2009-07-08 11:01:15 +08:00
|
|
|
|
2014-10-27 13:21:43 +08:00
|
|
|
return 0;
|
2009-07-08 11:01:15 +08:00
|
|
|
}
|
2009-09-23 22:20:23 +08:00
|
|
|
|
2014-10-27 13:21:46 +08:00
|
|
|
int mp_unregister_ioapic(u32 gsi_base)
|
|
|
|
{
|
|
|
|
int ioapic, pin;
|
|
|
|
int found = 0;
|
|
|
|
|
|
|
|
for_each_ioapic(ioapic)
|
|
|
|
if (ioapics[ioapic].gsi_config.gsi_base == gsi_base) {
|
|
|
|
found = 1;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
if (!found) {
|
|
|
|
pr_warn("can't find IOAPIC for GSI %d\n", gsi_base);
|
|
|
|
return -ENODEV;
|
|
|
|
}
|
|
|
|
|
|
|
|
for_each_pin(ioapic, pin) {
|
2015-04-13 14:11:59 +08:00
|
|
|
u32 gsi = mp_pin_to_gsi(ioapic, pin);
|
|
|
|
int irq = mp_map_gsi_to_irq(gsi, 0, NULL);
|
|
|
|
struct mp_chip_data *data;
|
|
|
|
|
|
|
|
if (irq >= 0) {
|
|
|
|
data = irq_get_chip_data(irq);
|
|
|
|
if (data && data->count) {
|
|
|
|
pr_warn("pin%d on IOAPIC%d is still in use.\n",
|
|
|
|
pin, ioapic);
|
|
|
|
return -EBUSY;
|
|
|
|
}
|
2014-10-27 13:21:46 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Mark entry not present */
|
|
|
|
ioapics[ioapic].nr_registers = 0;
|
|
|
|
ioapic_destroy_irqdomain(ioapic);
|
|
|
|
free_ioapic_saved_registers(ioapic);
|
|
|
|
if (ioapics[ioapic].iomem_res)
|
|
|
|
release_resource(ioapics[ioapic].iomem_res);
|
|
|
|
clear_fixmap(FIX_IO_APIC_BASE_0 + ioapic);
|
|
|
|
memset(&ioapics[ioapic], 0, sizeof(ioapics[ioapic]));
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2014-10-27 13:21:47 +08:00
|
|
|
int mp_ioapic_registered(u32 gsi_base)
|
|
|
|
{
|
|
|
|
int ioapic;
|
|
|
|
|
|
|
|
for_each_ioapic(ioapic)
|
|
|
|
if (ioapics[ioapic].gsi_config.gsi_base == gsi_base)
|
|
|
|
return 1;
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2015-04-13 14:11:55 +08:00
|
|
|
static void mp_irqdomain_get_attr(u32 gsi, struct mp_chip_data *data,
|
2015-04-14 10:29:38 +08:00
|
|
|
struct irq_alloc_info *info)
|
2015-04-13 14:11:55 +08:00
|
|
|
{
|
|
|
|
if (info && info->ioapic_valid) {
|
|
|
|
data->trigger = info->ioapic_trigger;
|
|
|
|
data->polarity = info->ioapic_polarity;
|
|
|
|
} else if (acpi_get_override_irq(gsi, &data->trigger,
|
|
|
|
&data->polarity) < 0) {
|
2015-04-14 10:30:06 +08:00
|
|
|
/* PCI interrupts are always active low level triggered. */
|
|
|
|
data->trigger = IOAPIC_LEVEL;
|
|
|
|
data->polarity = IOAPIC_POL_LOW;
|
2015-04-13 14:11:55 +08:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mp_setup_entry(struct irq_cfg *cfg, struct mp_chip_data *data,
|
|
|
|
struct IO_APIC_route_entry *entry)
|
|
|
|
{
|
|
|
|
memset(entry, 0, sizeof(*entry));
|
|
|
|
entry->delivery_mode = apic->irq_delivery_mode;
|
|
|
|
entry->dest_mode = apic->irq_dest_mode;
|
|
|
|
entry->dest = cfg->dest_apicid;
|
|
|
|
entry->vector = cfg->vector;
|
|
|
|
entry->trigger = data->trigger;
|
|
|
|
entry->polarity = data->polarity;
|
|
|
|
/*
|
2015-04-14 10:30:06 +08:00
|
|
|
* Mask level triggered irqs. Edge triggered irqs are masked
|
|
|
|
* by the irq core code in case they fire.
|
2015-04-13 14:11:55 +08:00
|
|
|
*/
|
2015-04-14 10:30:06 +08:00
|
|
|
if (data->trigger == IOAPIC_LEVEL)
|
|
|
|
entry->mask = IOAPIC_MASKED;
|
|
|
|
else
|
|
|
|
entry->mask = IOAPIC_UNMASKED;
|
2015-04-13 14:11:55 +08:00
|
|
|
}
|
|
|
|
|
|
|
|
int mp_irqdomain_alloc(struct irq_domain *domain, unsigned int virq,
|
|
|
|
unsigned int nr_irqs, void *arg)
|
|
|
|
{
|
|
|
|
int ret, ioapic, pin;
|
|
|
|
struct irq_cfg *cfg;
|
|
|
|
struct irq_data *irq_data;
|
|
|
|
struct mp_chip_data *data;
|
|
|
|
struct irq_alloc_info *info = arg;
|
2015-10-16 01:42:23 +08:00
|
|
|
unsigned long flags;
|
2015-04-13 14:11:55 +08:00
|
|
|
|
|
|
|
if (!info || nr_irqs > 1)
|
|
|
|
return -EINVAL;
|
|
|
|
irq_data = irq_domain_get_irq_data(domain, virq);
|
|
|
|
if (!irq_data)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
ioapic = mp_irqdomain_ioapic_idx(domain);
|
|
|
|
pin = info->ioapic_pin;
|
|
|
|
if (irq_find_mapping(domain, (irq_hw_number_t)pin) > 0)
|
|
|
|
return -EEXIST;
|
|
|
|
|
|
|
|
data = kzalloc(sizeof(*data), GFP_KERNEL);
|
|
|
|
if (!data)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
info->ioapic_entry = &data->entry;
|
|
|
|
ret = irq_domain_alloc_irqs_parent(domain, virq, nr_irqs, info);
|
|
|
|
if (ret < 0) {
|
|
|
|
kfree(data);
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2015-04-14 10:29:53 +08:00
|
|
|
INIT_LIST_HEAD(&data->irq_2_pin);
|
2015-04-13 14:11:55 +08:00
|
|
|
irq_data->hwirq = info->ioapic_pin;
|
2015-04-13 14:11:59 +08:00
|
|
|
irq_data->chip = (domain->parent == x86_vector_domain) ?
|
|
|
|
&ioapic_chip : &ioapic_ir_chip;
|
2015-04-13 14:11:55 +08:00
|
|
|
irq_data->chip_data = data;
|
|
|
|
mp_irqdomain_get_attr(mp_pin_to_gsi(ioapic, pin), data, info);
|
|
|
|
|
|
|
|
cfg = irqd_cfg(irq_data);
|
2015-04-14 10:29:53 +08:00
|
|
|
add_pin_to_irq_node(data, ioapic_alloc_attr_node(info), ioapic, pin);
|
2015-10-16 01:42:23 +08:00
|
|
|
|
|
|
|
local_irq_save(flags);
|
2015-04-13 14:11:55 +08:00
|
|
|
if (info->ioapic_entry)
|
|
|
|
mp_setup_entry(cfg, data, info->ioapic_entry);
|
|
|
|
mp_register_handler(virq, data->trigger);
|
|
|
|
if (virq < nr_legacy_irqs())
|
|
|
|
legacy_pic->mask(virq);
|
2015-10-16 01:42:23 +08:00
|
|
|
local_irq_restore(flags);
|
2015-04-13 14:11:55 +08:00
|
|
|
|
|
|
|
apic_printk(APIC_VERBOSE, KERN_DEBUG
|
|
|
|
"IOAPIC[%d]: Set routing entry (%d-%d -> 0x%x -> IRQ %d Mode:%i Active:%i Dest:%d)\n",
|
|
|
|
ioapic, mpc_ioapic_id(ioapic), pin, cfg->vector,
|
|
|
|
virq, data->trigger, data->polarity, cfg->dest_apicid);
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
void mp_irqdomain_free(struct irq_domain *domain, unsigned int virq,
|
|
|
|
unsigned int nr_irqs)
|
|
|
|
{
|
|
|
|
struct irq_data *irq_data;
|
2015-04-14 10:29:53 +08:00
|
|
|
struct mp_chip_data *data;
|
2015-04-13 14:11:55 +08:00
|
|
|
|
|
|
|
BUG_ON(nr_irqs != 1);
|
|
|
|
irq_data = irq_domain_get_irq_data(domain, virq);
|
|
|
|
if (irq_data && irq_data->chip_data) {
|
2015-04-14 10:29:53 +08:00
|
|
|
data = irq_data->chip_data;
|
|
|
|
__remove_pin_from_irq(data, mp_irqdomain_ioapic_idx(domain),
|
2015-04-13 14:11:55 +08:00
|
|
|
(int)irq_data->hwirq);
|
2015-04-14 10:29:53 +08:00
|
|
|
WARN_ON(!list_empty(&data->irq_2_pin));
|
2015-04-13 14:11:55 +08:00
|
|
|
kfree(irq_data->chip_data);
|
|
|
|
}
|
|
|
|
irq_domain_free_irqs_top(domain, virq, nr_irqs);
|
|
|
|
}
|
|
|
|
|
|
|
|
void mp_irqdomain_activate(struct irq_domain *domain,
|
|
|
|
struct irq_data *irq_data)
|
|
|
|
{
|
|
|
|
unsigned long flags;
|
|
|
|
struct irq_pin_list *entry;
|
|
|
|
struct mp_chip_data *data = irq_data->chip_data;
|
|
|
|
|
|
|
|
raw_spin_lock_irqsave(&ioapic_lock, flags);
|
2015-04-14 10:29:53 +08:00
|
|
|
for_each_irq_pin(entry, data->irq_2_pin)
|
2015-04-13 14:11:55 +08:00
|
|
|
__ioapic_write_entry(entry->apic, entry->pin, data->entry);
|
|
|
|
raw_spin_unlock_irqrestore(&ioapic_lock, flags);
|
|
|
|
}
|
|
|
|
|
|
|
|
void mp_irqdomain_deactivate(struct irq_domain *domain,
|
|
|
|
struct irq_data *irq_data)
|
|
|
|
{
|
|
|
|
/* It won't be called for IRQ with multiple IOAPIC pins associated */
|
|
|
|
ioapic_mask_entry(mp_irqdomain_ioapic_idx(domain),
|
|
|
|
(int)irq_data->hwirq);
|
|
|
|
}
|
|
|
|
|
|
|
|
int mp_irqdomain_ioapic_idx(struct irq_domain *domain)
|
|
|
|
{
|
|
|
|
return (int)(long)domain->host_data;
|
|
|
|
}
|
2015-04-14 10:30:08 +08:00
|
|
|
|
|
|
|
const struct irq_domain_ops mp_ioapic_irqdomain_ops = {
|
|
|
|
.alloc = mp_irqdomain_alloc,
|
|
|
|
.free = mp_irqdomain_free,
|
|
|
|
.activate = mp_irqdomain_activate,
|
|
|
|
.deactivate = mp_irqdomain_deactivate,
|
|
|
|
};
|