From owner-freebsd-current@FreeBSD.ORG Thu Jun 30 06:42:07 2005 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 26DF416A41C for ; Thu, 30 Jun 2005 06:42:07 +0000 (GMT) (envelope-from dan.cojocar@gmail.com) Received: from wproxy.gmail.com (wproxy.gmail.com [64.233.184.203]) by mx1.FreeBSD.org (Postfix) with ESMTP id AD3FC43D1F for ; Thu, 30 Jun 2005 06:42:06 +0000 (GMT) (envelope-from dan.cojocar@gmail.com) Received: by wproxy.gmail.com with SMTP id i25so45739wra for ; Wed, 29 Jun 2005 23:42:06 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=bwc6MGdIGdwOZwN+OhpoaYA7tqYipLyg7guw0h/Rcv/AuAmABo2bWI+H0fQ5gPL3wgaCK3fl7hK93ZYTISQeNW4a18CoD6+R3bxyCTynLOrofhzY7i6c2hLTquZJUkLTIMj/sZdmbKJh6bjbGgxtTdTXLQ5RbMT0+GmJA5fIOQA= Received: by 10.54.40.58 with SMTP id n58mr225523wrn; Wed, 29 Jun 2005 23:42:05 -0700 (PDT) Received: by 10.54.26.12 with HTTP; Wed, 29 Jun 2005 23:42:05 -0700 (PDT) Message-ID: Date: Thu, 30 Jun 2005 09:42:05 +0300 From: Dan Cojocar To: freebsd-current@freebsd.org In-Reply-To: <200506291437.50698.jhb@FreeBSD.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <200504051423.29530.jhb@FreeBSD.org> <200506291437.50698.jhb@FreeBSD.org> Cc: Subject: Re: Interrupt storm X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Dan Cojocar List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 30 Jun 2005 06:42:07 -0000 On 6/29/05, John Baldwin wrote: > On Tuesday 05 April 2005 03:46 pm, Dan Cojocar wrote: > > On Apr 5, 2005 9:23 PM, John Baldwin wrote: > > > On Tuesday 05 April 2005 12:44 pm, Dan Cojocar wrote: > > > > On Apr 5, 2005 5:30 PM, John Baldwin wrote: > > > > > On Tuesday 29 March 2005 01:35 pm, Jung-uk Kim wrote: > > > > > > On Tuesday 29 March 2005 01:22 pm, Dan Cojocar wrote: > > > > > > > Hello all, > > > > > > > > > > > > > > I noticed the following interrupt storm in my dmesg: > > > > > > > > > > > > > > Interrupt storm detected on "irq11: rl1"; throttling interrup= t > > > > > > > source Interrupt storm detected on "irq5: rl0"; throttling > > > > > > > interrupt source > > > > > > > > > > > > > > Here is my vmstat -i: > > > > > > > interrupt total rate > > > > > > > irq0: clk 128600 199 > > > > > > > irq1: atkbd0 2499 3 > > > > > > > irq4: sio0 2 0 > > > > > > > irq5: rl0 128901 200 > > > > > > > irq8: rtc 82294 127 > > > > > > > irq11: rl1 128902 200 > > > > > > > irq12: psm0 16152 25 > > > > > > > irq14: ata0 12600 19 > > > > > > > irq15: ata1 71 0 > > > > > > > Total 500021 776 > > > > > > > > > > > > > > and here is my dmesg: > > > > > > > http://cs.ubbcluj.ro/~dan/dmesg.txt > > > > > > > > > > > > > > I have options DEVICE_POLLING and options HZ=3D200, i have te= sted > > > > > > > with greater HZ values like 1000, and without DEVICE_POLLING = but > > > > > > > i have the same storm. > > > > > > > > > > > > Try this: > > > > > > > > > > > > http://docs.freebsd.org/cgi/mid.cgi?200503161154.04555.jkim > > > > > > > > > > He's not using an APIC so that won't make any difference. > > > > > > > > If I use APIC, I will get "watchdog timeout" as i posted in an = old > > > > > > email: > > > > >rl0: watchdog timeout > > > > >rl1: watchdog timeout > > > > >rl0: watchdog timeout > > > > >rl1: watchdog timeout > > > > >rl1: watchdog timeout > > > > >rl1: watchdog timeout > > > > >rl1: watchdog timeout > > > > > > > > Thanks, > > > > Dan > > > > > > That's indicative of other interrupt routing problems. This is with = APIC > > > + ACPI, yes? If you disable ACPI, does APIC work or is it not found? > > > Also, if you disable ACPI, does non-APIC work ok? > > > > Yes i get "watchdog timeout" if i use ACPI+APIC. > > Here is my dmesg for current+acpi+apic. > > http://cs.ubbcluj.ro/~dan/current+acpi+apic.txt > > If i disable acpi, the system is not able to boot to the login > > prompt. I did a break > > into debugger after waiting for 15 minutes and i noticed that the > > interrupts for rl0 are increasing very quickly, so i think that is why > > the system is so slow. > > Here is a dmesg for current without ACPI and with APIC, but only in > > single mode > > without if_rl module and many other modules. > > http://cs.ubbcluj.ro/~dan/current+noacpi+apic+singlemode.txt > > And if I disable ACPI i don't see the interrupts storms but i don't > > have ACPI :( > > http://cs.ubbcluj.ro/~dan/current+noacpi+noapic.txt > > And here is current with ACPI and without APIC: > > http://cs.ubbcluj.ro/~dan/current+acpi+noapic.txt > > Thanks, > > Dan >=20 > I haven't forgotten about this, I've just been busy. So, I'm still tryin= g to > understand. Correct me if I'm wrong: APIC always gets interrupt storms; = it > doesn't matter if ACPI is enabled or disabled. Without APIC, ACPI gives = you > interrupt storms, but no ACPI works fine. Is that correct? Do you get > interrupt storms if ACPI is enabled but APIC is not on both current and > RELENG_5? Hello John, With apic and acpi i was getting watchdog timeout on rl0 and rl1. If i disabled apic, and use only acpi i was getting interrupt storms, the same i get if i use only apic, and the system was ok without acpi and apic, but in this version i didn't have acpi features :(. The links with my dmesg from previous message are valid. I was getting this behaviour only if i used a second network card, tested only with realteck. Unfortunately now i don't have the second nic :( in my system, but if you need more info i will get a new one. Now with recent -current and only one realteck nic and with acpi enabled and apic disabled the system is not having interrupt storms and no watchdog timeouts, but i noticed that my downloads rate is dropping to half of my bandwidth, no mather if i use device_polling or not. I was thinking that this is related to my realteck nic but now just tested my modem on usb with cdce driver and i get the same behaviour. Last time I tested this only on current, I don't know if in recent RELENG_5 is the same. Thank you John for your time, Dan