From owner-freebsd-current@FreeBSD.ORG Sun Mar 21 22:30:15 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 07B3416A4CF for ; Sun, 21 Mar 2004 22:30:15 -0800 (PST) Received: from sax.sax.de (sax.sax.de [193.175.26.33]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1455B43D2D for ; Sun, 21 Mar 2004 22:30:13 -0800 (PST) (envelope-from mw@theatre.sax.de) Received: (from uucp@localhost) by sax.sax.de (8.9.3/8.9.3) with UUCP id HAA17911 for freebsd-current@freebsd.org; Mon, 22 Mar 2004 07:30:11 +0100 (CET) Received: from theatre.sax.de (mw@localhost [127.0.0.1]) by theatre.sax.de (8.12.10/8.12.10) with ESMTP id i2M6BvLl015852 for ; Mon, 22 Mar 2004 07:11:57 +0100 (CET) (envelope-from mw@theatre.sax.de) Received: (from mw@localhost) by theatre.sax.de (8.12.10/8.12.9/Submit) id i2M6Bv6p015851 for freebsd-current@freebsd.org; Mon, 22 Mar 2004 07:11:57 +0100 (CET) (envelope-from mw) Date: Mon, 22 Mar 2004 07:11:57 +0100 From: Martin Welk To: freebsd-current@freebsd.org Message-ID: <20040322061157.GA15607@theatre.sax.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-PGP-Fingerprint: D3 C0 49 7D 8C B6 FC 97 F7 F4 1B DF 09 0E 15 2A User-Agent: Mutt/1.5.6i Subject: Still troubles with indefinite wait buffer errors X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: Martin Welk List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Mar 2004 06:30:15 -0000 Good morning, I'm still having some troubles with "indefinite wait buffer errors" as they still come up from time to time since upgrading to 5.2.1-RELEASE(-p1) and I'm still trying to find the problem. I'm very sure that I can exclude hardware errors, but please read my "full story"... I've sent that to -questions a while ago, without any reply. >From time to time, I get the following error messages under heavy load, for example, when copying a larger amount of data or while having some load on the machine while compiling something from the ports: Mar 4 03:29:00 theatre kernel: swap_pager: indefinite wait buffer: device: vinum/scratch, blkno: 190, size: 4096 Mar 4 03:29:00 theatre kernel: swap_pager: indefinite wait buffer: device: vinum/var, blkno: 536, size: 4096 This happens sometimes during the daily jobs running at night, the block numbers differ every time, so I think that's no bad block thing. The machine is based on an Asus P2B-S board with a P-II CPU (350 MHz), 256 MBytes of memory and three ATA hard disks. I have replaced the ATA controller meanwhile, so I'm no longer using the on-board chipset but a cheap Silicon Image SiI0680 based one which is recognized correctly, as it looks like. This machine is running like this for about 1 year now, was running rock-solid before (I think I've started with 5.1-REL on it). I have organized all my disks with Vinum and I'm quite happy with it: there are two 120 GByte disks, bootable, all volumes mirrored through Vinum including root but except swap space. I know that it's probably not the best idea to run swapping through a logical volume manager at all, but as this is a small home serve which shouldn't swap to much at all, I don't care for it but it helps keeping disk organizing more simple - simply having all as Vinum devices :) I would appreciate to be able to keep swapping to Vinum devices in the future again (...latest GEOM changes...), but that's a another issue - as swap space is usually almost unused (a few hundred KBytes), I'm currently swapping to a md device (*cough*) hoping that this will be fixed soon. The machine is running as a small home server and DSL gateway, so it's running ppp, natd, ipfw-based firewall, Squid, sendmail, SpamAssassin, natd, Samba, NFS server, DHCP server, ntpd and a few other small things. I've done an upgrade to 5.2.1-RELEASE a few days ago, with an update to -p1 a day later, and the other change during that time was to enable fxp1, which hasn't been used before and that's now using the same IRQ (9) as the on-board Adaptec SCSI adapter, but when the machine shows the described symptoms, fxp1 wasn't used heavily and the SCSI adapter is completely unused - well, there's a CD-ROM and a CD-R connected to it but never used. I have to screen shots of kernel backtraces from the point of time when the error happens - "real" screen shots, please see them at http://www.sax.de/~mw/KIF_1374.JPG and http://www.sax.de/~mw/KIF_1377.JPG Again, at all no ata disk errors, and if I force the machine to read out the full disks (dd if=/dev/ad[n]s1c of=/dev/null) I get no ata errors at all and it runs through smoothly with throughput between 25 and 41 MBytes/sec. Thanks in advance for any help. Regards, Martin (...) FreeBSD 5.2.1-RELEASE-p1 #3: Thu Mar 4 12:26:22 CET 2004 CPU: Pentium II/Pentium II Xeon/Celeron (350.80-MHz 686-class CPU) (...) real memory = 268423168 (255 MB) avail memory = 251105280 (239 MB) (...) Pentium Pro MTRR support enabled pcibios: BIOS version 2.10 Using $PIR table, 8 entries at 0xc00f0d10 pcib0: at pcibus 0 on motherboard pci0: on pcib0 pci_cfgintr: 0:6 INTA BIOS irq 9 pci_cfgintr: 0:7 INTA BIOS irq 5 pci_cfgintr: 0:9 INTA BIOS irq 9 pci_cfgintr: 0:10 INTA BIOS irq 5 pci_cfgintr: 0:11 INTA BIOS irq 12 agp0: mem 0xe4000000-0xe7ffffff at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci_cfgintr: 0:1 INTA routed to irq 4 pcib1: slot 0 INTA is routed to irq 4 pci1: at device 0.0 (no driver attached) isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: port 0xd800-0xd80f at device 4.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata0: [MPSAFE] ata1: at 0x170 irq 15 on atapci0 ata1: [MPSAFE] uhci0: port 0xd400-0xd41f at device 4.2 on pci0 pci_cfgintr: 0:4 INTD routed to irq 9 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered pci0: at device 4.3 (no driver attached) ahc0: port 0xd000-0xd0ff mem 0xdf800000-0xdf800fff irq 9 at device 6.0 on pci0 aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs fxp0: port 0xb800-0xb81f mem 0xdf000000-0xdf0fffff,0xe2000000-0xe2000fff irq 5 at device 7.0 on pci0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp1: port 0xb400-0xb43f mem 0xde000000-0xde0fffff,0xde800000-0xde800fff irq 9 at device 9.0 on pci0 miibus1: on fxp1 inphy1: on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pcm0: port 0xb000-0xb03f irq 5 at device 10.0 on pci0 de0: port 0xa800-0xa87f mem 0xdd800000-0xdd80007f irq 12 at device 11.0 on pci0 de0: 21041 [10Mb/s] pass 2.1 orm0: