From owner-freebsd-questions@FreeBSD.ORG Wed Jul 31 13:28:42 2013 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 126EED3E for ; Wed, 31 Jul 2013 13:28:42 +0000 (UTC) (envelope-from frank2@fjl.co.uk) Received: from bs1.fjl.org.uk (bs1.fjl.org.uk [84.45.41.196]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 842EE21F2 for ; Wed, 31 Jul 2013 13:28:41 +0000 (UTC) Received: from [192.168.1.35] (mux.fjl.org.uk [62.3.120.246]) (authenticated bits=0) by bs1.fjl.org.uk (8.14.4/8.14.4) with ESMTP id r6VDSWO0088520 (version=TLSv1/SSLv3 cipher=DHE-DSS-CAMELLIA256-SHA bits=256 verify=NO) for ; Wed, 31 Jul 2013 14:28:32 +0100 (BST) (envelope-from frank2@fjl.co.uk) Message-ID: <51F91101.4080301@fjl.co.uk> Date: Wed, 31 Jul 2013 14:28:33 +0100 From: Frank Leonhardt User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20130620 Thunderbird/17.0.7 MIME-Version: 1.0 To: freebsd-questions@freebsd.org Subject: Re: System hangs for several minutes (disk IO related) References: <20130730171938.GA3602@aurora.oekb.co.at> In-Reply-To: <20130730171938.GA3602@aurora.oekb.co.at> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.14 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 31 Jul 2013 13:28:42 -0000 I don't know what kind of answer you're expecting unless its for moral support or the obvious. I was thinking of buying one of these as they're very cheap at the moment, but decided against it due to compatibility problems reported. IIRC something in it was supported up to FreeBSD 7.2 - the NIC I think. If you get it working I'd be interested myself! I think they were commonly used for VMWare but won't run version 4.0 onwards, and are therefore as desirable to that fraternity as a dead camel in reception. However, I did once get the same symptoms you're reporting, and it turned out to be a HD that was on the way out even though it pretended it was fine on every test. I think it was just very slow to respond on a write. If the RAID is struggling to do a write I assume you'd see the same thing. If I were in your place I'd try to attach a SATA drive directly - does it have a SATA optical drive connection you could pinch? Regards, Frank. On 30/07/2013 18:19, Ewald Jenisch wrote: > Hi, > > I'm seeing rather strange behavior on an HP DL585 G5 wrt. disk IO: > > When there's any disk io the machine completely freezes, i.e. no > console input possible, no screen output - complete hang. After some > minutes the box comes back to normal again - but sure enough with the > next disk io it freezes again. > > To give you a typical example: While a "portsnap fetch extract" was > running I did a "sync". Normally this should complete in a matter of > milliseconds to seconds in the worst case - but dig this: > > # date;time sync;date > Tue Jul 30 09:57:38 CEST 2013 > 0.000u 0.311s 9:54.69 0.0% 4+161k 0+1287io 0pf+0w > Tue Jul 30 10:07:38 CEST 2013 > # > > No, this is not a typo - it really took nearly ten minutes (!) for the > sync to complete. In the meantime - every windows, all activity > (console, screen-output etc.) is completely blocked. ('portsnap fetch > extract' was only given as an example here - the lockup occurs > whenever there is disk io like for example tar, etc). > > We're speaking about a machine with decent hardware here, here's an > excerpt from "dmesg": > > ------------------------------ < Cut here > ------------------------------ > > FreeBSD 9.2-BETA2 #0 r253750: Mon Jul 29 11:07:04 CEST 2013 > root@sniff-rz2:/usr/obj/usr/src/sys/GENERIC amd64 > gcc version 4.2.1 20070831 patched [FreeBSD] > CPU: Quad-Core AMD Opteron(tm) Processor 8358 SE (2411.16-MHz K8-class CPU) > Origin = "AuthenticAMD" Id = 0x100f23 Family = 0x10 Model = 0x2 Stepping = 3 > Features=0x178bfbff > Features2=0x802009 > AMD Features=0xee400800 > AMD Features2=0x7ff > TSC: P-state invariant > real memory = 137438953472 (131072 MB) > avail memory = 132973432832 (126813 MB) > Event timer "LAPIC" quality 400 > ACPI APIC Table: > FreeBSD/SMP: Multiprocessor System Detected: 16 CPUs > ... > ciss0: port 0x3000-0x30ff mem 0xd9e00000-0xd9efffff,0xd9df0000-0xd9df0fff irq 16 at device 0.0 on pci8 > ciss0: PERFORMANT Transport > ... > da0 at ciss0 bus 0 scbus2 target 0 lun 0 > da0: Fixed Direct Access SCSI-5 device > da0: 135.168MB/s transfers > da0: Command Queueing enabled > da0: 139979MB (286677120 512 byte sectors: 255H 32S/T 35132C) > da0: quirks=0x1 > > ------------------------------ < Cut here > ------------------------------ > > Kernel: Latest kernel as of yesterday (9.2Beta) > > BIOS: is at the latest level (Support pack as of Spring 2013) > installed which updated BIOS, iLO etc. Aside from that I reset BIOS to > default values just to be sure. > > SmartArray P400 - Firmware 7.24 (latest) > > Harddisks: Two 146GB HDs running in Raid1-mode. Already tried > hot-swapping the disks - didn't change anything. > > Needless to say - no error message etc. in neither dmesg nor > /var/log/messages :-( > > To me it looks like this is some sort of timing problem - but where > should I start looking? > > Thanks much in advance for any help, > -ewald > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"