From owner-freebsd-stable@FreeBSD.ORG Mon Nov 26 21:01:58 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0B224D83 for ; Mon, 26 Nov 2012 21:01:58 +0000 (UTC) (envelope-from rcartwri@asu.edu) Received: from mail-ia0-f182.google.com (mail-ia0-f182.google.com [209.85.210.182]) by mx1.freebsd.org (Postfix) with ESMTP id C34898FC14 for ; Mon, 26 Nov 2012 21:01:57 +0000 (UTC) Received: by mail-ia0-f182.google.com with SMTP id x2so10610260iad.13 for ; Mon, 26 Nov 2012 13:01:57 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type :x-gm-message-state; bh=M58Fa7cpzo7MkH3rUs4JOJo9HHmRYrCfuVYwn4AjRD0=; b=TcOPBlocItjq5l0oOSPNa5Jq9jkU9CSKGVqhpTEia6QZ6Jdv0UaDmUMcJVHjCNd7dB CfuFnqR7AM/rmVdoF7ri/wVNBE0Gavj6ysWLhPNGXbtRJZZnaIOc3Eu24GLliBrWuFZ1 fzbTfU4C3buoI0MVQ6vl3tgDKJvfOWkQ79HoQUQVUk7Otfnk5jw4n1bz8s0ey31sh7O3 6GypyZz06Eb9P2SDpXSGlzJYwJVpt/CADe+Srp6U0mUCZme126jW9QipvFiGy9tBGShP FUZmuGF/9N+mRcgK+hT/bmnd7Yf2KqsjgYVDpB1FGM1ZaTmXtrUxwFPURDCcjXr4cXZW 6I/w== MIME-Version: 1.0 Received: by 10.50.42.170 with SMTP id p10mr15950876igl.47.1353963717307; Mon, 26 Nov 2012 13:01:57 -0800 (PST) Received: by 10.64.64.39 with HTTP; Mon, 26 Nov 2012 13:01:57 -0800 (PST) Date: Mon, 26 Nov 2012 14:01:57 -0700 Message-ID: Subject: Write Failed message with 9.1-RC3 From: "Reed A. Cartwright" To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQkW9KCL6NNzE0GzrQ8YbKzF577JvibeqBTLGGVcAFyLkg95HYZjuJSzBP3mKMSVti0HmT7k X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 26 Nov 2012 21:01:58 -0000 I'm new to this list... I'm running a bioinformatics server using 9.1-RC3 (64 cores, 512GB ram). I have a ZFS raid-z2 array attached to an LSI controller with a SSD cache drive. Since upgrading to 9.1-RC2/3 (for AVX support), I have been experiencing hard drive lockups with the message "write failed" printed to the console. After this reading from the hard drives no longer work, but the machine is not locked up. If the appropriate files are in the cache, I can log in and execute programs. I know the LSI driver has been updated in 9.1 and I have updated my cards' bios to match. It doesn't seem to make a difference. Once I was able to run top, and saw that many processes were stuck in the 'tx->tx' state. So far, no corruption appears to have occurred in the drives. I'm about to downgrade to 9.0, but I wanted to know if anyone has any idea what the issue is. -- Reed A. Cartwright, PhD Assistant Professor of Genomics, Evolution, and Bioinformatics School of Life Sciences Center for Evolutionary Medicine and Informatics The Biodesign Institute Arizona State University