From owner-freebsd-scsi@FreeBSD.ORG Sat Oct 27 04:20:58 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0C84C3A5 for ; Sat, 27 Oct 2012 04:20:58 +0000 (UTC) (envelope-from stephane.lapie@darkbsd.org) Received: from quasar.darkbsd.org (shinigami.darkbsd.org [82.227.96.182]) by mx1.freebsd.org (Postfix) with ESMTP id 62E808FC08 for ; Sat, 27 Oct 2012 04:20:55 +0000 (UTC) Received: from quasar.darkbsd.org (localhost [127.0.0.1]) by quasar.darkbsd.org (Postfix) with ESMTP id 058FAA451 for ; Sat, 27 Oct 2012 06:10:49 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=darkbsd.org; h=message-id :date:from:mime-version:to:subject:content-type; s=selector1; bh=wG2ILwl4++1etX3wmnsnq1SgGSA=; b=xUOTEVdd4RaF2Ssg3lgvMbyt1LuO sZpGttTnG3xQy3dj0H8L14w49wZmCy9aARjb6Xx9UPq+aMsriP7jhgFVEZsgwjSw 6gcyFCkDRQ0/JLlXzt8NBPxfSVnnJwHN7uP59KSMZIwBzElJBtheszEJfbwnLBmg JKv3IOqjtttVFHA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=darkbsd.org; h=message-id :date:from:mime-version:to:subject:content-type; q=dns; s= selector1; b=lvGuLsXE8QLDrbhrazkfsRt8xaMxxmHm9T5BQW/piQciL1YKYr/ E9wvdBgp4nmWkhk6a/n9YoX6DNrG0K2V1caLY9d78NpOeFp/IhwCbMZFN3U1TrKf rMKXtyvl9lOYPJCZSEoWd+RfcCFpSL1Iv2roHDMK1/yFyUDCfNMS2mas= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=darkbsd.org; h= content-type:content-type:subject:subject:mime-version :user-agent:from:from:date:date:message-id:received:received; s= selector1; t=1351311046; bh=tS0bdzioH9RYlCCuJMu7aGkXAiWpkMPAwi9R WBUFezE=; b=kg+2VKMCuyjA0MbLEObEVnOdjfDfbZh6Kpx7jyyWf8pghYk+GGmY 7gZbJEXVC1ibNf7nFQkWe/Vyt5VOY+yfUWMr1WGrwLJYGfJpUtikQqNa7rxZ5+YG NNPd1qPxXa/Xpp8l6sgq0dwzcJZJvavjnZIvvHzwe/Gu4wCVMwxLszU= Received: from quasar.darkbsd.org ([127.0.0.1]) by quasar.darkbsd.org (quasar.darkbsd.org [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id b07j6uIzJuGo for ; Sat, 27 Oct 2012 06:10:46 +0200 (CEST) Received: from [192.168.3.42] (archer.yomi.darkbsd.org [192.168.3.42]) (Authenticated sender: darksoul) by quasar.darkbsd.org (Postfix) with ESMTPSA id 02AA5A448 for ; Sat, 27 Oct 2012 06:10:45 +0200 (CEST) Message-ID: <508B5EBC.8070509@darkbsd.org> Date: Sat, 27 Oct 2012 13:10:36 +0900 From: Stephane LAPIE User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:16.0) Gecko/20121011 Thunderbird/16.0.1 MIME-Version: 1.0 To: freebsd-scsi@freebsd.org Subject: LSI mpt(4) driver problem : can't SMART poll, controller freezes X-Enigmail-Version: 1.4.5 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig6D23D7F6B37ED6706408533D" X-Content-Filtered-By: Mailman/MimeDel 2.1.14 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 27 Oct 2012 04:20:58 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig6D23D7F6B37ED6706408533D Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hello list, I have two controller cards of the following make (PCI-X controllers) : Oct 24 09:26:00 eirei-no-za kernel: mpt0: port 0x2000-0x20ff mem 0xdfa20000-0xdfa23fff,0xdfa00000-0xdfa0ffff irq 24 at device 1.0 on pci6 Oct 24 09:26:00 eirei-no-za kernel: mpt0: MPI Version=3D1.5.12.0 Oct 24 09:26:00 eirei-no-za kernel: mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 ) Oct 24 09:26:00 eirei-no-za kernel: mpt0: 0 Active Volumes (2 Max) Oct 24 09:26:00 eirei-no-za kernel: mpt0: 0 Hidden Drive Members (10 Max)= Oct 24 09:26:00 eirei-no-za kernel: mpt1: port 0x2400-0x24ff mem 0xdfa24000-0xdfa27fff,0xdfa10000-0xdfa1ffff irq 28 at device 7.0 on pci6 Oct 24 09:26:00 eirei-no-za kernel: mpt1: MPI Version=3D1.5.12.0 Oct 24 09:26:00 eirei-no-za kernel: mpt1: Capabilities: ( RAID-0 RAID-1E RAID-1 ) Oct 24 09:26:00 eirei-no-za kernel: mpt1: 0 Active Volumes (2 Max) Oct 24 09:26:00 eirei-no-za kernel: mpt1: 0 Hidden Drive Members (10 Max)= Each of them having 8 ports used in the following fashion : at scbus0 target 0 lun 0 (pass0,da0) at scbus0 target 1 lun 0 (pass1,da1) at scbus0 target 3 lun 0 (pass2,da2) at scbus0 target 4 lun 0 (pass3,da3) at scbus0 target 5 lun 0 (pass4,da4) at scbus0 target 6 lun 0 (pass5,da5) at scbus0 target 7 lun 0 (pass6,da6) at scbus2 target 0 lun 0 (pass7,da7) at scbus2 target 1 lun 0 (pass8,da8) at scbus2 target 2 lun 0 (pass9,da9) at scbus2 target 3 lun 0 (pass10,da10)= at scbus2 target 4 lun 0 (pass11,da11)= at scbus2 target 5 lun 0 (pass12,da12)= at scbus2 target 6 lun 0 (pass13,da13)= at scbus2 target 7 lun 0 (da14,pass14)= It should also be noted that I have to override the default SCSI timeout delay, in order to ensure proper detection of all devices at boot by putting the following in /boot/loader.conf : kern.cam.scsi_delay=3D15000 I wanted to know if anyone had experienced the following problems, and found a way around them : 1) I can't run any detailed and meaningful SMART polls on disks belonging to these controllers. (execution logs as separate files) As can be seen I am running the latest available version of smartctl from the ports : http://www.yomi.darkbsd.org/~darksoul/eirei-no-za-broken-disk-smart-log.t= xt (Using the pass devices gives the same result) Only the "-d scsi" polling returns somewhat meaningful info whatsoever (disk serial number etc), but even that is error-inducing, as the disk was actually nearing death. Here is the full SMART log recovered from running the disk from a USB->SATA device : http://www.yomi.darkbsd.org/~darksoul/eirei-no-za-broken-disk-smart-log2.= txt I actually have scripts to monitor that, but it obviously relies on smartctl being able to do its job, which it's not... (Also, this worked perfectly fine under 8-STABLE with "-d sat"...) 2) Also, less annoying but still a show-stopper sort of for any serious work requiring high availability : Any disk I/O freeze ends up locking the whole controller (and the whole ZFS pool...) until either the server crashes or the disk bails out, whichever comes first, really. (kernel log as separate file) http://www.yomi.darkbsd.org/~darksoul/eirei-no-za-mpt-timeout.txt Thanks for your time. --=20 Stephane LAPIE, EPITA SRS, Promo 2005 "Even when they have digital readouts, I can't understand them." --MegaTokyo --------------enig6D23D7F6B37ED6706408533D Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://www.enigmail.net/ iEYEARECAAYFAlCLXsIACgkQ24Ql8u6TF2PCGQCg4ohBfi7CAtQY1++GZt4PtvdV ZngAn0wEWovubo+PRhLKdcMi45fJsB0S =tWfh -----END PGP SIGNATURE----- --------------enig6D23D7F6B37ED6706408533D--