Date: Thu, 6 Mar 2014 11:12:06 GMT From: Borja Marcos <borjam@sarenet.es> To: freebsd-gnats-submit@FreeBSD.org Subject: kern/187312: LSI Logic Invader card fails at boot Message-ID: <201403061112.s26BC61N020678@cgiserv.freebsd.org> Resent-Message-ID: <201403061120.s26BK1tT043394@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
>Number: 187312 >Category: kern >Synopsis: LSI Logic Invader card fails at boot >Confidential: no >Severity: non-critical >Priority: low >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Thu Mar 06 11:20:01 UTC 2014 >Closed-Date: >Last-Modified: >Originator: Borja Marcos >Release: FreeBSD 10-STABLE >Organization: >Environment: FreeBSD elibm 10.0-STABLE FreeBSD 10.0-STABLE #1: Tue Mar 4 09:36:48 UTC 2014 root@:/usr/obj/usr/src/sys/GENERIC amd64 >Description: There is some trouble with this LSI Logic RAID card: mfi0 Adapter: Product Name: ServeRAID M5210e Serial Number: 3CJ0SG Firmware: 24.0.2-0013 RAID Levels: JBOD, RAID0, RAID1, RAID10 Battery Backup: not present NVRAM: 32K Onboard Memory: 0M Minimum Stripe: 64K Maximum Stripe: 64K It's an Invader series card. When booting from install media (for example, a memstick) FreeBSD detects the RAID volume (/dev/mfid0) without issues and the installation completes. There's just a warning message: However, once the system is installed, FreeBSD is unable to boot. The loader works, the kernel starts, but it fails when trying to mount the root file system from /dev/mfid0p2. It fails with 10-RELEASE and 10-STABLE. The errors are: mfi0: Adapter RESET condition is detected mfi0: First step of FW reset initialized… mfi0: ADP_RESET_TBOLT: Hostdiag=0xa0 mfi0: First stage of reset complete, second stage initialized… mfi0: APJ ****requeue command 0xfffffe000104dfc0 index=120 mfi0: second stage of reset complete, FW is ready now. mfi0: 11032 (boot + 3s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014) mfi0: 11033 (boot + 3s/0x0020/info) - Firmware version 4.200.21-2840 … I haven't copied all the errors, last time I tried it finished entering an endless loop. Interestingly, IT WORKS With 11-CURRENT. I can successfully install 11-CURRENT and boot normally. >How-To-Repeat: Create a RAID 1 volume (it's the only option I have tried) and install 10-RELEASE or 10-STABLE on /dev/mfid0, standard installation, no frills. >Fix: It works with 11-CURRENT. Comparing the mfi driver betwen CURRENT and STABLE I noticed that there are three files with changes, sys/dev/mfi/mfi_pci.c sys/dev/mfi/mfi_tbolt.c sys/dev/mvi/mfivar.h http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfi_pci.c?revision=261535&view=markup http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfi_tbolt.c?revision=261535&view=markup http://svnweb.freebsd.org/base/head/sys/dev/mfi/mfivar.h?revision=261535&view=markup I copied the three files on a -STABLE source tree checked out yesterday, built a kernel and tried. The -STABLE kernel with mfi compiled with these new files works. It can boot from /dev/mfid0 and mount /dev/mfid0p2 as the root file system. mfi0: <Invader> port 0x4f00-0x4fff mem 0x913f0000-0x913fffff,0x91400000-0x914fffff irq 34 at device 0.0 on pci22 mfi0: Using MSI mfi0: Megaraid SAS driver Ver 4.23 mfi0: FW MaxCmds = 240, limiting to 128 mfi0: MaxCmd = 240, Drv MaxCmd = 128, MaxSgl = 70, state = 0xb73c00f0 mfi0: 11019 (447416787s/0x0020/info) - Shutdown command received from host mfi0: 11020 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014) mfi0: 11021 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840 mfi0: 11022 (boot + 12s/0x0020/info) - Package version 24.0.2-0013 mfi0: 11023 (boot + 12s/0x0020/info) - Board Revision 00AL055 mfi0: 11024 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) mfi0: 11025 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000 mfi0: 11026 (boot + 33s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649 mfi0: 11027 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) mfi0: 11028 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000 mfi0: 11029 (boot + 33s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649 mfi0: 11030 (447416847s/0x0020/info) - Time established as 03/06/14 10:27:27; (33 seconds since power on) mfi0: 11031 (447416924s/0x0020/info) - Time established as 03/06/14 10:28:44; (111 seconds since power on) mfi0: 11032 (boot + 3s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014) mfi0: 11033 (boot + 3s/0x0020/info) - Firmware version 4.200.21-2840 mfi0: 11034 (447417030s/0x0020/CRIT) - Controller encountered a fatal error and was reset mfi0: 11035 (447417032s/0x0020/info) - Package version 24.0.2-0013 mfi0: 11036 (447417032s/0x0020/info) - Board Revision 00AL055 mfi0: 11037 (447417052s/0x0002/info) - Inserted: PD 02(e0x3e/s25) mfi0: 11038 (447417052s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000 mfi0: 11039 (447417052s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649 mfi0: 11040 (447417052s/0x0002/info) - Inserted: PD 03(e0x3e/s24) mfi0: 11041 (447417052s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000 mfi0: 11042 (447417052s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649 mfi0: 11043 (447417052s/0x0020/info) - Controller Hot Plug detected mfi0: 11044 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014) mfi0: 11045 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840 mfi0: 11046 (boot + 12s/0x0020/info) - Package version 24.0.2-0013 mfi0: 11047 (boot + 12s/0x0020/info) - Board Revision 00AL055 mfi0: 11048 (boot + 32s/0x0002/info) - Inserted: PD 02(e0x3e/s25) mfi0: 11049 (boot + 32s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000 mfi0: 11050 (boot + 32s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649 mfi0: 11051 (boot + 32s/0x0002/info) - Inserted: PD 03(e0x3e/s24) mfi0: 11052 (boot + 32s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000 mfi0: 11053 (boot + 32s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649 mfi0: 11054 (447417194s/0x0020/info) - Time established as 03/06/14 10:33:14; (32 seconds since power on) mfi0: 11055 (447417277s/0x0020/info) - Time established as 03/06/14 10:34:37; (115 seconds since power on) mfid0 on mfi0 mfid0: 121116MB (248045568 sectors) RAID volume (no label) is optimal mfi0: Failed to get command mfi0: <Invader> port 0x4f00-0x4fff mem 0x913f0000-0x913fffff,0x91400000-0x914fffff irq 34 at device 0.0 on pci22 mfi0: Using MSI mfi0: Megaraid SAS driver Ver 4.23 mfi0: FW MaxCmds = 240, limiting to 128 mfi0: MaxCmd = 240, Drv MaxCmd = 128, MaxSgl = 70, state = 0xb73c00f0 mfi0: 11056 (447417497s/0x0020/info) - Shutdown command received from host mfi0: 11057 (boot + 10s/0x0020/info) - Firmware initialization started (PCI ID 005d/1000/045b/1014) mfi0: 11058 (boot + 10s/0x0020/info) - Firmware version 4.200.21-2840 mfi0: 11059 (boot + 12s/0x0020/info) - Package version 24.0.2-0013 mfi0: 11060 (boot + 12s/0x0020/info) - Board Revision 00AL055 mfi0: 11061 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) mfi0: 11062 (boot + 33s/0x0002/info) - Inserted: PD 02(e0x3e/s25) Info: enclPd=3e, scsiType=0, portMap=00, sasAddr=4433221102000000,0000000000000000 mfi0: 11063 (boot + 33s/0x0002/info) - PD 02(e0x3e/s25) FRU is 90Y8649 mfi0: 11064 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) mfi0: 11065 (boot + 33s/0x0002/info) - Inserted: PD 03(e0x3e/s24) Info: enclPd=3e, scsiType=0, portMap=01, sasAddr=4433221103000000,0000000000000000 mfi0: 11066 (boot + 33s/0x0002/info) - PD 03(e0x3e/s24) FRU is 90Y8649 mfi0: 11067 (447417556s/0x0020/info) - Time established as 03/06/14 10:39:16; (33 seconds since power on) mfi0: 11068 (447417632s/0x0020/info) - Time established as 03/06/14 10:40:32; (110 seconds since power on) mfid0 on mfi0 mfid0: 121116MB (248045568 sectors) RAID volume (no label) is optimal Trying to mount root from ufs:/dev/mfid0p2 [rw]... There's still some problem with the driver (see the "fatal error" message) but at least it works. I'm going to do some checks, run some heavy I/O activity to see if it can be reliable. In case this solves the issue, could the above mentioned changes be backported to 10-STABLE? >Release-Note: >Audit-Trail: >Unformatted:
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?201403061112.s26BC61N020678>