Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 2 Jun 2017 11:13:31 -0600
From:      Stephen Mcconnell <stephen.mcconnell@broadcom.com>
To:        Harry Schmalzbauer <freebsd@omnilan.de>, "Kenneth D. Merry" <ken@freebsd.org>
Cc:        freebsd-scsi@freebsd.org, Scott Long <scottl@freebsd.org>
Subject:   RE: mps(4) blocks panic-reboot
Message-ID:  <86a38661813a20d3b349920c2de8962e@mail.gmail.com>
In-Reply-To: <59319917.1050301@omnilan.de>
References:  <592FDE8C.1090609@omnilan.de> <ff9342e2e1eb541f347d9f683cfc8214@mail.gmail.com> <59303484.1040609@omnilan.de> <e6fe7cc17fb1302caf2122eaa11d10ba@mail.gmail.com> <59306503.4010007@omnilan.de> <59315A74.9050506@omnilan.de> <20170602153705.GA56018@mithlond.kdm.org> <593198C3.2080902@omnilan.de> <59319917.1050301@omnilan.de>

next in thread | previous in thread | raw e-mail | index | archive | help
Thanks Harry. I'll need to do some testing here to see if I can figure it
out.

Steve

> -----Original Message-----
> From: Harry Schmalzbauer [mailto:freebsd@omnilan.de]
> Sent: Friday, June 02, 2017 10:58 AM
> To: Kenneth D. Merry
> Cc: Stephen Mcconnell; freebsd-scsi@FreeBSD.ORG; Scott Long
> Subject: Re: mps(4) blocks panic-reboot
>
>  Bez=C3=BCglich Harry Schmalzbauer's Nachricht vom 02.06.2017 18:56
(localtime):
> > Bez=C3=BCglich Kenneth D. Merry's Nachricht vom 02.06.2017 17:37
(localtime):
> >> On Fri, Jun 02, 2017 at 14:30:44 +0200, Harry Schmalzbauer wrote:
> > =E2=80=A6
> >>> KDB: stack backtrace:
> >>> #0 0xffffffff805df4f7 at kdb_backtrace+0x67
> >>> #1 0xffffffff8059df96 at vpanic+0x186
> >>> #2 0xffffffff8059de03 at panic+0x43
> >>> #3 0xffffffff808a1892 at trap_fatal+0x322
> >>> #4 0xffffffff808a18e9 at trap_pfault+0x49
> >>> #5 0xffffffff808a1126 at trap+0x286
> >>> #6 0xffffffff80887401 at calltrap+0x8
> >>> #7 0xffffffff805800f2 at __mtx_unlock_sleep+0x72
> >>> #8 0xffffffff8029a7dc at xpt_polled_action+0x31c
> >>> #9 0xffffffff80416c2b at mpssas_ir_shutdown+0x51b
> >>> #10 0xffffffff8059db9a at kern_reboot+0x49a
> >>> #11 0xffffffff8059d6f8 at sys_reboot+0x458
> >>> #12 0xffffffff808a23f4 at amd64_syscall+0x6c4
> >>> #13 0xffffffff808876eb at Xfast_syscall+0xfb
> >>>
> >>> (kgdb) list *0xffffffff805f43ec
> >>> 0xffffffff805f43ec is in turnstile_broadcast
> >>> (/usr/local/share/deploy-
> tools/RELENG_11/src/sys/kern/subr_turnstile.c:837).
> >>> 832
> >>> 833             /*
> >>> 834              * Transfer the blocked list to the pending list.
> >>> 835              */
> >>> 836             mtx_lock_spin(&td_contested_lock);
> >>> 837             TAILQ_CONCAT(&ts->ts_pending,
&ts->ts_blocked[queue],
> >>> td_lockq);
> >>> 838             mtx_unlock_spin(&td_contested_lock);
> >>> 839
> >>> 840             /*
> >>> 841              * Give a turnstile to each thread.  The last thread
gets
> >>>
> >>> I haven't looked at the code at all and only very briefly lokked at
> >>> the diff, just out of curiosity, like pigs staring at clockworks ;-)
> >>>
> >>> But at least I hope this report does help.
> >> Thanks for testing it!
> >>
> >> My guess is that the problem is that the problem is
> >> xpt_polled_action() releases the device mutex, but
> >> mpssas_SSU_to_SATA_devices() isn't acquiring the mutex.
> >>
> >> You could try putting the following around the call to
xpt_polled_action():
> >>
> >> 	mtx_lock(xpt_path_mtx(ccb->ccb_h.path));
> >> 	xpt_polled_action(ccb);
> >> 	mtx_unlock(xpt_path_mtx(ccb->ccb_h.path));
> >>
> >> See if that fixes things.  One other thing to put in there -- after
> >> the if (target->stop_at_shutdown) { } statement, but still inside the
> >> for loop, add these two lines:
> >>
> >> 	xpt_free_path(ccb->ccb_h.path);
> >> 	xpt_free_ccb(ccb);
> >
> > Jope I didn't mess up with text editing, pleas see the attached hunk
> > if it corresponds to the (additional) chages to Stephen's diff.
>
> Sorry, now really with attachment...



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?86a38661813a20d3b349920c2de8962e>