Date: Tue, 20 Jul 2010 14:16:11 +0200 From: "Svein Skogen (Listmail account)" <svein-listmail@stillbilde.net> To: freebsd-current@freebsd.org Subject: Re: current + mpt = panic: Bad link elm 0xffffff80002d6480 next->prev != elm Message-ID: <4C45938B.8000604@stillbilde.net> In-Reply-To: <20100720115528.GA88965@putsch.kolbu.ws> References: <20100715123423.GC52222@putsch.kolbu.ws> <20100715160048.GA61891@alchemy.franken.de> <20100715175225.GA52693@putsch.kolbu.ws> <20100716103125.GA73878@putsch.kolbu.ws> <20100718122022.GW4706@alchemy.franken.de> <20100719170654.GA19889@putsch.kolbu.ws> <20100720101736.GD4706@alchemy.franken.de> <20100720115528.GA88965@putsch.kolbu.ws>
next in thread | previous in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enig1C9D8BE22A5CD5A921D0B1B7
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
On 20.07.2010 13:55, St=E5le Kristoffersen wrote:
> On 2010-07-20 at 12:17, Marius Strobl wrote:
>> On Mon, Jul 19, 2010 at 07:06:54PM +0200, Stle Kristoffersen wrote:
>>> On 2010-07-18 at 14:20, Marius Strobl wrote:
>>>>>> Downgrading now...
>>>>>
>>>>> And it crashed again, with current from r209598...
>>>>>
>>>>
>>>> Ok, this at least means that your problem isn't caused by the recent=
>>>> changes to mpt(4) as the pre-r209599 version only differed from the
>>>> 8-STABLE one in a cosmetic change at that time.
>>>
>>> I have another data-point, I cvsup'ed to the latest current again, an=
d
>>> rebuilt without INVARIANT and WITNESS, and now it seems to survive th=
e
>>> timeouts.
>>
>> That's more or less expected as the sanity check issuing the panic
>> just isn't compiled in then. However, my understanding was that with
>> STABLE you don't get the timeouts in the first place, or do you see
>> them there also?
>=20
> I got the timeouts with STABLE as well, that was the reason for me to
> try out CURRENT. I'm sorry I didn't mention that earlier.
>=20
> My main concern is to get rid of the timeouts, but a panic on one can't=
 be
> right. How can I debug this further? I can get timeout fairly consisten=
t by
> putting a bit of load on the drives. If it would help I can also provid=
e
> remote access.
>=20
> I'm trying to update the firmware on some of the drives now to see if t=
hat
> helps with the timeouts.
Sorry for the late response here, but what you're describing matches
fairly well what I saw with RELENG_8 (just after 8.0 was released), but
luckily I didn't have any disks on my MPT, just my tape autoloader.
Random timeouts, and then bus resets (that made tape IO unreliable).
The bad news, is that I had the exact same trouble with OpenSolaris
(134), and something-similar with Linux (can't remember versions), at
the time.
I never did find a solution, and ended up throwing windows on the box,
just to get reliable backups.
My MPT is a 3801 LSI1068e based card running the latest bios.
//Svein
--=20
--------+-------------------+-------------------------------
  /"\   |Svein Skogen       | svein@d80.iso100.no
  \ /   |Solberg =D8stli 9    | PGP Key:  0xE5E76831
   X    |2020 Skedsmokorset | svein@jernhuset.no
  / \   |Norway             | PGP Key:  0xCE96CE13
        |                   | svein@stillbilde.net
 ascii  |                   | PGP Key:  0x58CD33B6
 ribbon |System Admin       | svein-listmail@stillbilde.net
Campaign|stillbilde.net     | PGP Key:  0x22D494A4
        +-------------------+-------------------------------
        |msn messenger:     | Mobile Phone: +47 907 03 575
        |svein@jernhuset.no | RIPE handle:    SS16503-RIPE
--------+-------------------+-------------------------------
         If you really are in a hurry, mail me at
               svein-mobile@stillbilde.net
 This mailbox goes directly to my cellphone and is checked
        even when I'm not in front of my computer.
------------------------------------------------------------
                     Picture Gallery:
          https://gallery.stillbilde.net/v/svein/
------------------------------------------------------------
--------------enig1C9D8BE22A5CD5A921D0B1B7
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.12 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAkxFk48ACgkQODUnwSLUlKSExQCeONKY0PZSJCL+6RKURaZax2JU
NeIAoIaFZN91ghfF1QF97Gozbo8kyZaZ
=R8hO
-----END PGP SIGNATURE-----
--------------enig1C9D8BE22A5CD5A921D0B1B7--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4C45938B.8000604>
