Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 9 Nov 2016 11:58:45 +0100
From:      "O. Hartmann" <o.hartmann@walstatt.org>
To:        Mark Johnston <markj@FreeBSD.org>
Cc:        "O. Hartmann" <ohartmann@walstatt.org>, FreeBSD CURRENT <freebsd-current@freebsd.org>, glebius@FreeBSD.org, ae@FreeBSD.org
Subject:   Re: was: CURRENT [r308087] still crashing: Backtrace provided
Message-ID:  <20161109115845.6c8811ca@thor.walstatt.dynvpn.de>
In-Reply-To: <20161108182829.GA62725@wkstn-mjohnston.west.isilon.com>
References:  <alpine.GSO.1.10.1610231515170.5272@multics.mit.edu> <20161029163336.46bb24c4.ohartman@zedat.fu-berlin.de> <20161030013345.GC67644@raichu> <20161030082525.6fb6d8a4.ohartman@zedat.fu-berlin.de> <20161030163934.GA49633@raichu> <20161030185500.64e57233.ohartman@zedat.fu-berlin.de> <20161030182509.GA1491@charmander> <20161105184509.28d162f1@thor.walstatt.dynvpn.de> <20161105203748.GD63972@wkstn-mjohnston.west.isilon.com> <20161106111356.39850d7e@thor.walstatt.dynvpn.de> <20161108182829.GA62725@wkstn-mjohnston.west.isilon.com>

next in thread | previous in thread | raw e-mail | index | archive | help
--Sig_/VqDTDRS/e8/PMgKefEfNruo
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable

Am Tue, 8 Nov 2016 10:28:51 -0800
Mark Johnston <markj@FreeBSD.org> schrieb:

> On Sun, Nov 06, 2016 at 11:13:56AM +0100, O. Hartmann wrote:
> > Yesterday, I ran the whole day (> 9 hours) without problems r307233 wit=
hout the
> > reported crash.
> >=20
> > Today's morning I got brave and tried r307234 - and had a crash within =
an hour. =20
>=20
> Thanks for confirming - I cc'ed glebius@ and ae@, who can provide more
> insight than me. I was just trying to narrow down the problem to a
> specific commit.
>=20

Thank you for helping.

Kind regards,

Oliver
> >  =20
> > >  =20
> > > >=20
> > > > Attached, you'll find the backtrace report as last time. I had to t=
ype in "dump"
> > > > blindly on the system as a dark screen or a stuck X11 screen blocke=
d the console
> > > > (I use vt() and nVidia BLOB with my nVidia GPUs - and this is still=
 broken on
> > > > FBSD).
> > > >=20
> > > > Please let me know how I can assist further. I saved both the core =
AND this time
> > > > the culprit kernel.   =20
> > >=20
> > > Great, thank you. I would first like to confirm that r307234 is indeed
> > > causing the crash - since it appears to be easy to trigger, that shou=
ld
> > > be faster. If not, the core will help track down the real problem. =20
> >=20
> > Although I was under the impression the in-kernel-config option
> >=20
> > makeoptions    DEBUG=3D-g
> >=20
> > would make debugging symbols available, I'm proved wrong.
> >=20
> > I tried also on=20
> >=20
> > FreeBSD 12.0-CURRENT #15 r308329: Sat Nov  5 08:52:24 CET 2016
> > =20
> > and crashed, from which I picked up kernel and vmcore as well as
> > the text of the backtrace as provided in an earlier mail (see below at =
[core.txt.0],
> > and if I perform this suggested command sequence:
> >=20
> > ohartmann@thor [kernel_debug]: kgdb ./kernel vmcore.0=20
> > GNU gdb 6.1.1 [FreeBSD]
> > Copyright 2004 Free Software Foundation, Inc.
> > GDB is free software, covered by the GNU General Public License, and yo=
u are
> > welcome to change it and/or distribute copies of it under certain condi=
tions.
> > Type "show copying" to see the conditions.
> > There is absolutely no warranty for GDB.  Type "show warranty" for deta=
ils.
> > This GDB was configured as "amd64-marcel-freebsd"...(no debugging symbo=
ls found)...
> > Attempt to extract a component of a value that is not a structure point=
er.
> > Attempt to extract a component of a value that is not a structure point=
er.
> > #0  0xffffffff807b8d83 in doadump ()
> > (kgdb) frame 12
> > #12 0xffffffff80923a74 in ip_output ()
> > (kgdb) p *ifp
> > No symbol table is loaded.  Use the "file" command.
> > (kgdb) p *ro
> > No symbol table is loaded.  Use the "file" command.
> > (kgdb)
> >=20
> > Again, I'm doing this kind of debugging the very first time and I miss =
something here,
> > apologizes for that. =20
>=20
> Hm, I'm not sure what the problem is. When a kernel is installed and
> WITHOUT_KERNEL_SYMBOLS is not set in src.conf, debug symbols should
> automatically be installed to /usr/lib/debug/boot/kernel.
>=20
> >=20
> > Sorry about the redundancy.
> >=20
> > The curious thing to me is that this bug is triggered on systems with I=
ntel CPU
> > architectures older or equal than IvyBridge. The very same /etc/make.co=
nf
> > and /etc/src.conf as well as the very same kernel config apart from som=
e local
> > hardware dependend modifications are spread around my servers and works=
tations and
> > especially my bureau's box is a sHaswell XEON with almost the exact sam=
e confict
> > running on CURRENT (recent as of Thursday) without problems while the b=
ox I'm
> > reporting this error from is crashing (i3-3220, the server, also crashi=
ng here, is a
> > E3-1245 V2. Another crashing system is a 2009 C2D XEON 5XXX, two socket=
 server,
> > crashing the same way, but with a different kernel config.
> > I tried on the crashing systems with GENERIC as well with the same resu=
lts.
> >=20
> > I'm using IPFW as the firewall on all systems.
> >=20
> > Please tell me if you revert some commits, I'll then checkout the sourc=
es up to recent
> > CURRENT and try again.
> >=20
> > This just for addition and completion.
> >=20
> >=20
> > Kind regards and thanks in advance,
> >=20
> > Oliver
> >=20
> > [...]
> > [core.txt.0]
> > ...
> > Fatal trap 9: general protection fault while in kernel mode
> > cpuid =3D 0; apic id =3D 00
> > instruction pointer     =3D 0x20:0xffffffff807b44fb
> > stack pointer           =3D 0x28:0xfffffe0238f7c290
> > frame pointer           =3D 0x28:0xfffffe0238f7c310
> > code segment            =3D base 0x0, limit 0xfffff, type 0x1b
> >                         =3D DPL 0, pres 1, long 1, def32 0, gran 1
> > processor eflags        =3D interrupt enabled, resume, IOPL =3D 0
> > current process         =3D 521 (nslcd)
> >=20
> > Reading symbols from /boot/modules/nvidia-modeset.ko...done.
> > Loaded symbols for /boot/modules/nvidia-modeset.ko
> > Reading symbols from /boot/modules/nvidia.ko...done.
> > Loaded symbols for /boot/modules/nvidia.ko
> > #0  doadump (textdump=3D0) at pcpu.h:222
> > 222     pcpu.h: No such file or directory.
> >         in pcpu.h
> > (kgdb) #0  doadump (textdump=3D0) at pcpu.h:222
> > #1  0xffffffff8049e1eb in db_dump (dummy=3D<value optimized out>, dummy=
2=3Dfalse,=20
> >     dummy3=3D0, dummy4=3D0x0) at /usr/src/sys/ddb/db_command.c:546
> > #2  0xffffffff8049dfe9 in db_command (cmd_table=3D<value optimized out>)
> >     at /usr/src/sys/ddb/db_command.c:453
> > #3  0xffffffff8049dd44 in db_command_loop ()
> >     at /usr/src/sys/ddb/db_command.c:506
> > #4  0xffffffff804a11af in db_trap (type=3D<value optimized out>,=20
> >     code=3D<value optimized out>) at /usr/src/sys/ddb/db_main.c:248
> > #5  0xffffffff807fd3e3 in kdb_trap (type=3D<value optimized out>,=20
> >     code=3D<value optimized out>, tf=3D<value optimized out>)
> >     at /usr/src/sys/kern/subr_kdb.c:654
> > #6  0xffffffff80afeaf1 in trap_fatal (frame=3D0xfffffe0238f7c1d0, eva=
=3D0)
> >     at /usr/src/sys/amd64/amd64/trap.c:796
> > #7  0xffffffff80afe7df in trap (frame=3D0xfffffe0238f7c1d0)
> >     at /usr/src/sys/amd64/amd64/trap.c:198
> > #8  0xffffffff80adf4a1 in calltrap ()
> >     at /usr/src/sys/amd64/amd64/exception.S:236
> > #9  0xffffffff807b44fb in __rw_wlock_hard (c=3D<value optimized out>,=20
> >     tid=3D<value optimized out>, file=3D<value optimized out>,=20
> >     line=3D<value optimized out>) at /usr/src/sys/kern/kern_rwlock.c:830
> > #10 0xffffffff807b437c in _rw_wlock_cookie (c=3D0xfffff80070538310,=20
> >     file=3D0xffffffff80ca31b2 "/usr/src/sys/net/if_ethersubr.c", line=
=3D304)
> >     at /usr/src/sys/kern/kern_rwlock.c:296
> > #11 0xffffffff808d1e07 in ether_output (ifp=3D0xfffff800036e7800,=20
> >     m=3D<value optimized out>, dst=3D0xfffff8003d980e60, ro=3D0xfffff80=
03d980e40)
> >     at /usr/src/sys/net/if_ethersubr.c:304
> > #12 0xffffffff80923a74 in ip_output (m=3D0xfffff8000a24a500,=20
> >     opt=3D<value optimized out>, ro=3D<value optimized out>, flags=3D0,=
 imo=3D0x0,=20
> >     inp=3D<value optimized out>) at /usr/src/sys/netinet/ip_output.c:664
> > #13 0xffffffff8099a7ee in tcp_output (tp=3D<value optimized out>)
> >     at /usr/src/sys/netinet/tcp_output.c:1432
> > #14 0xffffffff809a7c88 in tcp_usr_send (so=3D<value optimized out>,=20
> >     flags=3D<value optimized out>, m=3D0xfffff8003d837800, nam=3D0x0,=20
> >     control=3D<value optimized out>, td=3D0xfffff8000a24a500)
> >     at /usr/src/sys/netinet/tcp_usrreq.c:956
> > #15 0xffffffff808567b4 in sosend_generic (so=3D<value optimized out>,=20
> >     addr=3D<value optimized out>, uio=3D<value optimized out>,=20
> >     top=3D0xfffff8003d837800, control=3D<value optimized out>,=20
> >     flags=3D<value optimized out>, td=3D<value optimized out>)
> >     at /usr/src/sys/kern/uipc_socket.c:1359
> > #16 0xffffffff8082d672 in soo_write (fp=3D<value optimized out>,=20
> >     uio=3D0xfffffe0238f7c900, active_cred=3D<value optimized out>,=20
> >     flags=3D<value optimized out>, td=3D<value optimized out>)
> >     at /usr/src/sys/kern/sys_socket.c:146
> > #17 0xffffffff80823d84 in dofilewrite (td=3D0xfffff8000a24a500, fd=3D7,=
=20
> >     fp=3D0xfffff8000a0421e0, auio=3D0xfffffe0238f7c900,=20
> >     offset=3D<value optimized out>, flags=3D0) at file.h:311
> > #18 0xffffffff80823ac8 in kern_writev (td=3D0xfffff8000a24a500, fd=3D7,=
=20
> >     auio=3D0xfffffe0238f7c900) at /usr/src/sys/kern/sys_generic.c:508
> > #19 0xffffffff80823a54 in sys_write (td=3D0xfffff800705382f8,=20
> >     uap=3D<value optimized out>) at /usr/src/sys/kern/sys_generic.c:421
> > #20 0xffffffff80aff33f in amd64_syscall (td=3D0xfffff8000a24a500,=20
> >     traced=3D<value optimized out>) at subr_syscall.c:135
> > #21 0xffffffff80adf78b in Xfast_syscall ()
> >     at /usr/src/sys/amd64/amd64/exception.S:396
> > #22 0x0000000801261f5a in ?? ()
> > Previous frame inner to this frame (corrupt stack?)
> > Current language:  auto; currently minimal
> > (kgdb)=20
> > [...]
> > --=20
> > O. Hartmann
> >=20
> > Ich widerspreche der Nutzung oder =DCbermittlung meiner Daten f=FCr
> > Werbezwecke oder f=FCr die Markt- oder Meinungsforschung (=A7 28 Abs. 4=
 BDSG). =20
> _______________________________________________
> freebsd-current@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-current
> To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"



--=20
O. Hartmann

Ich widerspreche der Nutzung oder =DCbermittlung meiner Daten f=FCr
Werbezwecke oder f=FCr die Markt- oder Meinungsforschung (=A7 28 Abs. 4 BDS=
G).

--Sig_/VqDTDRS/e8/PMgKefEfNruo
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----

iJ4EARMKAAYFAlgjAWYACgkQ0udvH8hYWJSW6wIAqaSm7x1xkzl/cbot74xQTlMD
br3XQgnfVLAtcghLcBRcnR+P4pCdTY83Its7sykqCNnDlEQ9Q88qK8u7KkTl2gH+
Kr/W8UXmHQpR4oGi9Yh4yZCPk6MOV50myQuXi2Ktz7wchgut+0aOjytWNS6yno5l
6w531W7ptfUizqMu0ikY7g==
=NtGI
-----END PGP SIGNATURE-----

--Sig_/VqDTDRS/e8/PMgKefEfNruo--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20161109115845.6c8811ca>