Date: Sat, 7 Mar 2009 20:25:21 +0200 From: Kostik Belousov <kostikbel@gmail.com> To: Boris Kochergin <spawk@acm.poly.edu> Cc: freebsd-stable@freebsd.org Subject: Re: "Fatal trap 12: page fault while in kernel mode" on 7.1/amd64, but not 7.0 Message-ID: <20090307182521.GH41617@deviant.kiev.zoral.com.ua> In-Reply-To: <49B19D28.1060803@acm.poly.edu> References: <49B07482.4080208@acm.poly.edu> <1236334857.88789.4.camel@buffy.york.ac.uk> <49B19D28.1060803@acm.poly.edu>
next in thread | previous in thread | raw e-mail | index | archive | help
--3XEb2DysMhnLsv3a Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Mar 06, 2009 at 05:01:12PM -0500, Boris Kochergin wrote: > Gavin Atkinson wrote: > >On Thu, 2009-03-05 at 19:55 -0500, Boris Kochergin wrote: > > =20 > >>Ahoy. I recently upgraded an amd64 machine to 7.1-RELEASE, and started= =20 > >>getting a bunch of these at a pretty high frequency (a few hours to a= =20 > >>day apart): > >> > >>http://acm.poly.edu/~spawk/IMG00033.jpg > >> > >>The "current process" is always httpd. They're particularly annoying=20 > >>because the machine doesn't actually ever reboot, requiring manual=20 > >>intervention. Reverting the kernel back to 7.0 makes the panic go away,= =20 > >>and the machine had been happily running 7.0 for about a year=20 > >>beforehand. I realize that the photo hardly contains any useful=20 > >>debugging information, but I was hoping it might look familiar to=20 > >>someone. If not, I guess I'll come back with a backtrace. > >> =20 > > > >A backtrace will almost certainly be necessary to figure out what this > >issue is, although there is a possibility that the output of > >"addr2line -e /boot/kernel/kernel.symbols 0x8:0xffffffff802d7010" > >might help, assuming you've not recompiled your kernel yet. (That > >number should be the same as the "instruction pointer" shown by the > >panic, but as the photo is quite blurred there's a chance I've got it > >wrong, if you have a better picture of it or wrote it down then use > >that) > > > >Gavin > >_______________________________________________ > >freebsd-stable@freebsd.org mailing list > >http://lists.freebsd.org/mailman/listinfo/freebsd-stable > >To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > > =20 > Here it is, with some additional information afterward: >=20 > Unread portion of the kernel message buffer: > kernel trap 12 with interrupts disabled >=20 >=20 > Fatal trap 12: page fault while in kernel mode > cpuid =3D 1; apic id =3D 01 > fault virtual address =3D 0x30 > fault code =3D supervisor read data, page not present > instruction pointer =3D 0x8:0xffffffff80293faf > stack pointer =3D 0x10:0xffffffff9cbaea70 > frame pointer =3D 0x10:0xffffff000fc14000 > code segment =3D base 0x0, limit 0xfffff, type 0x1b > =3D DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags =3D resume, IOPL =3D 0 > current process =3D 881 (httpd) > trap number =3D 12 > panic: page fault > cpuid =3D 1 > Uptime: 1m51s > Physical memory: 8185 MB > Dumping 328 MB: 313 297 281 265 249 233 217 201 185 169 153 137 121 105= =20 > 89 73 57 41 25 9 >=20 > #0 doadump () at pcpu.h:195 > 195 pcpu.h: No such file or directory. > in pcpu.h > (kgdb) where > #0 doadump () at pcpu.h:195 > #1 0xffffff000fc14000 in ?? () > #2 0xffffffff8025eba9 in boot (howto=3D260) at=20 > /usr/src-7.1/sys/kern/kern_shutdown.c:418 > #3 0xffffffff8025efb2 in panic (fmt=3D0x104 <Address 0x104 out of=20 > bounds>) at /usr/src-7.1/sys/kern/kern_shutdown.c:574 > #4 0xffffffff803df5c3 in trap_fatal (frame=3D0xffffff000fc14000,=20 > eva=3DVariable "eva" is not available. > ) at /usr/src-7.1/sys/amd64/amd64/trap.c:764 > #5 0xffffffff803e018f in trap (frame=3D0xffffffff9cbae9c0) at=20 > /usr/src-7.1/sys/amd64/amd64/trap.c:290 > #6 0xffffffff803c5c4e in calltrap () at=20 > /usr/src-7.1/sys/amd64/amd64/exception.S:209 > #7 0xffffffff80293faf in turnstile_broadcast (ts=3D0x0, queue=3D0) at=20 > /usr/src-7.1/sys/kern/subr_turnstile.c:836 > #8 0xffffffff8025256a in _mtx_unlock_sleep (m=3D0xffffffff80593538,=20 > opts=3DVariable "opts" is not available. > ) at /usr/src-7.1/sys/kern/kern_mutex.c:619 > #9 0xffffffff80275ed3 in __umtx_op_cv_wait (td=3D0x1ee, uap=3DVariable= =20 > "uap" is not available. > ) at /usr/src-7.1/sys/kern/kern_umtx.c:312 > #10 0xffffffff803dfb78 in syscall (frame=3D0xffffffff9cbaec80) at=20 > /usr/src-7.1/sys/amd64/amd64/trap.c:907 > #11 0xffffffff803c5e5b in Xfast_syscall () at=20 > /usr/src-7.1/sys/amd64/amd64/exception.S:330 > #12 0x0000000800f5354c in ?? () > Previous frame inner to this frame (corrupt stack?) > (kgdb) >=20 > The dump was difficult to acquire--the system would often lock up after= =20 > dumping only a portion of the memory it wanted to save. I can also now=20 > trigger the panic pretty reliably using this bit of script: >=20 > #!/usr/local/bin/bash >=20 > for i in {1..900} > do > wget --quiet -O /dev/null http://acm.poly.edu/wiki/Hosting & > done >=20 > ...where the URL is a MediaWiki installation on the afflicted machine. Can you, please, recompile the kernel with debugging options, and provoke the panic on it ? We need at least options INVARIANTS, INVARIANT_SUPPORT and WITNESS. --3XEb2DysMhnLsv3a Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkmyvBEACgkQC3+MBN1Mb4gG7ACfXq/jWlpsPSKHrX1n0iZw614m 7KQAn0OtyeElCcl+jituOTaM+CY7+Tyl =TN5B -----END PGP SIGNATURE----- --3XEb2DysMhnLsv3a--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090307182521.GH41617>