Date: Thu, 20 Dec 2007 15:32:40 +0200 From: Nikolay Pavlov <qpadla@gmail.com> To: freebsd-current@freebsd.org Cc: arch@freebsd.org, Robert Watson <rwatson@freebsd.org>, current@freebsd.org Subject: Re: DDB scripting, output capture, and textdumps Message-ID: <200712201532.41123.qpadla@gmail.com> In-Reply-To: <20071218120359.E15521@fledge.watson.org> References: <20071218120359.E15521@fledge.watson.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--nextPart1211070.4xZmMiWbSL
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
On Tuesday 18 December 2007 14:10:46 Robert Watson wrote:
> Dear all:
>
> I've been hacking on-and-off for a while on a side project to improve
> our kernel debugging facilities. Primarily, my concern has been to
> address three problems:
>
> - The complications of employing kernel core dumps for debugging,
> including the large size of dumps making them unwieldy to distribute
> or store for any extended period (even with minidumps), the requirement
> to have relatively synchronized kernel source in order to use the dumps,
> the need to have a kernel with debugging symbols, and the problems with
> fsck causing sufficient swap use to invalidate dumps before they can be
> extracted.
>
> - The decreasing likelihood that notebooks will ship with serial ports
> that can be used for interactive debugging using DDB. Making
> end-users type in stack traces is cruel, photos are a pain, and X11
> rules out both.
>
> - The fact that a great many problems are most easily diagnosed using
> utility routines present in DDB, but not as easily using kgdb for
> offline analysis. I find that for many bugs I analyze, simply
> looking at the DDB output is sufficient to identify the source of the
> problem.
>
> An idea I punted around a bit at BSDCan earlier this year (or perhaps it
> was at EuroBSDCon the previous year) was an idea of a "textdump" -- that
> is, a new type of kernel dump based on capturing automatically extracted
> debugging information generated by DDB. The result would be an ASCII
> text file that could be filed as a bug report, perhaps even
> automatically.
>
> To this end, I have implemented three new facilities for use with DDB:
>
> (1) DDB output capture. The output of DDB is stored in a memory buffer,
> and can be extracted using a sysctl or textdumps (see below). This
> can be turned on and off, both for use manually ("I'll want this
> later, but not that") and as part of scripts (see below).
>
> (2) DDB scripting. A limited number of named scripts can be defined to
> run a series of DDB commands. No loops, etc, just simple command
> lists. These can be caused to run automatically on entering DDB
> for various scenarios, including WITNESS violations and kernel panics.
> They can also be run by hand in order to save a bit of typing if you use
> DDB in a repetitive way (as I do).
>
> (3) Textdumps. A new dump type that stores a series of data files
> containing various pieces of information, including the DDB capture
> buffer, kernel message buffer, kernel configuration (if compiled
> into the kernel), panic message, and kernel version string. These are
> stored in the ustar format inside the dump partition (aligned to the
> end) so can be easily extended, and savecore(8) requires almost no new
> logic to deal with them (it just drops numbered tar files in
> /var/crash). This makes it straight forward to extend the textdump
> format to include new types of information and avoids the issue of how
> to safely simultaneously represent information in many different formats
> in the same file.
>
> These are pretty flexible tools, and you can imagine doing the following
> sorts of things:
>
> - Setting the kdb.enter.panic script to automatically turn on output
> capture, do full backtraces of all threads, show open file
> information, dump UMA stats, and save it all to a textdump and then
> reboot.
>
> - Setting the kdb.enter.witness script to show lock information,
> generate a coredump, and reboot. Or, just to automatically do "show
> allocks" and drop to the DDB prompt.
>
> - Adding a flag to rc.conf to automatically submit textdumps via e-mail
> to a specific address, perhaps including GNATS or an automated bug
> system. These could be unpacked and automatically analyzed, and do to
> the compact size, kept for long-term trend analysis or to identify when
> a problem started occuring.
>
> I've produced an initial snapshot of the above, which can be found here:
>
> http://www.watson.org/~robert/freebsd/20071218-ddb.tgz
>
> This adds three files to DDB, patches quite a few kernel files (to pass
> more information into KDB about why it's being entered, in order to
> trigger the right script), enhancements to savecore(8) to know how to
> extract textdumps, adds a ddb(8) command line tool so that userspace can
> manage DDB scripts from outside the debugger, extensions to the ddb(4)
> man page, and a new textdump(4) man page.
>
> There are a number of known limitations; I've tried to document them at
> the top of the pertinent files where I am aware of them. I also regret
> to say that to date I've been able to test only on i386, and not other
> platforms. I'd welcome any feedback -- I'd like to get these changes
> into CVS in the next week or two.
It looks like some files is not included in the patch.
I have this error:
make -V CFILES -V SYSTEM_CFILES -V GEN_CFILES | MKDEP_CPP=3D"cc -E" CC=3D"=
cc"=20
xargs=20
mkdep -a -f .newdep -O -pipe -std=3Dc99 -g -Wall -Wredundant-decls -Wneste=
d-externs -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Winlin=
e -Wcast-qual -Wundef -Wno-pointer-sign -fformat-extensions -nostdinc -I.=
-I/usr/src/sys -I/usr/src/sys/contrib/altq -I/usr/src/sys/contrib/ipfilter=
-I/usr/src/sys/contrib/pf -I/usr/src/sys/dev/ath -I/usr/src/sys/contrib/ng=
atm -I/usr/src/sys/dev/twa -I/usr/src/sys/gnu/fs/xfs/FreeBSD -I/usr/src/sys=
/gnu/fs/xfs/FreeBSD/support -I/usr/src/sys/gnu/fs/xfs -D_KERNEL -DHAVE_KERN=
EL_OPTION_HEADERS -include=20
opt_global.h -fno-common -finline-limit=3D8000 --param=20
inline-unit-growth=3D100 --param=20
large-function-growth=3D1000 -mno-align-long-strings -mpreferred-stack-bou=
ndary=3D2 -mno-mmx -mno-3dnow -mno-sse -mno-sse2 -mno-sse3 -ffreestanding
cc: /usr/src/sys/ddb/db_capture.c: No such file or directory
cc: /usr/src/sys/ddb/db_script.c: No such file or directory
cc: /usr/src/sys/ddb/db_textdump.c: No such file or directory
mkdep: compile failed
*** Error code 1
Stop in /usr/obj/usr/src/sys/GENERIC.
*** Error code 1
Stop in /usr/src.
*** Error code 1
Stop in /usr/src.
root@orion-vm:/usr/src# ls -la /usr/src/sys/ddb/ =
=
=20
total 424
drwxr-xr-x 2 root wheel 1024 Dec 19 16:49 ./
drwxr-xr-x 53 root wheel 1024 Oct 29 08:24 ../
=2Drw-r--r-- 1 root wheel 2591 Dec 4 2005 db_access.c
=2Drw-r--r-- 1 root wheel 1431 Jan 6 2005 db_access.h
=2Drw-r--r-- 1 root wheel 7737 Jan 6 2005 db_break.c
=2Drw-r--r-- 1 root wheel 2098 Jan 6 2005 db_break.h
=2Drw-r--r-- 1 root wheel 16579 Dec 19 16:49 db_command.c
=2Drw-r--r-- 1 root wheel 15705 Jan 17 2007 db_command.c.orig
=2Drw-r--r-- 1 root wheel 1633 Dec 19 16:49 db_command.h
=2Drw-r--r-- 1 root wheel 1588 Jan 6 2005 db_command.h.orig
=2Drw-r--r-- 1 root wheel 7270 Oct 27 20:19 db_examine.c
=2Drw-r--r-- 1 root wheel 4811 Jan 6 2005 db_expr.c
=2Drw-r--r-- 1 root wheel 7981 Dec 19 16:49 db_input.c
=2Drw-r--r-- 1 root wheel 7931 Jan 6 2005 db_input.c.orig
=2Drw-r--r-- 1 root wheel 5978 Dec 19 16:49 db_lex.c
=2Drw-r--r-- 1 root wheel 5304 Jan 6 2005 db_lex.c.orig
=2Drw-r--r-- 1 root wheel 1951 Dec 19 16:49 db_lex.h
=2Drw-r--r-- 1 root wheel 1861 Jan 6 2005 db_lex.h.orig
=2Drw-r--r-- 1 root wheel 5976 Dec 19 16:49 db_main.c
=2Drw-r--r-- 1 root wheel 5787 Nov 6 2006 db_main.c.orig
=2Drw-r--r-- 1 root wheel 6889 Dec 19 16:49 db_output.c
=2Drw-r--r-- 1 root wheel 6639 Oct 10 2006 db_output.c.orig
=2Drw-r--r-- 1 root wheel 1444 Oct 8 2006 db_output.h
=2Drw-r--r-- 1 root wheel 2013 Apr 14 2005 db_print.c
=2Drw-r--r-- 1 root wheel 10956 Nov 13 13:43 db_ps.c
=2Drw-r--r-- 1 root wheel 8810 Apr 14 2005 db_run.c
=2Drw-r--r-- 1 root wheel 7815 Jun 16 2006 db_sym.c
=2Drw-r--r-- 1 root wheel 3591 Jan 6 2005 db_sym.h
=2Drw-r--r-- 1 root wheel 5143 Jan 17 2007 db_thread.c
=2Drw-r--r-- 1 root wheel 3402 Jan 6 2005 db_variables.c
=2Drw-r--r-- 1 root wheel 1854 Jan 6 2005 db_variables.h
=2Drw-r--r-- 1 root wheel 7218 Nov 17 2006 db_watch.c
=2Drw-r--r-- 1 root wheel 1509 Jan 6 2005 db_watch.h
=2Drw-r--r-- 1 root wheel 2264 Jan 6 2005 db_write_cmd.c
=2Drw-r--r-- 1 root wheel 7467 Dec 19 16:49 ddb.h
=2Drw-r--r-- 1 root wheel 5776 Jul 12 2006 ddb.h.orig
=2D-=20
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D =20
=2D Best regards, Nikolay Pavlov. <<<----------------------------------- =
=20
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D =20
--nextPart1211070.4xZmMiWbSL
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
iD8DBQBHam75/2R6KvEYGaIRAo+GAJ996lCyHp0+dyN5rW7dWbBRugjQFgCg39e1
+bfdaDTErGPe2aJWZCW0BTw=
=CE18
-----END PGP SIGNATURE-----
--nextPart1211070.4xZmMiWbSL--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200712201532.41123.qpadla>
