Date: Wed, 19 Mar 2014 11:31:19 -0500 From: Alan Cox <alc@rice.edu> To: John-Mark Gurney <jmg@funkthat.com>, Adrian Chadd <adrian@freebsd.org> Cc: "svn-src-head@freebsd.org" <svn-src-head@freebsd.org>, "svn-src-all@freebsd.org" <svn-src-all@freebsd.org>, "src-committers@freebsd.org" <src-committers@freebsd.org> Subject: Re: svn commit: r263214 - in head/sys: compat/freebsd32 kern sys Message-ID: <5329C657.3000705@rice.edu> In-Reply-To: <20140316012609.GY32089@funkthat.com> References: <201403160053.s2G0rfmA073668@svn.freebsd.org> <CAJ-Vmon9%2BNmJghpjwi1NB2k1ETc=bPJJSjQzL-TFVKLZHi8iiA@mail.gmail.com> <20140316012609.GY32089@funkthat.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On 03/15/2014 20:26, John-Mark Gurney wrote: > Adrian Chadd wrote this message on Sat, Mar 15, 2014 at 18:17 -0700: >> How far along does it get? > It rarely gets to multiuser, and even if it does, it panics very > shortly afterward: > panic: vm_page_alloc: page 0xc0805db0 is wired > > I did finally get around to dumping the vm_page struct for it (the > CTF crazyness) and I did send it to alc and kib, but neither one has > replied... > > here is a dump in case someone else has some vm_page clue: > {'act_count': '\x00', > 'aflags': '\x00', > 'busy_lock': 1, > 'dirty': '\xff', > 'flags': 0, > 'hold_count': 0, > 'listq': {'tqe_next': 0xc0805e00, 'tqe_prev': 0xc06d18a0}, > 'md': {'pv_kva': 3235856384, > 'pv_list': {'tqh_first': 0x0, 'tqh_last': 0xc0805de0}, > 'pv_memattr': '\x00', > 'pvh_attrs': 0}, > 'object': 0xc06d1878, > 'oflags': '\x04', > 'order': '\t', > 'phys_addr': 17776640, > 'pindex': 3572, > 'plinks': {'memguard': {'p': 0, 'v': 3228376932}, > 'q': {'tqe_next': 0x0, 'tqe_prev': 0xc06d1f64}, > 's': {'pv': 0xc06d1f64, 'ss': {'sle_next': 0x0}}}, > 'pool': '\x00', > 'queue': '\xff', > 'segind': '\x01', > 'valid': '\xff', > 'wire_count': 1} > > and as you can see, wire_count is not 0... but looks resonable... There are several things wrong with this page. Two lines later, this assertion would also fail: KASSERT(m->dirty == 0, ("vm_page_alloc: page %p is dirty", m)); because a page in the cache/free lists should never be dirty. The page's flags field doesn't include PG_CACHED. So, the object field should be NULL and the valid field should be 0, but they are not. All of these fields are (explicitly) cleared when a page is freed (cf. vm_page_free_toq()). Can you determine if the value of the object field matches the address of either kernel_object or kmem_object? The oflags field containing VPO_UNMANAGED makes that likely. In a nutshell, this looks like the same page is simultaneously in use by one part of the kernel and free in another part. I doubt that it's a simple case of use after free. In that case, the dirty, object, and valid fields would be zero, and the crash would be in a different part of the kernel. > So, I'm blocked until someone w/ clue tells me what more I need to do > to debug this... >
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5329C657.3000705>