Date: Thu, 8 Jul 2004 10:54:03 -0400 (EDT) From: Andrew Gallatin <gallatin@cs.duke.edu> To: Ken Smith <kensmith@cse.Buffalo.EDU> Cc: freebsd-alpha@freebsd.org Subject: Re: Alpha problems (though maybe not just Alpha...) Message-ID: <16621.24587.229940.823746@grasshopper.cs.duke.edu> In-Reply-To: <20040708135441.GD15096@electra.cse.Buffalo.EDU> References: <20040708135441.GD15096@electra.cse.Buffalo.EDU>
next in thread | previous in thread | raw e-mail | index | archive | help
Ken Smith writes: > > FYI, the Alpha reference machine in the cluster has had problems the > past few days when anything significant (e.g. 'make buildworld'...) > gets run on it. It usually locks up solid right after printing > this on the console: > > panic() at panic+0x200 > _mtx_assert() at _mtx_assert+0xb4 > vrele() at vr 16 32 48 64 80 A mutex assert failed in vrele(). The first one that comes to mind is the GIANT_REQUIRED; right at the top. Its really too bad that we don't have the rest of the stack. The remainder of the output is quite strange.. It almost looks like it has stopped printing the stack trace and started taking a dump, and then panic'ed taking the dump (ahc intr might support that): > fatal kernel trap: > > trap entry = 0x2 (memory management fault) > cpuid = 0 > faulting va = 0x0 > type = access violation > cause = instruction fetch > pc = 0x0 > ra = 0x0 > sp = 0xfffffe00317bfc70 > curthread = 0xfffffc007d772000 > pid = 23, comm = intr: ahc1 > > spin lock sched lock held by 0xfffffc007d772000 for > 5 seconds A va and ra of 0 means that it followed a null function pointer, or the stack got corrupted and it tried to return to the wrong place... Can you try disabling dumps, and disabling kern.sync_on_panic and see if you can get a decent stack trace? Thanks, Drew
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?16621.24587.229940.823746>