Date: Tue, 26 Jul 2011 21:44:05 -1000 From: Tim Newsham <tim.newsham+fbsd@gmail.com> To: freebsd-amd64@freebsd.org Subject: Re: page fault in kernel Message-ID: <CAGSRWbh_O1iNavv3e=PKo3s4tP9gPz4k3V80-OncUE2WoFCtEw@mail.gmail.com> In-Reply-To: <CAGSRWbibOcLKUOrLFx-K5ftnfO1pkTyNew5kGMdgrJ8suyU8DQ@mail.gmail.com> References: <CAGSRWbibOcLKUOrLFx-K5ftnfO1pkTyNew5kGMdgrJ8suyU8DQ@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
finally got to the cpuburn. Normal cpu burn for K7 didnt turn anything up, but the burnMMX test which does a heavy memory test actually did.. It wouldnt report errors for memory sizes <=3D 256kb, but >=3D 512kb it would return code 254 after a few seconds (when I ran it twice, once for each core). So.. looks like memory after all (fits the pattern). Using the test I took the same ram and seated it in a different slot and it looks like the problem went away. When I tried adding in the second ram stick I have (each 2G, for a total of 4G) the problem came back, but I didnt diagnose it further to see if it was the ram stick or the memory slot that was to blame (save that for another day). So anyway, going to try a big overnight build to see if that has completely resolved the issue or not. Here's my repro for my original crash: - boot single user - mount -o ro -a - (burnMMX P || echo $?) & - (burnMMX P || echo $?) & - ... wait ... (useful program that burncpu package!). will update tomorow if it fixed the original issue or not... Tim On Tue, Jul 26, 2011 at 8:39 PM, Tim Newsham <tim.newsham+fbsd@gmail.com> w= rote: > I have an amd64 box that is occasionally panic'ing with a kernel > page fault while the system is under heavy load. =A0I tried swapping > out ram and running ram tests and that hasn't seem to turn up > anything. =A0I originally saw this with 8.1 but I recently updated to > 8.2 to be sure and I'm still seeing the problem. =A0I'm up to date > with "freebsd-update" and I verified that the core files all match > the md5 of a clean 8.2 release (after it is also updated). =A0I also > tried to disable various frills I had installed (like ip6 tunnel). > > The normal way I repro this issue is to start a "portupgrade -caP" > going and I usually get a crash within a few hours. =A0Sometimes > I see some unusual behavior instead of or right before a crash > (ie. program segfaults that shouldn't normally). > > Are there any known kernel issues that sound similar to this > problem? =A0I've been working under the assumption that this is > probably hardware related, but I haven't yet ruled out software. > > I'll probably be trying some tests with cpuburn. =A0Any other advice > for tracking down this issue would be appreciated. > > Tim >
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAGSRWbh_O1iNavv3e=PKo3s4tP9gPz4k3V80-OncUE2WoFCtEw>