Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 09 Nov 2002 19:57:10 -0800
From:      Kent Stewart <kstewart@owt.com>
To:        neil@mpfreescene.com
Cc:        "'Matthew Seaman'" <m.seaman@infracaninophile.co.uk>, kris@obsecurity.org, freebsd-questions@FreeBSD.ORG
Subject:   Re: Maybe isolated the signal 12s im getting to hi loads
Message-ID:  <3DCDD916.5080906@owt.com>
References:  <006001c2885f$369329c0$0200a8c0@b1>

next in thread | previous in thread | raw e-mail | index | archive | help


Neil Doody wrote:
> Firstly guys thanks all for your feedback.
> 
> Just to answer the questions on your minds, im not neglecting the
> problem, my main problem is that it's a remote server, but my host has
> changed all hardware multiple times, except for the hard drive, though
> that was replaced last night [with another Maxtor I may add] and its
> still doing it.
> 
> The other question, well I generally have been having reboots with this
> messages left in the logs :-
> 
> Sep 16 21:15:07 admin /kernel: Fatal trap 12: page fault while in kernel
> mode
> Sep 16 21:15:07 admin /kernel: fault virtual address    = 0x10
> Sep 16 21:15:07 admin /kernel: fault code               = supervisor
> read, page not present
> 
> However I did recently get one of these after the motherdboard cpu and
> memory had been changed again [I actually had an upgrade to a faster
> cpu] :-
> 
> Nov  5 03:12:04 admin /kernel: vop_panic[vop_open]
> Nov  5 03:12:04 admin /kernel: panic: Filesystem goof
> Nov  5 03:12:04 admin /kernel:
> 
> Now after having the hard drive replaced, I have done a fresh install of
> FreeBSD[4.6.2] because I inadvertently deleted everything off the old
> disk, I tried to do an cvsup and a make world to FreeBSD 4.7.
> 
> I havnt been able to do this successfully after numerous attempts,
> sometimes the server reboots on the trap 12, but mostly I get these :-
> 
> Nov  9 15:24:50 admin /kernel: pid 98573 (cc1), uid 0: exited on signal
> 11 (core dumped)
> 
> So, you was wondering why I wanted to change the make world script, well
> to check there wasn't a bug fix in the latest stable tree, I wanted to
> complete just the one make world, and as I couldn't do it, I was trying
> things to force it on, i.e. starting where it left off, rather than all
> over again.
> 
> Anyway, I found that the signal 11's would come very close after the
> last one, but it took a long time for the first one to occur, so I
> figured it down to load averages.
> 
> Well I used Ctrl-Z to suspend the process as soon as the 30 min average
> counter go near 0.90.
> 
> Doing this has allowed me to complete a buildworld successfully.
> 
> Now, that brings me to your other theories, something that didn't even
> occur to me was over heating, my host is going over to the NOC to check
> this out for me, do you know of any heat monitoring tools for FreeBSD ?
> Maybe I can do some graphs or something ?
> 
> I am quite convinced that it is down to heat, as this is an AMD cpu were
> talking about after all [XP2000] and it would coincide with the hi load
> averages.
> 

I use mbmon on 2000+ XP but it doesn't get one of the temperatures 
right. I haven't tried anything else. It runs along at 49oC. The fan 
on it is larger than AMD provides with their kits.

Kent

> 
> Anyway, thanks all very much for your help, ill keep you posted to my
> problem here, its been going on for months now, but I think im getting
> closer to the problem ;)
> 
> 
> 
> To Unsubscribe: send mail to majordomo@FreeBSD.org
> with "unsubscribe freebsd-questions" in the body of the message
> 
> .
> 


-- 
Kent Stewart
Richland, WA

http://users.owt.com/kstewart/index.html


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3DCDD916.5080906>