Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 18 Aug 2004 07:31:01 +0200
From:      =?ISO-8859-1?Q?Eirik_=D8verby?= <ltning@anduin.net>
To:        Julian Elischer <julian@elischer.org>
Cc:        stable@freebsd.org
Subject:   Re: How to find the cause of a hang
Message-ID:  <CA6BB5FA-F0D7-11D8-AE9C-000D9335BCEC@anduin.net>
In-Reply-To: <411E7B28.7050505@elischer.org>
References:  <26ECE35D-EDDB-11D8-945B-000D9335BCEC@anduin.net> <411E7B28.7050505@elischer.org>

next in thread | previous in thread | raw e-mail | index | archive | help


On 14. Aug 2004, at 22:50, Julian Elischer wrote:

> Eirik Øverby wrote:
>> Hi all,
>> I'm currently experiencing frequent (about once per week) hangs of a 
>> server that is about 1500 kilometers away from me. I have a serial 
>> cable on the box, and using minicom on the neighbor box I am now in 
>> the kernel debugger - but I'm at a complete loss as to what to do to 
>> figure out what is, in fact, wrong.
>> Calling panic or boot doesn't work - it just stops at "syncing 
>> disks..." and never actually reboots. I suspect something fishy going 
>> on with disk I/O, but I can't be certain of that.
>> The box responds to ping - until I call panic or boot - but no other 
>> services are working.
>
> try capture a stack trace "tr"

Looks like the box is "idle"

> if you have KTR enabled do "show ktr"

This is 4.x... As my follow-up msg on current@ indicated (yes i posted 
to the wrong list initially ;)

> do "ps"

An insane number of cron processes.. Is it trying to run scheduled jobs 
and fails because of the hang?

> do show pcpu
> show witness
> show locks

No workie on 4.x...

> if you have a dump device define..
>   call doadump

No dump device here (if disk is the problem, it would be of no use I 
guess), and i get undefined symbol anyway.

> then to reboot..
>    "call cpu_reset"

Now that one is handy. ;)

> The dump will appear after the next boot in /var/crash
> if it's not big enough for a complete ram dump, symlink it to 
> somewhere where there is enough room.

(See above)


> when you have all that.. let us know :-)

I have what I have. See
http://anduin.net/~ltning/debug.cap
(it is too long to include in a mail...

Got any clues for me?

Thanks,
/Eirik


>
>
>> What can I do? I'm now at the db> prompt ... Help :)
>> /Eirik
>> _______________________________________________
>> freebsd-current@freebsd.org mailing list
>> http://lists.freebsd.org/mailman/listinfo/freebsd-current
>> To unsubscribe, send any mail to 
>> "freebsd-current-unsubscribe@freebsd.org"
>
>
>
>




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CA6BB5FA-F0D7-11D8-AE9C-000D9335BCEC>