Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 21 Aug 1995 12:00:48 +0200
From:      j@bonnie.heep.sax.de (J Wunsch)
To:        current@freebsd.org, fmayhar@locus.com
Subject:   Re: Show-stopper crashes on 2.1-STABLE and 2.1.0-072695-SNAP.
Message-ID:  <199508211000.MAA26738@bonnie.tcd-dresden.de>
In-Reply-To: <410bgk$3vij@janus.la.locus.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Since Frank has apparently email problems, and i'm not sure if this
report appeared in the lists, is it a known problem?

In article <410bgk$3vij@janus.la.locus.com> you write:
>I sent this to hackers, but since my home uucp provider has decided to
>not accept any email from freefall.cdrom.com, it's doing me little good.
>At the risk of injecting some hard-core technical discussion, I'm posting
>this here to see if I can finally get this problem resolved.
>
>The (slightly modified) message as sent to hackers follows:
>
>My less immediately critical problem is with the XFree86 3.1.1 server; I
>run the S3 server against an ISA Actix Graphics Engine Ultra +, and after
>a short time, it appears to hang.  A ps (from an async terminal) shows it
>racking up CPU; sometimes there are visual flashes or short-lived bits of
>garbage on the screen shortly before it crashes.  Is there a known hardware
>incompatibility between the ASUS motherboard and the Actix card?  Note
>that I've ordered a Number Nine GXE64Pro, so this should shortly become
>a non-issue, but I thought I would mention it anyway.
>
>Now the critical problem:
>
>I just upgraded my hardware to a P100/PCI motherboard (configuration is
>below), and needed to upgrade to FreeBSD 2.0.5 or better since 1.1.5.1
>(which I had been running) doesn't support my new hardware.  I've had
>nothing but headaches.  The real problem is that I can't run News; email
>works okay, but INN pounds the system hard enough that it crashes almost
>immediately (if I don't run News, it crashes eventually, but it takes a
>lot longer -- although I've had it crash immediately after boot).
>
>My configuration:
>
>Pentium 100, ASUS P55TP4XE motherboard, pipeline burst SRAM, 32 MB 60ns
>memory, Adaptec 2940W, Maxtor LXT340sy + 2 Toshiba MK538FBs (340 MB, and
>two 1.2 GB), Archive Viper 2150S tape, Actix GraphicsEngine Ultra +.
>
>Crash:
>
>Running 2.1.0-072695-SNAP, and INN 1.4 compiled with MMAP.  I get a
>compressed uucp newsfeed; while unpacking news, the system quickly
>crashes with:
>
>Fatal trap 12: page fault while in kernel mode
>fault virtual address		= 0xf81ef050
>fault code			= supervisor write, page not present
>instruction pointer		= 0x8:0x4018679c
>code segment			= base 0x0, limit 0xfffff, type 0x1b
>				= DPL 0, pres 1, def 321, gran 1
>processor eflags		= interrupt enabled, resume, IOPL=0
>current process			= 11503 (gzip)
>interrupt mask			= net tty bio
>panic: page fault
>
>I finally got a trace courtesy of DDB (not corresponding to the above
>message, but the same fault code and under the same circumstances):
>
>_trap_fatal(efbffb4c,c,f0c10500,efbffb4c,f0b43e00) at _trap_fatal+0x277
>_trap_pfault(efbffb4c,0,f024fb10,f01cbd60,f26db6f8) at _trap_pfault+0x158
>_trap(10,10,f26eb6f8,f01cbd60,efbffb94) at _trap+0x27b
>calltrap(f01cbd60,2bfe0000,0,f26db6f8,0) at calltrap+0x15
>_vm_hold_load_pages(f26eb6f8,f2bbc000,f2bbe000,f26db6f8) at _vm_hold_load_pages+0x4c
>_allocbuf(f26eb6f8,2000,efbffc98,efbffd10,ffffffff) at _allocbuf+0x8a
>_getblk(f0c7cd00,b,2000,0,0) at _getblk+0x23a
>_bread(f0c7cd00,b,2000,ffffffff,efbffc98) at _bread+0x21
>_ffs_blkatoff(efbffd10,f0c7cd00,efbfff0c,efbffef8,f0bf2700) at _ffs_blkatoff+0xc3
>_ufs_lookup(efbffd74,0,efbfff0c,efbffee8,1) at _ufs_lookup+0x44a
>_lookup(efbffee8,0,efbfff94,602,f0277f2c) at _lookup+0x256
>_namei(efbffee8,0,efbfff94,f0c10500,f0c10500) at _namei+0x122
>_vn_open(efbffee8,602,1b4,efbfff94,f0c10500) at _vn_open+0x5a
>_open(f0c10500,efbfff94,efbfff8c,842006,0) at _open+0x97
>_syscall(27,27,287d4,0,efbfd2fc) at _syscall+0x161
>
>That was with 2.1.0-072695-SNAP.  This is with 2.1-STABLE:
>
>I tried INN built with MMAP.  No joy, it looked like it crashed in a page
>fault due to an mmap (nothing on the stack below the _trap_pfault()).  I
>rebuilt INN with READ instead of MMAP.  It got further this time, but
>finally crashed in _cache_lookup():
>
>Fatal trap 12: page fault while in kernel mode
>fault virtual address	= 0xfccb074c
>fault code		= supervisor write, page not present
>instruction pointer	= 0x8:0xf01265f5
>code segment		= base 0x0, limit 0xfffff, type 0x1b
>			= DPL 0, pres 1, def32 1, gran 1
>processor eflags	= interrupt enabled, resume, IOPL = 0
>current process		= 4580 (innd)
>interrupt mask		= 
>panic: from debugger
>
>Stack trace:
>_cache_lookup(f0b90e00,efbffef8,efbfff0c,f0b90e00,efbfff0c) at _cache_lookup+0x195
>_ufs_lookup(efbffd74,0,efbfff0c,efbffee8,1) at _ufs_lookup+0xdb
>_lookup(efbffee8,0,efbfff94,602,f02535c4) at _lookup+0x256
>_namei(efbffee8,0,efbfff94,f0cd6d00,f0cd6d00) at _namei+0x122
>_vn_open(efbffee8,602,1b4,efbfff94,f0cdbd00) at _vn_open+0x5a
>_open(f0cd6d00,efbfff94,efbfff8c,878006,0) at _open+0x97
>_syscall(26,26,286d4,0,efbfd2f4) at 0x161
>
>I've poked around in cache_lookup() and ufs_lookup() looking for any obvious
>errors, but found nothing I could point to as the culprit; someone more
>knowledgable of the code needs to take a look at it, and maybe give me some
>advice as to what to look at.
>
>I can reproduce these pretty much at will (and they quite often happen all
>by themselves, on an only slightly loaded system), so if there's anything
>else that anyone needs me to look at, let me know.  If it's already solved,
>so much the better (but I doubt that it is).  I'm available pretty much
>any time for diagnostic work, so if someone wants to get on the phone with
>me and work on this thing, send me email (at home at 'frank@exit.com' or
>at work at 'fmayhar@locus.com', one will get forwarded to the other
>automatically) and I'll give you my number.
>
>Thanks in advance for any help anyone can give me.  I quite understand that
>everyone is very busy (join the crowd), but this is being a great aggravation,
>particularly since my only other choice is to leave my $2000 worth of nice
>new hardware sitting idle while I go back to running 1.1.5.1 on a crufty old
>486/33 box.
>
>Here's my config file, and a copy of the 'dmesg' output, just for grins:
>#
># GENERIC -- Generic machine with WD/AHx/NCR/BTx family disks
>#
>#	$Id: GENERIC,v 1.46 1995/06/11 19:31:11 rgrimes Exp $
>#
>
>machine		"i386"
>cpu		"I586_CPU"
>ident		TINKER
>maxusers	64
>
>#options		MATH_EMULATE		#Support for x87 emulation
>options		INET			#InterNETworking
>options		FFS			#Berkeley Fast Filesystem
>options		NFS			#Network Filesystem
>options		MFS			#Memory File System
>options		MSDOSFS			#MSDOS Filesystem
>#options		"CD9660"		#ISO 9660 Filesystem
>options		PROCFS			#Process filesystem
>options		"COMPAT_43"		#Compatible with BSD 4.3
>options		"SCSI_DELAY=15"		#Be pessimistic about Joe SCSI device
>#options		BOUNCE_BUFFERS		#include support for DMA bounce buffers
>options		UCONSOLE		#Allow users to grab the console
>options		"NSWAPDEV=4"
>options		SYSVSHM
>options		SYSVSEM
>options		SYSVMSG
>options		KTRACE			#kernel tracing
>
>options		DDB			#kernel debugger
>
>config		kernel	root on sd0 
>
>controller	isa0
>controller	pci0
>
>controller	fdc0	at isa? port "IO_FD1" bio irq 6 drq 2 vector fdintr
>disk		fd0	at fdc0 drive 0
>disk		fd1	at fdc0 drive 1
>#tape		ft0	at fdc0 drive 2
>
>#controller	wdc0	at isa? port "IO_WD1" bio irq 14 vector wdintr
>#disk		wd0	at wdc0 drive 0
>#disk		wd1	at wdc0 drive 1
>
>#controller	wdc1	at isa? port "IO_WD2" bio irq 15 vector wdintr
>#disk		wd2	at wdc1 drive 0
>#disk		wd3	at wdc1 drive 1
>
>#controller	ncr0
>#controller	ahc0
>controller	ahc0	at pci? bio irq ? vector ahcintr
>
>#controller	bt0	at isa? port "IO_BT0" bio irq ? vector btintr
>#controller	uha0	at isa? port "IO_UHA0" bio irq ? drq 5 vector uhaintr
>#controller	ahc1	at isa? bio irq ? vector ahcintr
>#controller	ahb0	at isa? bio irq ? vector ahbintr
>#controller	aha0	at isa? port "IO_AHA0" bio irq ? drq 5 vector ahaintr
>#controller	aic0    at isa? port 0x340 bio irq 11 vector aicintr
>#controller	nca0	at isa? port 0x1f88 bio irq 10 vector ncaintr
>#controller	nca1	at isa? port 0x350 bio irq 5 vector ncaintr
>#controller	sea0	at isa? bio irq 5 iomem 0xc8000 iosiz 0x2000 vector seaintr
>
>controller	scbus0
>
>device		sd0
>
>device		st0
>
>device		cd0	#Only need one of these, the code dynamically grows
>
>device		wt0	at isa? port 0x300 bio irq 5 drq 1 vector wtintr
>#device		mcd0	at isa? port 0x300 bio irq 10 vector mcdintr
>#device		mcd1	at isa? port 0x340 bio irq 11 vector mcdintr
>
>#controller	matcd0	at isa? port ? bio
>
>#device		scd0	at isa? port 0x230 bio
>
># syscons is the default console driver, resembling an SCO console
>device		sc0	at isa? port "IO_KBD" tty irq 1 vector scintr
># Enable this and PCVT_FREEBSD for pcvt vt220 compatible console driver
>#device		vt0	at isa? port "IO_KBD" tty irq 1 vector pcrint
>#options		"PCVT_FREEBSD=210"	# pcvt running on FreeBSD 2.1
>options		XSERVER			# include code for XFree86
>
>device		npx0	at isa? port "IO_NPX" irq 13 vector npxintr
>
>device		sio0	at isa? port "IO_COM1" tty irq 4 vector siointr
>device		sio1	at isa? port "IO_COM2" tty irq 3 vector siointr
>device		sio2	at isa? port 0x338 tty irq 12 vector siointr
>#device		sio2	at isa? port "IO_COM3" tty irq 5 vector siointr
>#device		sio3	at isa? port "IO_COM4" tty irq 9 vector siointr
>
>device		lpt0	at isa? port? tty irq 7 vector lptintr
>#device		lpt1	at isa? port? tty
>#device		lpt2	at isa? port? tty
>
># Order is important here due to intrusive probes, do *not* alphabetize
># this list of network interfaces until the probes have been fixed.
># Right now it appears that the ie0 must be probed before ep0. See
># revision 1.20 of this file.
>#device de0
>#device ed0 at isa? port 0x280 net irq  5 iomem 0xd8000 vector edintr
>#device ed1 at isa? port 0x300 net irq  5 iomem 0xd8000 vector edintr
>#device ie0 at isa? port 0x360 net irq  7 iomem 0xd0000 vector ieintr
>#device ep0 at isa? port 0x300 net irq 10 vector epintr
>#device ix0 at isa? port 0x300 net irq 10 iomem 0xd0000 iosiz 32768 vector ixintr
>#device le0 at isa? port 0x300 net irq 5 iomem 0xd0000 vector le_intr
>#device lnc0 at isa? port 0x280 net irq 10 drq 0 vector lncintr
>#device lnc1 at isa? port 0x300 net irq 10 drq 0 vector lncintr
>#device ze0 at isa? port 0x300 net irq 5 iomem 0xd8000 vector zeintr
>#device zp0 at isa? port 0x300 net irq 10 iomem 0xd8000 vector zpintr
>
>controller	snd0
>device sb0      at isa? port 0x220 irq 7 conflicts drq 1 vector sbintr
>device pas0     at isa? port 0x388 irq 10 drq 6 vector pasintr
>device opl0     at isa? port 0x388
>
>pseudo-device	loop
>pseudo-device	ether
>pseudo-device	log
>#pseudo-device	sl	1
># ijppp uses tun instead of ppp device
>#pseudo-device	ppp	1
>pseudo-device	tun	2
>pseudo-device	pty	16
>pseudo-device	gzip		# Exec gzipped a.out's
>pseudo-device	bpfilter	4	#Berkeley packet filter
>
>
>FreeBSD 2.1-STABLE #0: Wed Aug 16 08:34:38  1995
>    root@exit.com:/usr/src/sys/compile/TINKER
>CPU: 99-MHz Pentium 735\\90 or 815\\100 (Pentium-class CPU)
>  Origin = "GenuineIntel"  Id = 0x525  Stepping=5
>  Features=0x1bf<FPU,VME,PSE,MCE,CX8,APIC>
>real memory  = 33161216 (8096 pages)
>avail memory = 31178752 (7612 pages)
>Probing for devices on the ISA bus:
>sc0 at 0x60-0x6f irq 1 on motherboard
>sc0: VGA color <16 virtual consoles, flags=0x0>
>sio0 at 0x3f8-0x3ff irq 4 on isa
>sio0: type 16550A
>sio1 at 0x2f8-0x2ff irq 3 on isa
>sio1: type 16550A
>sio2 at 0x338-0x33f irq 12 on isa
>sio2: type 16450
>lpt0 at 0x378-0x37f irq 7 on isa
>lpt0: Interrupt-driven port
>lp0: TCP/IP capable interface
>fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
>fdc0: NEC 72065B
>fd0: 1.44MB 3.5in
>fd1: 1.2MB 5.25in
>wt0 not found at 0x300
>npx0 on motherboard
>npx0: INT 16 interface
>sb0 at 0x220 irq 7 drq 1 on isa
>sb0: <SoundBlaster 2.0>
>pas0 at 0x388 irq 10 drq 6 on isa
>pas0: <CDPC rev 0>
>opl0 not probed due to I/O address conflict with pas0 at 0x388
>Probing for devices on the pci0 bus:
>	configuration mode 1 allows 32 devices.
>chip0 <CPU-PCI bridge> rev 2 on pci0:0
>chip1 <PCI-ISA bridge> rev 2 on pci0:7
>ahc0 <Adaptec 2940 SCSI host adapter> rev 0 int a irq 11 on pci0:12
>ahc0: reading board settings
>ahc0: Reading SEEPROM...done.
>ahc0: 2940 Wide Channel, SCSI Id=7, aic7870, 16 SCBs
>ahc0: Downloading Sequencer Program...Done
>ahc0: Probing channel A
>ahc0 waiting for scsi devices to settle
>ahc0: target 0 synchronous at 4.4MB/s, offset = 0xf
>(ahc0:0:0): "MAXTOR LXT-340S 6.74" type 0 fixed SCSI 1
>sd0(ahc0:0:0): Direct-Access 324MB (665154 512 byte sectors)
>ahc0: target 1 synchronous at 6.67MB/s, offset = 0xf
>(ahc0:1:0): "TOSHIBA MK538FB 6061" type 0 fixed SCSI 2
>sd1(ahc0:1:0): Direct-Access 1170MB (2396970 512 byte sectors)
>ahc0: target 2 synchronous at 6.67MB/s, offset = 0xf
>(ahc0:2:0): "TOSHIBA MK538FB 6030" type 0 fixed SCSI 2
>sd2(ahc0:2:0): Direct-Access 1172MB (2400302 512 byte sectors)
>pci0: uses 4096 bytes of memory from fbff7000 upto fbff7fff.
>pci0: uses 256 bytes of I/O space from e400 upto e4ff.
>bpf: lo0 attached
>bpf: tun0 attached
>bpf: tun1 attached
>-- 
>Frank Mayhar frank@exit.com
>


-- 
cheers, J"org                      private:   joerg_wunsch@uriah.heep.sax.de
                                   http://www.sax.de/~joerg/

Never trust an operating system you don't have sources for. ;-)



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199508211000.MAA26738>