From owner-freebsd-current Thu Aug 6 09:38:06 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id JAA03288 for freebsd-current-outgoing; Thu, 6 Aug 1998 09:38:06 -0700 (PDT) (envelope-from owner-freebsd-current@FreeBSD.ORG) Received: from zone.syracuse.net (zone.syracuse.net [205.232.47.17]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id JAA03240 for ; Thu, 6 Aug 1998 09:37:52 -0700 (PDT) (envelope-from green@zone.syracuse.net) Received: from localhost (green@localhost) by zone.syracuse.net (8.8.8/8.8.7) with SMTP id MAA21946; Thu, 6 Aug 1998 12:37:11 -0400 (EDT) Date: Thu, 6 Aug 1998 12:37:11 -0400 (EDT) From: Brian Feldman To: Vallo Kallaste cc: freebsd-current@FreeBSD.ORG Subject: Re: Crash: Fatal trap 12 In-Reply-To: <19980806172033.46098@matti.ee> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG This seems quite a bit like my current problems, I've noticed only very recently, and with no real regularity. My uptime just won't exceed 1 or 2 days, usually, beacuse I'll get a lockup. It happenes doing different things. This morning, it locked up after switching from vty0 to X, solid, no panic or trap... It crashed ~2 days ago when grepping stuff in /tmp. I have no idea what it was, but I've completely gone over all speed settings for RAM, CPU, to make sure everything's right. Another weird thing is swap was at 122 mb being used, after the 80 mb of system RAM, when there was nothing to be taking up anywhere near as much swap as that, and starting new stuff dived more into swap, while there was still nothing actually USING the swap at all... it just didn't seem to be "going free". Oh, and Sendmail seemed to be SIGSEGVing quite a bit, but a new sendmail.cf and a reboot fixed that. But here's any possibly pertinent info: 100 mb MFS /tmp (I actaully thing that was what was causing the leak of memory, since a program had filled it up to the top and then my swap was really high, removing stuff didn't help, even when I brought it down to 1mb in size SoftUpdates on all FSen CURRENT as of 8/3/98, world of 8/5/98 80 mb ram, K6 200, no special drivers, all EIDE I'd start working on the mfs code, looking for leaks and improper memory freeing, except I have a BSD book to buy and read :) Cheers, Brian Feldman On Thu, 6 Aug 1998, Vallo Kallaste wrote: > Hello ! > > Today I got two crashes sequentially, after that machine is very unstable > > Background: > > I'm working under X3.3.2, some rxvt's and Netscape running, make > buildworld also. At some moment my machine hangs completely, nothing > responds, I can't get out of X or access machine remotely, it even doesn't > respond to ping. I compiled kernel yesterday with new XL driver in memory > mapped mode (commented out one #define in if_xl.c ). My machine hasn't > crashed for a month until now. My sources are dated 1 August. So, I > rebooted machine and it goes up normally, fsck's disks and corrects > something, but just before initial prompt it crashes: > > Fatal trap 12: page fault while in kernel mode > fault virtual address: = 0xeffd6004 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xf01e2207 > stack pointer = 0x10:0xf577bc24 > frame pointer = 0x10:0xf577bc44 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = resume, IOPL = 0 > current process = Idle > interrupt mask = net tty bio cam > kernel: type 12 trap, code=0 > Stopped at _pmap_enter+0x97: movl 0(%ecx),%ecx > db> trace > _pmap_enter(f023f188,f5801000,780e00,7,0,f577bcac,0) at _pmap_enter+0x97 > _vm_fault(f023514c,f5801000,1,0,0) at _vm_fault+0x837 > _trap_pfault(f577abd78,0,7b2d000,6,2000) at _trap_pfault+0x10b > _trap(10,10,2000,6,f577bde4) at _trap+0x35f > calltrap() at calltrap+0x15 > --- trap 0xc, eip = 0xf01e42a0, esp = 0xf577bdb4, ebp = 0xf577bde4 --- > swtch_com(80000000,f577bf84,f576a9c0,f3270110,100) at swtch_com+0x25 > _tsleep(f0245a10,118,f01239bb,1f5,f576a9c0) at _tsleep+0x121 > _poll(f576a9c0,f577bf84) at _poll+0x24c > _syscall(27,27,1388,efbfaa68,efbfabf4) at _syscall+0x10f > _Xsyscall() at _Xsyscall+0x35 > --- syscall 0xd1, eip = 0x2005d6b1, esp = 0xefbfa958, ebp = 0xefbfabf4 --- > db> show registers > cs 0x8 > ds 0x10 > es 0x10 > ss 0x10 > eax 0xeffd6004 > ecx 0xeffd6004 > edx 0xefc00000 _PTmap > ebx 0xf5801000 > esp 0xf577bc24 > ebp 0xf577bc44 > esi 0x780e000 > edi 0xf023f188 _kernel_pmap_store > eip 0xf01e2207 _pmap_enter+0x97 > efl 0x10082 > _pmap_enter+0x97: movl 0(%ecx),%ecx > db> > > This is a first crash traceback, copied by hand. Please remember, I > never haven't done that and I don't have any background knowledge about > debugging. > Now, I rebooted machine and all repeats exactly, only crash is different. > My machine crashes again just before initial prompt: > > mode = 0100644, inum = 266, fs = / > panic: ffs_valloc: deep alloc > Debugger("panic") > stopped at _Debugger+0x35: movb $0,_in_Debugger.98 > db> trace > _Debugger(f011a7d4) at _Debugger+0x35 > _panic(f01b056b,f01b054a,81a4,10a,f094e0d4) at _panic+0x5a > _ffs_valloc(f57648e0,8180,f0910b00,f577fcbc,f577fe20) at _ffs_valloc+0xf6 > _ufs_makeinode(8180,f57648e0,f577ff04,f577ff18) at _ufs_makeinode+0x57 > _ufs_create(f577fe20,f577fe94,f0142b76,f577fe20,0) at _ufs_create+0x26 > _ufs_vnoperate(f577fe20) at _ufs_vnoperate+0x15 > _vn_open(f577fef4,a03,180,f576a880,f021c348) at _vn_open+0x116 > _open(f576a880,f577ff84) at _open+0xad > _syscall(27,27,e052,efbfde9b,efbfde18) at _syscall+0x10f > _Xsyscall() at _Xsyscall+0x35 > --- syscall 0x5, eip = 0x3a35, esp = 0xefbfdaf8, ebp = 0xefbfde18 > db> show registers > cs 0x8 > ds 0x10 > es 0x10 > ss 0x10 > eax 0x12 > ecx 0xf01d9ad3 _db_write_bytes+0x147 > edx 0 > ebx 0x100 > esp 0xf577fc40 > ebp 0xf577fc48 > esi 0xf01b056b _ffs_reallocblks+0x2b > edi 0xf577fcbc > eip 0xf01d9b09 _Debugger+0x35 > efl 0x246 > _Debugger+0x35: movb $0,_in_Debugger.98 > db> > > This is a second crash and traceback. I used command "panic" at db> > prompt, but unfortunately I don't have savecore enabled, so I don't have > crashdump. > After second crash my machine reboots itself and I used -s switch to get > to single user. I fsck'ed filesystems by hand and lost one (not > important because I have backup ). This is not a system partition, for > remark. So far so good. After reboot my machine goes up normally, but are > unstable, console hangs sometimes and Alt+F2 etc. don't work. > > Dmesg: > > Copyright (c) 1992-1998 FreeBSD Inc. > Copyright (c) 1982, 1986, 1989, 1991, 1993 > The Regents of the University of California. All rights reserved. > FreeBSD 3.0-CURRENT #0: Wed Aug 5 18:23:57 EEST 1998 > root@myhakas.matti.ee:/opt/src/sys/compile/Myhakas > Timecounter "i8254" frequency 1193233 Hz cost 2638 ns > Timecounter "TSC" frequency 199440070 Hz cost 137 ns > CPU: Pentium/P54C (199.44-MHz 586-class CPU) > Origin = "GenuineIntel" Id = 0x52c Stepping=12 > Features=0x1bf > real memory = 134217728 (131072K bytes) > avail memory = 127967232 (124968K bytes) > Probing for devices on PCI bus 0: > chip0: rev 0x01 on pci0.0.0 > chip1: rev 0x01 on pci0.7.0 > ide_pci0: rev 0x01 on pci0.7.1 > chip2: rev 0x01 int d irq 10 on pci0.7.2 > chip3: rev 0x01 on pci0.7.3 > ahc0: rev 0x00 int a irq 9 on pci0.9.0 > ahc0: aic7880 Wide Channel, SCSI Id=7, 16/255 SCBs > ahc0: waiting for scsi devices to settle > scbus0 at ahc0 bus 0 > ahc0:A:0: refuses WIDE negotiation. Using 8bit transfers > cd0 at scbus0 target 0 lun 0 > cd0: type 5 removable SCSI 2 > cd0: CD-ROM can't get the size > st0 at scbus0 target 1 lun 0 > st0: type 1 removable SCSI 2 > st0: Sequential-Access density code 0x45, drive empty > xl0: <3Com 3c905B Fast Etherlink XL 10/100BaseTX> rev 0x24 int a irq 5 on > pci0.10.0 > xl0: Ethernet address: 00:a0:24:4e:3e:57 > vga0: rev 0x9a on pci0.11.0 > Probing for PnP devices: > CSN 1 Vendor ID: GRV0001 [0x0100561e] Serial 0x00000484 > mss_attach 1 at 0x328 irq 11 dma 7:5 flags 0x15 > pcm1 (GusPnP sn 0x00000484) at 0x328-0x32f irq 11 drq 7 flags 0x15 on > isa > Probing for devices on the ISA bus: > sc0 at 0x60-0x6f irq 1 on motherboard > sc0: VGA color <16 virtual consoles, flags=0x0> > psm0 at 0x60-0x64 irq 12 on motherboard > psm0: model Generic PS/2 mouse, device ID 0 > sio0 at 0x3f8-0x3ff irq 4 on isa > sio0: type 16550A > sio1 at 0x2f8-0x2ff irq 3 on isa > sio1: type 16550A > pcm0 not found > wdc0 at 0x1f0-0x1f7 irq 14 flags 0xb008b008 on isa > wdc0: unit 0 (wd0): , LBA, 32-bit, multi-block-8 > wd0: 814MB (1667232 sectors), 827 cyls, 32 heads, 63 S/T, 512 B/S > wdc0: unit 1 (wd1): , LBA, DMA, 32-bit, > multi-block-8 > wd1: 1151MB (2358720 sectors), 585 cyls, 64 heads, 63 S/T, 512 B/S > wdc1 at 0x170-0x177 irq 15 flags 0xb008b008 on isa > wdc1: unit 0 (wd2): , LBA, DMA, 32-bit, multi-block-8 > wd2: 2014MB (4124736 sectors), 1023 cyls, 64 heads, 63 S/T, 512 B/S > wdc1: unit 1 (wd3): , LBA, DMA, 32-bit, multi-block-8 > wd3: 2014MB (4124736 sectors), 1023 cyls, 64 heads, 63 S/T, 512 B/S > fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa > fdc0: FIFO enabled, 8 bytes threshold > fd0: 1.44MB 3.5in > npx0 on motherboard > npx0: INT 16 interface > ppc0 at 0x378 irq 7 on isa > ppc0: Generic chipset in EPP mode (EPP 1.9) > Intel Pentium F00F detected, installing workaround > changing root device to wd0s1a > WARNING: / was not properly dismounted. > xl0: autoneg complete, link status good (half-duplex, 10Mbps) > ffs_mountfs: superblock updated > ffs_mountfs: superblock updated > ffs_mountfs: superblock updated > ffs_mountfs: superblock updated > ffs_mountfs: superblock updated > ffs_mountfs: superblock updated > > Kernel config: > > # Myhakas > # > > machine "i386" > ident Myhakas > maxusers 12 > > config kernel root on wd0 > > options PQ_LARGECACHE # color for 512k/16k cache > > cpu "I586_CPU" # aka Pentium(tm) > #cpu "I686_CPU" # aka Pentium Pro(tm) > > options "CPU_FASTER_5X86_FPU" > #options "NO_F00F_HACK" > > options "COMPAT_43" > options SYSVSHM > options SYSVSEM > options SYSVMSG > options "MD5" > options "VM86" > options DDB > #options DDB_UNATTENDED > #options KTRACE #kernel tracing > options PERFMON > > options INET #Internet communications protocols > > pseudo-device ether #Generic Ethernet > pseudo-device loop #Network loopback device > pseudo-device bpfilter 4 #Berkeley packet filter > pseudo-device tun 1 #Tunnel driver (user process ppp(8)) > #pseudo-device sl 1 #Serial Line IP > #pseudo-device ppp 1 #Point-to-point protocol > #options PPP_BSDCOMP #PPP BSD-compress support > #options PPP_DEFLATE #PPP zlib/deflate/gzip support > #options PPP_FILTER #enable bpf filtering (needs bpfilter) > > options FFS #Fast filesystem > options NFS #Network File System > # options NFS_NOSERVER #Disable the NFS-server code. > options "CD9660" #ISO 9660 filesystem > #options KERNFS #Kernel filesystem > options MFS #Memory File System > options MSDOSFS #MS DOS File System > options PROCFS #Process filesystem > options FFS_ROOT #FFS usable as root device > > options SOFTUPDATES > > controller scbus0 #base SCSI code > device sd0 #SCSI disks > device st0 #SCSI tapes > device cd0 #SCSI CD-ROMs > > options "P1003_1B" > options "_KPOSIX_PRIORITY_SCHEDULING" > options "_KPOSIX_VERSION=199309L" > > pseudo-device pty 16 #Pseudo ttys - can go as high as 256 > pseudo-device gzip #Exec gzipped a.out's > pseudo-device vn #Vnode driver (turns a file into a device) > pseudo-device snp 3 #Snoop device - to look at pty/vty/etc.. > #pseudo-device ccd 4 #Concatenated disk driver > #pseudo-device su #scsi user > #pseudo-device ssc #super scsi > > controller isa0 > > #options "AUTO_EOI_1" > #options "AUTO_EOI_2" > #options BOUNCE_BUFFERS > > controller pnp0 > > device sc0 at isa? port "IO_KBD" tty irq 1 vector scintr > options MAXCONS=16 # number of virtual consoles > #options SLOW_VGA # do byte-wide i/o's to TS and > G > DC regs > #options "STD8X16FONT" # Compile font in > #makeoptions "STD8X16FONT"="cp850" > options SC_HISTORY_SIZE=600 # number of history buffer lines > options SC_DISABLE_REBOOT # disable reboot key sequence > > device npx0 at isa? port "IO_NPX" iosiz 0x0 flags 0x0 irq 13 > vector > npxintr > > controller wdc0 at isa? port "IO_WD1" bio irq 14 flags 0xb008b008 > vector > wdintr > disk wd0 at wdc0 drive 0 > disk wd1 at wdc0 drive 1 > controller wdc1 at isa? port "IO_WD2" bio irq 15 flags 0xb008b008 > vector > wdintr > disk wd2 at wdc1 drive 0 > disk wd3 at wdc1 drive 1 > > options ATAPI #Enable ATAPI support for IDE bus > #options ATAPI_STATIC #Don't do it as an LKM > device wcd0 > > controller fdc0 at isa? port "IO_FD1" bio irq 6 drq 2 vector fdintr > disk fd0 at fdc0 drive 0 > #disk fd1 at fdc0 drive 1 > > #device lpt0 at isa? port? tty irq 7 vector lptintr > #device lpt1 at isa? port "IO_LPT3" tty irq 5 vector lptintr > device psm0 at isa? port "IO_KBD" conflicts tty irq 12 vector > psmint > r > device sio0 at isa? port "IO_COM1" tty irq 4 vector siointr > device sio1 at isa? port "IO_COM2" tty irq 3 vector siointr > > device pcm0 at isa? port ? tty irq ? drq ? vector pcmintr > > controller ahc0 > options AHC_TAGENABLE > options AHC_SCBPAGING_ENABLE > options AHC_ALLOW_MEMIO > > controller pci0 > device de0 > device xl0 > > controller ppbus0 > controller vpo0 at ppbus? > controller ppc0 at isa? port ? irq 7 vector ppcintr > > options CLK_CALIBRATION_LOOP > options "CLK_USE_I8254_CALIBRATION" > options CLK_USE_TSC_CALIBRATION > options COMPAT_LINUX > options PANIC_REBOOT_WAIT_TIME=30 > options SCSI_DELAY=5 # Be pessimistic about Joe SCSI device > options SHMMAXPGS=1025 > > > I have one comment, I worked one whole day with new XL driver enabled without > a hitch, but this was on the 10Mbit half duplex network. Half hour before first > crash I switched over to 100Mbit full-duplex segment on our 3Com SuperStack II > 1100 switch. Currently my network connection is 10Mbit half-duplex, I switched > back because seems 100Mbit connection causes my machine unstability. > > Thanks > > > Vallo Kallaste > vallo@matti.ee > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-current" in the body of the message > To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message