Date: Thu, 12 Oct 2000 05:30:02 -0700 (PDT) From: Josef Karthauser <joe@pavilion.net> To: freebsd-bugs@FreeBSD.org Subject: Re: kern/21915: Machine dies sig 12 in ahc driver (Freebsd4.1.1) Message-ID: <200010121230.FAA81781@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
The following reply was made to PR kern/21915; it has been noted by GNATS. From: Josef Karthauser <joe@pavilion.net> To: systems@pavilion.net Cc: FreeBSD-gnats-submit@freebsd.org, gibbs@freebsd.org Subject: Re: kern/21915: Machine dies sig 12 in ahc driver (Freebsd4.1.1) Date: Thu, 12 Oct 2000 13:27:35 +0100 Had another go today, incorporated Justin's MFC of last night. The problem still exists though. I had originally reported the problems as happening as soon as rsync's filelist build had completed, but upon reflection we get 20-30 seconds of file transfer happening first before the bus error. I've also set up the same thing with an rsync from an internal disk to the vinum partition: # rsync -vaW /usr /data/usr-copy This appears to work, i.e. it ran for several minutes before I stopped it with a ^c. My suspicion is that there's a race condition somewhere which is causing the bus to crash - somewhere between the ahc and the fxp drivers. I've not investigated deeper though, not having any familiarity with the device driver code. Joe On Wed, Oct 11, 2000 at 05:39:31PM +0100, systems@pavilion.net wrote: > > >Number: 21915 > >Category: kern > >Synopsis: Machine dies sig 12 in ahc driver (Freebsd4.1.1) > >Confidential: no > >Severity: critical > >Priority: high > >Responsible: freebsd-bugs > >State: open > >Quarter: > >Keywords: > >Date-Required: > >Class: sw-bug > >Submitter-Id: current-users > >Arrival-Date: Wed Oct 11 09:40:00 PDT 2000 > >Closed-Date: > >Last-Modified: > >Originator: Joe Karthauser > >Release: FreeBSD-4.1.1 > >Organization: > Pavilion Internet plc > >Environment: > > Copyright (c) 1992-2000 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD 4.1.1-STABLE #0: Wed Oct 11 15:19:45 BST 2000 root@:/usr/obj/usr/src/sys/BLASTER > Timecounter "i8254" frequency 1193182 Hz > Timecounter "TSC" frequency 801823410 Hz > CPU: Pentium III/Pentium III Xeon/Celeron (801.82-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0x683 Stepping = 3 > Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> > real memory = 805306368 (786432K bytes) > avail memory = 780689408 (762392K bytes) > Preloaded elf kernel "kernel" at 0xc02e4000. > Pentium Pro MTRR support enabled > md0: Malloc disk > npx0: <math processor> on motherboard > npx0: INT 16 interface > pcib0: <Host to PCI bridge> on motherboard > pci0: <PCI bus> on pcib0 > pcib2: <VIA 82C598MVP (Apollo MVP3) PCI-PCI (AGP) bridge> at device 1.0 on pci0 > pci1: <PCI bus> on pcib2 > isab0: <VIA 82C596B PCI-ISA bridge> at device 7.0 on pci0 > isa0: <ISA bus> on isab0 > atapci0: <VIA 82C596 ATA66 controller> port 0xe000-0xe00f at device 7.1 on pci0 > ata0: at 0x1f0 irq 14 on atapci0 > ata1: at 0x170 irq 15 on atapci0 > pci0: <VIA 83C572 USB controller> at 7.2 irq 11 > fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xe800-0xe83f mem 0xea000000-0xea0fffff,0xea101000-0xea101fff irq 10 at device 9.0 on pci0 > fxp0: Ethernet address 00:02:b3:03:75:62 > ahc0: <Adaptec 29160N Ultra160 SCSI adapter> port 0xec00-0xecff mem 0xea100000-0xea100fff irq 5 at device 10.0 on pci0 > aic7892: Wide Channel A, SCSI Id=7, 32/255 SCBs > pci0: <Matrox MGA Millennium II 2164W graphics accelerator> at 11.0 irq 9 > pcib1: <Host to PCI bridge> on motherboard > pci2: <PCI bus> on pcib1 > fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 > fdc0: FIFO enabled, 8 bytes threshold > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 > atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 > kbd0 at atkbd0 > psm0: <PS/2 Mouse> irq 12 on atkbdc0 > psm0: model Generic PS/2 mouse, device ID 0 > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > sc0: <System console> at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 > sio0: type 16550A > sio1 at port 0x2f8-0x2ff irq 3 on isa0 > sio1: type 16550A > ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 > ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode > lpt0: <Printer> on ppbus0 > lpt0: Interrupt-driven port > plip0: <PLIP network interface> on ppbus0 > ppi0: <Parallel I/O> on ppbus0 > acd0: CDROM <ATAPI 52X CDROM> at ata0-master using PIO4 > Waiting 3 seconds for SCSI devices to settle > Mounting root from ufs:/dev/da0s1a > da0 at ahc0 bus 0 target 0 lun 0 > da0: <COMPAQ AB009322B4 A019> Fixed Direct Access SCSI-2 device > da0: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled > da0: 8678MB (17773500 512 byte sectors: 255H 63S/T 1106C) > da3 at ahc0 bus 0 target 3 lun 0 > da3: <FUJITSU MAJ3364MC 0114> Fixed Direct Access SCSI-4 device > da3: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled > da3: 34858MB (71390320 512 byte sectors: 255H 63S/T 4443C) > da2 at ahc0 bus 0 target 2 lun 0 > da2: <FUJITSU MAJ3364MC 0114> Fixed Direct Access SCSI-4 device > da2: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled > da2: 34858MB (71390320 512 byte sectors: 255H 63S/T 4443C) > da1 at ahc0 bus 0 target 1 lun 0 > da1: <FUJITSU MAJ3364MC 0114> Fixed Direct Access SCSI-4 device > da1: 160.000MB/s transfers (80.000MHz, offset 127, 16bit), Tagged Queueing Enabled > da1: 34858MB (71390320 512 byte sectors: 255H 63S/T 4443C) > WARNING: / was not properly dismounted > vinum: loaded > vinum: reading configuration from /dev/da1s1d > vinum: updating configuration from /dev/da2s1d > vinum: updating configuration from /dev/da3s1d > > > > # cat /etc/fstab > > # Device Mountpoint FStype Options Dump Pass# > /dev/da0s1b none swap sw 0 0 > /dev/da0s1a / ufs rw 1 1 > /dev/da0s1h /home ufs rw 2 2 > /dev/da0s1g /tmp ufs rw 2 2 > /dev/da0s1f /usr ufs rw 2 2 > /dev/da0s1e /var ufs rw 2 2 > /dev/acd0c /cdrom cd9660 ro,noauto 0 0 > /dev/vinum/dataraid5 /data ufs rw,noauto 2 2 > proc /proc procfs rw 0 0 > > > > # vinum dumpconfig > > Drive raid5drive0: Device /dev/da1d > Created on temp126.staff.pavilion.net at Tue Oct 10 16:25:38 2000 > Config last updated Wed Oct 11 18:29:58 2000 > Size: 36544886784 bytes (34851 MB) > volume dataraid5 state up > plex name dataraid5.p0 state up org raid5 512s vol dataraid5 > sd name dataraid5.p0.s0 drive raid5drive0 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 0s > sd name dataraid5.p0.s1 drive raid5drive1 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 512s > sd name dataraid5.p0.s2 drive raid5drive2 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 1024s > > Drive /dev/da1d: 34 GB (36544886784 bytes) > Drive raid5drive1: Device /dev/da2d > Created on temp126.staff.pavilion.net at Tue Oct 10 16:25:59 2000 > Config last updated Wed Oct 11 18:29:58 2000 > Size: 36544886784 bytes (34851 MB) > volume dataraid5 state up > plex name dataraid5.p0 state up org raid5 512s vol dataraid5 > sd name dataraid5.p0.s0 drive raid5drive0 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 0s > sd name dataraid5.p0.s1 drive raid5drive1 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 512s > sd name dataraid5.p0.s2 drive raid5drive2 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 1024s > > Drive /dev/da2d: 34 GB (36544886784 bytes) > Drive raid5drive2: Device /dev/da3d > Created on temp126.staff.pavilion.net at Tue Oct 10 16:26:00 2000 > Config last updated Wed Oct 11 18:29:58 2000 > Size: 36544886784 bytes (34851 MB) > volume dataraid5 state up > plex name dataraid5.p0 state up org raid5 512s vol dataraid5 > sd name dataraid5.p0.s0 drive raid5drive0 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 0s > sd name dataraid5.p0.s1 drive raid5drive1 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 512s > sd name dataraid5.p0.s2 drive raid5drive2 plex dataraid5.p0 state up len 71376384s driveoffset 265s plexoffset 1024s > > Drive /dev/da3d: 34 GB (36544886784 bytes) > > > > > > > >Description: > > Rsyncing a remote machine's data onto this machine's > vinum drive causes the machine to bus error. > > This is repeatable, and occurs on two exactly similar machines. > > > Fatal trace 12: page fault while in kernel mode > fault virtual address = 0xbfc00000 > fault code = supervisor read, page not present > intruction pointer = 0x8:0xc0210222 > stack pointer = 0x10:0xc025e55c > frame pointer = 0x10:0xc025e708 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = Idle > interrupt mask = > kernel: type 12 trap, code=0 > > stopped at: bus_dmamap_load+0x1ca: movl PTmap(,%eax,4),%edx > > db>trace > bus_dmamap_load(c1e5cdc0,c0290700,0,0,c013af88,c1e5e9cc,0) at bus_dmamap_load+0x > 1ca > ahc_setup_data(c1e4d200,c1e5cc40,c21c9000,c1e5e9cc,c1e62244) at ahc_setup_data+0 > x17e > ahc_action(c1e5cc40,c21c9000,1,c1f36a90,6c0000) at ahc_action+0x2ec > xpt_run_dev_sendq(c1e5cc00) at xpt_run_dev_sendq+0x1cb > xpt_action(c21c9000,680000) at xpt_action+0x23f > dastart(c1f2b500,c21c9000,c21c9000,c1f36a90,1,680000,c1e62230,1) at dastart+0x1c > c > xpt_run_dev_allocq(c1e5cc00,c21c9000,c1f3e000,c21c9000,680000) at xpt_run_dev_al > locq+0x80 > xpt_release_ccb(c21c9000,c21c9000,c1e62200,680000,680000) at xpt_release_ccb+0xe > b > dadone(c1f2b500,c21c9000,0,0,ffffffff) at dadone+0x459 > camisr(c028a630,0,c021309f,0,400010) at camisr+0x1eb > swi_cambio(0,400010,c0250010,c0120010,ffffffff) at swi_cambio+0xd > doreti_swi() at doreti_swi+0xf > > > > >How-To-Repeat: > > # cd /data > # rsync -vaW -e ssh remote.machine:/ localcopy/ > [wait for file list to be compiled - then _bang_] > > >Fix: > > None. > > > >Release-Note: > >Audit-Trail: > >Unformatted: > > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-bugs" in the body of the message -- Josef Karthauser FreeBSD: How many times have you booted today? Technical Manager Viagra for your server (http://www.uk.freebsd.org) Pavilion Internet plc. [joe@pavilion.net, joe@uk.freebsd.org, joe@tao.org.uk] To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-bugs" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200010121230.FAA81781>