Date: Sat, 31 Aug 2002 18:11:00 +0900 (JST) From: BugsGrief@bugsgrief.net To: freebsd-stable@freebsd.org Subject: ata problem(s) Message-ID: <200208310930.g7V9UcK57531@ogyo.bugsgrief.net>
next in thread | raw e-mail | index | archive | help
I have been using following IBM ata disk since late 4.4 stable. 1. 4.4-stable % dmesg|grep ata atapci0: <Intel PIIX3 ATA controller> port 0xffa0-0xffaf at device 13.1 on pci0 atapci0: Busmastering DMA not supported ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 ad0: 58644MB <IC35L060AVER07-0> [119150/16/63] at ata0-master BIOSPIO (full dmesg attached) The disk occasionally caused read error (per one or two days), sometimes causing victim process segfault. But the system itself has never freezed or crashed, after swap space on the disk is removed. The tag value had no relevance to the error. 2. 4.6.2-release with tag=0 After moving to 4.6.2, a read error has become very serious. It prints reset message and the system freezes. login: ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done On the other hand, the system stands for write errors although the recovery took at leaset 2 x 3 retries (not sure if a series of retry attempts is related to single write event, but the errors happened when I peeked the progress of copy with du into the disk), as far as I tested. login: ad0: WRITE command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: WRITE command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: timeout waiting for DRQ - resetting ata0: resetting devices .. done ad0: WRITE command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done 3. 4.6.2-release with tag=1 This is more stable than tag=0. But freezes do occur. The messages always claim tag=0. login: ad0: READ command timeout tag=0 serv=0 - resetting ata0: resetting devices .. done 4. Non-technical observations. o READ timeout never recovers. o In a conservative speak, tag=1 is slightly better than 4.4 wrt. stability. o When tag=1, READ command timeout says tag=0. But at least on the way observable, tag=1. o Besides heavy load, "abruptness" is very much hated. For example, a freeze occurred when man atacontrol is typed on it which has been quiescent for a while, saying Formatting page, please wait...Done. Similar freezes are experienced with ls, sysctl, grep, top and reboot (some of them might be wrong, since initially I was careless about the relationships of freezes and command inputs). o 40/80 pin cables have no difference. o 'di apm0' causes almost immediate hang at the first login, while 'en apm0' is much better, but giving neither is the best. horio shoichi http://http.bugsgrief.net/ ---D-M-E-S-G---------------------------------------------------- Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.6.2-RELEASE #1: Thu Aug 20 21:32:03 JST 2002 horio@ghost.near.this:/usr/obj/usr/src/sys/GHOST Timecounter "i8254" frequency 1193182 Hz CPU: Pentium Pro (199.43-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x617 Stepping = 7 Features=0xfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV> real memory = 67108864 (65536K bytes) config> q avail memory = 61571072 (60128K bytes) Changing APIC ID for IO APIC #0 from 0 to 2 on chip Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc03b8000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc03b809c. Pentium Pro MTRR support enabled apm0: <APM BIOS> on motherboard apm: found APM BIOS v1.2, connected at v1.2 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Host to PCI bridge> on motherboard IOAPIC #0 intpin 16 -> irq 2 IOAPIC #0 intpin 17 -> irq 16 pci0: <PCI bus> on pcib0 isab0: <Intel 82371SB PCI to ISA bridge> at device 13.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel PIIX3 ATA controller> port 0xffa0-0xffaf at device 13.1 on pci0 atapci0: Busmastering DMA not supported ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pcib1: <DEC 21052 PCI-PCI bridge> at device 14.0 on pci0 IOAPIC #0 intpin 18 -> irq 17 pci1: <PCI bus> on pcib1 rl0: <RealTek 8139 10/100BaseTX> port 0xec00-0xecff mem 0xfcfffc00-0xfcfffcff irq 17 at device 10.0 on pci1 rl0: Ethernet address: 00:40:95:20:19:04 miibus0: <MII bus> on rl0 rlphy0: <RealTek internal media interface> on miibus0 rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: <Number Nine Imagine 128 II graphics accelerator> at 16.0 irq 2 ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0xd800-0xd8ff mem 0xfe810000-0xfe810fff irq 16 at device 17.0 on pci0 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcc7ff on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 psm0: failed to get data. psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model Generic PS/2 mouse, device ID 0 vga0: <Generic ISA VGA> at port 0x3b0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <4 virtual consoles, flags=0x100> sio0 at port 0x3f8-0x3ff irq 4 flags 0x30 on isa0 sio0: type 16550A, console sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A IPsec: Initialized Security Association Processing. IP Filter: v3.4.27 initialized. Default = pass all, Logging = enabled SMP: AP CPU #1 Launched! ad0: 58644MB <IC35L060AVER07-0> [119150/16/63] at ata0-master BIOSPIO Waiting 3 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad0s1a da1 at ahc0 bus 0 target 1 lun 0 da1: <QUANTUM QM318000TD-S N1B0> Fixed Direct Access SCSI-2 device da1: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing Enabled da1: 17366MB (35566499 512 byte sectors: 255H 63S/T 2213C) da0 at ahc0 bus 0 target 0 lun 0 da0: <SEAGATE ST32430N 0510> Fixed Direct Access SCSI-2 device da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing Enabled da0: 2049MB (4197405 512 byte sectors: 255H 63S/T 261C) cd0 at ahc0 bus 0 target 5 lun 0 cd0: <NEC CD-ROM DRIVE:462 1.14> Removable CD-ROM SCSI-2 device cd0: 10.000MB/s transfers (10.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200208310930.g7V9UcK57531>