Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 31 Aug 2002 18:11:00 +0900 (JST)
From:      BugsGrief@bugsgrief.net
To:        freebsd-stable@freebsd.org
Subject:   ata problem(s)
Message-ID:  <200208310930.g7V9UcK57531@ogyo.bugsgrief.net>

next in thread | raw e-mail | index | archive | help
I have been using following IBM ata disk since late 4.4 stable.

1. 4.4-stable

% dmesg|grep ata
atapci0: <Intel PIIX3 ATA controller> port 0xffa0-0xffaf at device 13.1 on pci0
atapci0: Busmastering DMA not supported
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
ad0: 58644MB <IC35L060AVER07-0> [119150/16/63] at ata0-master BIOSPIO
(full dmesg attached)

The disk occasionally caused read error (per one or two days),
sometimes causing victim process segfault. But the system itself has
never freezed or crashed, after swap space on the disk is removed.

The tag value had no relevance to the error.

2. 4.6.2-release with tag=0

After moving to 4.6.2, a read error has become very serious. It prints reset
message and the system freezes.

login: ad0: READ command timeout tag=0 serv=0 - resetting
ata0: resetting devices .. done

On the other hand, the system stands for write errors although the recovery
took at leaset 2 x 3 retries (not sure if a series of retry attempts
is related to single write event, but the errors happened when I peeked
the progress of copy with du into the disk), as far as I tested.

login: ad0: WRITE command timeout tag=0 serv=0 - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: WRITE command timeout tag=0 serv=0 - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: timeout waiting for DRQ - resetting
ata0: resetting devices .. done
ad0: WRITE command timeout tag=0 serv=0 - resetting
ata0: resetting devices .. done

3. 4.6.2-release with tag=1

This is more stable than tag=0. But freezes do occur. The messages always 
claim tag=0.

login: ad0: READ command timeout tag=0 serv=0 - resetting
ata0: resetting devices .. done

4. Non-technical observations.

o READ timeout never recovers.

o In a conservative speak, tag=1 is slightly better than 4.4 wrt. stability.

o When tag=1, READ command timeout says tag=0. But at least on the way
  observable, tag=1.

o Besides heavy load, "abruptness" is very much hated. For example, a freeze
  occurred when man atacontrol is typed on it which has been quiescent for
  a while, saying

  Formatting page, please wait...Done.

  Similar freezes are experienced with ls, sysctl, grep, top and
  reboot (some of them might be wrong, since initially I was careless
  about the relationships of freezes and command inputs).

o 40/80 pin cables have no difference.

o 'di apm0' causes almost immediate hang at the first login, while 'en apm0'
  is much better, but giving neither is the best.


horio shoichi http://http.bugsgrief.net/



---D-M-E-S-G----------------------------------------------------
Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD 4.6.2-RELEASE #1: Thu Aug 20 21:32:03 JST 2002
    horio@ghost.near.this:/usr/obj/usr/src/sys/GHOST
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium Pro (199.43-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x617  Stepping = 7
  Features=0xfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV>
real memory  = 67108864 (65536K bytes)
config> q
avail memory = 61571072 (60128K bytes)
Changing APIC ID for IO APIC #0 from 0 to 2 on chip
Programming 24 pins in IOAPIC #0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  0, version: 0x00040011, at 0xfee00000
 cpu1 (AP):  apic id:  1, version: 0x00040011, at 0xfee00000
 io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec00000
Preloaded elf kernel "kernel" at 0xc03b8000.
Preloaded userconfig_script "/boot/kernel.conf" at 0xc03b809c.
Pentium Pro MTRR support enabled
apm0: <APM BIOS> on motherboard
apm: found APM BIOS v1.2, connected at v1.2
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Host to PCI bridge> on motherboard
IOAPIC #0 intpin 16 -> irq 2
IOAPIC #0 intpin 17 -> irq 16
pci0: <PCI bus> on pcib0
isab0: <Intel 82371SB PCI to ISA bridge> at device 13.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel PIIX3 ATA controller> port 0xffa0-0xffaf at device 13.1 on pci0
atapci0: Busmastering DMA not supported
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pcib1: <DEC 21052 PCI-PCI bridge> at device 14.0 on pci0
IOAPIC #0 intpin 18 -> irq 17
pci1: <PCI bus> on pcib1
rl0: <RealTek 8139 10/100BaseTX> port 0xec00-0xecff mem 0xfcfffc00-0xfcfffcff irq 17 at device 10.0 on pci1
rl0: Ethernet address: 00:40:95:20:19:04
miibus0: <MII bus> on rl0
rlphy0: <RealTek internal media interface> on miibus0
rlphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pci0: <Number Nine Imagine 128 II graphics accelerator> at 16.0 irq 2
ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0xd800-0xd8ff mem 0xfe810000-0xfe810fff irq 16 at device 17.0 on pci0
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcc7ff on isa0
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
psm0: failed to get data.
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
vga0: <Generic ISA VGA> at port 0x3b0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <4 virtual consoles, flags=0x100>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x30 on isa0
sio0: type 16550A, console
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
IPsec: Initialized Security Association Processing.
IP Filter: v3.4.27 initialized.  Default = pass all, Logging = enabled
SMP: AP CPU #1 Launched!
ad0: 58644MB <IC35L060AVER07-0> [119150/16/63] at ata0-master BIOSPIO
Waiting 3 seconds for SCSI devices to settle
Mounting root from ufs:/dev/ad0s1a
da1 at ahc0 bus 0 target 1 lun 0
da1: <QUANTUM QM318000TD-S N1B0> Fixed Direct Access SCSI-2 device 
da1: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing Enabled
da1: 17366MB (35566499 512 byte sectors: 255H 63S/T 2213C)
da0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGATE ST32430N 0510> Fixed Direct Access SCSI-2 device 
da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing Enabled
da0: 2049MB (4197405 512 byte sectors: 255H 63S/T 261C)
cd0 at ahc0 bus 0 target 5 lun 0
cd0: <NEC CD-ROM DRIVE:462 1.14> Removable CD-ROM SCSI-2 device 
cd0: 10.000MB/s transfers (10.000MHz, offset 15)
cd0: Attempt to query device size failed: NOT READY, Medium not present


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200208310930.g7V9UcK57531>