Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 27 Jun 2007 21:16:49 +0200
From:      Patrick Proniewski <patpro@patpro.net>
To:        freebsd-hardware@freebsd.org
Subject:   hardware/driver incompatibility ?
Message-ID:  <7D3EA7E7-CF83-4385-A8FC-611B886CC7E3@patpro.net>

next in thread | raw e-mail | index | archive | help
Hello,

Since the end of 2006, I'm running a FreeBSD 6.x on this box:

- TYAN Tiger i7520SD motherboard
- 2 Intel Xeon LV Sossaman 1,66 GHz
- 2 Western Digital WD2500YS hard drives
- 1GB RAM

Two weeks ago, I've added a DLink DWL-G520 Wifi PCI card (ath  
chipset), and turned this FreeBSD box into a wifi access point. Then,  
the PC started to crash from time to time, always when I'm not at home.
Today, I've had the 2nd crash.

Symptom: the PC crashes and shows an "invalid partition" message when  
I try to reboot it. The first time, after playing around with the  
restore tools of the install CD, I've erased the HD, reinstalled,  
then try to boot... only to read on the screen the infamous "invalid  
partition" message. I've taken a look into the BIOS, changing boot  
sequence, saving, re-changing, re-saving. The next reboot was an  
happy ending. Then I've spent 1 day to restore my various files/ 
settings/softwares.
Today, I was a little bit more lucky, when I came back, the PC was  
stuck, and the screen read:

ad4: FAILURE - device detached
subdisk4: detached
g_vfs_done():ad4s1f[WRITE(offset=84513456128, length=16384)]error = 6
ad4: det_avcfhse_dd
g_vfs_done():ad4s1f...
g_vfs_done():ad4s1a...
...

ad4 is my boot HD (the other one, ad6, is a backup drive, non-bootable)
I've rebooted, discovering ad4 was nowhere to be found (boot sequence  
was slow, and the BIOS eventually disabled the first HD).
I've rebooted again, to enter the bios: first disk still missing
shutdown, unplug the 1st HD, reboot, shutdown, plug the 1st HD,  
reboot -> victory. FreeBSD booted fine, and fsck did its job.

Now the box is up & running. No problem so far. I've removed the Wifi  
PCI card, and removed wifi related settings in rc.conf and  
loader.conf, just in case.

So my question is: is there any known incompatibility between ath  
related kernel modules and any of my existing hardware, or has my HD  
suddenly decided to die ?

Is there any way I could test my HD to detect/prevent failure ?

Any lead/idea is welcome,
thanks

patpro
-- 


atheros related dmesg lines:

ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413,  
RF5413)
ath0: <Atheros 5212> mem 0xd8120000-0xd812ffff irq 21 at device 5.0  
on pci6
ath0: Ethernet address: 00:19:5b:ca:31:3a
ath0: mac 7.9 phy 4.5 radio 5.6

current /boot/loader.conf: (wifi commented out after today's fix)

cpufreq_load = "YES"
#if_ath_load="YES"
#wlan_wep_load="YES"
#wlan_tkip_load="YES"
#wlan_ccmp_load="YES"
#wlan_xauth_load="YES"
#wlan_acl_load="YES"

current kldstat:

Id Refs Address    Size     Name
1    8 0xc0400000 3ae6d4   kernel
2    1 0xc07af000 a9d4     cpufreq.ko
3    1 0xc07ba000 59e80    acpi.ko
4    1 0xc5155000 2000     pflog.ko
5    1 0xc515f000 2a000    pf.ko
6    1 0xc52d3000 2000     green_saver.ko

current dmesg:

Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
         The Regents of the University of California. All rights  
reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-RELEASE-p5 #0: Sat Jun 16 15:51:22 CEST 2007
     root@xxxxxxxxxxxxx:/usr/obj/usr/src/sys/PATPRO-20061208
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU            000  @ 1.66GHz (1666.79-MHz 686- 
class CPU)
   Origin = "GenuineIntel"  Id = 0x6e8  Stepping = 8
    
Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE 
,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
   Features2=0xc1a9<SSE3,MON,VMX,EST,TM2,<b14>,<b15>>
   AMD Features=0x100000<NX>
   Cores per package: 2
real memory  = 1072824320 (1023 MB)
avail memory = 1040695296 (992 MB)
ACPI APIC Table: <PTLTD          APIC  >
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
cpu0 (BSP): APIC ID:  0
cpu1 (AP): APIC ID:  1
cpu2 (AP): APIC ID:  6
cpu3 (AP): APIC ID:  7
ioapic0 <Version 2.0> irqs 0-23 on motherboard
ioapic1 <Version 2.0> irqs 24-47 on motherboard
kbd1 at kbdmux0
acpi0: <PTLTD   RSDT> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0
cpu0: <ACPI CPU> on acpi0
est0: <Enhanced SpeedStep Frequency Control> on cpu0
p4tcc0: <CPU Frequency Thermal Control> on cpu0
cpu1: <ACPI CPU> on acpi0
est1: <Enhanced SpeedStep Frequency Control> on cpu1
p4tcc1: <CPU Frequency Thermal Control> on cpu1
cpu2: <ACPI CPU> on acpi0
est2: <Enhanced SpeedStep Frequency Control> on cpu2
p4tcc2: <CPU Frequency Thermal Control> on cpu2
cpu3: <ACPI CPU> on acpi0
est3: <Enhanced SpeedStep Frequency Control> on cpu3
p4tcc3: <CPU Frequency Thermal Control> on cpu3
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pci0: <base peripheral> at device 1.0 (no driver attached)
pcib1: <ACPI PCI-PCI bridge> irq 16 at device 2.0 on pci0
pci1: <ACPI PCI bus> on pcib1
pcib2: <ACPI PCI-PCI bridge> irq 16 at device 4.0 on pci0
pci2: <ACPI PCI bus> on pcib2
pcib3: <ACPI PCI-PCI bridge> irq 16 at device 5.0 on pci0
pci3: <ACPI PCI bus> on pcib3
em0: <Intel(R) PRO/1000 Network Connection Version - 6.2.9> port  
0x2000-0x201f mem 0xd8020000-0xd803ffff,0xd8000000-0xd801ffff irq 16  
at device 0.0 on pc
i3
em0: Ethernet address: 00:e0:81:42:e3:aa
em1: <Intel(R) PRO/1000 Network Connection Version - 6.2.9> port  
0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 17  
at device 0.1 on pc
i3
em1: Ethernet address: 00:e0:81:42:e3:ab
pcib4: <ACPI PCI-PCI bridge> irq 16 at device 6.0 on pci0
pci4: <ACPI PCI bus> on pcib4
pcib5: <ACPI PCI-PCI bridge> at device 28.0 on pci0
pci5: <ACPI PCI bus> on pcib5
uhci0: <UHCI (generic) USB controller> port 0x1400-0x141f irq 16 at  
device 29.0 on pci0
uhci0: [GIANT-LOCKED]
usb0: <UHCI (generic) USB controller> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <UHCI (generic) USB controller> port 0x1420-0x143f irq 19 at  
device 29.1 on pci0
uhci1: [GIANT-LOCKED]
usb1: <UHCI (generic) USB controller> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
pci0: <base peripheral> at device 29.4 (no driver attached)
pci0: <base peripheral, interrupt controller> at device 29.5 (no  
driver attached)
ehci0: <Intel 6300ESB USB 2.0 controller> mem 0xd8402400-0xd84027ff  
irq 23 at device 29.7 on pci0
ehci0: [GIANT-LOCKED]
usb2: EHCI version 1.0
usb2: companion controllers, 2 ports each: usb0 usb1
usb2: <Intel 6300ESB USB 2.0 controller> on ehci0
usb2: USB revision 2.0
uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub2: 4 ports with 4 removable, self powered
pcib6: <ACPI PCI-PCI bridge> at device 30.0 on pci0
pci6: <ACPI PCI bus> on pcib6
fxp0: <Intel 82551 Pro/100 Ethernet> port 0x3400-0x343f mem  
0xd8130000-0xd8130fff,0xd8100000-0xd811ffff irq 16 at device 0.0 on pci6
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:e0:81:42:e3:69
pci6: <display, VGA> at device 12.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel 6300ESB UDMA100 controller> port  
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x1460-0x146f at device 31.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
atapci1: <Intel 6300ESB SATA150 controller> port 0xf4a8-0xf4af, 
0xf49c-0xf49f,0xf4a0-0xf4a7,0xf498-0xf49b,0xf470-0xf47f irq 18 at  
device 31.2 on pci0
ata2: <ATA channel 0> on atapci1
ata3: <ATA channel 1> on atapci1
ichsmb0: <Intel 6300ESB (ICH) SMBus controller> port 0x1440-0x145f  
irq 17 at device 31.3 on pci0
ichsmb0: [GIANT-LOCKED]
smbus0: <System Management Bus> on ichsmb0
smb0: <SMBus generic I/O> on smbus0
acpi_button0: <Power Button> on acpi0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10  
on acpi0
sio0: type 16550A
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77f irq 7  
drq 3 on acpi0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/9 bytes threshold
ppbus0: <Parallel port bus> on ppc0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
pmtimer0 on isa0
orm0: <ISA Option ROM> at iomem 0xc0000-0xc8fff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on  
isa0
Timecounters tick every 1.000 msec
acd0: CDROM <SAMSUNG CD-ROM SCR-3231/S102> at ata1-master PIO4
ad4: 239372MB <WDC WD2500YS-01SHB0 20.06C03> at ata2-master SATA150
ad6: 239372MB <WDC WD2500YS-01SHB0 20.06C03> at ata3-master SATA150
SMP: AP CPU #1 Launched!
SMP: AP CPU #2 Launched!
SMP: AP CPU #3 Launched!
Trying to mount root from ufs:/dev/ad4s1a






Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?7D3EA7E7-CF83-4385-A8FC-611B886CC7E3>