Date: Fri, 20 Apr 2001 12:08:01 -0500 From: ryanb <ryanb@goddamnbastard.org> To: freebsd-stable@freebsd.org Subject: AMI MegaRAID (428 series; Enterprise 1200?) + 4-STABLE (2001.12.14) -> hard lock w/o ability to dump Message-ID: <20010420120801.B9227@bjorn.goddamnbastard.org>
index | next in thread | raw e-mail
My, that Subject: line is quite long. Please excuse it. :)
Anyway, I was handed down a Dell PowerEdge (gak) with an AMI MegaRAID
Enterprise 1200 and 4 9G disks. The setup within that machine left all
disks (80pin) stuck to the Dell backplane which then ran to said RAID
controller. With no free 3.5"/5.25" bays to mount a 68 pin disk in, I
was stuck with creating 2 RAID volumes under the controller: a 1 disk
RAID 0 and a 3 disk RAID 5. (Get that smirk off your faces. ;)
Getting to the point, this machine likes to just totally lock on its
own. Still pingable, it seems everything is happy that doesn't rely on
the filesystems/disks. ie: I received a page stating that its SMTP services
were timing out, but my systat over a serial console was still responsive.
After exiting systat, it just sat and did nothing, never returning to a
shell prompt. (Possibly from the shell trying to stat() after exiting
systat and getting no RAID response? I dunno ...)
Rather than playing the power cycling game like we had before the serial
consoles were wired, I decided to panic the machine to see what I could get
out of it, if anything. This eventually failed with the following error:
dumping to dev #amrd/0x20001, offset 1048592
dump failed, reason: device doesn't support a dump routine
Automatic reboot in 15 seconds - press a key on the console to abort
This is making me think the RAID controller is completely acting up if
we can't get a core dump (well, with _that_ error above). (Remember what
I said about only having disks on the RAID controller? (Stop chuckling
already.))
... sooo, I'll go ahead and post some diagnostic info below. I'm
basically looking for any hints or anything else to check out to confirm
a cause to the locks.
( See below -- what's this "no devsw" business? Seems there was a post
with similar diagnostic info as mine in freebsd-users-jp, 'cept it was
in Japanese. Duh.)
Thanks in advance!
- ryan
dmesg output:
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights
reserved.
FreeBSD 4.2-STABLE #0: Thu Apr 19 18:41:42 CDT 2001
root@backup.enteract.com:/usr/obj/usr/src/sys/SMTP2
Timecounter "i8254" frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (447.69-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0x672 Stepping = 2
Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory = 536862720 (524280K bytes)
avail memory = 519028736 (506864K bytes)
hanging APIC ID for IO APIC #0 from 0 to 2 on chip
Programming 24 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
FreeBSD/SMP: Multiprocessor motherboard
cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000
cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000
io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000
Preloaded elf kernel "kernel" at 0xc02d2000.
Pentium Pro MTRR support enabled
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Intel 82443GX host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
pcib2: <Intel 82443GX (440 GX) PCI-PCI (AGP) bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib2
pci1: <ATI Mach64-GD graphics accelerator> at 0.0
pcib3: <DEC 21152 PCI-PCI bridge> at device 2.0 on pci0
pci2: <PCI bus> on pcib3
ahc0: <Adaptec aic7890/91 Ultra2 SCSI adapter> port 0xec00-0xecff mem 0xf9fff000-0xf9ffffff irq 16 at device 4.0 on pci2
aic7890/91: Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: <Adaptec aic7860 SCSI adapter> port 0xe800-0xe8ff mem 0xf9ffe000-0xf9ffefff irq 16 at device 6.0 on pci2
aic7860: Single Channel A, SCSI Id=7, 3/255 SCBs
amr0: <AMI MegaRAID> port 0xe480-0xe4ff irq 18 at device 10.0 on pci2
amr0: <Series 428> Firmware Uc77, BIOS 1.47, 128MB RAM
isab0: <Intel 82371AB PCI to ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
pci0: <Intel PIIX4 ATA controller> at 7.1
pci0: <Intel 82371AB/EB (PIIX4) USB controller> at 7.2 irq 0 Timecounter "PIIX" frequency 3579545 Hz
chip1: <Intel 82371AB Power management controller> port 0x850-0x85f at device 7. 3 on pci0
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xdce0-0xdcff mem 0xfe000000-0xfe0fffff,0xf7000000-0xf7000fff irq 20 at device 8.0 on pci0
fxp0: Ethernet address 00:90:27:78:a7:1d
dc0: <82c169 PNIC 10/100BaseTX> port 0xd800-0xd8ff mem 0xfe100000-0xfe1000ff irq 21 at device 10.0 on pci0
dc0: Ethernet address: 00:a0:cc:3b:43:ba
miibus0: <MII bus> on dc0
ukphy0: <Generic IEEE 802.3u media interface> on miibus0
ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcib1: <Intel 82443GX host to AGP bridge> on motherboard
pci3: <PCI bus> on pcib1
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> on isa0
sc0: VGA <4 virtual consoles, flags=0x0>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: routing 8254 via IOAPIC #0 intpin 2
Waiting 2 seconds for SCSI devices to settle
amrd0: <MegaRAID logical drive> on amr0
amrd0: 8568MB (17547264 sectors) RAID 0 (optimal)
amrd1: <MegaRAID logical drive> on amr0
amrd1: 17136MB (35094528 sectors) RAID 5 (optimal)
SMP: AP CPU #1 Launched!
pass0 at ahc1 bus 0 target 5 lun 0
pass0: <NEC CD-ROM DRIVE:465 1.25> Removable CD-ROM SCSI-2 device
pass0: 20.000MB/s transfers (20.000MHz, offset 15)
no devsw (majdev=0 bootdev=0xa0200000)
Mounting root from ufs:/dev/amrd0s1a
WARNING: / was not properly dismounted
serial console output:
telnet> send break
Stopped at siointr1+0xb1: jmp siointr1+0x1a0
db> panic
panic: from debugger
mp_lock = 00000001; cpuid = 0; lapic.id = 01000000
boot() called on cpu#0
syncing disks...
Fatal trap 12: page fault while in kernel mode
mp_lock = 00000002; cpuid = 0; lapic.id = 01000000
fault virtual address = 0x30
fault code = supervisor read, page not present
instruction pointer = 0x8:0xc01c9569
stack pointer = 0x10:0xff806d18
frame pointer = 0x10:0xff806d1c
code segment = base 0x0, limit 0xfffff, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process = Idle
interrupt mask = bio <- SMP: XXX
kernel: type 12 trap, code=0
Stopped at siointr1+0xb1: jmp siointr1+0x1a0
db> trace
siointr1(c4068400,c02852a8,0,0,c01fe090) at siointr1+0xb1
siointr(c4068400) at siointr+0x17
Xfastintr4() at Xfastintr4+0x20
db> show all procs
pid proc addr uid ppid pgrp flag stat wmesg wchan cmd
43098 ddc58b60 ddcf0000 0 134 134 000004 3 inode c40d0800 inetd
43097 de807ba0 de86e000 0 134 134 000004 3 inode c40d0800 inetd
43096 de103260 de142000 0 134 134 000004 3 inode c40d0800 inetd
43095 de529520 de59e000 0 139 139 000004 3 inode c40d0800 sshd
43094 de40db60 de4c5000 0 139 139 000004 3 inode c40d0800 sshd
43093 de5284e0 de5c1000 0 139 139 000004 3 inode c41a1900 sshd
43090 de411100 de431000 0 43088 43090 004004 3 biord ce93a468 sh
43088 de40dd00 de4c2000 0 136 136 000084 3 piperd ddbceb80 cron
43067 de9498a0 de99a000 0 358 358 004004 3 inode c4220500 smtpd
43066 ddf8f560 ddfec000 333 358 358 004104 3 inode c4376200 cleanup
43065 ddf8eee0 ddffa000 333 358 358 004184 3 select c027fc30 smtpd
43064 dde25860 dde9c000 333 358 358 004104 3 getblk ce8c83dc cleanup
43063 de94b2a0 de966000 333 358 358 004104 3 inode c4220500 smtpd
43062 ddf8e1e0 de01a000 333 358 358 004184 3 select c027fc30 smtpd
43063 de94b2a0 de966000 333 358 358 004104 3 inode c4220500 smtpd
43060 de94be00 de94d000 333 358 358 004184 3 select c027fc30 smtpd
43059 ddf8db60 de029000 333 358 358 004104 3 wdrain c026c48c cleanup
43058 de0453c0 de0ab000 333 358 358 004184 3 select c027fc30 smtpd
43057 de8aad80 de8ed000 333 358 358 004104 3 wdrain c026c48c cleanup
43056 dde28920 dde34000 333 358 358 004184 3 select c027fc30 smtpd
43055 de5d0a80 de5f9000 333 358 358 004104 3 wdrain c026c48c cleanup
43054 de94a260 de986000 333 358 358 004184 3 select c027fc30 smtpd
43053 de80a5e0 de81c000 333 358 358 004104 3 wdrain c026c48c cleanup
43052 de73a4e0 de2ff000 333 358 358 004104 3 getblk ce878140 cleanup
43051 ddc58680 ddcfa000 333 358 358 004184 3 select c027fc30 smtpd
43050 de8aaa40 de8f4000 333 358 358 004184 3 select c027fc30 smtpd
43049 de8a8b60 de92f000 333 358 358 004104 3 wdrain c026c48c cleanup
43048 de1bfa00 de23b000 333 358 358 004184 3 select c027fc30 smtpd
43047 de682080 de6f8000 333 358 358 004104 3 getblk ce90a768 cleanup
43046 dde263c0 dde83000 333 358 358 004184 3 select c027fc30 smtpd
43045 de5cd680 de671000 333 358 358 004104 3 wdrain c026c48c cleanup
43044 de279340 de332000 333 358 358 004104 3 getblk ce937b10 cleanup
43043 de8acac0 de8b5000 333 358 358 004104 3 wdrain c026c48c cleanup
43042 de52abe0 de570000 333 358 358 004184 3 select c027fc30 smtpd
43041 de52c440 de541000 333 358 358 004104 3 wdrain c026c48c cleanup
43040 ddf905a0 ddfc9000 333 358 358 004184 3 select c027fc30 smtpd
43039 ddedcc20 ddf05000 333 358 358 004184 3 select c027fc30 smtpd
43038 de40e040 de4bc000 333 358 358 004184 3 select c027fc30 smtpd
43037 de044520 de0cc000 333 358 358 004104 3 wdrain c026c48c cleanup
43036 de947680 de9e1000 333 358 358 004104 3 wdrain c026c48c cleanup
43035 de100680 de1ae000 333 358 358 004184 3 select c027fc30 smtpd
43034 db48ff60 ddbe6000 333 358 358 004184 3 select c027fc30 smtpd
43031 de33bd00 de3e9000 333 358 358 004104 3 wdrain c026c48c cleanup
43030 ddd0eac0 ddd1c000 333 358 358 004184 3 select c027fc30 smtpd
43029 ddd0df60 ddd32000 333 358 358 004104 3 ffsfsn de509704 cleanup
43028 de4105a0 de44e000 333 358 358 004184 3 select c027fc30 smtpd
43027 de33ec20 de36f000 333 358 358 004184 3 select c027fc30 smtpd
43026 ddc5b260 ddc9c000 333 358 358 004104 3 getblk ce88b5e8 cleanup
43025 db48e700 ddc1e000 333 358 358 004184 3 select c027fc30 smtpd
43024 de947820 de9de000 333 358 358 004104 3 getblk ce894490 cleanup
43023 de8a9a00 de913000 333 358 358 004104 3 ffsfsn db48b304 cleanup
43022 de52aa40 de573000 333 358 358 004184 3 select c027fc30 smtpd
43021 de27b560 de2d9000 333 358 358 004184 3 select c027fc30 smtpd
43020 de27ca80 de2aa000 333 358 358 004104 3 ffsfsn de356404 cleanup
43019 de8a81a0 de942000 333 358 358 004104 3 getblk ce8d24ac cleanup
43018 db4905e0 ddbc8000 333 358 358 004184 3 select c027fc30 smtpd
43017 ddf912a0 ddfae000 333 358 358 004104 3 ffsfsn ddd15404 cleanup
43016 de045d80 de092000 333 358 358 004184 3 select c027fc30 smtpd
43015 de52ba80 de554000 333 358 358 004104 3 ffsfsn dea28984 cleanup
43014 dde244e0 ddecc000 333 358 358 004184 3 select c027fc30 smtpd
43013 de94ac20 de973000 333 358 358 004104 3 ffsfsn de4e0684 cleanup
43012 de806ea0 de887000 333 358 358 004104 3 ffsfsn ddebb504 cleanup
43011 ddedbf20 ddf21000 333 358 358 004184 3 select c027fc30 smtpd
43010 de683dc0 de6bd000 333 358 358 004104 3 inode c4220500 cleanup
43009 de1beea0 de254000 333 358 358 004184 3 select c027fc30 smtpd
43008 db48ed80 ddc0a000 333 358 358 004104 3 ffsfsn de4e7544 cleanup
43007 db48d6c0 ddc44000 333 358 358 004184 3 select c027fc30 smtpd
43006 de73cbe0 de794000 333 358 358 004104 3 ffsfsn de4fe284 cleanup
43005 de27d2a0 de29a000 333 358 358 004104 3 getblk ce865460 cleanup
42992 de5cfbe0 de618000 333 358 358 004104 3 biord ce8ff594 cleanup
42991 de40e520 de4b1000 333 358 358 004184 3 select c027fc30 smtpd
42990 de043820 de0e8000 333 358 358 004184 3 select c027fc30 smtpd
42988 de8a8820 de936000 333 358 358 004104 3 ffsfsn ddd8f004 cleanup
42987 de8088a0 de855000 333 358 358 004184 3 select c027fc30 smtpd
42986 ddf8dd00 de025000 333 358 358 004104 3 ffsfsn de4f8904 cleanup
42984 de1bf860 de23e000 333 358 358 004184 3 select c027fc30 smtpd
42975 de6809c0 de725000 333 358 358 004104 3 ffsfsn de43de84 cleanup
42974 de947340 de9e7000 333 358 358 004184 3 select c027fc30 smtpd
42973 de73c700 de79d000 333 358 358 004184 3 select c027fc30 smtpd
42972 de949be0 de992000 333 358 358 004104 3 ffsfsn ddd12944 cleanup
42967 dde241a0 dded3000 333 358 358 004104 3 biord ce928718 flush
42938 de73e920 de74b000 333 358 358 004104 3 getbuf ce8b1a06 bounce
42922 de680d00 de71e000 333 358 358 004104 3 ffsfsn dea11104 bounce
42921 dde28440 dde3f000 333 358 358 004104 3 getblk ce840798 bounce
42919 de103740 de138000 333 358 358 004104 3 ffsfsn de513e84 bounce
42905 ddd0e2a0 ddd2b000 333 358 358 004184 3 select c027fc30 smtp
42885 dde24680 ddec9000 333 358 358 004184 3 select c027fc30 smtp
42854 de8061a0 de8a3000 333 358 358 004184 3 select c027fc30 smtp
42834 de046400 de081000 333 358 358 004184 3 select c027fc30 smtp
42820 de103400 de13e000 333 358 358 004184 3 select c027fc30 smtpd
42688 de27d920 de28c000 333 358 358 004104 3 inode c4220500 smtp
42662 de6804e0 de72f000 333 358 358 004184 3 select c027fc30 smtp
42658 de8071e0 de881000 333 358 358 004104 3 getblk ce8fd020 smtp
42535 de40d1a0 de522000 333 358 358 004104 3 ffsfsn de77d1c4 bounce
42487 de33edc0 de36c000 333 358 358 004104 3 inode c4220500 smtp
42479 ddd0ab60 dde0b000 333 358 358 004184 3 select c027fc30 smtp
42459 de047100 de065000 333 358 358 004184 3 select c027fc30 smtp
42369 de5ce6c0 de652000 333 358 358 004104 3 ffsfsn ddd9dd04 bounce
42325 de33e400 de389000 333 358 358 004104 3 ffsfsn de383704 bounce
42298 dde27740 dde5a000 333 358 358 004104 3 wdrain c026c48c cleanup
42296 de684c60 de689000 333 358 358 004104 3 ffsfsn dea20e44 cleanup
42295 de045080 de0b3000 333 358 358 004104 3 getblk ce87bf80 cleanup
42291 de807380 de87e000 333 358 358 004104 3 ffsfsn ddd8b684 cleanup
42229 de948ba0 de9b5000 333 358 358 004104 3 ffsfsn de4d1a04 cleanup
41967 ddd0ec60 ddd18000 333 358 358 004184 3 select c027fc30 smtp
41812 dde24340 ddecf000 333 358 358 004184 3 select c027fc30 smtp
41737 de27a6c0 de303000 333 358 358 004184 3 select c027fc30 smtp
41609 de1bf520 de246000 333 358 358 004184 3 select c027fc30 smtpd
41608 de045560 de0a8000 333 358 358 004104 3 inode c4220500 smtpd
41458 de5d00c0 de60f000 333 358 358 004184 3 select c027fc30 smtpd
41309 de411440 de429000 333 358 358 004104 3 wdrain c026c48c cleanup
41306 de808a40 de852000 333 358 358 004104 3 wdrain c026c48c cleanup
41305 de809a80 de832000 333 358 358 004104 3 wdrain c026c48c cleanup
41239 ddc59ba0 ddccd000 333 358 358 004184 3 select c027fc30 smtpd
41237 de1bf1e0 de24d000 333 358 358 004184 3 select c027fc30 smtpd
41230 ddd0bd40 ddd7e000 333 358 358 004184 3 select c027fc30 smtpd
41224 de8a9d40 de90d000 333 358 358 004184 3 select c027fc30 smtpd
41217 de8ab260 de8e4000 333 358 358 004184 3 select c027fc30 smtpd
41215 de40e380 de4b5000 333 358 358 004184 3 select c027fc30 smtpd
24519 ddd0ca40 ddd63000 0 1 24519 004006 3 inode c40d0800 sh
24009 ddf915e0 ddfa6000 22787 20877 24009 004086 3 ttyin c4611c30 bash
24006 de33b000 de40b000 0 20883 24006 004086 3 ttyin c44fc030 systat
21033 de683740 de6cb000 333 358 358 004184 3 select c027fc30 trivial-rewr
ite
21016 db48f0c0 ddc03000 333 358 358 004104 3 inode c4801900 qmgr
20883 ddf8e520 de00e000 0 20880 20883 004086 3 wait ddf8e520 bash
20880 de33f780 de350000 0 20878 20880 004086 3 wait de33f780 sh
20878 de5d1440 de5e6000 22787 20877 20878 004086 3 wait de5d1440 bash
20877 de33bb60 de3ec000 22787 1 20877 000084 3 select c027fc30 screen-3.9.8
386 ddd0e440 ddd28000 0 1 386 004086 3 ttyin c43d7110 getty
385 ddd0c080 ddd78000 0 1 385 004086 3 ttyin c43d7f10 getty
384 ddd0c700 ddd6a000 0 1 384 004086 3 ttyin c43d4110 getty
383 ddd0ddc0 ddd36000 0 1 383 004086 3 ttyin c4063210 getty
358 db48ef20 ddc06000 0 1 358 004104 3 inode c4220500 master
139 db48f260 ddc00000 0 1 139 000004 3 inode c40d0800 sshd
136 db48f400 ddbfc000 0 1 136 000004 3 inode c40d0800 cron
134 db48f5a0 ddbf8000 0 1 134 000084 3 select c027fc30 inetd
117 db48f740 ddbf5000 1 1 117 000104 3 inode c40d0800 rwhod
111 db48f8e0 ddbf2000 0 1 106 000084 3 nfsidl c0281dac nfsiod
110 db48fa80 ddbef000 0 1 106 000084 3 nfsidl c0281da8 nfsiod
109 db48fc20 ddbec000 0 1 106 000084 3 nfsidl c0281da4 nfsiod
108 db48fdc0 ddbe9000 0 1 106 000084 3 nfsidl c0281da0 nfsiod
102 db490100 ddbd5000 0 1 102 000084 3 select c027fc30 ntpd
95 db4902a0 ddbd2000 0 1 95 000084 3 select c027fc30 syslogd
5 db490780 db49d000 0 0 0 000204 3 wdrain c026c48c syncer
4 db490920 db49b000 0 0 0 100204 3 psleep c026c454 bufdaemon
3 db490ac0 db499000 0 0 0 000204 3 psleep c0276fa0 vmdaemon
2 db490c60 db497000 0 0 0 100204 3 psleep c025e5f8 pagedaemon
1 db490e00 db495000 0 0 1 004284 3 wait db490e00 init
0 c027efa0 c02f2000 0 0 0 000204 3 sched c027efa0 swapper
39739 de5d0400 de609000 0 358 358 006104 5 pickup
db> step
panic: rslock: cpu: 0, addr: 0xc02852a8, lock: 0x00000001
mp_lock = 00000002; cpuid = 0; lapic.id = 01000000
boot() called on cpu#0
Uptime: 15h14m39s
amr0: flushing cache...failed
dumping to dev #amrd/0x20001, offset 1048592
dump failed, reason: device doesn't support a dump routine
Automatic reboot in 15 seconds - press a key on the console to abort
Rebooting...
cpu_reset called on cpu#0
cpu_reset: Stopping other CPUs
To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message
help
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20010420120801.B9227>
