Date: Mon, 23 Jan 2006 06:43:35 +0100 From: =?ISO-8859-1?Q?Johan_Str=F6m?= <johan@stromnet.org> To: freebsd-stable@freebsd.org Subject: Re: Page fault, GEOM problem?? Message-ID: <44B2CAEF-A9E7-454B-A232-292B58083952@stromnet.org> In-Reply-To: <1137975447.40786.83.camel@yak.mseubanks.net> References: <991F35AA-151B-4AEA-82BD-5F4AEDF28424@stromnet.org> <a78074950511180117r6d64db25o4ae37c0c5998e002@mail.gmail.com> <74994962-5050-47BD-897B-DE3880B9EBD5@stromnet.org> <a78074950511180943r57fd9d03r64efcc705001bc35@mail.gmail.com> <A6F22EE2-B1E6-44B5-B4C2-E77E1A24FEBB@stromnet.org> <1132353600.903.19.camel@genius1.i.cz> <20051118231351.GA46946@holestein.holy.cow> <1132356649.903.32.camel@genius1.i.cz> <8A4DAD5D-44CF-42DD-A113-340226284533@stromnet.org> <268C3DEB-7569-4C18-BC35-1C5F36EF8EC4@stromnet.org> <1137967081.40786.36.camel@yak.mseubanks.net> <1DA0C9DF-BB42-415B-8851-FFB91CD0F1AC@stromnet.org> <1137975447.40786.83.camel@yak.mseubanks.net>
next in thread | previous in thread | raw e-mail | index | archive | help
--Apple-Mail-9-774025555 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed On 23 jan 2006, at 01.17, Michael S. Eubanks wrote: > On Sun, 2006-01-22 at 23:51 +0100, Johan Str=F6m wrote: > > ...snip... > > >> On 22 jan 2006, at 22.58, Michael S. Eubanks wrote: >> This card does afaik dont have raid functionalitys (I've never read >> anything about it either on the web, the cards box or anywhere =20 >> else..). >> I'm running GENERIC, which does include ataraid.. >> What does your dmesg identify your card as? >> >> atapci0: <Promise PDC40518 SATA150 controller> port 0xb800-0xb87f, >> 0xb400-0xb4ff mem 0xfb800000-0xfb800fff,0xfb000000-0xfb01ffff irq 19 >> at device 12.0 on pci0 >> >> Is it the same PDC chipset? >> >> -- >> Johan >> >> > > No, I have a different controller. My mistake. I think what is > happening is the DMA read command is failing, therefore causing the > device to be disconnected, and the kernel can't write to the disk from > that point on (this is somewhat obvious given the output below). > > >>> Nov 29 20:36:54 elfi kernel: subdisk10: detached >>> Nov 29 20:36:54 elfi kernel: ad10: detached >>> Nov 29 20:36:54 elfi kernel: unknown: TIMEOUT - READ_DMA48 retrying >>> (1 retry left) LBA=3D426562704 >>> Nov 29 20:36:54 elfi kernel: GEOM_MIRROR: Device gm0s1: provider >>> ad10s1 disconnected. >>> > > The message seen from the last line above is generated in any of the > following scenarios (from g_mirror.c): > 1. Device wasn't running yet, but disk disappear. > 2. Disk was active and disapppear. > 3. Disk disappear during synchronization process. > > >>> Nov 29 20:36:54 elfi kernel: GEOM_MIRROR: Request failed (error=3D6). >>> ad10s1[WRITE(offset=3D134356992, length=3D16384)] >>> > > As far as recovering the disk, I remember seeing something about =20 > booting > to single user mode and using fsck after a core dump in a previous =20 > post. > I'm assuming the disks worked initially and that you were able to =20 > label > them etc? Is there any possibility that the disk state may be altered > by a power saving feature or setting in the BIOS and FreeBSD just > doesn't know when it happens until the next time it tries to access =20= > the > disk? > For recovering, i've always done a direct reboot, the gmirror =20 rebuilds the mirror and fsck is run. No problems reading labels etc, and never has been, only problem has =20 been these sporadic crashes.. And the read/write performance (see =20 earlier in thread)... This is a server, so all bios setting for powersaving is (should be) =20 shut of. Bios should thus never make the disk go to sleep. > > > -Michael > > Thanks for trying to help! -- Johan= --Apple-Mail-9-774025555--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?44B2CAEF-A9E7-454B-A232-292B58083952>