Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 27 Jul 2003 22:03:56 +0200
From:      Grzegorz Czaplinski <G.Czaplinski@prioris.mini.pw.edu.pl>
To:        "Marc G. Fournier" <scrappy@hub.org>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: Dump Card State Begins ...
Message-ID:  <20030727200355.GM82199@prioris.mini.pw.edu.pl>
In-Reply-To: <20030726115857.M37284@hub.org>
References:  <20030726115857.M37284@hub.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--Uwl7UQhJk99r8jnw
Content-Type: text/plain; charset=iso-8859-2
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sat, Jul 26, 2003 at 12:03:48PM -0300, Marc G. Fournier wrote:
>=20
> Hi ...
>=20
>   Can someone tell me whether or not this is indicative of a hardware, or
> software, problem?  It happened a few times today, on two different
> drives, and it seem to "self-recover", since the server is still purring
> along without any noticeable problems:
>=20
> neptune# grep "timed out" /var/log/messages
> Jul 25 03:52:51 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x40 - timed out
> Jul 25 03:57:22 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x18 - timed out
> Jul 25 03:58:53 neptune /kernel: (da1:ahd1:0:1:0): SCB 0x1e - timed out
> Jul 26 10:55:46 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x39 - timed out
>=20
>   The drives are all U320 Seagate Cheetah 70G ... no RAID involved, its
> just straight drives using the motherboard's onboard SCSI controller ...
> the motherboard is the Intel SE7501, in the SR2300 chassis ...
>=20
>   It did it back on the 19th as well:
>=20
> Jul 19 19:37:16 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x46 - timed out
> Jul 19 19:38:46 neptune /kernel: (da1:ahd1:0:1:0): SCB 0x2d - timed out
>=20

Time outs may be a case of bad cabling or termination.
Check them...

>   But again, appears to have recovered with no ill effects ...
>=20
>=20
> Jul 25 03:52:51 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x40 - timed out
> Jul 25 03:53:06 neptune /kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begin=
s <<<<<<<<<<<<<<<<<
> Jul 25 03:53:06 neptune /kernel: ahd1: Dumping Card State at program addr=
ess 0x15 Mode 0x22
> Jul 25 03:53:06 neptune /kernel: Card was paused
> Jul 25 03:53:06 neptune /kernel: HS_MAILBOX[0x0] INTCTL[0xc0] SEQINTSTAT[=
0x0] SAVED_MODE[0x11]
> Jul 25 03:53:06 neptune /kernel: DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x=
0] SCSIBUS[0x0]
> Jul 25 03:53:06 neptune /kernel: LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x=
12] SEQCTL0[0x10]
> Jul 25 03:53:06 neptune /kernel: SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS=
2[0x0] SSTAT0[0x0]
> Jul 25 03:53:06 neptune /kernel: SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERR=
DIAG[0x8]
> Jul 25 03:53:06 neptune /kernel: SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0=
] LQISTAT2[0x0]
> Jul 25 03:53:06 neptune /kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x1]
> Jul 25 03:53:06 neptune /kernel:
> Jul 25 03:53:06 neptune /kernel: SCB Count =3D 96 CMDS_PENDING =3D 29 LAS=
TSCB 0x22 CURRSCB 0x22 NEXTSCB 0xff00
> Jul 25 03:53:06 neptune /kernel: qinstart =3D 65252 qinfifonext =3D 65252
> Jul 25 03:53:06 neptune /kernel: QINFIFO:
> Jul 25 03:53:06 neptune /kernel: WAITING_TID_QUEUES:
> Jul 25 03:53:06 neptune /kernel: Pending list:
> Jul 25 03:53:06 neptune /kernel: 21 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:06 neptune /kernel: 29 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:06 neptune /kernel: 63 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:06 neptune /kernel: 65 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:06 neptune /kernel: 24 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:06 neptune /kernel: 10 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 15 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 47 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 59 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 26 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 77 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 54 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 42 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 57 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 55 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 92 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 78 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 12 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 27 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 28 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 32 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 95 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 53 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 25 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 52 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 1 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_SC=
SIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 38 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 93 FIFO_USE[0x0] SCB_CONTROL[0x62] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: 64 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S=
CSIID[0x27]
> Jul 25 03:53:07 neptune /kernel: Total 29
> Jul 25 03:53:07 neptune /kernel: Kernel Free SCB list: 7 56 34 19 4 37 20=
 46 40 61 11 39 31 45 58 23 73 30 5 62 8 41 18 16 13 66 51 14 44 49 36 50 7=
0 35 9 76 74 2 48 43 3 33 79 71 75 60 67 69 91 94 6 72 0 68 17 22 90 89 88 =
87 86 85 84 83 82 81 80
> Jul 25 03:53:07 neptune /kernel: Sequencer Complete DMA-inprog list:
> Jul 25 03:53:07 neptune /kernel: Sequencer Complete list:
> Jul 25 03:53:07 neptune /kernel: Sequencer DMA-Up and Complete list:
> Jul 25 03:53:07 neptune /kernel:
> Jul 25 03:53:07 neptune /kernel: ahd1: FIFO0 Free, LONGJMP =3D=3D 0x80ff,=
 SCB 0x22
> Jul 25 03:53:07 neptune /kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x=
0] DFSTATUS[0x89]
> Jul 25 03:53:07 neptune /kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSX=
FRCTL[0x0]
> Jul 25 03:53:07 neptune /kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR =3D 0x=
00, SHCNT =3D 0x0
> Jul 25 03:53:07 neptune /kernel: HADDR =3D 0x00, HCNT =3D 0x0 CCSGCTL[0x1=
0]
> Jul 25 03:53:07 neptune /kernel: ahd1: FIFO1 Free, LONGJMP =3D=3D 0x8277,=
 SCB 0x7
> Jul 25 03:53:07 neptune /kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x=
4] DFSTATUS[0x89]
> Jul 25 03:53:07 neptune /kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSX=
FRCTL[0x0]
> Jul 25 03:53:07 neptune /kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR =3D 0x=
00, SHCNT =3D 0x0
> Jul 25 03:53:07 neptune /kernel: HADDR =3D 0x00, HCNT =3D 0x0 CCSGCTL[0x1=
0]
> Jul 25 03:53:07 neptune /kernel: LQIN: 0x55 0x0 0x0 0x7 0x0 0x0 0x0 0x0 0=
x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
> Jul 25 03:53:07 neptune /kernel: ahd1: LQISTATE =3D 0x0, LQOSTATE =3D 0x0=
, OPTIONMODE =3D 0x42
> Jul 25 03:53:07 neptune /kernel: ahd1: OS_SPACE_CNT =3D 0x20 MAXCMDCNT =
=3D 0x1
> Jul 25 03:53:07 neptune /kernel: SIMODE0[0xc]
> Jul 25 03:53:07 neptune /kernel: CCSCBCTL[0x0]
> Jul 25 03:53:07 neptune /kernel: ahd1: REG0 =3D=3D 0x22, SINDEX =3D 0x122=
, DINDEX =3D 0x102
> Jul 25 03:53:07 neptune /kernel: ahd1: SCBPTR =3D=3D 0x7, SCB_NEXT =3D=3D=
 0x49, SCB_NEXT2 =3D=3D 0xfff1
> Jul 25 03:53:07 neptune /kernel: CDB 2a 0 7 80 a0 ca
> Jul 25 03:53:07 neptune /kernel: STACK: 0x125 0x125 0x125 0x257 0x257 0x2=
57 0x29 0x15
> Jul 25 03:53:07 neptune /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends >>=
>>>>>>>>>>>>>>>>
> Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:=
 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0
> Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:=
 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0
> Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:=
 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0
> _______________________________________________

This looks like, your drive da2 is daying.
I had the same sort of errors few weeks ago.
You may not be able to unmount the drives properly now. Try to boot into
single user mode and work on that drive from there. If you are lucky,
you will have a chance to get the data back.

Good luck,
	gregory
--
Grzegorz Czaplinski <gregory at prioris.mini.pw.edu.pl>
"The Power to Serve, Right for the Power Users!" - http://www.FreeBSD.org/
 Fingerprint: EB77 E19D CFA2 5736 810F  847C A70F A275 2489 469F

--Uwl7UQhJk99r8jnw
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2 (FreeBSD)

iD8DBQE/JDArpw+idSSJRp8RAnkBAKDLkNHC+aOwwgJFyxVXCs4rMiLqbQCeLU4f
XiJK8EQyKKvyEK+ZHqYCvAI=
=aiXw
-----END PGP SIGNATURE-----

--Uwl7UQhJk99r8jnw--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20030727200355.GM82199>