Date: Sat, 14 Jul 2007 00:11:23 +0200 (CEST) From: "Gelsema, P \(Patrick\)" <gelsemap@superhero.nl> To: "Scott Long" <scottl@samsco.org> Cc: "Gelsema, P \(Patrick\)" <gelsemap@superhero.nl> Subject: Re: Difference between 6.2 and 7.0 Adaptec 39320D - 7.0 performing less Message-ID: <1084.10.202.77.103.1184364683.squirrel@webmail.superhero.nl> In-Reply-To: <462CDDA5.9080304@samsco.org> References: <200704162247.29909.gelsemap@superhero.nl> <16295.195.50.100.20.1176825375.squirrel@www.superhero.nl> <4624F4C3.30006@samsco.org> <200704182222.32069.gelsemap@superhero.nl> <462684E0.20105@samsco.org> <9987.195.50.100.20.1176979371.squirrel@www.superhero.nl> <46277B98.7080007@samsco.org> <2712.195.50.100.20.1176996182.squirrel@www.superhero.nl> <462A7CCB.6020302@samsco.org> <1193.10.202.77.103.1177345281.squirrel@webmail.superhero.nl> <462CDDA5.9080304@samsco.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, April 23, 2007 18:24, Scott Long wrote: > Gelsema, P (Patrick) wrote: >> On Sat, April 21, 2007 23:06, Scott Long wrote: >>> Gelsema, P (Patrick) wrote: >>>> On Thu, April 19, 2007 16:24, Scott Long wrote: >>>>> Gelsema, P (Patrick) wrote: >>>>>> On Wed, April 18, 2007 22:51, Scott Long wrote: >>>>>>> Gelsema, P (Patrick) wrote: >>>>>>>> On Tuesday 17 April 2007 18:24, Scott Long wrote: >>>>>>>>> Gelsema, P (Patrick) - FreeBSD wrote: >>>>>>>>>> On Tue, April 17, 2007 16:45, Scott Long wrote: >>>>>>>>>>> Gelsema, P (Patrick) - FreeBSD wrote: >>>>>>>>>> <SNAP></SNAP> >>>>>>>>>> >>>>>>>>>>> The 39320D is a finicky card. I don't recall putting in the >>>>>>>>>>> code >>>>>>>>>>> that >>>>>>>>>>> would downshift the speed like this, but it wouldn't surprise >>>>>>>>>>> me >>>>>>>>>>> if >>>>>>>>>>> it >>>>>>>>>>> is a side effect of the system going slower. Anyways, it >>>>>>>>>>> sounds >>>>>>>>>>> like >>>>>>>>>>> you're a good candidate/victim for the MPSAFE locking changes >>>>>>>>>>> that >>>>>>>>>>> I >>>>>>>>>>> just made to the SCSI layer and the ahc/ahd drivers. Would you >>>>>>>>>>> mind >>>>>>>>>>> testing it out (just update to the latest 7-CURRENT sources) >>>>>>>>>>> and >>>>>>>>>>> let >>>>>>>>>>> me >>>>>>>>>>> know how it works for you? >>>>>>>> <SNAP></SNAP> >>>>>>>> >>>>>>>>>> Is building world/kernel sufficient as test or do you want me to >>>>>>>>>> do >>>>>>>>>> more >>>>>>>>>> tests? >>>>>>>>> Any amount of testing that you can do is appreciated. Even >>>>>>>>> verifying >>>>>>>>> that it boots is helpful =-) >>>>>>>> Cvsupped this evening at about 6.15 UTC time (20:15 CET zone) >>>>>>>> FreeBSD hulk.superhero.nl 7.0-CURRENT FreeBSD 7.0-CURRENT #0: Wed >>>>>>>> Apr >>>>>>>> 18 >>>>>>>> 21:56:58 CEST 2007 >>>>>>>> root@hulk.superhero.nl:/usr/obj/usr/src/sys/GENERIC >>>>>>>> amd64 >>>>>>>> >>>>>>>> After buildworld and the whole lot the computer boots fine, >>>>>>>> however >>>>>>>> the >>>>>>>> disk >>>>>>>> is still detected as only 160.00MB/s. >>>>>>>> >>>>>>>> I do get the following crash. It seems to be related to pressing >>>>>>>> scroll >>>>>>>> lock >>>>>>>> on the console and hitting the page up/down buttons. When I just >>>>>>>> log >>>>>>>> on >>>>>>>> locally or remotely it seems to be ok. When I hit the scroll lock >>>>>>>> key >>>>>>>> before >>>>>>>> or after logging on I get the below crash. >>>>>>>> >>>>>>>> Apr 18 22:08:22 hulk kernel: lock order reversal: (Giant after >>>>>>>> non-sleepable) >>>>>>>> Apr 18 22:08:22 hulk kernel: 1st 0xffffff007b413358 ahd_lock >>>>>>>> (ahd_lock) >>>>>>>> @ /usr/src/sys/cam/cam_periph.c:559 >>>>>>>> Apr 18 22:08:22 hulk kernel: 2nd 0xffffffff80977f20 Giant (Giant) >>>>>>>> @ /usr/src/sys/vm/vm_contig.c:590 >>>>>>>> Apr 18 22:08:22 hulk kernel: KDB: stack backtrace: >>>>>>>> Apr 18 22:08:22 hulk kernel: db_trace_self_wrapper() at >>>>>>>> db_trace_self_wrapper+0x3a >>>>>>>> Apr 18 22:08:22 hulk kernel: witness_checkorder() at >>>>>>>> witness_checkorder+0x4f9 >>>>>>>> Apr 18 22:08:22 hulk kernel: _mtx_lock_flags() at >>>>>>>> _mtx_lock_flags+0x75 >>>>>>>> Apr 18 22:08:22 hulk kernel: contigmalloc() at contigmalloc+0x63 >>>>>>>> Apr 18 22:08:22 hulk kernel: bus_dmamem_alloc() at >>>>>>>> bus_dmamem_alloc+0x8d >>>>>>>> Apr 18 22:08:22 hulk kernel: ahd_alloc_scbs() at >>>>>>>> ahd_alloc_scbs+0x32a >>>>>>>> Apr 18 22:08:22 hulk kernel: ahd_get_scb() at ahd_get_scb+0x69 >>>>>>>> Apr 18 22:08:22 hulk kernel: ahd_action() at ahd_action+0x47c >>>>>>>> Apr 18 22:08:22 hulk kernel: xpt_run_dev_sendq() at >>>>>>>> xpt_run_dev_sendq+0x1ae >>>>>>>> Apr 18 22:08:22 hulk kernel: xpt_action() at xpt_action+0x4d3 >>>>>>>> Apr 18 22:08:22 hulk kernel: dastart() at dastart+0x211 >>>>>>>> Apr 18 22:08:22 hulk kernel: xpt_run_dev_allocq() at >>>>>>>> xpt_run_dev_allocq+0xf4 >>>>>>>> Apr 18 22:08:22 hulk kernel: dastrategy() at dastrategy+0x78 >>>>>>>> Apr 18 22:08:22 hulk kernel: g_disk_start() at g_disk_start+0xe6 >>>>>>>> Apr 18 22:08:22 hulk kernel: g_io_schedule_down() at >>>>>>>> g_io_schedule_down+0x189 >>>>>>>> Apr 18 22:08:22 hulk kernel: g_down_procbody() at >>>>>>>> g_down_procbody+0x7a >>>>>>>> Apr 18 22:08:22 hulk kernel: fork_exit() at fork_exit+0xaa >>>>>>>> Apr 18 22:08:22 hulk kernel: fork_trampoline() at >>>>>>>> fork_trampoline+0xe >>>>>>>> Apr 18 22:08:22 hulk kernel: --- trap 0, rip = 0, rsp = >>>>>>>> 0xffffffffac102d30, >>>>>>>> rbp = 0 --- >>>>>>>> >>>>>>>> Is this information sufficient? If not please let me know what >>>>>>>> more >>>>>>>> is >>>>>>>> required. >>>>>>>> >>>>>>>> Rgds, >>>>>>>> >>>>>>>> Patrick >>>>>>>> >>>>>>> Thanks for the info. Fixing this problem is going to be a royal >>>>>>> pain. >>>>>>> You can probably get around it by disabling WITNESS and INVARIANTS. >>>>>>> >>>>>>> Scott >>>>>> The computer seems to remain working even with the crash. Disabling >>>>>> WINTNESS and INVARIANTS only disables the checking but not the >>>>>> actual >>>>>> problem, is that correct? >>>>>> >>>>>> If you want I can provide you full SSH access to the box to make >>>>>> working >>>>>> on the fix of this problem easier? I am not using this box for >>>>>> anything >>>>>> else than just toying, getting a better understanding. Just let me >>>>>> know. >>>>>> HTH. >>>>>> >>>>> Thanks for the offer. I have tons of hardware, I just didn't think >>>>> to >>>>> check the adaptec drivers on amd64 specifically. On i386 they don't >>>>> trigger the warning (though they do still have the same problem) so I >>>>> didn't notice it. >>>> In case you require it later on, just let me know. It is the least I >>>> can >>>> do after bombarding you with mails ;-) >>>> >>>>>> Also the disk is still detected as only 160.00MB/s, any thought >>>>>> about >>>>>> this? >>>>>> >>>>> I'll look into this as well. Actually, it might be a result of the >>>>> simple domain validation code that was added to 7-current a while >>>>> back. >>>>> DV is both very tricky to implement and very tricky to predict in >>>>> operation, so what you're seeing might be a bug or it might be a >>>>> legitimate problem with your disk or cables. >>>> Ok. The drive is brand new, the cable a bit less. But this is >>>> something >>>> I >>>> can easily test. I got a Maxtor disk spare, I will install 7.0 on that >>>> one. Cable is going to be a bit more cumbersome as I don't have any >>>> spare. >>>> That might have to wait till late next week. >>>> >>>> Rgds, >>>> >>>> Patrick >>>> >>> Regarding the speed problem, can you try the following change? >>> >>> ==== >>> //depot/projects/scottl-camlock/src/sys/dev/aic7xxx/aic79xx_osm.c#18 - >>> /home/scottl/p4/camlock/src/sys/dev/aic7xxx/aic79xx_osm.c ==== >>> @@ -579,7 +579,7 @@ >>> } else { >>> cpi->target_sprt = 0; >>> } >>> - cpi->hba_misc = 0; >>> + cpi->hba_misc = PIM_SEQSCAN; >>> cpi->hba_eng_cnt = 0; >>> cpi->max_target = (ahd->features & AHD_WIDE) ? 15 : 7; >>> cpi->max_lun = AHD_NUM_LUNS_NONPKT - 1; >>> >>> Scott >>> >> Tested, no success. Disk still recognised at 160MB/s and half the Mhz >> (80). >> I have tested 7-Current now with 3 different Disks (all seagate brand) >> and all of them recognised with only half the speed. When I insert the >> 6.2Release they get recognised at 320MB. >> >> Rgds, >> >> Patrick >> > > Ok, I'll work on putting in a knob to turn off DV. I plan to completely > re-write the DV code anyways. Thanks for testing and helping out. I'll > let you know when I have something more to test. > > Scott > Hi Scott, Finally I was able to buy a proper SCSI cable. QUite hard to get one from the normal computershops. Anyways, I still have the same problem. Disks still recognised as 160MB instead of 320. I am also having problems now installing Current on AMD64 with 4GB of memory. Seems to be something with memoryhole. Anyways.. I did a verbose boot and I've put the results on my webserver. http://www.superhero.nl/messages Hope this helps. Also, any suggestions why I get mangled entry panics or after rebuilding world/kernel I am unable to mount the root system? I believe it has something to do with the memory hole, but am not sure.. Rgds, Patrick
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1084.10.202.77.103.1184364683.squirrel>