From owner-soc-status@FreeBSD.ORG Sun Jul 4 16:38:47 2010 Return-Path: Delivered-To: soc-status@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C8165106564A; Sun, 4 Jul 2010 16:38:47 +0000 (UTC) (envelope-from gabor@FreeBSD.org) Received: from server.mypc.hu (server.mypc.hu [87.229.73.95]) by mx1.freebsd.org (Postfix) with ESMTP id 519D88FC1D; Sun, 4 Jul 2010 16:38:47 +0000 (UTC) Received: from server.mypc.hu (localhost [127.0.0.1]) by server.mypc.hu (Postfix) with ESMTP id 0275114DBEE2; Sun, 4 Jul 2010 18:38:46 +0200 (CEST) X-Virus-Scanned: amavisd-new at server.mypc.hu Received: from server.mypc.hu ([127.0.0.1]) by server.mypc.hu (server.mypc.hu [127.0.0.1]) (amavisd-new, port 10024) with LMTP id rNNHXHLPddZw; Sun, 4 Jul 2010 18:38:43 +0200 (CEST) Received: from [192.168.1.105] (catv-80-99-92-167.catv.broadband.hu [80.99.92.167]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by server.mypc.hu (Postfix) with ESMTPSA id 0DAA014DBDDE; Sun, 4 Jul 2010 18:38:43 +0200 (CEST) Message-ID: <4C30B90F.9040703@FreeBSD.org> Date: Sun, 04 Jul 2010 18:38:39 +0200 From: Gabor Kovesdan User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; pt-PT; rv:1.9.1.10) Gecko/20100512 Thunderbird/3.0.5 MIME-Version: 1.0 To: soc-status@freebsd.org References: <4C1BCB96.4040608@FreeBSD.org> <4C21CAF0.2040607@FreeBSD.org> <4C230A0B.3080700@FreeBSD.org> <4C2761B8.9030800@FreeBSD.org> In-Reply-To: <4C2761B8.9030800@FreeBSD.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Cc: =?UTF-8?B?RWR3YXJkIFRvbWFzeiBOYXBpZXJhxYJh?= Subject: Collective resource limits status report #5 X-BeenThere: soc-status@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Summer of Code Status Reports and Discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 04 Jul 2010 16:38:47 -0000 Since last report, I've resolved the problems that caused kernel panic. The linked list handling now seems to be ok, I've checked it many times but somehow still there's a bug somewhere. I store an entry there but later the job isn't found. I've also added some Robert Watson locking (see rwlock(9), last line :) to check if that helps but it caused another kernel panics, so this is commented out at the moment. Now, I'm working on resolving the locking problem and making the lists actually work. I might try debugging with adding some temporary syscalls to access internal kernel structures or printing out something with a sysctl (this was pgj's idea) so that I can figure out what's going wrong. If you have a better idea or you find something strange in the patch, please let me know. Another issue, I've tried to merge my work into p4 but I got strange problems. I wrote to soc-admins@ and perforce-admin@ regarding this, I hope I can also figure it out soon. For the meantime, I provide a full patch, I think it's still easy to read because the main part of the code resides in sys/kern/kern_jobs.c. http://kovesdan.org/patches/jobs_current.diff Gabor