From owner-freebsd-questions@FreeBSD.ORG Tue Jul 19 09:06:30 2011 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2859C106566B for ; Tue, 19 Jul 2011 09:06:30 +0000 (UTC) (envelope-from cpghost@cordula.ws) Received: from mail-vx0-f182.google.com (mail-vx0-f182.google.com [209.85.220.182]) by mx1.freebsd.org (Postfix) with ESMTP id DFC2B8FC14 for ; Tue, 19 Jul 2011 09:06:29 +0000 (UTC) Received: by vxg33 with SMTP id 33so3847231vxg.13 for ; Tue, 19 Jul 2011 02:06:29 -0700 (PDT) MIME-Version: 1.0 Received: by 10.52.96.193 with SMTP id du1mr6017241vdb.206.1311066389034; Tue, 19 Jul 2011 02:06:29 -0700 (PDT) Received: by 10.220.100.73 with HTTP; Tue, 19 Jul 2011 02:06:28 -0700 (PDT) X-Originating-IP: [93.221.179.252] In-Reply-To: <89EB5E14-AA8E-4265-9C5D-22641ECC1C37@my.gd> References: <201107190549.p6J5n6sP028960@mail.r-bonomi.com> <4E252119.3030208@esiee.fr> <89EB5E14-AA8E-4265-9C5D-22641ECC1C37@my.gd> Date: Tue, 19 Jul 2011 11:06:28 +0200 Message-ID: From: "C. P. Ghost" To: Damien Fleuriot Content-Type: text/plain; charset=ISO-8859-1 Cc: Frank Bonnet , "freebsd-questions@freebsd.org" Subject: Re: Tools to find "unlegal" files ( videos , music etc ) X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 19 Jul 2011 09:06:30 -0000 On Tue, Jul 19, 2011 at 8:55 AM, Damien Fleuriot wrote: > On 19 Jul 2011, at 08:15, Frank Bonnet wrote: >> In France it's illegal and I have my boss's instruction : >> >> - find and delete the files that's all. > > Bon courage then... > > A file can not be illegal per se, so you won't be able to detect > these by looking up names or contents. > Even then, if a file is labeled as personal, privacy protection > applies and it is *unlawful* for you to process it. > (That is in the same way that your employer is strictly forbidden > from peeking inside your email messages clearly labeled as personal, > even if they were received on your work mailbox.) Exactly! Speaking with my university sysadmin hat on: you're NOT allowed to peek inside personal files of your users, UNLESS the user has waived his/her rights to privacy by explicitly agreeing to the TOS and there's legal language in the TOS that allows staff to inspect files (and then staff needs to abide by those rules in a very strict and cautious manner). So unless the TOS are very explicit, a sysadmin or an IT head can get in deep trouble w.r.t. privacy laws. > You may want to look for files that are unusually large. > They could possibly be ISOs, dvdrips, HD movie dumps... Not to forget encrypted RAR files (which btw. could contain anything, including legitimate content, so be careful here). > We have the same problem here with users sharing movies on the file > servers, and what makes it worse is some of their movie files are > legit because they're, for example, official trailers that are > reworked and redistributed to our customers. > > You won't win this, tell your boss it can not be done. What can technically be done is that the copyright owner provides a list of hashes for his files, and requests that you traverse your filesystems, looking for files that match those hashes. AND, even then, all you can do is flag the files, and you'll have to check with the user that he/she doesn't own a license permitting him/her to own that file! However, even that isn't foolproof: nothing prevents a user from flipping a bit or two, rescaling, resampling, splitting the files into multiple files in a non-obvious manner, adding random bytes at the end etc...: the result would still be infringing, but can't be detected automatically (at least not in a reasonable amount of time). Better talk with your users and resolve the problem using non-technical means. Inventive users WILL always outsmart any technical solution that you implement: this is a race you absolutely can't win. -cpghost. -- Cordula's Web. http://www.cordula.ws/