From owner-freebsd-questions Sun May 31 12:23:12 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id MAA03235 for freebsd-questions-outgoing; Sun, 31 May 1998 12:23:12 -0700 (PDT) (envelope-from owner-freebsd-questions@FreeBSD.ORG) Received: from caladan.tdx.co.uk (caladan.tdx.co.uk [195.188.177.4]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id MAA03218 for ; Sun, 31 May 1998 12:22:56 -0700 (PDT) (envelope-from kpielorz@tdx.co.uk) Received: from tdx.co.uk (lorca-tx.tdx.co.uk [195.188.177.242]) by caladan.tdx.co.uk (8.8.8/8.8.8) with ESMTP id UAA01953 for ; Sun, 31 May 1998 20:22:54 +0100 (BST) (envelope-from kpielorz@tdx.co.uk) Message-ID: <3571AE0E.8F9E5AED@tdx.co.uk> Date: Sun, 31 May 1998 20:22:54 +0100 From: Karl Pielorz Organization: TDX X-Mailer: Mozilla 4.05 [en] (WinNT; I) MIME-Version: 1.0 To: questions@FreeBSD.ORG Subject: Removing duplicate files? Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Hi All, Is there any way (or some hints on a shell script or something) to remove duplicate files in a massive directory tree? Some software I have creates 'message files' with non-descript names, but theres loads (read hundreds) of duplicate messages, all with different filenames... What I need to be able to do is remove all the duplicate files (even though they may not have duplicate file names)... The only thing I could think of was to MD5 the whole lot, sort the list of MD5 checksums and then go nuking... ;-) Unless theres some other utlitiy or something I've not heard of? Regards, Karl Pielorz To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message