Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 04 Jul 2022 17:20:16 +0000
From:      bugzilla-noreply@freebsd.org
To:        ports-bugs@FreeBSD.org
Subject:   [Bug 264950] New port: misc/dedup
Message-ID:  <bug-264950-7788-9FLZFMdbrE@https.bugs.freebsd.org/bugzilla/>
In-Reply-To: <bug-264950-7788@https.bugs.freebsd.org/bugzilla/>
References:  <bug-264950-7788@https.bugs.freebsd.org/bugzilla/>

next in thread | previous in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D264950

--- Comment #5 from Oleg Borodin <borodin@unix7.org> ---

(In reply to Daniel Engberg from comment #4)

sysutils/duff - uses SHA1 to SHA256
sysutils/dupd - uses xxHash
sysutils/fdupes - uses MD5=20
sysutils/filedupe - uses MD5
sysutils/jdupes - uses xxHash
sysutils/rmlint - one utility which can use the HighwayHash algorithm, but =
is
overloaded with functionality and dependencies.

I used one of the fastest HighwayHash which is comparable to xxHash in spee=
d,
but as far as I understand from tests it has less chance of collisions even
with unchanged initial vector (I could be wrong here).

All you need to do to find files with duplicate content is to hash sum the
content and compare against a database in the file tree, and print the resu=
lt
(of course before doing so check that the file sizes match, because that's =
easy
to do).

But if you think this port duplicates the functionality of the others and
doesn't make sense to add it, so be it.

Perhaps it makes sense to close this ticket.

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-264950-7788-9FLZFMdbrE>