Re: Bug#656142: ITP: duff -- Duplicate file finder

To: debian-devel@lists.debian.org
Subject: Re: Bug#656142: ITP: duff -- Duplicate file finder
From: Lars Wirzenius <liw@liw.fi>
Date: Tue, 17 Jan 2012 13:17:43 +0000
Message-id: <[🔎] 20120117131743.GA32187@havelock.liw.fi>
In-reply-to: <[🔎] 20120117130510.GO4320@type.bordeaux.inria.fr>
References: <[🔎] 20120116205813.24274.12515.reportbug@localhost6.localdomain6> <[🔎] 20120117091258.GA20971@havelock.liw.fi> <[🔎] 20120117093020.GA4320@type.bordeaux.inria.fr> <[🔎] 20120117104520.GA29095@havelock.liw.fi> <[🔎] 20120117110341.GL4320@type.bordeaux.inria.fr> <[🔎] 87k44qxzej.fsf@mirexpress.internal.placard.fr.eu.org> <[🔎] 20120117130510.GO4320@type.bordeaux.inria.fr>

On Tue, Jan 17, 2012 at 02:05:10PM +0100, Samuel Thibault wrote:
> Roland Mas, le Tue 17 Jan 2012 13:41:23 +0100, a écrit :
> > Samuel Thibault, 2012-01-17 12:03:41 +0100 :
> > 
> > [...]
> > 
> > > I'm not sure to understand what you mean exactly. If you have even
> > > just a hundred files of the same size, you will need ten thousand file
> > > comparisons!
> > 
> >   I'm sure that can be optimised.  Read all 100 files in parallel,
> > comparing blocks of similar offset.  You need to perform 99 comparisons
> > on each block for as long as blocks are identical;
> 
> Ah, right. So you'll start writing yet another tool? ;)

I've implemented pretty much that (http://liw.fi/dupfiles), but my
duplicate file finder is not so much better than existing ones in
Debian that I would inflict it on Debian. But the algorithm works
nicely, and works even for people who research hash collisions.

-- 
Freedom-based blog/wiki/web hosting: http://www.branchable.com/

Attachment: signature.asc
Description: Digital signature

Reply to:

Follow-Ups:
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Johan Henriksson <mahogny@areta.org>

References:
- Bug#656142: ITP: duff -- Duplicate file finder
  - From: Kamal Mostafa <kamal@whence.com>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Lars Wirzenius <liw@liw.fi>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Samuel Thibault <sthibault@debian.org>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Lars Wirzenius <liw@liw.fi>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Samuel Thibault <sthibault@debian.org>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Roland Mas <lolando@debian.org>
- Re: Bug#656142: ITP: duff -- Duplicate file finder
  - From: Samuel Thibault <sthibault@debian.org>

Prev by Date: Re: Bug#656142: ITP: duff -- Duplicate file finder
Next by Date: Bug#656199: ITP: liboptions-java -- liboptions-java
Previous by thread: Re: Bug#656142: ITP: duff -- Duplicate file finder
Next by thread: Re: Bug#656142: ITP: duff -- Duplicate file finder
Index(es):
- Date
- Thread