[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: finding similar files



On Wed, 25 Feb 2009 18:58:48 +0000, Hendrik Boom wrote:

> There wouldn't happen to be any handy tools for searching a directory
> tree with a few hundred ASCII files and telling me which ones have
> similar content?

Yes, such tools exist. They are called AI tools, Artificial Intelligent.  
Debian has many native AI packages but I doubt that you'll ever want to 
touch them, because they all need extensive training.

> What I'm looking for is something that will give me a first cut on
> finding those pairs of files.

As said before, no easy solution for none-easy task, unless you name them 
wisely, i.e., give similar files similar names. If so, check out

http://cpansearch.perl.org/src/SUNTONG/File-FindSimilars-2.06/README.html

It's super fast to give you the first cut.

-- 
Tong (remove underscore(s) to reply)
  http://xpt.sourceforge.net/techdocs/
  http://xpt.sourceforge.net/tools/


Reply to: