Re: MD5 collisions found - alternative?

To: Almut Behrens <almut-behrens@gmx.net>
Cc: debian-security@lists.debian.org
Subject: Re: MD5 collisions found - alternative?
From: Dale Amon <amon@vnl.com>
Date: Thu, 26 Aug 2004 02:22:24 +0100
Message-id: <[🔎] 20040826012224.GC14856@vnl.com>
In-reply-to: <[🔎] 20040825230421.GJ22410@citadelle.homelinux.net>
References: <[🔎] 412B14D7.2080300@deepblue.sk> <[🔎] Pine.LNX.4.60.0408241234570.32088@ace.ulyssis.org> <[🔎] 20040824182254.GD22410@citadelle.homelinux.net> <[🔎] Pine.LNX.4.60.0408242038590.32088@ace.ulyssis.org> <[🔎] 20040824204045.GE22410@citadelle.homelinux.net> <[🔎] 20040824210158.GA45695@cuttysark.lan> <[🔎] 20040825040222.GH22410@citadelle.homelinux.net> <[🔎] 20040825090125.GA25228@vnl.com> <[🔎] 20040825230421.GJ22410@citadelle.homelinux.net>

On Thu, Aug 26, 2004 at 01:04:21AM +0200, Almut Behrens wrote:
> ...and I think somewhere in between lie hashing functions like crc32,
> as used for detecting transmission errors, for example.  Those are
> not cryptographic, but possess a sufficiently large output space, so we
> can expect few random collisions for most practical purposes.

I wouldn't call CRC a hash code although you can use it that way
I guess. It is really an error detecting and correction
code that does have the ability, in a sense, to go backwards. It 
stands for Cyclic Redundancy Check. Such codes are redudant data,
to be included with a transmission, not a hash. Some of them
allow correction of multiple bit errors; typically you can detect
1 bit of error more than you can correct.

> Right, but I believe that a uniform mapping _also_ is a desirable
> property (besides speed, of course) of hashing functions as used to
> compute table lookup indices -- as this property assures that the data
> storage locations will be spread as evenly as possible across the
> available buckets, which in turn minimizes the time spent on resolving
> collisions (on average).  And, as practical considerations in this case
> always enforce a rather small output space (i.e. number of buckets)
> we're certainly expecting collisions here. [1]

Well, in theory yes. In practice you usually aren't much fussed
if you've got a variance in the bucket utilization unless you're
working on something with a real need for speed. For example, I
would bet there are some really, really good uniform hash functions
used inside of gcc. 

> > 	* randomness. Input strings which differ by 1 bit in any
> > 	  position generate hash keys a random distance apart
> 
> I'd add:
>   * huge size of the output space (with its upper limit corresponding
>     to the number of bits of the hash value).  The probability of
>     accidentally finding a collision is of course directly related to
>     the size of the output space (assuming a uniform mapping).

Yes, can't argue there. That is where the basic difference between
a typical hash function and a cryptographic hash comes. You want
small keyspaces and very simple functions to generate lookup keys,
whereas you don't much care about the function overhead for a
cryptographic key as you tend not to do the encodings as often.

> If anyone knows of any other requirements, please feel free to chime
> in... Well, OTOH, this would probably be getting a little off-topic for
> debian-security (especially the debian aspect).

-- 
------------------------------------------------------
   Dale Amon     amon@islandone.org    +44-7802-188325
       International linux systems consultancy
     Hardware & software system design, security
    and networking, systems programming and Admin
	      "Have Laptop, Will Travel"
------------------------------------------------------

Attachment: signature.asc
Description: Digital signature

Reply to:

References:
- MD5 collisions found - alternative?
  - From: Robert Trebula <r0b0@deepblue.sk>
- Re: MD5 collisions found - alternative?
  - From: Danny De Cock <godot@ace.ulyssis.org>
- Re: MD5 collisions found - alternative?
  - From: Almut Behrens <almut-behrens@gmx.net>
- Re: MD5 collisions found - alternative?
  - From: Danny De Cock <godot@ace.ulyssis.org>
- Re: MD5 collisions found - alternative?
  - From: Almut Behrens <almut-behrens@gmx.net>
- Re: MD5 collisions found - alternative?
  - From: Moritz Schulte <moritz@duesseldorf.ccc.de>
- Re: MD5 collisions found - alternative?
  - From: Almut Behrens <almut-behrens@gmx.net>
- Re: MD5 collisions found - alternative?
  - From: Dale Amon <amon@vnl.com>
- Re: MD5 collisions found - alternative?
  - From: Almut Behrens <almut-behrens@gmx.net>

Prev by Date: Re: MD5 collisions found - alternative?
Next by Date: Strange problem with mail...
Previous by thread: Re: MD5 collisions found - alternative?
Next by thread: Re: MD5 collisions found - alternative?
Index(es):
- Date
- Thread