On a 1.3Ghz Celeron it takes around 20 seconds to generate a MD5 hash for 226MB of files. Assuming a worse case of a full 120GB player than it will take at around 3 hours to compute the hashes on that. It's definately going to be an optional feature so people that don't have the time or processing power to turn off.

Maybe just doing a quick CRC on the first few k of the file and combining that with the file size would be sufficient?

On my player I've only got about 20 duplicates for files with the same size out of about 3000 tracks.

It's more risky doing this since the chance of a collision are significantly greater than doing a MD5 hash on the file.

- Trevor