| Back | Top | Next |
Used perl -d:DProf and dprofpp to gather stats on runs
with small files and
large files (100MB to 2GB)
Most time taken in dealing with system calls to the hashing .EXE
Added functionality to deal with archive files based on Magic file info
Overall speed went into the toilet using File::MMagic
dprofpp using magic
One MS MSDN DVD took 30 hours to hash without magic
Same DVD took a week using magic - over 650,000 files
Still using Perl 5.6 at this point, heading to 5.8 to resolve Unicode issues
April 2003 attended a talk by M.J.Dominus
Bottom line: when it's slow, we know why and can offer the user options to trade time for completeness
We always choose completeness - NSRL motto: "You can never have too much overkill."