I've done some basic performance tests against:
- Scala Actor-based parallel Widefinder
- Scala serial Widefinder using a BufferedReader
- Tim Bray's Ruby Widefinder
The test platform was my 2.2 GHz MacBook with 4GB of RAM using a 6 million line file. The times were as follows:
Scala Parallel:
- real 0m14.588s
- user 0m24.541s
- sys 0m1.383s
- real 0m20.095s
- user 0m18.821s
- sys 0m1.441s
- real 0m14.301s
- user 0m12.485s
- sys 0m1.813s
For those that are curious, the source code for the parallel widefinder is available here:
the parallel IO code
the actual widefinder code