Speed War :)

So I did some optimizations and yours is about 6.5% faster with a compute capability 1.1 card (probably same for all 1.x), 24% faster with a compute capability 2.1 card, and probably somewhere in between for compute capability 2.0.
md5_loweralpha-numeric#1-7
md5_loweralpha-numeric#1-7
- Code: Select all
9800 GTX+ (128 cores, 1836 MHz, compute capability 1.1)
330 MLinks/sec 1.065x CryptoHaze (generation)
310 MLinks/sec 1.00x mine (generation)
240 MLinks/sec 0.77x rcrack's GPU version (pre-work 100k)
82 MLinks/sec 0.26x rcracki_mt 0.7b (pre-work 100k)
GTS 450 (192 cores, 1566 MHz, compute capability 2.1)
360 MLinks/sec 1.24x CryptoHaze (generation)
290 MLinks/sec 1.00x mine (generation)
200 MLinks/sec 0.69x rcrack's GPU version (pre-work 100k)
91 MLinks/sec 0.31x rcracki_mt 0.7b (pre-work 100k)