Attached.
Please test out.
NOTE: The defaults SUCK right now for thread/block count.
If you’re on ATI, you’ll want to use:
–threads 256 –blocks 256 -m 100 –amd
Seriously. Otherwise performance blows.
Otherwise, the same as the other gen tools.
Right now, it is MD5/NTLM *ONLY* – no SHA1 (yet).
Needs an OpenCL runtime and libnuma (apparently a requirement for the Intel OpenCL runtime…)
It should work on CPUs as well – just don’t use "–amd" and pass something like "–threads 8 –blocks 1" or so. I think it works with both the AMD and Intel OpenCL runtimes.
I expect plenty of bugs/glitches… but I think it’s correct.
Also, it should generate byte for byte identical files to the CUDA version. If it does not (with the same –seed), it IS A BUG. Please report it!
