for optimal results on the GF100 i believe you can use -t 1024 and the usual -b shaders/8 , seems to be working wonders for me

I'm gaining about 120 M more compared to -t 512
These are the rough numbers using -t 1024 -b 56 for my GTX470
NT: 1.1 Billion
MD5: 850 M
Ntb i guess bitweasil.
Oh i remember spotting a bug with the -u function can't remember what it was but kept on causing it to crash, it was the way the chars were arranged.
So i guess for other readers to achieve optimal results copy and paste the following
GTX480 users-t 1024 -b 60 -m 1000 -l
GTX470 users-t 1024 -b 56 -m 1000 -l