[Beowulf] Opinions of Hyper-threading?
Mark Hahn
hahn at mcmaster.ca
Thu Feb 28 12:45:52 PST 2008
> How do your Rate numbers correlate to the max bandwitdh of 32GB/s
> (http://en.wikipedia.org/wiki/GeForce_8_Series)?
good point. I had assumed the quoted numbers were merely in-cache,
but it does claim to be running on array size 2e6 (8e6 bytes),
which seems a bit large for in-cache. (though very small for a Stream run).
>> http://forums.nvidia.com/index.php?showtopic=52686
this quotes a plausible 64-65 GB/s on a C870 (76.8 peak theoretical).
>> Running this on my 8600 card I get:
>>
>> STREAM Benchmark implementation in CUDA
>> Array size (single precision)=2000000
>> using 128 threads per block, 15625 blocks
>> Function Rate (MB/s) Avg time Min time Max time
>> Copy: 291777.6696 0.0001 0.0001 0.0001
>> Scale: 291777.6696 0.0001 0.0001 0.0001
>> Add: 437666.5043 0.0001 0.0001 0.0001
>> Triad: 437666.5043 0.0001 0.0001 0.0001
this is implausible. my guess is the timing code is broken.
More information about the Beowulf
mailing list