https://bugs.freedesktop.org/show_bug.cgi?id=106631
--- Comment #6 from Ricardo Ribalda ricardo.ribalda@gmail.com --- Eventhough it is not comparable, for reference: this is the result with fgrlx.
root@qt5022:~# time clpeak
Platform: AMD Accelerated Parallel Processing Device: AMD G-T56N Processor Driver version : 1800.8 (sse2) (Linux x64) Compute units : 2 Clock frequency : 530 MHz
Global memory bandwidth (GBPS) float : 0.80 float2 : 1.12 float4 : 1.09 float8 : 1.31 float16 : 1.34
Single-precision compute (GFLOPS) float : 0.59 float2 : 1.16 float4 : 2.32 float8 : 4.42 float16 : 0.85
No half precision support! Skipped
Double-precision compute (GFLOPS) double : 0.43 double2 : 0.84 double4 : 1.46 double8 : 1.41 double16 : 0.28
Integer compute (GIOPS) int : 0.73 int2 : 0.30 int4 : 0.35 int8 : 0.40 int16 : 0.32
Transfer bandwidth (GBPS) enqueueWriteBuffer : 1.29 enqueueReadBuffer : 1.08 enqueueMapBuffer(for read) : 3591.11 memcpy from mapped ptr : 0.98 enqueueUnmap(after write) : 15339.17 memcpy to mapped ptr : 0.99
Kernel launch latency : 64.41 us
real 8m12.337s user 12m51.022s sys 0m29.840s