Logs for PR #1050 (2026-02-28T01:59:33.711034+00:00):

=== СТАТУС: Успешно выполнены программы: main_mandelbrot, main_sum ===
=== main_mandelbrot stdout (exit code: -11 (segfault после выполнения)) ===
Found 1 GPUs in 0.300292 sec (CUDA: 0.123019 sec, OpenCL: 0.0382075 sec, Vulkan: 0.139006 sec)
Available devices:
Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using OpenCL API...
______________________________________________________
Evaluating algorithm #1/3: CPU
algorithm times (in seconds) - 1 values (min=3.21191 10%=3.21191 median=3.21191 90%=3.21191 max=3.21191)
Mandelbrot effective algorithm GFlops: 3.11342 GFlops
saving image to 'mandelbrot CPU.bmp'...
CPU vs CPU average results difference: 0%
______________________________________________________
Evaluating algorithm #2/3: CPU with OpenMP
OpenMP threads: x4 threads
algorithm times (in seconds) - 10 values (min=0.998009 10%=1.00105 median=1.02332 90%=1.05365 max=1.05365)
Mandelbrot effective algorithm GFlops: 9.7721 GFlops
saving image to 'mandelbrot CPU with OpenMP.bmp'...
CPU with OpenMP vs CPU average results difference: 0%
______________________________________________________
Evaluating algorithm #3/3: GPU
Kernels compilation done in 0.0533816 seconds
algorithm times (in seconds) - 10 values (min=0.00427339 10%=0.00427684 median=0.00428078 90%=0.0577146 max=0.0577146)
Mandelbrot effective algorithm GFlops: 2336.02 GFlops
saving image to 'mandelbrot GPU.bmp'...
GPU vs CPU average results difference: 0.942446%
=== main_sum stdout (exit code: -11 (segfault после выполнения)) ===
Found 1 GPUs in 0.296002 sec (CUDA: 0.1289 sec, OpenCL: 0.0374102 sec, Vulkan: 0.129638 sec)
Available devices:
Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using OpenCL API...
PCI-E upload times - 3 values (min=0.0412848 10%=0.0412848 median=0.0414234 90%=0.0417029 max=0.0417029) s
PCI-E upload median bandwidth: 8.9932 GB/s
______________________________________________________
Evaluating algorithm #1/6: CPU
algorithm times (in seconds) - 10 values (min=0.034578 10%=0.0346804 median=0.0350704 90%=0.0356046 max=0.0356046)
sum median effective algorithm bandwidth: 10.6223 GB/s
______________________________________________________
Evaluating algorithm #2/6: CPU with OpenMP
algorithm times (in seconds) - 10 values (min=0.0156988 10%=0.0157769 median=0.0162231 90%=0.0165231 max=0.0165231)
sum median effective algorithm bandwidth: 22.9628 GB/s
______________________________________________________
Evaluating algorithm #3/6: 01 atomicAdd from each workItem
Kernels compilation done in 0.0552275 seconds
algorithm times (in seconds) - 10 values (min=0.00275223 10%=0.00275238 median=0.00275404 90%=0.0580801 max=0.0580801)
sum median effective algorithm bandwidth: 135.266 GB/s
______________________________________________________
Evaluating algorithm #4/6: 02 atomicAdd but each workItem loads K values
Kernels compilation done in 0.0409813 seconds
algorithm times (in seconds) - 10 values (min=0.00146241 10%=0.0014625 median=0.001463 90%=0.0425418 max=0.0425418)
sum median effective algorithm bandwidth: 254.634 GB/s
______________________________________________________
Evaluating algorithm #5/6: 03 local memory and atomicAdd from master thread
Kernels compilation done in 0.0447164 seconds
algorithm times (in seconds) - 10 values (min=0.00682616 10%=0.00703404 median=0.00708198 90%=0.0516351 max=0.0516351)
sum median effective algorithm bandwidth: 52.6024 GB/s
______________________________________________________
Evaluating algorithm #6/6: 04 local reduction
Kernels compilation done in 0.0538434 seconds
algorithm times (in seconds) - 10 values (min=0.0142349 10%=0.0142363 median=0.0296506 90%=0.0835936 max=0.0835936)
sum median effective algorithm bandwidth: 12.564 GB/s