Media Summary: Host to device transfer speeds, local memory. Optimizing the reduction kernel for data access (coalescing).
Opencl Runtime Architecture 6 - Detailed Analysis & Overview
Host to device transfer speeds, local memory. Optimizing the reduction kernel for data access (coalescing).