Sha256: a4aa932711d7b55a604f5c70c6a386481fdaf9eef7eb65cb1a9ce0ea21fe9409

Contents?: true

Size: 450 Bytes

Versions: 13

Compression:

Stored size: 450 Bytes

Contents

% c_dtype = dtype_to_c_type(dtype)

__kernel void square_<%= dtype %>(const int M, const int N, __global const <%= c_dtype %> *A, __global <%= c_dtype %> *C) {
    // Get the index of the current element to be processed
    const int globalRow = get_global_id(0); // Row ID of C (0..M)
    const int globalCol = get_global_id(1); // Col ID of C (0..N)

    C[globalRow * N + globalCol] = A[globalRow * N + globalCol] * A[globalRow * N + globalCol];
}

Version data entries

13 entries across 13 versions & 2 rubygems

Version Path
tensor_stream-opencl-0.1.3 lib/tensor_stream/opencl/kernels/square.cl
tensor_stream-opencl-0.1.2 lib/tensor_stream/opencl/kernels/square.cl
tensor_stream-opencl-0.1.1 lib/tensor_stream/opencl/kernels/square.cl
tensor_stream-opencl-0.1.0 lib/tensor_stream/opencl/kernels/square.cl
tensor_stream-0.8.1 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.8.0 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.7.0 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.6.1 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.6.0 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.5.1 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.5.0 lib/tensor_stream/evaluator/opencl/kernels/square.cl
tensor_stream-0.4.1 lib/tensor_stream/evaluator/kernels/square.cl
tensor_stream-0.4.0 lib/tensor_stream/evaluator/kernels/square.cl