| void syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl | ( | std::size_t | sz, |
| const float * | inp, | ||
| int * | is_nonzero, | ||
| void * | str | ||
| ) |
Launcher for the CUDA isZero kernel, float version.
| sz | number of floats in array |
| inp | input array, device memory |
| is_nonzero | a boolean output parameter, true if the array is non-zero, device memory |
| str | CUDA stream in which the computation will take place |