Cuda isZero kernel implementation header. More...
#include <cstdint>
#include <complex>
Namespaces | |
namespace | syten |
Syten namespace. | |
namespace | syten::CudaDenseTensorImpl |
Implementation namespace for CUDA dense tensors. | |
Functions | |
void | syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const double *inp, int *is_nonzero, void *str) |
Launcher for the CUDA isZero kernel, double version. More... | |
void | syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const float *inp, int *is_nonzero, void *str) |
Launcher for the CUDA isZero kernel, float version. More... | |
void | syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const std::complex< double > *inp, int *is_nonzero, void *str) |
Launcher for the CUDA isZero kernel, std::complex<double> version. More... | |
void | syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const std::complex< float > *inp, int *is_nonzero, void *str) |
Launcher for the CUDA isZero kernel, std::complex<float> version. More... | |
Variables | |
constexpr std::size_t | cukrn_iszero_threads_per_block = 256 |
Number of threads per block to launch for the iszero kernels. More... | |
Cuda isZero kernel implementation header.