SyTen
cuda_iszero_cukrn.h File Reference

Cuda isZero kernel implementation header. More...

#include <cstdint>
#include <complex>
+ Include dependency graph for cuda_iszero_cukrn.h:
+ This graph shows which files directly or indirectly include this file:

Namespaces

namespace  syten
 Syten namespace.
 
namespace  syten::CudaDenseTensorImpl
 Implementation namespace for CUDA dense tensors.
 

Functions

void syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const double *inp, int *is_nonzero, void *str)
 Launcher for the CUDA isZero kernel, double version. More...
 
void syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const float *inp, int *is_nonzero, void *str)
 Launcher for the CUDA isZero kernel, float version. More...
 
void syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const std::complex< double > *inp, int *is_nonzero, void *str)
 Launcher for the CUDA isZero kernel, std::complex<double> version. More...
 
void syten::CudaDenseTensorImpl::cuda_iszero_kernel_impl (std::size_t sz, const std::complex< float > *inp, int *is_nonzero, void *str)
 Launcher for the CUDA isZero kernel, std::complex<float> version. More...
 

Variables

constexpr std::size_t cukrn_iszero_threads_per_block = 256
 Number of threads per block to launch for the iszero kernels. More...
 

Detailed Description

Cuda isZero kernel implementation header.