|
inline |
Reordering scalar product of a CUDA dense tensor and a standard dense tensor.
If either tensor is larger than SYTEN_CUDA_THRESHOLD, the calculation is done on the GPU, otherwise, the calculation is done on the host.
References syten::EnvVars::cuda_size_to_gpu, and syten::CudaDenseTensorImpl::CudaDenseTensor< rank, Scalar >::size().