SyTen

◆ cukrn_dot_threads

constexpr std::size_t cukrn_dot_threads = 16
constexpr

Number of threads per thread block for the dot kernel, 16 seems to be the optimum for Telsa P100 in a real-world test.