Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Optimize Jacobi eigensolver (WIP) (#847)
* use Amat, Jmat * change order of Amat, Jmat * snapshot before changing args to offd_rotate * use Jsh shared memory * development snapshot * debug snapshot * option to use Jsh in offd_rotate * use lmemsizeOR * change launch config for offd_rotate * snapshot use Jsh for offd_kernel * generalized offd_kernel * update diag_rotate * minor update to comments * update gridOR, threadsOR for offd_rotate * add option to use original offd_rotate() * use original algorithm for small problems * update idx2D for int32_t, add commonly used ceil function * use std::min, std::max, remove lambda idx2D * remove host idx2D * update with improvement to syevj * remove lambda for ceil() * add __device__ for ceil()
- Loading branch information