Skip to content

Implement CUDA PhysDeriv sum-factorization kernels

CUDA kernels for PhysDeriv operation have been implemented for all element types (seg, quad, tri, hex, tet, prism, and pyr). All implementations are based on the SIMD-based matrix-free version.

To avoid multiple copy from CPU to GPU, derivative data are copied to the GPU using the constructor.

Edited by Jacques Xing

Merge request reports