Tidy-up CUDA implementation of IProductWRTDerivBase and add CUDA kernels with additional parallelism (!72) · Merge requests · Nektar / redesign-prototype

Merged Jacques Xing requested to merge CFD-Xing/redesign-prototypes:tidy-up-iproductwrtderivbase into master 1 year ago

This MR tidy-up the previous implementation of the CUDA version of the IProductWRTDerivBase operator and introduces new CUDA kernels with additional parallelism across quadrature points.

Edited 1 year ago by Jacques Xing

Activity

Please register or sign in to reply

Tidy-up CUDA implementation of IProductWRTDerivBase and add CUDA kernels with additional parallelism

Merge request reports

Activity