Commits on Source (2)
-
d6be5f9f
-
Chris Cantwell authored
Implement CUDA IProductWRTBase sum-factorization kernels See merge request !12
554abc7b
Showing
- CMakeLists.txt 1 addition, 1 deletionCMakeLists.txt
- Operators/IProductWRTBase/IProductWRTBaseCUDA.cu 10 additions, 0 deletionsOperators/IProductWRTBase/IProductWRTBaseCUDA.cu
- Operators/IProductWRTBase/IProductWRTBaseCUDA.hpp 306 additions, 0 deletionsOperators/IProductWRTBase/IProductWRTBaseCUDA.hpp
- Operators/IProductWRTBase/IProductWRTBaseCUDAKernels.cuh 622 additions, 0 deletionsOperators/IProductWRTBase/IProductWRTBaseCUDAKernels.cuh
- Operators/IProductWRTBase/IProductWRTBaseStdMat.hpp 4 additions, 43 deletionsOperators/IProductWRTBase/IProductWRTBaseStdMat.hpp
- Operators/Operator.hpp 54 additions, 0 deletionsOperators/Operator.hpp
- Operators/OperatorHelper.cuh 61 additions, 0 deletionsOperators/OperatorHelper.cuh
- main.cpp 54 additions, 0 deletionsmain.cpp
- tests/CMakeLists.txt 26 additions, 5 deletionstests/CMakeLists.txt
- tests/test_bwdtranscuda.cpp 2 additions, 2 deletionstests/test_bwdtranscuda.cpp
- tests/test_ipwrtbasecuda.cpp 67 additions, 0 deletionstests/test_ipwrtbasecuda.cpp
This diff is collapsed.
tests/test_ipwrtbasecuda.cpp
0 → 100644