Commits on Source (2)
-
4650e118
-
Chris Cantwell authored
Implement CUDA PhysDeriv sum-factorization kernels See merge request !13
d05e90c5
Showing
- CMakeLists.txt 2 additions, 2 deletionsCMakeLists.txt
- Field.hpp 38 additions, 31 deletionsField.hpp
- Operators/BwdTrans/BwdTransCUDA.hpp 1 addition, 7 deletionsOperators/BwdTrans/BwdTransCUDA.hpp
- Operators/Operator.hpp 44 additions, 0 deletionsOperators/Operator.hpp
- Operators/OperatorHelper.cuh 102 additions, 10 deletionsOperators/OperatorHelper.cuh
- Operators/OperatorPhysDeriv.cpp 9 additions, 0 deletionsOperators/OperatorPhysDeriv.cpp
- Operators/OperatorPhysDeriv.hpp 57 additions, 0 deletionsOperators/OperatorPhysDeriv.hpp
- Operators/PhysDeriv/PhysDerivCUDA.cu 10 additions, 0 deletionsOperators/PhysDeriv/PhysDerivCUDA.cu
- Operators/PhysDeriv/PhysDerivCUDA.hpp 295 additions, 0 deletionsOperators/PhysDeriv/PhysDerivCUDA.hpp
- Operators/PhysDeriv/PhysDerivCUDAKernels.cuh 579 additions, 0 deletionsOperators/PhysDeriv/PhysDerivCUDAKernels.cuh
- Operators/PhysDeriv/PhysDerivImpl.cpp 13 additions, 0 deletionsOperators/PhysDeriv/PhysDerivImpl.cpp
- Operators/PhysDeriv/PhysDerivStdMat.hpp 170 additions, 0 deletionsOperators/PhysDeriv/PhysDerivStdMat.hpp
- main.cpp 176 additions, 1 deletionmain.cpp
- tests/CMakeLists.txt 34 additions, 0 deletionstests/CMakeLists.txt
- tests/init_fields.hpp 25 additions, 10 deletionstests/init_fields.hpp
- tests/test_bwdtrans.cpp 4 additions, 2 deletionstests/test_bwdtrans.cpp
- tests/test_bwdtranscuda.cpp 4 additions, 2 deletionstests/test_bwdtranscuda.cpp
- tests/test_ipwrtbase.cpp 4 additions, 2 deletionstests/test_ipwrtbase.cpp
- tests/test_ipwrtbasecuda.cpp 4 additions, 2 deletionstests/test_ipwrtbasecuda.cpp
- tests/test_physderiv.cpp 73 additions, 0 deletionstests/test_physderiv.cpp
Operators/OperatorPhysDeriv.cpp
0 → 100644
Operators/OperatorPhysDeriv.hpp
0 → 100644
Operators/PhysDeriv/PhysDerivCUDA.cu
0 → 100644
Operators/PhysDeriv/PhysDerivCUDA.hpp
0 → 100644
Operators/PhysDeriv/PhysDerivCUDAKernels.cuh
0 → 100644
Operators/PhysDeriv/PhysDerivImpl.cpp
0 → 100644
Operators/PhysDeriv/PhysDerivStdMat.hpp
0 → 100644
tests/test_physderiv.cpp
0 → 100644