This MR add basic vector math operator on CUDA. Unit test compare CUDA results with Nektar++'s Vmath.