Modified cmake to use CUDA_TOOLKIT_INCLUDE folder for headers

3 jobs for master in 1 minute and 10 seconds (queued for 1 second)
latest