(I see that someone else in this forum have experienced the same problem with a Pascal GPU.) I am using the CUDA 7.5 binary file. When I run CUDA simulations for small enough grid sizes it works fine, but when I try to run it for a 400x400x400 grid, I get a "Error : Failed to execute an cuFFT on the GPU for Execute_FFT_3D_R2C."
Regarding memory requirements, the CUDA card should have about 12 GB of memory, and the script prints "Current host memory in use: 4514MB. Current device memory in use: 4957MB".
I use the following CUDA version: "NVIDIA (R) Cuda compiler driver, Cuda compilation tools, release 7.5, V7.5.17". The CUDA card is a Tesla K40c which has 11439MiB of memory.
Do you know what might be the problem here?
Thanks in advance,