<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>k-Wave User Forum &#187; Topic: Run CUDA simulation on GPU with Pascal architecture</title>
		<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture</link>
		<description>Support for the k-Wave MATLAB toolbox</description>
		<language>en-US</language>
		<pubDate>Tue, 12 May 2026 23:01:48 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.2</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.k-wave.org/forum/search.php</link>
		</textInput>
		<atom:link href="http://www.k-wave.org/forum/rss/topic/run-cuda-simulation-on-gpu-with-pascal-architecture" rel="self" type="application/rss+xml" />

		<item>
			<title>Jiri Jaros on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-6823</link>
			<pubDate>Thu, 04 Apr 2019 13:40:12 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">6823@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi both,&#60;br /&#62;
I've just answered this question in &#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://www.k-wave.org/forum/topic/running-on-aws-tesla-instances#post-6815&#34; rel=&#34;nofollow&#34;&#62;http://www.k-wave.org/forum/topic/running-on-aws-tesla-instances#post-6815&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Simply said, you have to recompile the code with latest CUDA.&#60;/p&#62;
&#60;p&#62;Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>qjk on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-6818</link>
			<pubDate>Tue, 02 Apr 2019 03:09:35 +0000</pubDate>
			<dc:creator>qjk</dc:creator>
			<guid isPermaLink="false">6818@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;DLam, I received exactly the same error message when trying to run using Tesla V100 using AWS. The exact same code and data file ran perfectly on an AWS Tesla K80 instance.&#60;/p&#62;
&#60;p&#62;I was using Ubuntu 18 for this.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>DLam on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-6817</link>
			<pubDate>Mon, 01 Apr 2019 17:00:57 +0000</pubDate>
			<dc:creator>DLam</dc:creator>
			<guid isPermaLink="false">6817@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Brad, Jiri,&#60;/p&#62;
&#60;p&#62;Sorry for necro'ing, but this thread seems the most relevant.&#60;br /&#62;
I'm getting the same error of &#34;Error : All CUDA-capable devices are busy or unavailable&#34;.&#60;/p&#62;
&#60;p&#62;My cluster's GPU is a Tesla V100 (Volta archi) running Linux (CentOs 7).  I'm wondering if it's a build problem on my side as well?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Jiri Jaros on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5818</link>
			<pubDate>Fri, 20 Jan 2017 09:54:13 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">5818@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Julien,&#60;br /&#62;
I run a couple of tests and it appears the error only occurs on Pascal GPUs. It is likely to be caused by the fact the code was compiled by CUDA 7.5 and Pascal natively supports CUDA 8.0.&#60;/p&#62;
&#60;p&#62;I will have to take a closer look at the problem next week. &#60;/p&#62;
&#60;p&#62;Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Julien S. on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5817</link>
			<pubDate>Thu, 19 Jan 2017 12:10:25 +0000</pubDate>
			<dc:creator>Julien S.</dc:creator>
			<guid isPermaLink="false">5817@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Jiri,&#60;/p&#62;
&#60;p&#62;With my Titan X GPU, I have theoretically 12288MB of total memory and 12189MB of total dedicated memory (when checking nvidia-settings). I'm running nonlinear simulations in a homogeneous medium. I ran some tests to see what is the limit of gridsize I can reach and what are the corresponding computational resources displayed by k-wave when the cuda code initializes.&#60;/p&#62;
&#60;p&#62;For a grid of total size (PML included) 960 x 240 x 240 the code runs and I obtain :&#60;br /&#62;
- Current host memory in use : 4029MB&#60;br /&#62;
- Current device memory in use : 4352MB&#60;/p&#62;
&#60;p&#62;For a grid of size 1160 x 240 x 240 the code runs and I obtain :&#60;br /&#62;
- Current host memory in use : 4800MB&#60;br /&#62;
- Current device memory in use : 5212MB&#60;/p&#62;
&#60;p&#62;For a grid of size 1360 x 240 x 240 the error message appears and I obtain :&#60;br /&#62;
- Current host memory in use : 5569MB&#60;br /&#62;
- Current device memory in use : 6074MB&#60;/p&#62;
&#60;p&#62;For bigger gridsizes the code does not run.&#60;/p&#62;
&#60;p&#62;If FFT takes another 1GB to run the simulation, my 12000MB GPU still should have enough memory to run a bigger simulation, isn't it ? It seems that the code does not run anymore when I exceed 50% of used dedicated memory (i.e. 6000MB).&#60;/p&#62;
&#60;p&#62;Thank you again,&#60;/p&#62;
&#60;p&#62;Julien
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Jiri Jaros on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5816</link>
			<pubDate>Wed, 18 Jan 2017 18:41:35 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">5816@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Julien,&#60;br /&#62;
this is going to be an out-of-memory problem. cuFFT (CUDA 3D FFT) needs some scratch place to calculate FFTs in a fast way and I think there's not enough space for it.&#60;br /&#62;
This domain size in a fully heterogeneous absorbing case consumes about 6.7 GB of RAM for raw data. FFT can take another 1GB, so if you have less than 8GB, you won't be able to run such a big simulation.&#60;/p&#62;
&#60;p&#62;Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Julien S. on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5814</link>
			<pubDate>Tue, 17 Jan 2017 16:43:38 +0000</pubDate>
			<dc:creator>Julien S.</dc:creator>
			<guid isPermaLink="false">5814@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Brad,&#60;/p&#62;
&#60;p&#62;First of all, thank you for your reply and for the new version of your GPU code. Indeed I'm using Linux and I've tried your new version of the code for Pascal GPUs. I have actually one problem when running some simulations. When I increase the number of gridpoints I get this error :&#60;/p&#62;
&#60;p&#62;Error : Failed to execute an cuFFT on the GPU for Execute_FFT_3D_R2C.&#60;/p&#62;
&#60;p&#62;For example, with a grid of size 800 x 256 x 256, the code runs properly, but when designing a grid of size 900 x 256 x 256, the code does not start to run and returns this error message.&#60;/p&#62;
&#60;p&#62;Do you have any idea of what the problem could be ?&#60;/p&#62;
&#60;p&#62;Julien
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Bradley Treeby on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5798</link>
			<pubDate>Wed, 04 Jan 2017 14:03:31 +0000</pubDate>
			<dc:creator>Bradley Treeby</dc:creator>
			<guid isPermaLink="false">5798@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Julien,&#60;/p&#62;
&#60;p&#62;Are you using linux? We've just uploaded a GPU binary for Pascal GPUs. Let me know if it doesn't work.&#60;/p&#62;
&#60;p&#62;Brad.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Julien S. on "Run CUDA simulation on GPU with Pascal architecture"</title>
			<link>http://www.k-wave.org/forum/topic/run-cuda-simulation-on-gpu-with-pascal-architecture#post-5794</link>
			<pubDate>Fri, 23 Dec 2016 16:11:09 +0000</pubDate>
			<dc:creator>Julien S.</dc:creator>
			<guid isPermaLink="false">5794@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi,&#60;/p&#62;
&#60;p&#62;I am trying to use the kspaceFirstOrdre3D-CUDA code to simulate non-linear acoustics on a Titan X GPU. With my old graphic card GeForce GT 730 the code worked even if the GPU was not powerfull enough to speed up the execution. But now with my new Titan X card, I systematically get that error when I try to launch the code :&#60;br /&#62;
&#34;Error : All CUDA-capable devices are busy or unavailable&#34;.&#60;/p&#62;
&#60;p&#62;I read in the K-wave documentation that the CUDA code is complied for Maxwell graphic cards. My Titan X has a Pascal architecture. Is it why the code execution returns that error ? If so, is it possible to make the CUDA code work with a Pascal card, and how to do so ?&#60;/p&#62;
&#60;p&#62;Thanks in advance for your reply.
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
