<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>k-Wave User Forum &#187; Topic: CUDA replacement libraries for FFTW and BLAS</title>
		<link>http://www.k-wave.org/forum/topic/cuda-replacement-libraries-for-fftw-and-blas</link>
		<description>Support for the k-Wave MATLAB toolbox</description>
		<language>en-US</language>
		<pubDate>Tue, 12 May 2026 23:28:20 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.2</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.k-wave.org/forum/search.php</link>
		</textInput>
		<atom:link href="http://www.k-wave.org/forum/rss/topic/cuda-replacement-libraries-for-fftw-and-blas" rel="self" type="application/rss+xml" />

		<item>
			<title>marcelr on "CUDA replacement libraries for FFTW and BLAS"</title>
			<link>http://www.k-wave.org/forum/topic/cuda-replacement-libraries-for-fftw-and-blas#post-4581</link>
			<pubDate>Wed, 18 Jun 2014 21:08:29 +0000</pubDate>
			<dc:creator>marcelr</dc:creator>
			<guid isPermaLink="false">4581@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi Jiri,&#60;/p&#62;
&#60;p&#62;Thanks for the update. I have the exact same configuration CPU/GPU you tested on. No big speed-ups to be reckoned with, then. I would love to get the patch file anyway, might learn a thing or two.&#60;/p&#62;
&#60;p&#62;Good to hear that a GPU-only version of k-Wave is in the making. Looking forward to it. &#60;/p&#62;
&#60;p&#62;Marcel
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Jiri Jaros on "CUDA replacement libraries for FFTW and BLAS"</title>
			<link>http://www.k-wave.org/forum/topic/cuda-replacement-libraries-for-fftw-and-blas#post-4580</link>
			<pubDate>Tue, 17 Jun 2014 20:26:42 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">4580@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi marcelr,&#60;/p&#62;
&#60;p&#62;I've performed two simple benchmarks and the results are not surprising :-)&#60;br /&#62;
&#60;pre&#62;&#60;code&#62;size     iCore7 920 (4c/8T)     GTX 580        2xIntel E5-2670  (16T)
128^3   0.09918s/step        0.0967s/step        0.025s/step
256^3   1.10s/step           0.7347s/step        0.42s/step&#60;/code&#62;&#60;/pre&#62;
&#60;p&#62;Offloading the FFT on the GPU is not going to speed up k-Wave substantially because the data has to be moved back and forth 14 times per time step. This virtually kills all the benefit gained from the GPU. The only way to get a reasonable speed-up is to run k-Wave entirely on the GPU (this version is being tested).&#60;/p&#62;
&#60;p&#62;The maximum size of the simulation domain I was able to run with a GTX580/1.5GB VRAM was 256^3 (cufft is incredibly memory voracious).&#60;/p&#62;
&#60;p&#62;However, if you have a slow CPU and a powerful GPU, the speed-up is measurable. If interested, I can send you a patch file to be applied before compiling.&#60;/p&#62;
&#60;p&#62;Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>marcelr on "CUDA replacement libraries for FFTW and BLAS"</title>
			<link>http://www.k-wave.org/forum/topic/cuda-replacement-libraries-for-fftw-and-blas#post-4576</link>
			<pubDate>Mon, 16 Jun 2014 08:46:49 +0000</pubDate>
			<dc:creator>marcelr</dc:creator>
			<guid isPermaLink="false">4576@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;In their recent cuda tookits, NVidia have published plugin replacement libs for FFTW and BLAS. I want to have a look at these libs, see if they are any good in speeding up things related to k-Wave. I wonder if forum members have experience with those, in particular in combination with k-Wave?&#60;/p&#62;
&#60;p&#62;best,&#60;/p&#62;
&#60;p&#62;Marcel
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
