<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>k-Wave User Forum &#187; Topic: Linux C++ binary sees less nodes than job submission system assigns</title>
		<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns</link>
		<description>Support for the k-Wave MATLAB toolbox</description>
		<language>en-US</language>
		<pubDate>Tue, 12 May 2026 23:29:03 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.2</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.k-wave.org/forum/search.php</link>
		</textInput>
		<atom:link href="http://www.k-wave.org/forum/rss/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns" rel="self" type="application/rss+xml" />

		<item>
			<title>ahmedsallam on "Linux C++ binary sees less nodes than job submission system assigns"</title>
			<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns#post-8533</link>
			<pubDate>Fri, 13 May 2022 15:12:23 +0000</pubDate>
			<dc:creator>ahmedsallam</dc:creator>
			<guid isPermaLink="false">8533@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hello Jiri,&#60;/p&#62;
&#60;p&#62;Can you also please send me the OpenMPI code? I have sent you an email. &#60;/p&#62;
&#60;p&#62;Thanks,
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Jiri Jaros on "Linux C++ binary sees less nodes than job submission system assigns"</title>
			<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns#post-6793</link>
			<pubDate>Wed, 20 Mar 2019 08:46:12 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">6793@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi DLam,&#60;br /&#62;
yes, the released codes are targeted at multi-socket multi-core systems, but with shared memory (NUMA nodes). &#60;/p&#62;
&#60;p&#62;There is a distributed version, that can run across the whole cluster using OpenMPI. It is a release candidate 1 version. I plan to release it as soon as I've finished the current k-Wave release with 2D and axisymmetric codes for NUMA nodes. (I had to put it aside for two months as this releases has the priority). &#60;/p&#62;
&#60;p&#62;If you want to test the prototype, send me an email (jarosjir[ at ]fit.vutbr.cz) and I could become a beta tester.&#60;/p&#62;
&#60;p&#62;Best&#60;br /&#62;
Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>DLam on "Linux C++ binary sees less nodes than job submission system assigns"</title>
			<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns#post-6790</link>
			<pubDate>Mon, 18 Mar 2019 16:57:07 +0000</pubDate>
			<dc:creator>DLam</dc:creator>
			<guid isPermaLink="false">6790@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Dear Jiri,&#60;/p&#62;
&#60;p&#62;Thanks for the quick reply!&#60;br /&#62;
Yes, we want to run the code across multiple host servers (each server with multiple sockets for CPUs, each CPU with multiple cores).  If I read you correctly, the current C++ binary is incompatible with doing this message-passing between different servers?&#60;/p&#62;
&#60;p&#62;Best regards,&#60;/p&#62;
&#60;p&#62;Daniel
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Jiri Jaros on "Linux C++ binary sees less nodes than job submission system assigns"</title>
			<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns#post-6771</link>
			<pubDate>Fri, 15 Mar 2019 14:09:01 +0000</pubDate>
			<dc:creator>Jiri Jaros</dc:creator>
			<guid isPermaLink="false">6771@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Hi DLam,&#60;/p&#62;
&#60;p&#62;the default behavior is to use as many cores as the system has. If it helps, I call routine omp_get_num_procs() to find out this number.&#60;/p&#62;
&#60;p&#62;I guess the cluster schedules sets the variable OMP_NUM_THREADS to inform OpenMP codes about how many threads to use. This is not implemented directly because it does not reliably work on Windows. &#60;/p&#62;
&#60;p&#62;You can to use -t parameter to set up the number of threads manually, e.g.&#60;br /&#62;
kspaceFirstOrder3D -t ${OMP_NUM_THREADS}&#60;/p&#62;
&#60;p&#62;If you want to check, the cluster scheduler sets up the OpenMP environment correctly try:&#60;br /&#62;
OMP_DISPLAY_ENV=true kspaceFirstOrder3DC -t XXX&#60;/p&#62;
&#60;p&#62;Reading the post again, you may ask about something else. Do you want to run the code across multiple nodes? If so, this is not possible. It requires a completely different implementation using message passing. If this is the case, send me an email.&#60;/p&#62;
&#60;p&#62;Best wishes&#60;br /&#62;
Jiri
&#60;/p&#62;</description>
		</item>
		<item>
			<title>DLam on "Linux C++ binary sees less nodes than job submission system assigns"</title>
			<link>http://www.k-wave.org/forum/topic/linux-c-binary-sees-less-nodes-than-job-submission-system-assigns#post-6768</link>
			<pubDate>Wed, 13 Mar 2019 16:27:21 +0000</pubDate>
			<dc:creator>DLam</dc:creator>
			<guid isPermaLink="false">6768@http://www.k-wave.org/forum/</guid>
			<description>&#60;p&#62;Dear K-wave team,&#60;/p&#62;
&#60;p&#62;Our team has been running the K-wave C++ binary on a Linux cluster, submitting jobs within a bash script.  It works great if we ensure that all workers are on a single node -- when the job is assigned to workers across multiple nodes, I notice in the console output, the 'Number of CPU Threads' heading shows a smaller number than the ones assigned when submitting the job.&#60;/p&#62;
&#60;p&#62;We were wondering if this is simply a display bug, or if something else is going on?  We've been looking at increasing our worker assignments but the 'Time to Go' estimation doesn't change much when we double core count.&#60;/p&#62;
&#60;p&#62;Cluster:&#60;br /&#62;
OS:  Linux CentOS 7&#60;br /&#62;
Cluster architecture: Multiple hosts each with multiple workers, running openMPI&#60;br /&#62;
Job submission system: LSF
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
