<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Error Log &#187; mpi_send</title>
	<atom:link href="http://www.errorlog.dk/tag/mpi_send/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.errorlog.dk</link>
	<description>Problems I&#039;ve found solutions for.</description>
	<lastBuildDate>Thu, 20 Nov 2008 11:43:11 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>[0,1,0][/home/henrik/src/openmpi-1.2.6/ompi/mca/btl/openib/btl_openib_component.c:1334: btl_openib_component_progress] from hidden.hidden.dk to: hidden.hidden.dk error polling HP CQ with status LOCAL PROTOCOL ERROR status number 4 for wr_id 7510264 opcode 42</title>
		<link>http://www.errorlog.dk/2008/09/03/010homehenriksrcopenmpi-126ompimcabtlopenibbtl_openib_componentc1334btl_openib_component_progre-ss-from-hiddenhiddendk-to-hiddenhiddendk-error-polling-hp-cq-with-status-local/</link>
		<comments>http://www.errorlog.dk/2008/09/03/010homehenriksrcopenmpi-126ompimcabtlopenibbtl_openib_componentc1334btl_openib_component_progre-ss-from-hiddenhiddendk-to-hiddenhiddendk-error-polling-hp-cq-with-status-local/#comments</comments>
		<pubDate>Wed, 03 Sep 2008 12:40:47 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Errors]]></category>
		<category><![CDATA[mpi]]></category>
		<category><![CDATA[mpi_recv]]></category>
		<category><![CDATA[mpi_send]]></category>
		<category><![CDATA[ofed]]></category>
		<category><![CDATA[openmpi]]></category>
		<category><![CDATA[system]]></category>

		<guid isPermaLink="false">http://www.errorlog.dk/?p=44</guid>
		<description><![CDATA[Problem
A multiprocessor program using OpenMPI 1.2.6, with OFED 1.2.5 has some nodes fail with the following message:
[0,1,0][/home/henrik/src/openmpi-1.2.6/ompi/mca/btl/openib/btl_openib_component.c:1334:btl_openib_component_progress] from hidden.hidden.dk to: hidden.hidden.dk error polling HP CQ with status LOCAL PROTOCOL ERROR status number 4 for wr_id 7510264 opcode 42
[hidden.hidden:29673] [0,1,7]-[0,1,0] mca_oob_tcp_msg_recv: readv failed: Connection reset by peer (104)
The error was caused by calling system(3) in the [...]]]></description>
			<content:encoded><![CDATA[<h2>Problem</h2>
<p>A multiprocessor program using OpenMPI 1.2.6, with OFED 1.2.5 has some nodes fail with the following message:</p>
<blockquote><p>[0,1,0][/home/henrik/src/openmpi-1.2.6/ompi/mca/btl/openib/btl_openib_component.c:1334:btl_openib_component_progress] from hidden.hidden.dk to: hidden.hidden.dk error polling HP CQ with status LOCAL PROTOCOL ERROR status number 4 for wr_id 7510264 opcode 42<br />
[hidden.hidden:29673] [0,1,7]-[0,1,0] mca_oob_tcp_msg_recv: readv failed: Connection reset by peer (104)</p></blockquote>
<p>The error was caused by calling system(3) in the short time between calling <a href="http://www.mpi-forum.org/docs/mpi-11-html/node31.html#Node31">MPI_Send</a> and the corresponding <a href="http://www.mpi-forum.org/docs/mpi-11-html/node34.html#Node34">MPI_Recv</a> which resulted in the send buffer being corrupted and the MPI_Recv call failing.</p>
<h4><span id="more-44"></span></h4>
<h2>Solution</h2>
<p>The solution was to either use MPI_Barrier to ensure all pending communication had finished or to replace the call of system(3) with another way to achieve the same result.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.errorlog.dk/2008/09/03/010homehenriksrcopenmpi-126ompimcabtlopenibbtl_openib_componentc1334btl_openib_component_progre-ss-from-hiddenhiddendk-to-hiddenhiddendk-error-polling-hp-cq-with-status-local/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
