Archive for September, 2008

/usr/lib/python2.4/site-packages/_sqlitecache.so: undefined symbol: g_assert_warning

Saturday, September 20th, 2008

Problem

While upgrading a CentOS 4.5 to CentOS 5.*, yum starts to fail with

/usr/lib/python2.4/site-packages/_sqlitecache.so: undefined symbol: g_assert_warning

Please install a package which provides this module, or
verify that the module is installed correctly.

It’s possible that the above module doesn’t match the
current version of Python, which is:
2.4.3 (#1, May 24 2008, 13:47:28)
[GCC 4.1.2 20070626 (Red Hat 4.1.2-14)]

(more…)

_sqlite.OperationalError: database is locked when using a lustre mounted yum cache

Monday, September 15th, 2008

Problem

I was trying to have the yum cache on a lustre mount so it could be shared amongst several machines to save on external bandwidth, but got the message that
_sqlite.OperationalError: database is locked

It also happens with an nfs mounted directory.

(more…)

Compiling a program with icc so the path of icc dynamic libraries is known

Thursday, September 11th, 2008

Problem

I had the problem that programs compiled with icc and ifort needed a library from the compiler’s directory to run, and I had gotten tired of needing to set LD_LOAD_PATH

(more…)

How do I remove a previously discovered system from HP SIM?

Friday, September 5th, 2008

Problem

I have a system that HP’s System Insight Manager previously discovered but I don’t want to see it any more.

(more…)

[0,1,0][/home/henrik/src/openmpi-1.2.6/ompi/mca/btl/openib/btl_openib_component.c:1334: btl_openib_component_progress] from hidden.hidden.dk to: hidden.hidden.dk error polling HP CQ with status LOCAL PROTOCOL ERROR status number 4 for wr_id 7510264 opcode 42

Wednesday, September 3rd, 2008

Problem

A multiprocessor program using OpenMPI 1.2.6, with OFED 1.2.5 has some nodes fail with the following message:

[0,1,0][/home/henrik/src/openmpi-1.2.6/ompi/mca/btl/openib/btl_openib_component.c:1334:btl_openib_component_progress] from hidden.hidden.dk to: hidden.hidden.dk error polling HP CQ with status LOCAL PROTOCOL ERROR status number 4 for wr_id 7510264 opcode 42
[hidden.hidden:29673] [0,1,7]-[0,1,0] mca_oob_tcp_msg_recv: readv failed: Connection reset by peer (104)

The error was caused by calling system(3) in the short time between calling MPI_Send and the corresponding MPI_Recv which resulted in the send buffer being corrupted and the MPI_Recv call failing.

(more…)