[cctbxbb] CentOS7 bugs (cont.)

markus.gerstel at diamond.ac.uk markus.gerstel at diamond.ac.uk
Tue Sep 29 08:29:38 PDT 2015


Hello cctbx folks,

We are having problems running, building and/or installing DIALS on CentOS7. Running the linux installer unceremoniously ends in a segfault.
The installer is even broken when it is build on the same system using bootstrapped cctbx/dials.

The broken installer can be traced to the DIALS extension loader, which uses python's pkgutil loader, which triggers some library scanning routine within python, which in turn falls over boost_python_meta_ext.so and dies with the strace shown in the email below.

The installer can be successfully "fixed" by deleting build/lib/boost_python_meta_ext.so during the installation.

So I see two issues here:

1.       What is broken with build/lib/boost_python_meta_ext.so that it causes segfaults on CentOS7?

2.       Why is this not covered by any test? Neither of the following caught anything:

a.       python bootstrap.py builder=dials tests

b.       libtbx.run_tests_parallel module=cctbx module=dials module=boost_adaptbx

[NB: The previously reported 3 broken tests on CentOS7 are red herrings and are all fixed by running 'yum update']

I don't know how to proceed from here.

-Markus


From: markus.gerstel at diamond.ac.uk [mailto:markus.gerstel at diamond.ac.uk]
Sent: 29 September 2015 13:29
To: dials-support at lists.sourceforge.net
Subject: Re: [Dials-support] CentOS7 bugs (cont.)

Hello everyone,
(..)

Sorry, I hit ctrl+enter by accident...

In addition to these failed tests the installer dies a boost-related death according to strace:
(..)

poll([{fd=13, events=POLLIN|POLLPRI}, {fd=15, events=POLLIN|POLLPRI}], 2, 4294967295) = 1 ([{fd=13, revents=POLLIN}]) read(13, "\tlinux-vdso.so.1 =>  (0x00007fff"..., 4096) = 486

--- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=7648, si_status=0, si_utime=0, si_stime=0} --- poll([{fd=13, events=POLLIN|POLLPRI}, {fd=15, events=POLLIN|POLLPRI}], 2, 4294967295) = 2 ([{fd=13, revents=POLLHUP}, {fd=15, revents=POLLHUP}])

close(13)                               = 0

close(15)                               = 0

wait4(7648, [{WIFEXITED(s) && WEXITSTATUS(s) == 0}], 0, NULL) = 7648 stat("$prefix/dials-dev20150928/build/lib/boost_python_meta_ext", 0x7fff6421d3f0) = -1 ENOENT (No such file or directory) open("$prefix/dials-dev20150928/build/lib/boost_python_meta_ext.so", O_RDONLY) = 11 fstat(11, {st_mode=S_IFREG|0775, st_size=200624, ...}) = 0

--- SIGSEGV {si_signo=SIGSEGV, si_code=SEGV_MAPERR, si_addr=0x18a86} ---

+++ killed by SIGSEGV (core dumped) +++

Segmentation fault (core dumped)


Not sure if relevant, but CentOS7 compiler is
  GCC 4.8.3 20140911 (Red Hat 4.8.3-9)

compared to RHEL6
  GCC 4.4.7 20120313 (Red Hat 4.4.7-11)
Anybody have any ideas?
-Markus



--

This e-mail and any attachments may contain confidential, copyright and or privileged material, and are for the use of the intended addressee only. If you are not the intended addressee or an authorised recipient of the addressee please notify us of receipt by returning the e-mail and do not use, copy, retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not necessarily of Diamond Light Source Ltd.
Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments are free from viruses and we cannot accept liability for any damage which you may sustain as a result of software viruses which may be transmitted in or with the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and Wales with its registered office at Diamond House, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://phenix-online.org/pipermail/cctbxbb/attachments/20150929/5f2b6720/attachment.htm>


More information about the cctbxbb mailing list