I'm hoping to do the same thing (use CPU-multithreading via the OpenCL in the AMD APP SDK) and getting very similar errors. Also, I have noticed that the appearance of the following error message when selecting the OpenCL-CPU platform:
can be due to a known issue, and (if so) would not interfere with the execution of the program (see last bullet point here
). So it seems like the segfault that happens afterward could be unrelated.
Code: Select all
(gdb) run testInstallation.py
Starting program: /global/home/users/cjryan/epd/epd-7.3-2-rh3-x86_64/bin/python testInstallation.py
[Thread debugging using libthread_db enabled]
There are 2 Platforms available:
1 Reference - Successfully computed forces
Detaching after fork from child process 56135.
Setting of real/effective user Id to 0/0 failed
FATAL: Module fglrx not found.
Error! Fail to load fglrx kernel module! Maybe you can switch to root user to load kernel module directly
[New Thread 0x2aaab2797700 (LWP 56136)]
[New Thread 0x2aaabc812700 (LWP 56137)]
[New Thread 0x2aaabd025700 (LWP 56138)]
[New Thread 0x2aaac8812700 (LWP 56139)]
[New Thread 0x2aaac9025700 (LWP 56140)]
[New Thread 0x2aaad4812700 (LWP 56141)]
[New Thread 0x2aaad5025700 (LWP 56142)]
[New Thread 0x2aaae0812700 (LWP 56143)]
[New Thread 0x2aaae1025700 (LWP 56144)]
[New Thread 0x2aaaec812700 (LWP 56145)]
[New Thread 0x2aaaed025700 (LWP 56146)]
[New Thread 0x2aaaf8812700 (LWP 56147)]
[New Thread 0x2aaaf9025700 (LWP 56148)]
[New Thread 0x2aab04812700 (LWP 56149)]
[New Thread 0x2aab05025700 (LWP 56150)]
[New Thread 0x2aab10812700 (LWP 56151)]
[New Thread 0x2aab11025700 (LWP 56152)]
[New Thread 0x2aab1c812700 (LWP 56153)]
[New Thread 0x2aab1d025700 (LWP 56154)]
[New Thread 0x2aab28812700 (LWP 56155)]
[New Thread 0x2aab29025700 (LWP 56156)]
[New Thread 0x2aab34812700 (LWP 56157)]
[New Thread 0x2aab35025700 (LWP 56158)]
[New Thread 0x2aab40812700 (LWP 56159)]
[New Thread 0x2aab41025700 (LWP 56160)]
[New Thread 0x2aab4c812700 (LWP 56161)]
[New Thread 0x2aab4d025700 (LWP 56162)]
[New Thread 0x2aab58812700 (LWP 56163)]
[New Thread 0x2aab59025700 (LWP 56164)]
[New Thread 0x2aab64812700 (LWP 56165)]
[New Thread 0x2aab65025700 (LWP 56166)]
[New Thread 0x2aab70812700 (LWP 56167)]
[New Thread 0x2aab71025700 (LWP 56168)]
[New Thread 0x2aab7c812700 (LWP 56169)]
[New Thread 0x2aab7d025700 (LWP 56170)]
[New Thread 0x2aab88812700 (LWP 56171)]
[New Thread 0x2aab89025700 (LWP 56172)]
[New Thread 0x2aab94812700 (LWP 56173)]
[New Thread 0x2aab95025700 (LWP 56174)]
[New Thread 0x2aaba0812700 (LWP 56175)]
[New Thread 0x2aaba1025700 (LWP 56176)]
[New Thread 0x2aabac812700 (LWP 56177)]
[New Thread 0x2aabad025700 (LWP 56178)]
[New Thread 0x2aabb8812700 (LWP 56179)]
[New Thread 0x2aabb9025700 (LWP 56180)]
[New Thread 0x2aabc4812700 (LWP 56181)]
[New Thread 0x2aabc5025700 (LWP 56182)]
[New Thread 0x2aabd0812700 (LWP 56183)]
[New Thread 0x2aabd1025700 (LWP 56184)]
[New Thread 0x2aabdc812700 (LWP 56185)]
[New Thread 0x2aabdd025700 (LWP 56186)]
[New Thread 0x2aabe8812700 (LWP 56187)]
[New Thread 0x2aabe9025700 (LWP 56188)]
[New Thread 0x2aabf4812700 (LWP 56189)]
[New Thread 0x2aabf5025700 (LWP 56190)]
[New Thread 0x2aac00812700 (LWP 56191)]
[New Thread 0x2aac01025700 (LWP 56192)]
[New Thread 0x2aac0c812700 (LWP 56193)]
[New Thread 0x2aac0d025700 (LWP 56194)]
[New Thread 0x2aac18812700 (LWP 56195)]
[New Thread 0x2aac19025700 (LWP 56196)]
[New Thread 0x2aac24812700 (LWP 56197)]
[New Thread 0x2aac25025700 (LWP 56198)]
[New Thread 0x2aac30812700 (LWP 56199)]
[New Thread 0x2aac31025700 (LWP 56200)]
Detaching after fork from child process 56201.
Detaching after fork from child process 56202.
[New Thread 0x2aac3c405700 (LWP 56203)]
Detaching after fork from child process 56204.
Detaching after fork from child process 56205.
Detaching after fork from child process 56206.
Detaching after fork from child process 56207.
Detaching after fork from child process 56208.
Detaching after fork from child process 56209.
Detaching after fork from child process 56210.
Detaching after fork from child process 56211.
Detaching after fork from child process 56212.
Detaching after fork from child process 56213.
Detaching after fork from child process 56214.
Detaching after fork from child process 56215.
Detaching after fork from child process 56216.
Detaching after fork from child process 56217.
Detaching after fork from child process 56218.
Detaching after fork from child process 56219.
Detaching after fork from child process 56220.
Detaching after fork from child process 56221.
Detaching after fork from child process 56222.
Detaching after fork from child process 56223.
Detaching after fork from child process 56224.
Detaching after fork from child process 56225.
Program received signal SIGSEGV, Segmentation fault.
0x00002aaab5ccc895 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
Missing separate debuginfos, use: debuginfo-install glibc-2.12-1.80.el6_3.5.x86_64 libX11-1.3-2.el6.x86_64 libXau-1.0.5-1.el6.x86_64 libXext-1.1-3.el6.x86_64 libgcc-4.4.6-4.el6.x86_64 libstdc++-4.4.6-4.el6.x86_64 libxcb-1.5-1.el6.x86_64
(gdb) bt
#0 0x00002aaab5ccc895 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#1 0x00002aaab5425f05 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#2 0x00002aaab5443b44 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#3 0x00002aaab5444ff0 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#4 0x00002aaab544797d in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#5 0x00002aaab54dd502 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#6 0x00002aaab54de9f5 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#7 0x00002aaab54e438d in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#8 0x00002aaab54e5395 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#9 0x00002aaab5cba789 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#10 0x00002aaab5cba8c5 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#11 0x00002aaab5cbaac6 in ?? ()
---Type <return> to continue, or q <return> to quit---
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#12 0x00002aaab4e9d2c8 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#13 0x00002aaab4e9d519 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#14 0x00002aaab4ea28cf in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#15 0x00002aaab4ea4b88 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#16 0x00002aaab4e78a24 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#17 0x00002aaab48b0540 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#18 0x00002aaab48b0fe2 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#19 0x00002aaab48865f5 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#20 0x00002aaab48988a0 in ?? ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#21 0x00002aaab487e9d7 in clBuildProgram ()
from /global/home/users/cjryan/ksong/AMDAPP/lib/x86_64/libamdocl64.so
#22 0x00002aaab16447d2 in OpenMM::OpenCLContext::createProgram(std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::map<std::basic_strin---Type <return> to continue, or q <return> to quit---
g<char, std::char_traits<char>, std::allocator<char> >, std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::less<std::basic_string<char, std::char_traits<char>, std::allocator<char> > >, std::allocator<std::pair<std::basic_string<char, std::char_traits<char>, std::allocator<char> > const, std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > > const&, char const*) ()
from /global/home/users/cjryan/openmm/lib/plugins/libOpenMMOpenCL.so
#23 0x00002aaab15b54a1 in OpenMM::OpenCLNonbondedUtilities::initialize(OpenMM::System const&) ()
from /global/home/users/cjryan/openmm/lib/plugins/libOpenMMOpenCL.so
#24 0x00002aaab16432a6 in OpenMM::OpenCLContext::initialize() ()
from /global/home/users/cjryan/openmm/lib/plugins/libOpenMMOpenCL.so
#25 0x00002aaab1652810 in OpenMM::OpenCLPlatform::PlatformData::initializeContexts(OpenMM::System const&) ()
from /global/home/users/cjryan/openmm/lib/plugins/libOpenMMOpenCL.so
#26 0x00002aaab15d4b8c in OpenMM::OpenCLIntegrateLangevinStepKernel::initialize(OpenMM::System const&, OpenMM::LangevinIntegrator const&) ()
from /global/home/users/cjryan/openmm/lib/plugins/libOpenMMOpenCL.so
#27 0x00002aaaac4695c6 in OpenMM::LangevinIntegrator::initialize(OpenMM::ContextImpl&) () from /global/home/users/cjryan/openmm/lib/libOpenMM.so
#28 0x00002aaaac48c707 in OpenMM::ContextImpl::ContextImpl(OpenMM::Context&, OpenMM::System&, OpenMM::Integrator&, OpenMM::Platform*, std::map<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::basic_string<char, s---Type <return> to continue, or q <return> to quit---
td::char_traits<char>, std::allocator<char> >, std::less<std::basic_string<char, std::char_traits<char>, std::allocator<char> > >, std::allocator<std::pair<std::basic_string<char, std::char_traits<char>, std::allocator<char> > const, std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > > const&)
() from /global/home/users/cjryan/openmm/lib/libOpenMM.so
#29 0x00002aaaac4a0c99 in OpenMM::Context::Context(OpenMM::System&, OpenMM::Integrator&, OpenMM::Platform&) ()
from /global/home/users/cjryan/openmm/lib/libOpenMM.so
#30 0x00002aaaac11e4a6 in _wrap_new_Context__SWIG_1 (
self=<value optimized out>, args=<value optimized out>)
at src/swig_doxygen/OpenMMSwig.cxx:33875
#31 _wrap_new_Context (self=<value optimized out>, args=<value optimized out>)
at src/swig_doxygen/OpenMMSwig.cxx:35133
#32 0x00002aaaaada514b in PyEval_EvalFrameEx (f=0x1881540, throwflag=5247136)
at Python/ceval.c:4331
#33 0x00002aaaaada7324 in PyEval_EvalCodeEx (co=0x2aaaabfb9a30,
globals=0xfffffff8, locals=0x2aaab52cd760, args=0x36df220, argcount=4,
kws=0x0, kwcount=0, defs=0x0, defcount=0, closure=0x0)
at Python/ceval.c:3253
#34 0x00002aaaaad39c58 in function_call (func=0xb9c398, arg=0x36df208, kw=0x0)
at Objects/funcobject.c:526
#35 0x00002aaaaad15cfd in PyObject_Call (func=0xb9c398, arg=0x36df208, kw=0x0)
at Objects/abstract.c:2529
---Type <return> to continue, or q <return> to quit---
#36 0x00002aaaaad25094 in instancemethod_call (func=0xb9c398, arg=0x36df208,
kw=0x0) at Objects/classobject.c:2578
#37 0x00002aaaaad15cfd in PyObject_Call (func=0x16e1280, arg=0x3be0c80, kw=0x0)
at Objects/abstract.c:2529
#38 0x00002aaaaad69a97 in slot_tp_init (self=0x7fffffff8cd0, args=0x3be0c80,
kwds=0x0) at Objects/typeobject.c:5663
#39 0x00002aaaaad62a3b in type_call (type=0xd475e0, args=0x3be0c80, kwds=0x0)
at Objects/typeobject.c:735
#40 0x00002aaaaad15cfd in PyObject_Call (func=0xd475e0, arg=0x3be0c80, kw=0x0)
at Objects/abstract.c:2529
#41 0x00002aaaaada481e in PyEval_EvalFrameEx (f=0x1eb3f90, throwflag=32194880)
at Python/ceval.c:4239
#42 0x00002aaaaada7324 in PyEval_EvalCodeEx (co=0xd8fdb0, globals=0xfffffff8,
locals=0x2aaab52cd760, args=0x4, argcount=5, kws=0x0, kwcount=0,
defs=0xd9c9f8, defcount=2, closure=0x0) at Python/ceval.c:3253
#43 0x00002aaaaad39c58 in function_call (func=0xe62b90, arg=0xf29d70, kw=0x0)
at Objects/funcobject.c:526
#44 0x00002aaaaad15cfd in PyObject_Call (func=0xe62b90, arg=0xf29d70, kw=0x0)
at Objects/abstract.c:2529
#45 0x00002aaaaad25094 in instancemethod_call (func=0xe62b90, arg=0xf29d70,
kw=0x0) at Objects/classobject.c:2578
#46 0x00002aaaaad15cfd in PyObject_Call (func=0x16e1140, arg=0x3be8158, kw=0x0)
at Objects/abstract.c:2529
---Type <return> to continue, or q <return> to quit---
#47 0x00002aaaaad69a97 in slot_tp_init (self=0x7fffffff8cd0, args=0x3be8158,
kwds=0x0) at Objects/typeobject.c:5663
#48 0x00002aaaaad62a3b in type_call (type=0xe8d2b0, args=0x3be8158, kwds=0x0)
at Objects/typeobject.c:735
#49 0x00002aaaaad15cfd in PyObject_Call (func=0xe8d2b0, arg=0x3be8158, kw=0x0)
at Objects/abstract.c:2529
#50 0x00002aaaaada481e in PyEval_EvalFrameEx (f=0x5c97e0, throwflag=6068584)
at Python/ceval.c:4239
#51 0x00002aaaaada7324 in PyEval_EvalCodeEx (co=0x2aaaabcbaf30,
globals=0xfffffff8, locals=0x2aaab52cd760, args=0x0, argcount=0, kws=0x0,
kwcount=0, defs=0x0, defcount=0, closure=0x0) at Python/ceval.c:3253
#52 0x00002aaaaada7522 in PyEval_EvalCode (co=0x7fffffff8cd0,
globals=0xfffffff8, locals=0x2aaab52cd760) at Python/ceval.c:667
#53 0x00002aaaaadc029c in run_mod (mod=0x7fffffff8cd0,
filename=0xfffffff8 <Address 0xfffffff8 out of bounds>, globals=0x540280,
locals=0x540280, flags=0x2aaab52cd760, arena=0x0)
at Python/pythonrun.c:1353
#54 0x00002aaaaadc08fc in PyRun_FileExFlags (fp=0x5a6ed0,
filename=0x7fffffffe0ba "testInstallation.py", start=6149536,
globals=0x540280, locals=0x540280, closeit=1, flags=0x7fffffffda4c)
at Python/pythonrun.c:1339
#55 0x00002aaaaadc1811 in PyRun_SimpleFileExFlags (fp=0x5a6ed0,
filename=0x7fffffffe0ba "testInstallation.py", closeit=1,
---Type <return> to continue, or q <return> to quit---
flags=0x7fffffffda4c) at Python/pythonrun.c:943
#56 0x00002aaaaadd1f06 in Py_Main (argc=1, argv=0x7fffffffdcf8)
at Modules/main.c:729
#57 0x00002aaaab84dcdd in __libc_start_main () from /lib64/libc.so.6
#58 0x00000000004008ea in _start ()
I should also mention that, since the AMD APP SDK cannot typically be installed as a non-root user, a cluster support team member helped me to rewrite AMD's installation script to install inside my home directory. Examples programs of this SDK seem to run correctly, but perhaps OpenMM might not work well with such a non-standard configuration in my case.