I have seen a few threads regarding this issue, like https://simtk.org/forums/viewtopic.php? ... vice#p7862 for example, but I still have an issue. I am using OpenMM 4.1, so let me know if this is the problem.
I have a node with 2 GPUs, and am trying out the OpenCL platform. Running nvidia-smi gives me
Code: Select all
siddharth@node086 ~ $ nvidia-smi
Thu Jul 4 18:58:29 2013
+------------------------------------------------------+
| NVIDIA-SMI 2.285.05 Driver Version: 285.05.33 |
|-------------------------------+----------------------+----------------------+
| Nb. Name | Bus Id Disp. | Volatile ECC SB / DB |
| Fan Temp Power Usage /Cap | Memory Usage | GPU Util. Compute M. |
|===============================+======================+======================|
| 0. Tesla T10 Processor | 0000:07:00.0 Off | N/A N/A |
| N/A 47 C P0 Off / Off | 7% 287MB / 4095MB | 100% E. Thread |
|-------------------------------+----------------------+----------------------|
| 1. Tesla T10 Processor | 0000:09:00.0 Off | N/A N/A |
| N/A 47 C P0 Off / Off | 1% 54MB / 4095MB | 0% E. Thread |
|-------------------------------+----------------------+----------------------|
| Compute processes: GPU Memory |
| GPU PID Process name Usage |
|=============================================================================|
| 0. 26897 ZymeCAD 329MB |
| 1. 26897 ZymeCAD 329MB |
+-----------------------------------------------------------------------------+
Code: Select all
2terminate called after throwing an instance of 'OpenMM::OpenMMException'
2what(): Error initializing context: clCreateContextFromType (-2)
2[node086:26932] *** Process received signal ***
2[node086:26932] Signal: Aborted (6)
2[node086:26932] Signal code: (-6)
2[node086:26932] [ 0] /lib64/libpthread.so.0() [0x316040f500]
2[node086:26932] [ 1] /lib64/libc.so.6(gsignal+0x35) [0x315fc328a5]
2[node086:26932] [ 2] /lib64/libc.so.6(abort+0x175) [0x315fc34085]
2[node086:26932] [ 3] /usr/lib64/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x12d) [0x3161cbea7d]
2[node086:26932] [ 4] /usr/lib64/libstdc++.so.6() [0x3161cbcc06]
2[node086:26932] [ 5] /usr/lib64/libstdc++.so.6() [0x3161cbcc33]
2[node086:26932] [ 6] /usr/lib64/libstdc++.so.6() [0x3161cbcd2e]
2[node086:26932] [ 7] /usr/local/openmm/lib/plugins/libOpenMMOpenCL.so(_ZN6OpenMM13OpenCLContextC1EiiiRNS_14OpenCLPlatform12PlatformDataE+0x3636) [0x2b1cbae23286]
2[node086:26932] [ 8] /usr/local/openmm/lib/plugins/libOpenMMOpenCL.so(_ZN6OpenMM14OpenCLPlatform12PlatformDataC1EiRKSsS3_+0x693) [0x2b1cbae31a73]
2[node086:26932] [ 9] /usr/local/openmm/lib/plugins/libOpenMMOpenCL.so(_ZNK6OpenMM14OpenCLPlatform14contextCreatedERNS_11ContextImplERKSt3mapISsSsSt4lessISsESaISt4pairIKSsSsEEE+0x2af) [0x2b1cbae320af]
2[node086:26932] [10] /usr/local/openmm/lib/libOpenMM.so(_ZN6OpenMM11ContextImplC1ERNS_7ContextERNS_6SystemERNS_10IntegratorEPNS_8PlatformERKSt3mapISsSsSt4lessISsESaISt4pairIKSsSsEEE+0x899) [0x2b1ca3f109d9]
2[node086:26932] [11] /usr/local/openmm/lib/libOpenMM.so(_ZN6OpenMM7ContextC1ERNS_6SystemERNS_10IntegratorERNS_8PlatformE+0x81) [0x2b1ca3f0bb91]
I have so far been using the CUDA platform where this issue does not exist.