OpenMM 7.0 OpenCL memory errors

The functionality of OpenMM will (eventually) include everything that one would need to run modern molecular simulation.
POST REPLY
User avatar
George Pantelopulos
Posts: 64
Joined: Mon Jun 01, 2015 2:15 pm

OpenMM 7.0 OpenCL memory errors

Post by George Pantelopulos » Wed Dec 12, 2018 12:17 pm

Dear all,

I am trying to run several systems in OpenMM 7.0 using the OpenCL platform on a GTX 1080 in ubuntu 16.0.4. The CUDA platform works perfectly well. This workstation has nvidia drivers version 396.54.

After successfully constructing a system and a simulation, the same error is always given by OpenCL when doing anything to the simulation, such as obtaining the system energy, setting the velocities, or minimization.

The error is lead with:
OpenCL internal error: CL_OUT_OF_RESOURCES error executing CL_COMMAND_READ_BUFFER on GeForce GTX 1080 (Device 0).
and ends with:
Exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
The systems have two somewhat unique featues:
They use the OpenMM Monte Carlo Barostat
They use a custom force to implement a Force-Switch on nonbonded interactions.

Any ideas about the cause?

Thank you!
George

User avatar
Peter Eastman
Posts: 2573
Joined: Thu Aug 09, 2007 1:25 pm

Re: OpenMM 7.0 OpenCL memory errors

Post by Peter Eastman » Wed Dec 12, 2018 5:10 pm

The first thing I'd suggest is upgrading to OpenMM 7.3. 7.0 is a fairly old version, and this might reflect a problem that has since been fixed.

CL_OUT_OF_RESOURCES indicates the GPU ran out of some resource, most often memory though it can also be other things. Does your system have an unusually large number of atoms? Or are there other things that could be taking a lot of memory, such as a very large PME grid or a large tabulated function?
They use a custom force to implement a Force-Switch on nonbonded interactions.
What kind of custom force is it?

User avatar
George Pantelopulos
Posts: 64
Joined: Mon Jun 01, 2015 2:15 pm

Re: OpenMM 7.0 OpenCL memory errors

Post by George Pantelopulos » Thu Dec 13, 2018 10:59 am

Hi Peter,

For these systems 7.3 works just fine with OpenCL. 7.2.2 also works with OpenCL.

The number of atoms is not apparently a problem (the result is the same both the 418,669-atom system I am trying to run and a 16,724-atom test system), suggesting the memory error is "unnatural". The error occurs both with- and without the MCbarostat and Force-switch custom force. The custom force is a CustomBondForce.

There might be an issue related to CHARMM36? I do not what changes that might have been made in the update to 7.2 that might effect interpreting the CHARMM files, or what may effect the OpenCL platform between 7.0 and 7.3.

Ultimately, here I want to get a large membrane system to run on FAH with the openmm_21 core. I believe it was built on OpenMM 7.0, but internal testing is failing presumably due to this OpenCL issue (after downloading and deserializing XML files). Presumably, the openmm_22 core is available now and was built on OpenMM 7.2.2. I'm going to try to move forward using that instead -- perhaps this memory error is not a roadblock.

Thank you for your generous assistance as always,
George

User avatar
Peter Eastman
Posts: 2573
Joined: Thu Aug 09, 2007 1:25 pm

Re: OpenMM 7.0 OpenCL memory errors

Post by Peter Eastman » Thu Dec 13, 2018 4:08 pm

Ok, good to know the issue has already been fixed. Core 22 is based on 7.2.2, so it should be ok.

POST REPLY