.. _lecture5: ===================================================================== Molecular dynamics ===================================================================== Molecular dynamics (MD) is the field where ML is most widely used and ML successes in MD stimulated the wider adoption of ML in chemistry. - :ref:`Slides ` - :ref:`What is MD and why is it important? ` - :ref:`Why is molecular dynamics expensive? ` - :ref:`How can ML accelerate MD? ` - :ref:`MD types ` - :ref:`Further reading ` .. _ml-md-slides: Slides ------ :download:`Slides <_static/mlatom/7-XACSW2024_20240705_Dral_wm.pdf>`: .. raw:: html .. _lecture5_md_intro: What is MD and why is it important? ------------------------------------ .. image:: _static/lecture5/ethanol_md.gif :width: 800 :align: center :alt: Ethanol MD [Credit: Fuchun Ge, ethanol MD] Atoms in molecules are always in constant motion and, hence, to properly simulate molecules, we need to take this into account. The most widely used approach for directly simulating atomic motions is molecular dynamics (MD). By performing MD we can get chemically intuitive view on the processes on the atomistic level and derive :ref:`spectral ` and thermochemical properties. We can also better understand how the :ref:`chemical reactions ` happen as they are rearrangement of atoms in space and time. But how can we simulate the atomic motion? In principle, we need to solve the time-dependent Schrodinger equation for both electrons and nuclei. This is, unfortunately, numerically not easy. In MD, we make a huge simplification by asssuming that the atoms move according to the classical Newton's law and then working our way from the Neston's laws of motion. It means that we need to solve the equations of motion starting with the basic equation: .. math:: \mathbf{F} = m(\mathbf{a}). Now, given the known initial conditions (positions of atoms and their velocities), we can calculate the position of atoms in a short time increment (time step) and repeat this procedure until we propagate the nuclear motion for as long as we need. The obtained sequence of geometries and associated properties is called MD *trajectory*. The velocity Verlet algorithm is a popular but not the only way to do this. To give a better idea how MD is is propagated, you can check the following flowchart: .. image:: _static/lecture5/Molecular_dynamics_algorithm.png :width: 800 :align: center :alt: MD algorithm [Credit: Knordlun, `CC BY-SA 3.0 `__, via Wikimedia Commons.] The key component in above MD algorithm is calculation of forces. They can be obtained from the molecular mechanics force fields (not covered in this course) and QM calculations (and then such MD is called Born--Oppenheimer MD because the forces are calculated for static nuclei within the Born--Oppenheimer approximation). The forces are just negative energy gradients, i.e., negative derivatives of potential energy with respect to nuclear coordinates: .. math:: \mathbf{F} = - \nabla E(\mathbf{R}). Let's see MD in action: .. include:: materials/lecture5/task5.1_dft_md/task5.1_dft_md.inc .. _lecture5_md_expensive: Why is molecular dynamics expensive? ------------------------------------ From the above algorithm it is obvious that MD is quite an expensive business. Even for H\ :sub:`2`, where DFT was very fast, it takes some time to propagate a relatively short trajectory and, often, we need to propagate many trajectories for a longer time! Think about the typical reaction rates and how long would you need to propagate MD to see reaction events... There is also no way to parallelize the calculation of the time steps -- they should be run one after another in the strict order, further limiting the speed of MD. In addition, to obtain better quality of MD you need smaller time steps -- but not too small as the calculation time will increase correspondingly and numerical errors will accumulate faster too. .. _lecture5_ml_md: How can ML accelerate MD? ------------------------------------ Given the high cost of MD, it is highly desirable to find a way to accelerate it with ML because of its high speed. How would you proceed to use ML for accelerating MD? Accelerating calculation of forces ++++++++++++++++++++++++++++++++++ The obvious way is to accelerate the slowest part of MD which is calculation of forces. We have to evaluate them at each time step, e.g., with expensive QM methods in case of Born--Oppenheimer MD. Indeed, using ML for accelerating the forces evaluation is one of the most widely used applications of MD in computational chemistry (and probably in chemistry in general). How can we calculate forces with ML? One way to do it is to directly learn the forces themselves and, indeed, some works doing exactly this. However, another, more established approach, is to use ML potentials which, as we know from :ref:`ML-PES ` lecture, are energy functions of nuclear coordinates and, hence, ML can be viewed as surrogate models to QM. ML models are also analytical functions and given automatic differentiation capabilities of many ML packages, it is straightforward to obtain the first-order derivatives and forces from such potentials. The calculations will be much faster as the result. Let's the use of an MLP in action. .. include:: materials/lecture5/task5.2_ani1ccx_md/task5.2_ani1ccx_md.inc Note the speed up for the hydrogen molecule is not so dramatic, but try something bigger! Now, if you analyzed the trajectory carefully enoough you would notice how ANI-1ccx produces completely unphysical trajectory, where the atoms 'tunnel' through each other. This is the result of a known problem with ANI-1ccx which has wrong potential energy curve for the hydrogen molecule. .. To understand why ANI-1ccx produced what it produced, let's perform the following analysis: .. not include:: materials/lecture5/task5.3._ani1ccx_h2_pec/task5.3._ani1ccx_h2_pec.inc ANI-1ccx has two minima on PES (which you should remember from the :ref:`previous homework analysis `) and the energy for small internuclear distance is not going exponentially up as it should due to the repulsion of nuclei! This is ML and anything can happen -- it is based on what it saw in the data and has no physical model restraining it from behaving unphysically. If you also noticed, at large distance the potential energy drops while it should not (it should plateou). This is unfortunately also a common behavior or MLP which has a consequence that many MDs with MLPs lead to `unphysical explosion of molecules `__. Luckily, you can use your own ANI models trained on the FCI data! Let's see how they perform: .. include:: materials/lecture5/task5.4_ani_h2_fci/task5.4_ani_h2_fci.inc Finally, you have obtained a fast dynamics which also looks reasonable. Directly learning dynamics ++++++++++++++++++++++++++ Why not use ML to directly learn the nuclear positions as a function of time? Indeed, `we were the first to suggest and demonstrate `__ the feasibility of such a novel of way of accelerating dynamics with ML. In this approach, we seek a function .. math:: \mathbf{R}(t) = f(\mathbf{R}_0, \mathbf{v}_0, t). Currently, we are working on a generalization of our previous ML model so that it can be used easier. .. include:: outdating.rst .. _lecture5_md_types: MD types ------------------------------------ Constant energy vs constant temperature ++++++++++++++++++++++++++++++++++++++++++++++++ MD can be propagated in many different modes. If we consider an isolated molecule, not interacting with an environment, then it should conserve total energy (some of the potential energy calculated with your QM or ML model and the kinetic energy of nuclei). You run such simulations in above tasks and they are said to be done in the NVE ensemble. However, often, we want to simulate the process of a molecule in its environment without including all the environment (as it would be quite slow). Interacting with environment often means that the temperature on average should be more or less constant. You can do it with MLatom using the so-called NVT ensemble. There are many possible algorithms developed for the NVT ensemble called *thermostats*, e.g., Nosé--Hoover thermostat and Andersen which are implemented in MLatom. In this case, the total energy will not be conserved. You can see it for yourself in the corresponding example: .. include:: materials/lecture5/homework5.1/homework5.1_nvt.inc The key observation is that in the NVT ensemble the total energy is no longer conserved while in the NVE ensemble is does not deviate much. There is still noticable deviation of the total energy in the NVE ensemble. You can think about the reasons and how you can adjust the settings of the MD simulations to improve the energy conservation in the NVE simulations. The temperature is, strictly speaking, not conserved in either of the simulation but in the NVT it varies narrower and the average should be closer to the target temperature. Very often, in the NVT dynamics, the temperature rises or drops from the initial temperature to some more stable value with the average around the target temperature. Then you usually need to cut the initial (called equilibration) period before it becomes stable and then use for analysis the later part of the MD. Energy conservation ^^^^^^^^^^^^^^^^^^^ In the NVE ensemble we want to conserve the total energy :math:`E_\text{tot}` as the MD must adhere to the laws of physics. The total energy is a sum of the potential :math:`E_\text{pot}` and kinetic :math:`E_\text{kin}` energies. The potential energy is evaluated at the QM level or with MLP surrogate model as usual :math:`E_\text{pot} = f(\mathbf{R})`. The kinetic energy is a sum of the kinetic energies over nuclei: .. math:: E_\text{kin} = \Sigma_i m_i v^2_i / 2. All is good but for a small time step, most of MLPs will always produce an energy conserving MD which, as we know, can be still completely unphysical! Hence, `we suggested `__ to evaluate the correctness of the trajectory by evaluating how much it deviates from the *true* energy conservation (i.e., for a dynamics with an exact but usually unknown potential) by evaluating the theoretical-best estimate of the total energy, where the potential energy is calculated for some snapshots from the trajectory at the highest affordable QM level, i.e.: .. math:: E^\text{TBE}_\text{tot} = E^\text{TBE}_\text{pot} + E_\text{kin}. This criterion gives a much better error metric for the quality of MD. .. _qct: Quasi-classical trajectory +++++++++++++++++++++++++++ Now think what will happen if you set the initial velocities to zero and start from the geometry with zero forces too? (What is the geometry with zero forces?). After you come to an answer, you can check it in the following exercise: .. include:: materials/lecture5/homework5.2/homework5.2_0K.inc If you have done everything right, the simulation should lead to the trajectory where the atoms basically do not move. This is obvious if you consider how the MD is propagated - each steps changes the geometry based on the velocity and acceleration (defined by forces). Hence, for 0 K, velocity is zero by definition (kinetic energy should be zero). The molecule in the energy minimum has by definition energy derivatives (negative forces) equal to zero too, i.e., accelerations are zero. Hence, each next position must be the same as previous. You can still see small numerical noise. The problem is that classical MD does not properly account for the fact that molecules are always in motion, i.e., they neglect zero-point vibrational energy (ZPE). In classical MD you can set the nuclear velocities so that their kinetic energy corresponds to the required initial instantenuous temperature. But this will definetely not reflect the ZPE. In an attempt to recover the ZPE effect, you can propagate so-called quasi-classical trajectories (QCT). They differ from the classical ones by how the initial conditions are generated. To recover the ZPE, the initial geometries are sampled from the vibrational normal modes (typically calculated within the rigid-rotor harmonic-oscillator approximation) and the velocities are assigned so that the total energy (on average) includes ZPE. Then the QCT are propagated with the classical MD algorithm (e.g., velocity Verlet in MLatom). There are several ways to generate initial conditions, with two popular ones implemented in MLatom: - Wigner sampling. Commonly used in :ref:`surface-hopping dynamics `. - sampling from the harmonic quantum Boltzmann distribution (called "thermal sampling" in the manual to its early implementation in the VENUS program). In contrast to Wigner sampling, it samples from higher vibrational levels. This sampling is commonly used for performing the :ref:`downhill QCT ` investigation of transition states. Now let's run QCT trajectories: .. include:: materials/lecture5/homework5.3/homework5.3_qct_h2.inc The obvious qualitative difference is that even at 0 K, the QCT still has the motion of atoms in contrast to the classical MD. This is how it should be as the zero-point energy is always present in molecules, even at 0 K. Other approaches you should know ++++++++++++++++++++++++++++++++++++++++++++++++ - Meta-dynamics. Even with all the ML acceleration, the time scales of typical chemical processes are much longer than what we can simulate. Hence, many approaches emerged which try to artificially accelerate the simulation so that at least the key dynamical features are recovered. - PIMD (path-integral molecular dynamics). QCT is one of the steps towards recovering at least ZPE but it is still not recovering quantum nuclear effects fully. A better approach is PIMD with its variants CMD and RPMD, which produces more robust and physically correct results but at a much higher cost. MLatom is now getting the PIMD implemented. .. _lecture5_further_reading: Further reading --------------- - A good introductory book about MD: D. Frenkel, B. Smit, J. Tobochnik, S. R. Mckay and W. Christian, Understanding Molecular Simulation, Elsevier, Bodmin, Cornwall, 1997. - Another book where you can read up about meta-dynamics and PIMD is X.-Z. Li, E. Wang, Computer Simulations of Molecules and Condensed Matter: From Electronic Structures to Molecular Dynamics. Peking University Press, 2014.