Manual

User Manual:

Open the PDF directly: View PDF .
Page Count: 128 [warning: Documents this large are best viewed by clicking the View PDF Link!]

Introduction
Installation
Running free-energy simulations
Collective variables
Postprocessing

PLUMED User’s Guide

A portable plugin for free-energy calculations

with molecular dynamics

Version 1.3.0 – Nov 2011

Contents

1 Introduction 5

1.1 What is PLUMED? ......................... 5

1.2 Supportedcodes.......................... 6

1.3 Features .............................. 7

1.4 Newinversion1.3 ........................ 8

1.5 Restrictions ............................ 9

1.6 The PLUMED package ....................... 9

1.7 Onlineresources.......................... 10

1.8 Credits............................... 11

1.9 Citing PLUMED ........................... 11

1.10License............................... 11

2 Installation 13

2.1 Compiling PLUMED ......................... 13

2.1.1 Compiling the ACEMD plugin with PLUMED ...... 17

2.2 Including reconnaissance metadynamics . . . . . . . . . . . . . 18

2.3 Testing the installation . . . . . . . . . . . . . . . . . . . . . . 19

2.4 Back to the original code . . . . . . . . . . . . . . . . . . . . . 21

2.5 The Python interface to PLUMED ................. 21

3 Running free-energy simulations 23

3.1 How to activate PLUMED ..................... 23

3.2 Theinputﬁle ........................... 25

3.3 Anoteonunits.......................... 27

3.4 Metadynamics........................... 27

3.4.1 Typical output . . . . . . . . . . . . . . . . . . . . . . 27

3.4.2 Bias potential . . . . . . . . . . . . . . . . . . . . . . . 28

3.4.3 Well-tempered metadynamics . . . . . . . . . . . . . . 29

3.4.4 Restarting a metadynamics run . . . . . . . . . . . . . 30

3.4.5 Using GRID ........................ 30

3.4.6 Multiple walkers . . . . . . . . . . . . . . . . . . . . . 35

3.4.7 Monitoring a collective variable without biasing it . . . 35

3.4.8 Deﬁning an interval . . . . . . . . . . . . . . . . . . . . 36

3.4.9 Inversion condition . . . . . . . . . . . . . . . . . . . . 38

3.5 Running in parallel . . . . . . . . . . . . . . . . . . . . . . . . 40

3.6 Replica exchange methods . . . . . . . . . . . . . . . . . . . . 40

3.6.1 Parallel tempering metadynamics . . . . . . . . . . . . 41

3.6.2 Bias exchange simulations . . . . . . . . . . . . . . . . 42

3.7 Umbrella sampling . . . . . . . . . . . . . . . . . . . . . . . . 45

3.8 SteeredMD ............................ 46

3.8.1 Steerplan ......................... 46

3.9 Adiabatic Bias MD . . . . . . . . . . . . . . . . . . . . . . . . 48

3.10 External potentials . . . . . . . . . . . . . . . . . . . . . . . . 50

3.10.1 Walls............................ 50

3.10.2 Tabulated potentials . . . . . . . . . . . . . . . . . . . 50

3.11 Commitment analysis . . . . . . . . . . . . . . . . . . . . . . . 53

3.12 Projection of gradients . . . . . . . . . . . . . . . . . . . . . . 53

3.13 Reconnaissance metadynamics . . . . . . . . . . . . . . . . . . 54

3.13.1 Typical output . . . . . . . . . . . . . . . . . . . . . . 54

3.13.2 Controlling the clustering . . . . . . . . . . . . . . . . 56

3.13.3 Controlling the bias . . . . . . . . . . . . . . . . . . . . 57

3.13.4 Restarting a simulation . . . . . . . . . . . . . . . . . . 58

3.13.5 Using a subset of the deﬁned cvs . . . . . . . . . . . . 59

3.14 Driven Adiabatic Free Energy Dynamics (d-AFED) . . . . . . 60

3.14.1 Input for d-AFED . . . . . . . . . . . . . . . . . . . . 61

3.14.2 Typical output for d-AFED . . . . . . . . . . . . . . . 62

4 Collective variables 64

4.1 Absolute position . . . . . . . . . . . . . . . . . . . . . . . . . 66

4.2 Distance.............................. 67

4.3 Minimum distance . . . . . . . . . . . . . . . . . . . . . . . . 68

4.4 Angles............................... 69

4.5 Torsion............................... 70

4.6 Coordination number . . . . . . . . . . . . . . . . . . . . . . . 70

4.7 Hydrogenbonds.......................... 71

4.8 Interfacialwater.......................... 73

4.9 Radius of gyration . . . . . . . . . . . . . . . . . . . . . . . . 74

4.9.1 Gyration tensor based CVs . . . . . . . . . . . . . . . . 75

4.10Dipole ............................... 76

4.11 Dihedral correlation . . . . . . . . . . . . . . . . . . . . . . . . 77

4.12 Alpha-beta similarity . . . . . . . . . . . . . . . . . . . . . . . 77

4.13Alpharmsd............................. 78

4.14Antibetarmsd ........................... 79

4.15Parabetarmsd........................... 80

4.16 Electrostatic potential . . . . . . . . . . . . . . . . . . . . . . 81

4.17 Puckering coordinates . . . . . . . . . . . . . . . . . . . . . . 82

4.18 Path collective variables . . . . . . . . . . . . . . . . . . . . . 83

4.18.1 Mean square deviation . . . . . . . . . . . . . . . . . . 85

4.18.2 Distance mean square deviation . . . . . . . . . . . . . 87

4.18.3 Contact map distances . . . . . . . . . . . . . . . . . . 87

4.18.4 Using path variables as MSD, DMSD and CMAP and

the TARGETED statement ................. 91

4.19ContactMap ........................... 92

4.20Energy............................... 93

4.21Helixloops............................. 93

4.22PCAprojection.......................... 94

4.23 SPRINT topological variables . . . . . . . . . . . . . . . . . . 96

4.24 Radial distribution function . . . . . . . . . . . . . . . . . . . 97

4.25 Angular distribution function . . . . . . . . . . . . . . . . . . 98

4.26 Polynomial combination of CVs . . . . . . . . . . . . . . . . . 99

4.27FunctionofCVs..........................100

4.28 A note on periodic boundary conditions . . . . . . . . . . . . . 101

5 Postprocessing 103

5.1 Estimating the free energy after a metadynamics run . . . . . 103

5.1.1 Installation instructions . . . . . . . . . . . . . . . . . 103

5.1.2 Usage ...........................103

5.2 Evaluating collective variables on MD trajectories . . . . . . . 106

5.2.1 Installation instructions . . . . . . . . . . . . . . . . . 106

5.2.2 Usage ...........................106

5.3 Processing COLVAR ﬁles......................108

5.4 PLUMED as a standalone program . . . . . . . . . . . . . . . . . 109

5.4.1 Installation instructions . . . . . . . . . . . . . . . . . 109

5.4.2 Usage ...........................110

5.5 Reweighting well-tempered metadynamics calculations . . . . 111

5.5.1 Installation instructions . . . . . . . . . . . . . . . . . 112

5.5.2 Usage ...........................113

5.6 Bias-exchange simulations via the linux shell . . . . . . . . . . 114

Chapter 1

Introduction

1.1 What is PLUMED?

PLUMED[1] is a plugin for free-energy calculations in molecular systems. It

works with some of the most popular classical molecular dynamics (MD)

codes, such as GROMACS [2], NAMD [3], DL POLY [4] , LAMMPS [5]

and the SANDER module in AMBER [6]. It also works with the very fast

CUDA/GPU MD code ACEMD [7] and, more recently, it has been extended

to work with ab initio MD codes, such as Quantum-ESPRESSO [8] and

CPMD.

Free-energy calculations can be performed as a function of many order

parameters, with a particular focus on biological problems, and using state-

of-the-art methods such as metadynamics [9], umbrella sampling [10, 11, 12]

and Jarzynski-equation based steered MD [13, 14].

Here is a brief outline of this guide:

•In this chapter we give an overview of the features and restrictions of

the current release of PLUMED.

•In the second chapter we describe the procedure for installing the plugin

and testing the correct installation.

•The third chapter explains how PLUMED can be used to perform free-

energy calculations, setting up the input ﬁle and analyzing the output.

•The fourth chapter contains a list of collective variables (CVs) which

are implemented and allow a wide variety of physical and chemical

problems to be addressed.

•The ﬁfth chapter is dedicated to postprocessing. It explains how to

reconstruct the free-energy proﬁle from the output of a metadynamics

run and how to extract the CV values from MD trajectories.

1.2 Supported codes

PLUMED works as an add-on to some of the most popular MD codes: NAMD,

GROMACS, SANDER, DL POLY, Quantum-ESPRESSO, ACEMD, LAMMPS

and CPMD. These codes are not distributed with the PLUMED package, but

they must be obtained separately.

NAMD 2.6/2.7/2.8

http://www.ks.uiuc.edu/Research/namd/

GROMACS 4.0/4.5.x

http://www.gromacs.org/

AMBER 10/11

http://ambermd.org/

DL POLY 2.20

http://www.cse.scitech.ac.uk/ccg/software/DL POLY/

Quantum-ESPRESSO 4.3.2

http://www.quantum-espresso.org/

ACEMD 1.2

http://multiscalelab.org/acemd

LAMMPS 27-Oct-2011

http://lammps.sandia.gov

CPMD 3.15.1

http://www.cpmd.org

Note that at the moment of this release of PLUMED, only those speciﬁc

versions of the listed codes have been tested. Moreover, some other codes

are available through the Python interface provided by Rosa Bulo.

Amsterdam Density Functional (ADF)

http://www.scm.com/

and all the codes supported by

Atomic Simulation Environment (ASE)

https://wiki.fysik.dtu.dk/ase/

Additionally there exist some porting to

FHI-AIMS

https://aimsclub.fhi-berlin.mpg.de/

which is not directly mantained by the PLUMED developer team. Addi-

tional information have to be retrieved by the developers of AIMS them-

selves.

1.3 Features

PLUMED can perform several diﬀerent types of calculation:

•Metadynamics with a large variety of CVs [9];

•Well-tempered metadynamics [15];

•Multiple walkers metadynamics [16];

•Combined parallel tempering and metadynamics [17];

•Bias-exchange metadynamics [18];

•Reconaissance metadynamics [19];

•Steered MD;

•Umbrella sampling;

•Adiabatic biased molecular dynamics [20];

•Commitment analysis.

1.4 New in version 1.3

PLUMED version 1.3 presents several new features, including new collective

variables and support to new codes. Among these:

•Reconaissance Metadynamics [19];

•Driven adiabatic free energy dynamics (d-AFED, contributed by Michel

Cuendet)

•Deﬁnition of CVs as Polynomial combination of CVs and Function on

CVs;

•New INTERVAL keyword to limit a region to be sampled using Meta-

dynamics (contributed by Alessandro Laio)

•Better scalability with Gromacs;

•New collective variables: Projection on PCA eigenvectors (contributed

by Ludovico Sutto), SPRINT topological variable and gyration tensor

based CVs;

•New tool: reweight well-tempered metadynamics simulations [21];

•Added support to ACEMD 1.2, Lammps (27-oct-2011), Quantum Espresso

4.3.2, CPMD 3.15.1, NAMD 2.8, Gromacs 4.5.x and Amber11;

•Python interface (contributed by Rosa Bulo);

•New tool to perform bias-exchange simulations via the linux shell with

any MD code;

•Bugs have been ﬁxed and the manual has been improved thanks to the

feedbacks of many users.

A full list including also the bug ﬁxed in the current release can be found

in the CHANGES ﬁle distributed with the package.

1.5 Restrictions

The current release of PLUMED has a few restrictions:

•Parallel tempering plus metadynamics and bias-exchange metadynam-

ics are available only in the GROMACS version (however a tool allows

bias-exchange simulations via linux shell with any MD engine);

•The patched version of GROMACS cannot perform hamiltonian (lambda)

replica exchange;

•The ENERGY collective variable is available only for GROMACS, AM-

BER and DL POLY and cannot be used with multiple time step algo-

rithm;

•Only orthorhombic cells are supported in ACEMD and DRIVER;

•Only orthorhombic and truncated octahedron cells are supported in

AMBER;

•Only Car-Parrinello and Born-Oppenheimer simulations are supported

in CPMD;

•Only NVT simulations are supported because the contribution to the

virial is not calculated.

1.6 The PLUMED package

The plugin package has the following directory structure:

•common_files. The directory containing all the basic routines.

•tests. A variety of examples of diﬀerent CVs and free-energy methods

provided with topology and input ﬁles for the supported codes. These

examples, combined with a script adapted from CP2K [22], work also

as a regtest for the plugin.

•patches. A collection of patches to interface PLUMED with diﬀerent

codes.

•utilities. A few small utilities:

–sum_hills is a post-processing program which reads the HILLS

ﬁle produced by the plugin in a metadynamics simulation and

returns the free energy by summing the Gaussians that have been

deposited.

–driver is a tool that calculates the value of selected CVs along a

MD trajectory. It requires a PDB ﬁle, a trajectory in DCD format

and a ﬁle with the same syntax of the PLUMED input ﬁle.

–standalone is a program to run PLUMED as a standalone code.

–plumedat.sh is a shell script to extract information from PLUMED

output ﬁles.

–reweight is a tool to calculate the unbiased probability distribu-

tion of variables other than the CVs in well-tempered metady-

namics simulations

–python_interface a set of tools that enables to use PLUMED with

Python (hey, a Python which is plumed with feathers is a really

weird animal!).

–bias-exchange a set of tools that allow to use PLUMED to perform

bias-exchange simulations via the linux shell using any MD engine.

•manual. This manual.

•ACEMD. It contains the ﬁles needed to compile the plugin for ACEMD.

•tutorial. It contains the resources of the PLUMED tutorial 2010 (http:

//sites.google.com/site/plumedtutorial2010/)

•recon_src. It contains the source ﬁles speciﬁc for the Reconaissance

Metadynamics.

1.7 Online resources

You can ﬁnd more information on the web:

http://www.plumed-code.org

For any questions, please subscribe to our mailing list:

plumed-users@googlegroups.com

1.8 Credits

PLUMED has been developed by Massimiliano Bonomi, Davide Branduardi,

Giovanni Bussi, Carlo Camilloni, Davide Provasi, Paolo Raiteri, Davide

Donadio, Fabrizio Marinelli, Fabio Pietrucci, Francesco Luigi Gervasio and

others. However, this work would not have been possible without the joint

eﬀort of many people. Among these, we would like to thank (in alphabetical

order): Alessandro Barducci, Anna Berteotti, Rosa Bulo, Matteo Ceccarelli,

Michele Ceriotti, Paolo Elvati, Antonio Fortea-Rodriguez, Alessandro Laio,

Matteo Masetti, Fawzi Mohamed, Ferenc Molnar, Gabriele Petraglio, Jim

Pfaendtner and Federica Trudu. Francesco Marini is kindly acknowledged

for his technical support and Joost VandeVondele for permission to use his

regtest script.

Some PLUMED users have also contributed to the implementation of new

features and the debugging of old bugs. Among these, we would like to thank:

Toni Giorgino, Marcello Sega, Emmanuel Autieri, Gareth Tribello, Andrea

Coletta, Ludovico Sutto, Layla Martin-Samos, Walter Rocchia, Alessio Lodola,

Luigi Capoferri, Michel Cuendet, Jiri Vymetal, Fahimeh Baftizadeh, and

Katsumasa Kamiya.

1.9 Citing PLUMED

You may wish to cite the following reference if you have utilized PLUMED in

your work:

M. Bonomi, D. Branduardi, G. Bussi, C. Camilloni, D. Provasi, P. Raiteri,

D. Donadio, F. Marinelli, F. Pietrucci, R.A. Broglia and M. Parrinello.

PLUMED: a portable plugin for free-energy calculations with molecular

dynamics, Comp. Phys. Comm. 2009 vol. 180 (10) pp. 1961-1972.

1.10 License

PLUMED is free software: you can redistribute it and/or modify it under the

terms of the GNU Lesser General Public License as published by the Free

Software Foundation, either version 3 of the License, or (at your option)

any later version. PLUMED is distributed in the hope that it will be useful,

but WITHOUT ANY WARRANTY; without even the implied warranty of

MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See

the GNU Lesser General Public License for more detail. You should have

received a copy of the GNU Lesser General Public License along with PLUMED.

If not, see http://www.gnu.org/licenses/.

Chapter 2

Installation

The plugin installation requires the molecular dynamics code to be recom-

piled after the calls to the plugin routines have been inserted at the appro-

priate places in the original program. For a list of the supported molecular

dynamics codes see Sec. 1.2.

All the basic plugin routines are contained in the common files folder of

the PLUMED distribution package. The insertions are automatically performed

by a series of scripts provided in the patches directory.

In the following we will brieﬂy describe the procedure for applying these

patches to the diﬀerent MD codes supported by PLUMED. We will refer to the

root directory of the distribution package as PLUMED root.

2.1 Compiling PLUMED

A similar procedure can be followed for all the supported codes except

ACEMD. When code-speciﬁc procedures are needed, we state it explicitely.

A diﬀerent proceedure is required for ACEMD (see below).

Diﬀerent patches are available for diﬀerent code versions. The name of

the suitable patch is plumedpatch CODENAME CODEVERSION.sh. If the patch

corresponding to the exact CODEVERSION that you are using is not avail-

able, choose the closest match. In the following, we shall indicate with

plumedpatch.sh the proper patch script.

•Extract the source code from its archive and then move into its root

directory.

•Conﬁgure the code as usual (not necessary for DL POLY)

•NAMD and LAMMPS only: In the plumedpatch script, modify the

myarch variable to match your architecture.

•Set the environmental variable plumedir to point to PLUMED root.

•Copy or link the plumedpatch.sh ﬁle from the patches folder to the

current directory.

•Execute the script: ”./plumedpatch.sh -patch ”.

•DL POLY-only: copy the proper Makeﬁle from build/ to srcmod/ (or

src/ in older versions)

•CPMD only: you may need to change variables cxx, cxxflag, extraflag

in the plumedpatch script depending on your computer architecture:

see the examples below.

•Compile the code as usual.

Further details can be found in the patches/README ﬁle.

Example.

This is the procedure for compiling the serial version of AMBER 10 with PLUMED using g95 in the Bourne

shell.

tar zxf AMBER10.tgz

cd amber10/

export plumedir="PLUMED root"

cp $plumedir/patches/plumedpatch sander 10.sh .

cd src/

./configure g95

cd ..

./plumedpatch sander 10.sh -patch

make

Example.

This is the procedure for compiling the serial version of GROMACS, using the GNU compilers.

tar zxf gromacs-4.0.5.tar.gz

cd gromacs-4.0.5

export plumedir="PLUMED root"

cp $plumedir/patches/plumedpatch gromacs 4.0.4.sh .

CC=gcc CXX=g++ ./configure

./plumedpatch gromacs 4.0.4.sh -patch

make

make install

Example.

This is the procedure for compiling NAMD on an Intel Mac using the GNU g++ compiler and the FFTW.

tar zxf NAMD 2.6 Source.tar.gz

cd NAMD 2.6 Source

export plumedir="PLUMED root"

cp $plumedir/patches/plumedpatch namd 2.6.sh .

./config fftw MacOSX-i686-g++

Edit ./plumedpatch namd 2.6.sh by setting the myarch variable to MacOSX-i686-g++.

./plumedpatch namd 2.6.sh -patch

cd MacOSX-i686-g++

make

Example.

This is a sample procedure for compiling the scalar version of DL POLY 2.20 with the gfortran compiler

in the Bourne Shell environment.

tar zxf dl poly 2.20.tar.gz

cd dl poly 2.20

export plumedir=PLUMED root

cp $plumedir/patches/plumedpatch dlpoly 2.20.sh .

./plumedpatch dlpoly 2.20.sh -patch

cp build/MakeSEQ srcmod/Makefile

cd srcmod

make gfortran

Example.

This is a sample procedure for compiling the openmpi version of LAMMPS with the mpic++ compiler in

the Bourne Shell environment. Please, note that the LAMMPS tarball comes with the download date but

unless major changes are done in the host code, the patching procedure will stay unchanged and the

latest available patching ﬁle has to be used.

tar xvf lammps-15Jan10.tar

cd lammps-15Jan10/

Edit ./src/MAKE/Makefile.openmpi to suit your machine.

export plumedir="PLUMED root"

cp $plumedir/patches/plumedpatch lammps 15-01-2010.sh ./

Edit ./plumedpatch lammps 15-01-2010.sh to set myarch to openmpi.

./plumedpatch lammps 15-01-2010.sh -patch

cd src

make openmpi

Example.

This is a sample procedure for compiling CPMD-3.15.1 under different computer architectures.

tar zxvf cpmd-v3 15 1.tgz

cd CPMD

Generate a valid Makeﬁle for your machine using the script ./mk config.sh, e.g.:

./mk config.sh PC-GFORTRAN > Makefile

export plumedir="PLUMED root"

cp $plumedir/patches/plumedpatch cpmd-3.15.1.sh .

If necessary, modify the variables cxx (a valid C++ compiler), cxxflag (the corresponding ﬂag to

avoid exception handling), and extraflag (see below) in the script plumedpatch cpmd-3.15.1.sh, e.g.

for GNU compilers:

cxx="mpiCC"

cxxflag="-fno-exceptions"

for IBM machines like AIX-power7:

cxx="xlc r -c"

cxxflag="-qnoeh"

for IBM BlueGene:

cxx="bgxlc++ r -c -O"

cxxflag="-qnoeh"

for PGI compilers:

cxx="pgCC"

cxxflag="--no exceptions"

To successfully compile on some IBM architectures you may also need to set one or both of the

following extra ﬂags:

extraflag="-DPLUMED CPMD NOUNDERSCORE -DPLUMED AIXXLF"

Finally, patch and compile:

./plumedpatch cpmd-3.15.1.sh -patch

make

2.1.1 Compiling the ACEMD plugin with PLUMED

PLUMED works with ACEMD 1.2. As the source code for ACEMD is not

available, PLUMED will work as a plug-in. For this reason the compilation

will be slightly diﬀerent from that of the other codes. Moreover, given the

unavailability of source code, this version of PLUMED will not be supported

by the PLUMED development team.

•Extract the source code from its archive.

•Set the environmental variable plumedir to point to PLUMED root.

•Use make to compile the plugin inside the plumedir/ACEMD folder.

Example.

This is a sample procedure for compiling the ACEMD plug-in in the Bourne Shell environment.

export plumedir=PLUMED root

cd $plumedir ACEMD

make

Once the plumed.so is compiled, copy it where you will run ACEMD. Add

the following lines to the ACEMD input ﬁle:

pluginload testplug ./plumed.so

pluginarg testplug input META_INP

pluginarg testplug boxx xx

pluginarg testplug boxy yy

pluginarg testplug boxz zz

pluginfreq 1

where META INP is the PLUMED input ﬁle and xx,yy,zz are the box di-

mensions. At this time the support of the ACEMD plugin will be provided

by the ACEMD developers. Please note that only orthorhombic cells are

available in ACEMD 1.2.

2.2 Including reconnaissance metadynamics

The reconnaissance metadynamics routines (see section 3.13 included in PLUMED

make use of the C++ standard library and lapack. On some machines these

libraries are not installed. Hence, PLUMED has been written so that by default

no attempt to compile reconnaissance metadynamics is made. Consequen-

tially, if you wish to perform reconnaissance metadynamics you must patch

PLUMED slightly diﬀerently.

To include reconnaissance metadynamics in your patched MD code is

simply a matter of, during the patching procedure, telling PLUMED the lo-

cation of the C standard library, the location of lapack and the location of

a c++ compiler. This is done by making some small modiﬁcations to the

plumedpatch ﬁle. If this ﬁle is opened using a text editor you will notice at

least one of the following two lines:

RECON_LIBS=

RECON_CPP=

If at least one of these lines is not present then reconnaissance metady-

namics is not implemented for the particular code you are patching. The

ﬁrst of these two lines tells PLUMED the locations of the lapack and c stan-

dard libraries. If the space after the equal sign is left blank PLUMED patches

in its default fashion and no attempt is made to compile the reconnaissance

metadynamics routines. By contrast if there is any character after the equals

sign PLUMED will endeavor to compile the reconnaissance metadynamics rou-

tines. For a correct compilation the RECON_LIBS}variable should indicate the

locations of the lapack (i.e. -llapack) and c standard libraries (i.e. -lstdc++).

The RECON_CPP= variable should be set equal to a c++ compiler (e.g. g++).

Example.

To include the reconnaissance metadynamics routines in the PLUMED compilation the use must change

the values of RECON LIBS and RECON CPP in the plumedpatch ﬁle:

RECON LIBS=”-llapack -lstdc++”

RECON CPP=”g++”

There are some diﬀerences between the procedures for patching PLUMED+

reconnaissance metadynamics in the diﬀerent MD codes between the diﬀerent

codes. This is because some of the codes already have lapack included or are

compiled with a c++ compiler. These diﬀerences are described in table 2.1:

2.3 Testing the installation

Once the installation process has been completed successfully, the user is

encouraged to test the chosen MD package for any problems. The tests

directory contains regtest scripts for the diﬀerent MD codes. These also

serve as a regularity test in case the user implements his own modiﬁcations

and as a basic illustration of the capabilities of the plugin. The user should

Code Reconnaissance C++ compiler Lapack lstc++

implemented required required required

NAMD yes no yes yes

GROMACS yes yes no yes

AMBER no - - -

DL POLY yes yes yes yes

Q-ESPRESSO yes yes no yes

ACEMD no - - -

LAMMPS no - - -

CPMD yes yes no yes

Table 2.1: Details on compiling PLUMED compilation with the various codes

edit the test script, setting up the path where the test suite is found and

giving the location of the executable.

For GROMACS users only. Please note that:

•The tests for GROMACS are designed for and should be executed with

the double-precision version of the code;

•Biasxmd, ptmetad, dd and pd are designed for the parallel version of

GROMACS. The user should specify in the test script the location of

the parallel executable and the version of GROMACS used. These tests

will fail if the parallel version of GROMACS has not been compiled.

Example.

In the case of NAMD, the regtest script is called do regtest namd.sh. Here, the user should modify the

dir base and the namd prefix:

dir base=/Programs/md meta/tests/namd

namd preﬁx=/chicco/bin/namd plugin/namd2.

Regtest scripts for the other programs require an identical procedure.

The script executes a list of tests and then compares the results with the

outcome of previously run simulations. Please note that when the script is

run for the ﬁrst time it produces the reference. Finally, the script produces

a summary of the results. The test result can be one of the following:

•’OK’ if the results match those of a previous run precisely. The execu-

tion time is also given.

•’NEW’ if they have not been executed previously. The reference result

is generated automatically in this run. Tests can also be ’NEW’ if they

have been reset, i.e. been newly added to the TEST FILES RESET

ﬁles.

•’RUNTIME FAILURE’ if they stopped unexpectedly (e.g. core dump,

or stop).

•’WRONG RESULT’ if they produce a result that deviates from an old

reference.

The last two options generally mean that a bug has been introduced,

which requires investigation. Since regtesting only yields information relative

to a previously known result, it is most useful to do a regtest before and after

you make changes.

2.4 Back to the original code

At any time the user may want to ”unpatch” the MD code and revert back

to the original distribution. To do so, the user should go to the directory

where the PLUMED patch has been copied and type ./plumedpatch -revert.

2.5 The Python interface to PLUMED

Starting from version 1.3 PLUMED has also a Python interface contributed by

Rosa Bulo. It is possible to ﬁnd it in the utilities/python_interface

directory. Inside it you might ﬁnd a README ﬁle and a Makefile . After

copying (or linking) all the *.c and *.h ﬁles from the common_files direc-

tory you can immediately do:

make gnu

if you want to compile with gnu compilers. Intel compilers are also avail-

able. This creates a ﬁle that is libplumed.so that will be loaded runtime

by Python. You need to add the directory where this ﬁle resides into the

LD LIBRARY PATH so that Python will be able to ﬁnd it and load it runtime.

The extension amber is only reﬂecting the fact that the interface is built by

exploiting the simple AMBER interface. Therefore the limitiations regarding

AMBER in the functionalities apply also here.

For MacOSX users the compiler ﬂags should be changed accordingly. For

gnu compilers it requires to change the linking ﬂags from

-shared -Wl,-soname,libplumed.so

-bundle -flat namespace -undefined suppress.

Next, in the directory

utilities/python_interface/pylib

you can ﬁnd the ﬁle plumedamberlib.py which is the actual Python mod-

ule that will deal of loading libplumed.so and create the suitable Python

interface. This provides the init metadyn and cv calculation methods

that one can practically use through Python.

A practical use of this shown by the Python script

/utilities/python interface/test/run water amber.py

where a water molecule is simply displaced and the collective coordinates are

updated at each step. Notably, this interface is embodied in the Python tools

provided with the Amsterdam Density Functional code (also called ADF, see

http://www.scm.com/).

Additionally Rosa provided an interface with the powerful Atomic Simulation

Environment ( see https://wiki.fysik.dtu.dk/ase/ ). This enable PLUMED

to be interfaced with plenty of DFT and ab-initio codes. Some examples

are present in the directory utilities/python interface/test and are

ASEmd plumed emt water.py and ASEnvt plumed emt water.py .

Many thanks to Rosa for the contribution!

Chapter 3

Running free-energy

simulations

In this chapter we describe how to activate PLUMED and how to create the

correct PLUMED input ﬁle for a speciﬁc type of free energy calculation. The

typical output of these calculations is also explained in detail.

3.1 How to activate PLUMED

PLUMED input is contained in one single ﬁle, named plumed.dat by default,

which deﬁnes the CVs, the type of run to be performed and the parameters

for the bias potential generation.

•The users of NAMD, SANDER and DL POLY can instruct the code

to parse the PLUMED input ﬁle by setting the PLUMED variable to on (or

1 for SANDER) in the MD input ﬁle. It is also possible to change

the default name for the PLUMED input ﬁle by setting the plumedfile

variable in the MD input;

•GROMACS users should activate PLUMED on the command line by spec-

ifying the ﬂag -plumed followed by the input ﬁle name. The extension

of such a ﬁle must be .dat;

•Quantum-ESPRESSO users should activate PLUMED on the command

line by specifying the ﬂag -plumed. The name of the PLUMED input ﬁle

is hardcoded as plumed.dat;

•In LAMMPS, PLUMED is activated using the ﬁx plumed. The input ﬁle

is speciﬁed by plumedfile, the output ﬁle by outfile;

•In CPMD, PLUMED is activated on the command line by specifying the

ﬂag -plumed after the CPMD input ﬁle and the pseudopotential direc-

tory. The PLUMED input ﬁle name is plumed.dat.

Example.

A typical SANDER input ﬁle for a metadynamics calculation:

METADYNAMICS TEST

&cntrl

imin=0, irest=0, ntx=1, ig=71278,

nstlim=1001, dt=0.0002,

ntc=1, ntf=1,

ntt=3, gamma ln=5,

tempi=300.0, temp0=300.0,

ntpr=200, ntwx=0,

ntb=0, igb=0,

cut=999.,

plumed=1 , plumedfile=’plumed.dat’

Example.

The NAMD and DL POLY input ﬁle should contain, in addition to the usual keywords deﬁning the run, the

following lines;

plumed on

plumedfile plumed.dat

Example.

To perform a free-energy calculation with GROMACS, PLUMED must be activated on the command line:

mdrun -plumed plumed.dat ...

Example.

To perform a free-energy calculation with Quantum-ESPRESSO, PLUMED must be activated on the

command line:

pw.x -plumed ...

Example.

The LAMMPS input ﬁle should contain, in addition to the usual keywords deﬁning the run, the following

line;

fix 3 all plumed plumedfile plumed.dat outfile metaout.dat

Example.

To perform a free-energy calculation with LAMMPS the ﬁle should contain, in addition to the usual

keywords deﬁning the run, the following command:

fix ID all plumed plumedfile plumed.inp outfile plumed.out

where ID is the user assigned ﬁx name.

Example.

To perform a free-energy calculation with CPMD, PLUMED must be activated on the command line:

cpmd.x input . -plumed

where input is the CPMD input ﬁle, ”.” is the current directory (if pseudopotentials are located

elsewhere, substitute with the right location), and the PLUMED input ﬁle is called plumed.dat.

3.2 The input ﬁle

In the following sections we describe the syntax used in the PLUMED input ﬁle.

The commands contained in this ﬁle can be divided in two groups according

to the functionality required. A ﬁrst group of commands deﬁnes the type of

simulation (metadynamics, steering and umbrella sampling, replica exchange

methods) and the parameters needed for the chosen algorithm. These are

described in this chapter. A second part deﬁnes the degrees of freedom on

which the algorithms operate, the so-called collective variables or CVs. The

details of such commands are described in the next chapter.

As a general rule, each setting is deﬁned by a principal keyword that

must be placed at the beginning of the line, in upper case. Additional input

pertaining to the setting can be speciﬁed on the same line, using additional

keywords that can be added in any order. A line can be continued to the

next one by adding a backslash or an ampersand as a last character in the

line. Three kinds of keyword may exist: the directives, which must be placed

at the beginning of a line and deﬁne the argument of the line, the keywords,

which specify the attributes of the diﬀerent ﬁelds in the line and ﬂags, which

simply turn a given option on or oﬀ.

Example.

The following is an example of a complete PLUMED input ﬁle. In this case the input deﬁnes a well-tempered

metadynamics run with two CVs; the ﬁrst is the distance between two atoms, and the second a dihedral

angle.

# general options

HILLS HEIGHT 0.1 W STRIDE 100

WELLTEMPERED SIMTEMP 300 BIASFACTOR 10

# print each 50 time units and add a time offset of 20.0 to COLVAR

PRINT W STRIDE 50 T OFFSET 20.0

# definition of CVs

NOTE distance between hydrogens

DISTANCE LIST 13 46 SIGMA 0.35

NOTE torsional angle

TORSION LIST 1 4 65 344 SIGMA 0.1

# wall on the CV

UWALL CV 1 LIMIT 15.0 KAPPA 100.0 EXP 4.0 EPS 1.0 OFF 0.0

ENDMETA

The ENDMETA directive is the last line read from the PLUMED input ﬁle and

any line added after this keyword will be ignored.

The symbol #allows the user to comment any line in the input ﬁle. The

directive NOTE allows the user to place comments which are copied to the

PLUMED log ﬁle.

The PRINT allows to dump a ﬁle called COLVAR (see more in 3.4) which is

written each W STRIDE timesteps and (optional) a time oﬀset can be speciﬁed

through the TOFFSET keyword. To append the colvar values to an existing

COLVAR ﬁle, the ﬂag APPEND can be used.

3.3 A note on units

The values in the PLUMED input ﬁle are read in the internal units for the MD

engine. For DL POLY the energy in input can be in diﬀerent user-speciﬁed

units; to be consistent, the values in the PLUMED input ﬁle must be in the

same units speciﬁed in the FIELD ﬁle.

3.4 Metadynamics

The metadynamics algorithm applies additional forces to a standard molec-

ular dynamics simulation [9, 23, 24]. In this case, the PLUMED input ﬁle must

contain the deﬁnition of at least one CV (see chapter 4 for the required

syntax), and the HILLS keyword which deﬁnes the details of the bias poten-

tial (see section 3.4.2). In this case the biasing potential will be calculated

and applied to the microscopic degrees of freedom during the run. Before

discussing the additional optional commands (section 3.4.2), we give a brief

overview of the ﬁle produced in a typical metadynamics run.

3.4.1 Typical output

Standard metadynamics will produce, in addition to the usual output ﬁles

generated by the MD engine, a ﬁle called COLVAR that contains the following

data:

•The ﬁrst column contains the time step;

•The following dcolumns contain the values of the CV(s);

•Two additional columns contain the value V(s, t) of the bias potential

at the given point and time, and the potential due to the walls (if

deﬁned).

Since PLUMED 1.2, a more ﬂexible format for COLVAR has been introduced.

Here, the ﬁrst line of the COLVAR ﬁle is a header. The line begins with #,

so as to be ignored by plotting programs such as gnuplot and xmgrace. A

small script plumedat.sh to interpret this ﬁle is provided in the utilities.

Another ﬁle, called HILLS, contains the information of the biasing poten-

tial needed to estimate the free-energy, and to restart metadynamics runs.

The bias potential is given by:

V(~s, t) = X

kτ <t

W(kτ) exp 

−

i=1

(si−s(0)

i(kτ))2

2σ2

i

.(3.1)

Speciﬁcally,

•The ﬁrst column contains the time step at which the contribution to

the bias potential was added, τ, 2τ, etc.;

•The following dcolumns contain the values ~s(0)(t) specifying the posi-

tion of the centroid of the Gaussian;

•The next dcolumns contain the values ~σ specifying the Gaussian width

along the diﬀerent directions in the CV space;

•The last column but one contains the value Wof the Gaussian height;

•The last column contains the bias factor deﬁned in well-tempered meta-

dynamics.

Please refer to section 5.1 for a description of how the potential in equation

3.1 can be computed from the HILLS ﬁle.

It is important to note that a metadynamics run should typically start

from an equilibrated system. The equilibration protocol can be applied with-

out resorting to the PLUMED input ﬁle. However, in order to gain important

information concerning the behavior of the run and the tuning of the pa-

rameters of the metadynamics biasing potential, it is useful to monitor the

behavior of the CVs during the equilibration run. To this aim, just skip the

HILLS directive: in this way, only the COLVAR ﬁle will be generated.

3.4.2 Bias potential

The HILLS directive is used to deﬁne the details of the biasing potential. It

must be followed by the deﬁnition of the weight factor W, preceded by the

keyword HEIGHT (see the note on units in section 3.3). Also the frequency at

which the Gaussians are deposited and written into the HILLS ﬁle must be set

using W STRIDE followed by the number of MD steps between two successive

depositions. The Gaussian width must be set in the proper CV line using

the keyword SIGMA.

Notice that the HILLS directive is not compatible with the COMMITMENT di-

rective.

Example.

The following line switches on metadynamics with Gaussian height of 0.1 (in energy unit of the MD code)

and deposition stride of 1000 MD steps.

HILLS HEIGHT 0.1 W STRIDE 1000

3.4.3 Well-tempered metadynamics

The WELLTEMPERED directive activates well-tempered metadynamics [15] which,

by rescaling the Gaussian weight factor, guarantees the theoretical conver-

gence of metadynamics. In the well-tempered algorithm, the rate at which

the bias potential is added is decreased during the simulation proportionally

to e−V(s,t)/∆T, where ∆Tis a characteristic energy:

V(s, t) =

t0<t

t0=0,τG,2τG,...

W e−V(s(q(t0),t0)/∆Texp −

i=1

(si(q)−si(q(t0))2

2σ2

i!,(3.2)

where W=τGωis the height of a single Gaussian.

For a given temperature of the system T(speciﬁed with the keyword SIMTEMP),

the CVs are sampled at a ﬁctitious higher temperature T+ ∆Tdetermined

by the bias factor (T+ ∆T)/T . The user must specify this bias factor using

the keyword BIASFACTOR.

Example.

The following commands deﬁne a well-tempered metadynamics run, in which the CVs are sampled at the

higher temperature of 3000 K. The initial Gaussian height is 0.1 (in energy unit of the MD code) and the

deposition stride is 1000 MD steps.

HILLS HEIGHT 0.1 W STRIDE 1000

WELLTEMPERED SIMTEMP 300 BIASFACTOR 10

It should be noted that, in the case of well-tempered metadynamics, in

the output printed on the HILLS ﬁle the Gaussian height is rescaled using

the bias factor. This is done in order to directly obtain the free energy (and

not the bias), when summing all the Gaussians deposited during the run.

3.4.4 Restarting a metadynamics run

In order to restart a metadynamics run, the ﬂag RESTART must be added on

the line of the directive HILLS. It allows a metadynamics simulation to be

restarted after an interruption or after a run has ﬁnished. The HILLS ﬁles

will be read at the beginning of the simulation and the bias potential applied

to the dynamics. Note that the presence of the RESTART ﬂag only aﬀects

the metadynamics part of the simulation, and thus the usual procedure for

restarting a MD run must be followed. This depends on the particular MD

engine used and can be found in the relative documentation.

Example.

The following is an example of input ﬁle for restarting a metadynamics simulation.

HILLS RESTART HEIGHT 0.1 W STRIDE 1000

In case of well-tempered metadynamics, the Gaussians height is rescaled

in input according to the bias factor. This is done assuming that the sum

of the Gaussians stored in the HILLS ﬁle is an estimate of the (negative)

free energy landscape. Since the estimate of the free energy is in principle

independent from the choice of the bias factor, it is correct to restart a

well-tempered simulation with a diﬀerent bias factor or even restart from a

non-well-tempered simulation.

3.4.5 Using GRID

Normally the additional forces of metadynamics are calculated every MD

step by summing the contribution coming from the Gaussians deposited up

to this point, following Eq. 3.2. As the simulation goes on, the computational

time spent in the evaluation of these forces becomes larger and larger and

eventually comparable with the time needed to calculate the contribution

of the force ﬁeld itself. This eﬀect is particularly visible when the system

simulated is small or when using a simpliﬁed coarse grained potential.

A possible solution is to store an array containing the current value of the

bias potential (and of the derivatives with respect to the CVs) on a grid. In

this way the computational cost of metadynamics becomes constant all over

the simulation and corresponds to the cost of evaluating a single Gaussian

function on the whole grid with a frequency given by the stride between

subsequent hills. This approach is similar to that proposed in Ref. [25],

but has the advantage that the grid spacing is independent on the Gaussian

width.

This operation can be demanding if the number of collective variable

and/or the number of grid bins is high. However, the cost of adding the

Gaussian on the grid can be substantially reduced taking into account that

this function is almost zero outside a characteristic range determined by

the Gaussian sigma. In PLUMED this interval is calculated once (or at every

modiﬁcation of sigma) and used to build a smaller sub-grid on which the

potential is updated.

In order to use the grid, the directive GRID must be added together with

the keyword CV to specify the collective variable, MIN and MAX to ﬁx the CV

interval, NBIN for the number of bins and the ﬂag PBC if the CV is periodic.

Example.

In this example we run metadynamics using only one CV, a distance between two atoms, and we put the

bias potential on a grid of 200 bins in the interval between 0.0 and 10.0.

HILLS HEIGHT 0.1 W STRIDE 1000

PRINT W STRIDE 50

DISTANCE LIST 13 46 SIGMA 0.35

GRID CV 1 MIN 0.0 MAX 10.0 NBIN 200

ENDMETA

Special labels can be used in the deﬁnition of the interval with MIN and

MAX, such as -pi, +pi, +2pi, -2pi, pi, 2pi. These labels may be par-

ticularly useful with the CVs ANGLE or TORSION.

Example.

In this example we run metadynamics using a dihedral angle and putting the bias potential on a periodic

grid of 200 bins in the interval between -pi and pi.

HILLS HEIGHT 0.1 W STRIDE 1000

PRINT W STRIDE 50

TORSION LIST 13 15 17 19 SIGMA 0.35

GRID CV 1 MIN -pi MAX +pi NBIN 200 PBC

ENDMETA

As in standard metadynamics, a HILLS ﬁle containing the list of Gaussians

deposited is produced. This ﬁle is needed for restarting a metadynamics

simulation also when using GRID.

The bias potential in a generic point is calculated by a polynomial inter-

polation which has the proper values of the function and of its derivatives

in 2dpoints, where dis the number of collective variables. The forces are

then obtained as the analytical derivatives of the bias. You can switch oﬀ

the use of splines by using the directive NOSPLINE. In this case, the bias and

forces are simply taken at a close grid point (this requires a much denser grid).

Please also note that:

•GRID must be activated (or switched oﬀ) on ALL the CVs;

•GRID can be used together with multiple walkers metadynamics, bias-

exchange and parallel tempering metadynamics;

•For a correct calculation of the potential and forces, the bin size must

be smaller than half the Gaussian sigma. If a larger size is used, the

code will stop.

•If the simulation goes out of the grid, the code will stop. Please increase

MIN or MAX and restart metadynamics.

Writing a GRID to ﬁle

The directive WRITE GRID allows to save on a ﬁle the grid on which the bias

potential is stored and the relative forces. The keyword W STRIDE can be

used to specify the writing stride and FILENAME the name of the ﬁle. Since

saving the entire grid on ﬁle may take some time, a reasonable writing stride

should be used.

Example.

The following command controls a metadynamics calculation using as CV the distance between two

atoms. The bias potential is stored on a grid and saved to the ﬁle bias.dat every 100000 steps.

HILLS HEIGHT 0.1 W STRIDE 1000

DISTANCE LIST 10 12 SIGMA 3.0

GRID CV 1 MIN 0.0 MAX 10.0 NBIN 10

WRITE GRID FILENAME bias.dat W STRIDE 100000

The ﬁle on which the grid is saved has a speciﬁc format. A header contains

information about the presence of force data, the number and type of CVs,

the grid dimension and boundaries and whether the CVs are periodic or not.

The rest of the ﬁle contains for each grid point the value of the bias potential

and forces (i.e. minus the derivative of the potential with respect to the

CVs).

Example.

The header states that force data are on ﬁle, only one CV is present (a distance, see Tab. 3.1 for a

legend). The grid is made of 10 bins ranging from 0 to 10 in unit of the CV. This CV is not periodic.

#! FORCE 1

#! NVAR 1

#! TYPE 1

#! BIN 10

#! MIN 0.000000

#! MAX 10.000000

#! PBC 0

0.000000 0.998838 -0.016081

1.000000 0.960195 0.091229

2.000000 0.825978 0.170252

3.000000 0.635806 0.201699

4.000000 0.437950 0.187594

5.000000 0.269942 0.145621

6.000000 0.148888 0.096861

7.000000 0.073484 0.055971

8.000000 0.032454 0.028326

9.000000 0.012826 0.012620

10.000000 0.004536 0.004967

This ﬁle can be used to plot the free energy resulting from a metadynamics

simulation instead of summing the Gaussians stored on the HILLS ﬁle with the

post-processing code sum_hills. When using well-tempered metadynamics,

please remember that the bias potential does not compensate exactly the

underlying free energy [15]. In this case, the potential written on ﬁle must

be rescaled accordingly.

Reading a GRID from ﬁle

The bias potential stored on ﬁle can be used to restart a metadynamics simu-

lation by specifying the directive READ GRID and the ﬁle name with FILENAME.

Example.

The following command controls a metadynamics calculation using as CV the distance between two

atoms. The initial bias potential is read from the ﬁle bias.dat.

HILLS HEIGHT 0.1 W STRIDE 1000

DISTANCE LIST 10 12 SIGMA 3.0

GRID CV 1 MIN 0.0 MAX 10.0 NBIN 10

READ GRID FILENAME bias.dat

Please also note that:

•The CV number and type in the header must be consistent with what

declared in the PLUMED input ﬁle;

•The number of bins and the boundaries in the header can be diﬀerent

from what declared in the PLUMED input ﬁle. In this case the parameters

of the PLUMED input ﬁle will be used and the grid present on ﬁle will be

interpolated to ﬁt the new dimensions;

•If force data are not present on ﬁle, they will be calculated from the

bias potential using ﬁnite diﬀerences;

•READ GRID is not compatible with the MULTIPLE WALKERS directive.

Restarting a metadynamics run from a grid written on ﬁle is fully com-

patible with the standard restart by reading a HILLS ﬁle created in a previous

run.

Example.

The following command controls a metadynamics calculation using as CV the distance between two

atoms. The initial bias potential is read from the ﬁle bias.dat. To this initial bias, the Gaussians de-

posited on the HILLS ﬁle are added.

HILLS RESTART HEIGHT 0.1 W STRIDE 1000

DISTANCE LIST 10 12 SIGMA 3.0

GRID CV 1 MIN 0.0 MAX 10.0 NBIN 10

READ GRID FILENAME bias.dat

3.4.6 Multiple walkers

The MULTIPLE WALKERS directive sets the multiple walkers [16] running mode.

All the processes must have the same CVs, in the same order. Each process

will write its own bias potential in the directory speciﬁed by the HILLS DIR

keyword and in a ﬁle named HILLS.0,HILLS.1, etc etc. Multiple pro-

cesses must be launched independently and must point to the same directory

HILLS DIR, so that each one will contribute to the total bias potential.

The keyword NWALKERS sets the total number of walkers. This can be set

to a value greater than the actual number of processes running and it can be

increased during the run. ID determines a unique id of the walker, starting

from 0. The R STRIDE keyword sets the stride (in time steps) at which each

individual bias potential is updated by reading all the HILLS ﬁles contained

in HILLS DIR.

Example.

The following command controls a multiple walkers calculation using a maximum of 10 walkers. Gaussians

are added every 1000 steps and updated every 5000 steps.

HILLS HEIGHT 0.1 W STRIDE 1000

MULTIPLE WALKERS HILLS DIR /scratch/HILLS R STRIDE 5000 NWALKERS 10 ID 0

3.4.7 Monitoring a collective variable without biasing

The directive NOHILLS can be used to monitor a CV during a metadynamics

run without applying a bias on it. The keyword CV must be speciﬁed to select

the CV to be monitored. This directive must be used for metadynamics

simulations with Bias-Exchange.

Example.

In this example we run metadynamics using only one CV, a distance between two atoms. During the

simulation we monitor the behavior of another CV, the dihedral deﬁned by a set of four atoms, without

putting a bias on it.

HILLS HEIGHT 0.1 W STRIDE 1000

PRINT W STRIDE 50

DISTANCE LIST 13 46 SIGMA 0.35

TORSION LIST 1 4 65 344 SIGMA 0.1

NOHILLS CV 2

ENDMETA

As an alternative you may also avoid the SIGMA value in the CV. This

will be understood as a directive that no hills must be put on this CV. The

previous example therefore becomes:

Example.

HILLS HEIGHT 0.1 W STRIDE 1000

PRINT W STRIDE 50

DISTANCE LIST 13 46 SIGMA 0.35

TORSION LIST 1 4 65 344

ENDMETA

where the NOHILLS keyword for CV 2 has been omitted as implicitly included

by the missing SIGMA parameter.

Avoiding systematic errors at the boundaries

Metadynamics is often used to reconstruct the free energy as a function of

intrinsically limited CVs (like e.g. COORD and ALPHABETA, see chapter 4) or

articially limited CVs (using for example an external potential, see section

3.10). In this case the ﬁnite-width Gaussians used in metadynamics can

induce systematics errors at the boundaries. These errors become more and

more severe with time, and are due to the fact that a sum of a ﬁnite number

of Gaussians can not reproduce a free-energy proﬁle with large or inﬁnite

derivative. However, PLUMED implements two diﬀerent procedures aimed at

avoiding the onset of these systematic errors, activated respectively by the

keyword INTERVAL and by the keyword INVERT.

3.4.8 Deﬁning an interval

With the keyword INTERVAL one changes the metadynamics algorithm set-

ting the bias force equal to zero beyond a boundary [26]. If, for example,

metadynamics is performed on a CV sand one is interested only to the free

energy for s > sw, the history dependent potential is still updated according

to Eq. 3.1, but the metadynamics force is set to zero for s < sw:

∂VG(s, t)

∂s = 0 ∀s < sw

Notice that Gaussians are added also if s < sw, as the tails of these Gaussians

inﬂuence VGin the relevant region s > sw. In this way, the force on the system

in the region s > swcomes from both metadynamics and the force ﬁeld, in the

region s < swonly from the latter. This approach allows obtaining a history-

dependent bias potential VGthat ﬂuctuates around a stable estimator, equal

to the negative of the free energy far enough from the boundaries [26].

•In order to obtain a parallel growing VG, one should place the interval

limit swin a region where the free energy derivative is not large. For

example, if one uses the keyword with a coordination number CV (see

Section 4.6), that is intrinsically limited to s > 0, the correct choice for

swis not 0, but a number of the order of the width of the Gaussians

(σin Eq. 3.1).

•This remedy has the advantage of being robust and parameter-free, but

works only for one-dimensional biases.

•If in the region s < swthe system has a free energy minimum, the

INTERVAL keyword should be used together with a soft wall at sw

(keyword LWALL, see section 3.10), namely a time-indipendent exter-

nal potential that prevents the system from going there and remaining

trapped in the minimum. An example is provided below.

Example.

The following input ﬁle contains the interval condition in presence of soft walls:

HILLS HEIGHT 0.1 W STRIDE 500

COORD LIST <g1> <g2> NN 8 MM 12 D 0 0 R 0 0.25 SIGMA 0.15

g1->

26 27

g1<-

g2->

29 30

g2<-

INTERVAL CV 1 LOWER LIMIT 0.2 UPPER LIMIT 2.8

UWALL CV 1 LIMIT 2.8 KAPPA 3000.0

LWALL CV 1 LIMIT 0.2 KAPPA 3000.0

ENDMETA

3.4.9 Inversion condition

The INVERT keyword activates the inversion condition near predeﬁned bound-

aries LIMIT1,LIMIT2[27]. To simplify the notation, we here assume the CVs

space is one-dimensional with the boundary at s= 0. The inversion condition

ensure that near s= 0 the bias potential satisfy the following relation:

V(−s, t)≈2V(0, t)−V(s, t) (3.3)

This property ensures that, at stationary conditions, the history-dependent

potential is approximately linear close to the boundary, but it does not im-

pose the value of its derivative, that is iteratively determined by the thermo-

dynamic bias. In practice this is achieved by adding extra Gaussians out of

the boundaries according to the following roles:

•An interval centered in 0 is chosen, whose width χ1is of the order of σ

(Gaussian width for the selected CV; SIGMA).

•If s < χ1another Gaussian centered in −sand with the same width

and height is added.

•If s > χ1another Gaussian centered in −sand with the same width is

added. In this case, the height of the extra Gaussians depends on V

and is given by:

w= [2V(0, t)−V(−s, t)−V(s, t)] y(s),(3.4)

where y(s) = 1/h1+(s/(χ2χ1))10iand χ2> χ1

The second factor in Eq. (3.4) is approximately one for |s|< χ2χ1and goes

to zero for |s|> χ2χ1. This ensures that Vgoes smoothly to zero out of the

boundaries. χ2χ1thus deﬁnes the width of the inversion interval and χ2is

a scaling factor regulated by the keyword INVERSION.χ1is regulated by the

keyword REFLECTION (χ1is in σunits). The keyword MAXHEIGHT deﬁnes the

largest |w|in gaussian height units and it regulates the speed of variation

of Vout of the boundaries. The INVERT keyword can be used in presence of

an external potential that limits the CVs space exploration (see section 3.10)

like in the example below;

Example.

The following input ﬁle contains the inversion condition in presence of soft walls:

HILLS HEIGHT 0.1 W STRIDE 500

COORD LIST <g1> <g2> NN 8 MM 12 D 0 0 R 0 0.25 SIGMA 0.15

g1->

26 27

g1<-

g2->

29 30

g2<-

INVERT CV 1 REFLECTION 1.6 INVERSION 6 MAXHEIGHT 4 LIMIT1 0.2 LIMIT2 2.8

UWALL CV 1 LIMIT 2.8 KAPPA 3000.0

LWALL CV 1 LIMIT 0.2 KAPPA 100000.0

ENDMETA

or if an intrinsically limited CV is used (see chapter 4):

Example.

The following input ﬁle contains the inversion condition in presence of an intrinsically limited CV, such as

ALPHABETA:

HILLS HEIGHT 0.1 W STRIDE 250

ALPHABETA NDIH 1 SIGMA 0.05

5 12 14 23 -3.14159

INVERT CV 1 REFLECTION 1.6 INVERSION 6 MAXHEIGHT 4 LIMIT1 0.0 LIMIT2 1.0

ENDMETA

If INVERT is used in presence of soft walls it is recommended to use

KAPPA larger than KT/σEXP, where EXP is the exponent deﬁned for the

wall (see section 3.10). Much smaller values for KAPPA in fact do not eﬃ-

ciently prevent the exploration of CVs regions that are located at more than

σbeyond the bounduaries and this may lead to systematics errors. It is worth

to note that LIMIT1 and LIMIT2 must not be in unphysical regions of the

CVs space, i.e. they must be visited during the simulation. The present im-

plementation does not remove systematics errors for multidimensional meta-

dynamics in regions of the CVs space that are near crossing boundaries (e.g.

if LIMIT1=0 for CV 1 and LIMIT1=0 for CV 2 there still can be systematic

error near (0,0) ). The values for the REFLECTION,INVERSION and MAXHEIGHT

keywords chosen in the examples are also their default values (if not speciﬁed

the program selects automatically their default values). They were chosen in

other to minimize the free energy error at the boundaries for several diﬀerent

test cases. Further information can be found in ref.[27].

3.5 Running in parallel

Simulations of large systems can often accelerated by using parallel machines.

The behavior of PLUMED in parallel simulations is dependent on the host code:

•NAMD: the present version of PLUMED is fully compatible with NAMD

for parallel simulations. However, since PLUMED runs on the ﬁrst pro-

cessor only, the computational eﬀort required to evaluate the collective

variables and their derivatives, as well as the history dependent poten-

tial in metadynamics simulations, should be kept lower than 1/Npof

the total computational eﬀort, where Npis the number of processors.

Thus, pay attention to heavy variables and metadynamics simulations

with many hills.

•GROMACS, DL POLY, AMBER and LAMMPS: the present version

of PLUMED is fully compatible with these codes for parallel simulations.

Consider the following notes:

–The coordinates of the particles involved in collective variables are

replicated over all nodes at every step. If you don’t want to slow

down your simulation, minimize the number of involved atoms.

–The computational eﬀort required to evaluate the collective vari-

ables and their derivatives is replicated on all processors, thus is

eﬀectively not scaling. Pay attention to heavy variables (such as

coordination numbers with long lists).

–The computational eﬀort required to evaluate the history depen-

dent potential in metadynamics simulations is spread over proces-

sors, thus should scale linearly.

–With GROMACS and domain decomposition, pay attention to

periodic boundary conditions (see Section. 4.28).

–With DL POLY, use the patched version of MakePAR to compile

PLUMED.

3.6 Replica exchange methods

When combined with GROMACS (both version 4.0 and 4.5), PLUMED can

perform replica–exchange simulations coupled with metadynamics in two

diﬀerent ways: parallel tempering metadynamics (PTMetaD) [17, 28] and

bias-exchange metadynamics (BE-META) [18].

3.6.1 Parallel tempering metadynamics

Parallel tempering metadynamics (currently implemented only for the GRO-

MACS engine) is selected using the PTMETAD directive.

To run parallel tempering simulations with metadynamics, one has to

follow the standard GROMACS procedure for parallel tempering (see GRO-

MACS manual). A binary topology .tpr ﬁle must be prepared for each

replica, while only one plugin input ﬁle is required.

Example.

The following input ﬁle deﬁnes a parallel-tempering metadynamics:

# switching on metadynamics and Gaussian parameters

HILLS HEIGHT 0.1 W STRIDE 500

# switching on parallel tempering

PTMETAD

# instruction for CVs printout

PRINT W STRIDE 50

# the CV: radius of gyration

RGYR LIST <CA> SIGMA 0.1

CA->

20 22 26 30 32

CA<-

# end of the input

ENDMETA

The PTMETAD directive switches on parallel tempering. All replicas have

the same CVs, in this particular case the radius of gyration deﬁned by the

group of atoms <CA> .

The Gaussian height set by the keyword HEIGHT is automatically rescaled

with temperature, following Wi=W0Ti

T0, where iis the index of a replica and

Tiits temperature.

Similarly, the simulation temperature needed to use the well-tempered

algorithm (which, for non-parallel simulations is set with the SIMTEMP key-

word) is here taken directly from the GROMACS input at each replica. As

a result, the value of ∆Tfor the well-tempered algorithm is rescaled across

the replicas.

The plugin will produce one COLVAR ﬁle and one HILLS ﬁle for each

replica.

3.6.2 Bias exchange simulations

Bias exchange simulations must be run using the BIASXMD directive. Only

the GROMACS engine currently has this feature implemented within a single

parallel run, whereas the bias-exchange tool in the directory utilities al-

lows to perform bias-exchange simulations with any MD engine via the linux

shell (see Section 5.6).

The procedure for running bias-exchange simulations is similar to the

one described earlier for parallel tempering (i.e., add -multi nrep -replex

nexch to the mdrun command line, where nrep is the number of replicas and

nexch is the number of steps between attempting exchanges). One plugin

input and one binary topology ﬁle must be provided for each replica. These

ﬁles must be named with the replica index (e.g., plumed0.dat,plumed1.dat,

..., md0.tpr,md1.tpr, ...). Each plugin input ﬁle must contain the directive

BIASXMD and the same list of CVs, in the same order. Each replica usually

has only one (or few) of the CVs active: use a NOHILLS directive for each

of the inactive CVs (or do not specify SIGMA for them). It is also possible

to have replicas with all CVs inactive: such unbiased replicas can exchange

with biased ones, jumping in this way beyond free-energy barriers, but since

they are not subject to the hills they tend to populate the free-energy basins

according to equilibrium statistics.

Example.

The following two input ﬁles deﬁne a bias-exchange metadynamics run:

# --- input file plumed0.dat ---

# switching on metadynamics and Gaussian parameters

HILLS HEIGHT 0.1 W STRIDE 500

# switching on bias-exchange

BIASXMD

# instruction for CVs printout

PRINT W STRIDE 50

# instruction for labeling of COLVAR and HILLS files

HILLS LABEL A

# the CVs: dihedral angles

TORSION LIST 1 5 11 13 SIGMA 0.314

TORSION LIST 11 13 15 21 SIGMA 0.314

# in this replica we bias only CV 1:

NOHILLS CV 2

# end of the input

ENDMETA

# --- input file plumed1.dat ---

# switching on metadynamics and Gaussian parameters

HILLS HEIGHT 0.1 W STRIDE 500

# switching on bias-exchange

BIASXMD

# instruction for CVs printout

PRINT W STRIDE 50

# instruction for labeling of COLVAR and HILLS files

HILLS LABEL B

# the CVs: dihedral angles

TORSION LIST 1 5 11 13 SIGMA 0.314

TORSION LIST 11 13 15 21 SIGMA 0.314

# in this replica we bias only CV 2:

NOHILLS CV 1

# end of the input

ENDMETA

In bias-exchange runs, the plugin will produce one COLVAR ﬁle and one

HILLS ﬁle for each replica. What is actually exchanged between replicas are

the atomic coordinates, not the bias, so that replica 0 (printing out COLVAR0)

will always be biased by HILLS0, replica 1 (printing out COLVAR1) will always

be biased by HILLS1,) and likewise for the other replicas.

In the example above, note the use of the keyword HILLS LABEL: this will

print a comment line ”#! ACTIVE 1 1 A” in ﬁles COLVAR0 and HILLS0 and

”#! ACTIVE 1 2 B” in ﬁles COLVAR1 and HILLS1, indicating the number of

active CVs, their index, and the chosen label. This facilitates the reconstruc-

tion of the multidimensional free-energy landscape from the bias-exchange

simulation according to the weighted-histogram algorithm in Ref. [29], e.g.

employing the METAGUI program [30] downloadable from

http://www.plumed-code.org/contributions

See also Section 5.6 for informations on bias-exchange simulations with

MD codes diﬀerent from GROMACS.

3.7 Umbrella sampling

The directive UMBRELLA allows umbrella sampling calculations to be per-

formed on the CV speciﬁed by the parameter keyword CV. The position s0

of the umbrella restraint is determined by the keyword AT, and the spring

constant k- in internal unit of the main code - by the keyword KAPPA. Op-

tionally, also a constant force mcan be speciﬁed by the keyword SLOPE. This

turns on a potential of the following functional form:

Vumb(s) = 1

2k(s−s0)2+m(s−s0).(3.5)

Example.

The following input ﬁle deﬁnes an umbrella potential acting on the ﬁrst CV and centered in s0=−1.0.

TORSION LIST 13 15 17 1

UMBRELLA CV 1 KAPPA 200 AT -1.0

PRINT W STRIDE 100

ENDMETA

In the case of umbrella sampling runs, the value of the CVs is printed on

the COLVAR ﬁle with a stride ﬁxed by the directive PRINT and the keyword

W STRIDE. The ﬁle contains the time, the CV values, the metadynamics po-

tential, the harmonic potential of umbrella sampling, the CV on which the

restraint acts and the position of the restraint. The ﬁnal calculation of the

free energy as a function of this CV can be done using the weighted histogram

analysis method, choosing one of the many possible implementations, for in-

stance the wham code by Alan Grossﬁeld [31]. The directive UMBRELLA can

be used multiple times in the case of multidimensional umbrella sampling

calculations (one directive for each CV).

The keyword RESTART can be used when restarting an umbrella sampling

calculation to append the value of the CVs on the COLVAR ﬁle.

3.8 Steered MD

PLUMED can be used to drag a system to a target value in CV space using

an harmonic potential moving at constant speed. If the process is reversible,

i.e. for velocities tending to zero, the work done in the dragging corresponds

to the free energy. In the case of ﬁnite velocity it is possible to obtain an

estimate of the free-energy from the work distribution using Jarzynski [13]

or Crooks [14] relations.

The directive STEER activates the steering on the collective variable spec-

iﬁed by the keyword CV. The target value is determined by the keyword TO,

the velocity, in the same unit as the corresponding CV every 1000 steps, by

VEL and the spring constant by KAPPA. An additional keyword FROM can be

used to specify the starting point of the dragging, otherwise the CVs values

at the ﬁrst step are taken as the starting point. The functional form of the

dragging potential is the same as the one of formula 3.5, with the reference

position s0moving at the speciﬁed velocity.

The keyword RESTART can be used when restarting a steered MD calcu-

lation to append the value of the CVs on the COLVAR ﬁle.

Example.

The following input ﬁle deﬁnes a steered MD on the angle CV to a target value of 3.0 rad.

ANGLE LIST 13 15 17

STEER CV 1 TO 3.0 VEL 0.5 KAPPA 500.0

PRINT W STRIDE 100

ENDMETA

3.8.1 Steerplan

PLUMED can be used to drag a system on a pathway that is composed of

successive steering runs on chosen degrees of freedom in a planned fashion.

While the directive STEER is rather easy and intuitive, STEERPLAN is more

ﬂexible and it allows to avoid a lot of scripting whenever you plan to simulate

complex transitions by means of out-of-equilibrium runs.

The directive STEERPLAN activates this option and reads the name of a

ﬁle that contains the plan as an input.

Example.

The following input ﬁle contains many collective variables and a steerplan directive.

PRINT W STRIDE 5

# these CVs are put here just to show that you may use more variable than the

# ones defined in the steerplan

TARGETED TYPE MSD FRAMESET restrained.pdb

TARGETED TYPE MSD FRAMESET waters.pdb

TARGETED TYPE MSD FRAMESET ref H.pdb

# difference of distances for proton transfers

DISTANCE LIST 53 703 DIFFDIST 703 702

DISTANCE LIST 702 913 DIFFDIST 913 912

# steer plan read from file myplan

STEERPLAN myplan

ENDMETA

The steerplan ﬁle myplan contains the control sequence of the steerplan

and it is composed as follows. The ﬁrst column contains the time (in timeu-

nits of the program) at which a particular action is planned. The following

columns are composed in blocks of ﬁve. Each block contains: the CV key-

word, the index of the CV to be used, the spring constant at a given time,

the value of the center of the spring potential and a keyword among CENTRAL

(the potential is a normal parabolic shape), POSITIVE (the potential with

parabolic shape is applied only when the CV value is higher than the center

of the spring) and NEGATIVE (the potential with parabolic shape is applied

only when the CV value is lower than the center of the spring). A Block

of four columns without the latter keyword is also accepted. In this case

CENTRAL is assumed. Wildcards (*) are also accepted for the position. Their

meaning is much clearer in the following example. Comments are allowed

(#) .

Example.

An example of steerplan ﬁle.

# Bring CV 4 from wherever it is (*) to -0.5 at 2 ps by increasing

# gradually the spring constant from 0 to 800. CV 5 does the same and goes to -0.64.

0.00 CV 4 000.0 * POSITIVE CV 5 000.0 * POSITIVE

2000.00 CV 4 800.0 -0.5 POSITIVE CV 5 800.0 -0.64 POSITIVE

# Now in 400 fs keep CV 4 at its value while releasing the other

# slowly to 0 spring constant.

2400.00 CV 4 800.0 -0.5 POSITIVE CV 5 0.0 -0.64 POSITIVE

# Now drag back only CV 4 to 1.4 with a value that acts only

# on the negative part. The potential on CV 5 is off.

8400.00 CV 4 800.0 1.4 NEGATIVE CV 5 0.0 -0.64 POSITIVE

Please note that the wildcards have particular meaning:

Example.

Here the dragging force starts from whatever value the system assumes at 0 fs and drags it to -0.5 at 2

ps. The spring constant is linearly increasing from 0.0 to 800.0. When the starting point is a wildcard this

takes the current value.

0.00 CV 4 000.0 * POSITIVE

2000.00 CV 4 800.0 -0.5 POSITIVE

Here the center of the harmonic potential follows the system at each step from 0 to 2 ps. When the

position of the ending point is a wildcard then the potential will follow the coordinates (this means that the

potential is not applied as the force is zero everywhere).

0.00 CV 4 800.0 * POSITIVE

2000.00 CV 4 800.0 * POSITIVE

This case is considered identical as the one before

0.00 CV 4 800.0 0.5 POSITIVE

2000.00 CV 4 800.0 * POSITIVE

3.9 Adiabatic Bias MD

PLUMED can be used to evolve a system towards a target value in CV space

using an harmonic potential moving with the thermal ﬂuctuations of the

CV[20, 32, 33].

The directive ABMD activates the biasing on the collective variable speciﬁed

by the keyword CV. The target value is determined by the keyword TO, the

spring constant by KAPPA. The biasing potential is implemented as

V(ρ(t)) = (α

2(ρ(t)−ρm(t))2, ρ(t)> ρm(t)

0, ρ(t)≤ρm(t),(3.6)

where

ρ(t) = (CV (t)−T O)2(3.7)

and

ρm(t) = min

0≤τ≤tρ(τ).(3.8)

The method is based on the introduction of a biasing potential which is

zero when the system is moving towards the desired arrival point and which

damps the ﬂuctuations when the system attempts at moving in the opposite

direction. As in the case of the ratchet and pawl system, propelled by thermal

motion of the solvent molecules, the biasing potential does not exert work

on the system.

The keyword RESTART can be used when restarting an adiabatic bias MD

calculation to append the value of the CVs on the COLVAR ﬁle and to set the

best value previously reached.

Example.

The following input ﬁle deﬁnes an adiabatic biased MD on the angle CV to a target value of 3.0 rad.

ANGLE LIST 13 15 17

ABMD CV 1 TO 3.0 KAPPA 500.0

PRINT W STRIDE 100

ENDMETA

The corresponding 

COLVAR will look like:

#! FIELDS time cv1 vbias vwall vext XX XX ABMD1

0.000 1.884703657 0.000000000 0.000000000 0.000000000 ABMD 1 1.243885933

0.200 2.353724409 0.000000000 0.007812004 0.000000000 ABMD 1 0.412082147

0.400 2.433164746 0.000000000 0.000000000 0.000000000 ABMD 1 0.321302206

0.600 2.608463835 0.000000000 0.004202352 0.000000000 ABMD 1 0.149200641

0.800 2.818935636 0.000000000 0.009463992 0.000000000 ABMD 1 0.026631583

1.000 2.831631093 0.000000000 0.147609095 0.000000000 ABMD 1 0.004049192

1.200 2.583559043 0.000000000 7.171877688 0.000000000 ABMD 1 0.004049192

1.400 2.841442463 0.000000000 0.121111568 0.000000000 ABMD 1 0.003130352

1.600 2.794514101 0.000000000 0.382087213 0.000000000 ABMD 1 0.003130352

1.800 2.634418597 0.000000000 4.258829096 0.000000000 ABMD 1 0.003130352

2.000 2.861303394 0.000000000 0.086113847 0.000000000 ABMD 1 0.000677239

where the last column is ρm(t).

3.10 External potentials

3.10.1 Walls

The UWALL and LWALL keywords deﬁne a wall for the value of the CV swhich

limits the region of the phase space accessible during the simulation. The

restraining potential starts acting on the system when the value of the CV is

greater (in the case of UWALL) or lower (in the case of LWALL) than a certain

limit LIMIT minus an oﬀset OFF.

The functional form of this potential is the following:

Vwall(s) = KAPPA s−LIMIT +OFF

EPS EXP

,(3.9)

where KAPPA is an energy constant in internal unit of the code, EPS a rescaling

factor and EXP the exponent determining the power law.

By default: EXP = 4,EPS = 1.0,OFF = 0.

Example.

To run a well-tempered metadynamics simulation using as CV the distance between one atom and the

center of mass of a group of atoms and limiting its value below 15 ˚

A, you have to use the following input

ﬁle:

HILLS HEIGHT 0.1 W STRIDE 100

WELLTEMPERED SIMTEMP 300 BIASFACTOR 10

PRINT W STRIDE 50

DISTANCE LIST 13 <g1> SIGMA 0.35

g1->

17 20 22 30

g1<-

UWALL CV 1 LIMIT 15.0 KAPPA 100.0

ENDMETA

3.10.2 Tabulated potentials

An external potential of generic form can be added to any collective variables

using the directive EXTERNAL. The user must specify the total number of

CVs on which the potential acts with the keyword NCV and the variables

with CV. The external potential must be provided in a tabulated form in the

ﬁle speciﬁed by FILENAME. The format used for the external potential is the

same as the one described in 3.4.5 for the case of metadynamics potential on

aGRID.

Example.

The following input ﬁle controls a simulation with an external potential acting on two collective variables.

The tabulated potential is provided in the ﬁle external.dat.

PRINT W STRIDE 100

DISTANCE LIST 13 20

TORSION LIST 5 8 10 12

EXTERNAL NCV 2 CV 1 2 FILENAME external.dat

Every collective variables implemented in PLUMED has a unique ID. This

number must be used to deﬁne the type of CV in the header of the external

potential ﬁle and must match the CV activated in the PLUMED input ﬁle. In

Tab. 3.1 we provide a legend.

Example.

The header of the external potential ﬁle of the previous example looks like this:

#! FORCE 1

#! NVAR 2

#! TYPE 1 5

#! BIN 100 100

#! MIN 0.0 -3.14159

#! MAX 10.0 3.14159

#! PBC 0 1

If force data are not present on ﬁle, they will be calculated from the

tabulated potential using ﬁnite diﬀerences.

ID CV

1 Distance

2 Minimum distance

3 Coordination number

4 Angle

5 Torsion

6 Alpha-beta similarity

7 Hydrogen bonds

8 Dipole

11 Radius of gyration

16 Dihedral correlation

20 Interfacial water

30 Path collective variable S

31 Path collective variable Z

32 Absolute position

33 Electrostatic potential

34 Puckering coordinates

35 Energy

36 Helix loops

37 Alpha helix rmsd

38 Antiparallel beta rmsd

39 Parallel beta rmsd

42 PCA projection

45 Contact Map

55 SPRINT

Table 3.1: ID of the collective variables implemented in PLUMED.

3.11 Commitment analysis

The COMMITMENT directive is used to run commitment analysis. The keyword

NCV determines the total number of CVs for the analysis, while CV must

be used to specify the variable id. Following this line, NCV lines must be

provided, each of which containing the upper and lower limits of the Aand

Bbasins for the i-th variable.

Example.

The following line deﬁnes a commitment analysis on the ﬁrst two collective variables s1and s2, while the

third is only monitored. The commitment basins are deﬁned as A={(s1, s2)|s1∈(0,1), s2∈(−1,1)},

and B={(s1, s2)|s1∈(1,2), s2∈(−1,1)}.

COMMITMENT NCV 2 CV 1 2

0.0 1.0 1.0 2.0

-1.0 1.0 -1.0 1.0

DISTANCE LIST 12 30

DISTANCE LIST 20 24

TORSION LIST 14 16 20 22

ENDMETA

3.12 Projection of gradients

The keyword PROJ GRAD is enable one to calculate the projection of the gra-

dient in this way:

Pij(R) =

NAT

n∇nsi(R)· ∇nsj(R) (3.10)

where si(R) and sj(R) are two collective coordinates deﬁned in input. The

output is stored into the COLVAR ﬁle in the form of upper diagonal matrix.

The initial line of COLVAR ﬁle describes the output in detail.

Example.

The following line instructs plumed to calculate the projection of two collective variables s1and s2.

DISTANCE LIST 1 2

TORSION LIST 1 2 3 4

TORSION LIST 2 3 4 8

PROJ GRAD CV <g1>

g1->

2 3

g1<-

ENDMETA

In the COLVAR ﬁle you ﬁnd a PROJ GRAD keyword followed by N(N+ 1)/2

elements that constitute the upper diagonal matrix (diagonal elements in-

cluded) of the projection. It is important to know that PROJ GRAD is NOT

supporting the LIST keyword therefore all the variables that one needs to

use must be included in a ”group” as reported in the example above.

3.13 Reconnaissance metadynamics

Reconnaissance metadynamics is a self-learning algorithm for accelerated dy-

namics that is able to work with a very large number of collective coordinates.

Acceleration of the dynamics is achieved by constructing a bias potential in

terms of a patchwork of one-dimensional, locally valid collective coordinates.

To understand the details of the methodology please read reference [19]. Fur-

thermore, as detailed in section 2.2, by default PLUMED compiles without re-

connaissance metadynamics and so some slight modiﬁcations in compilation

are required if you wish to run this type of simulation.

For a reconnaissance metadynamics simulation the PLUMED input must

contain the deﬁnition of at least one CV (see chapter 4 for the required

syntax) and the RECONNAISSANCE,BASINS,ONIONS and CLUSTER keywords.

These keywords provide the parameters for reconnaissance metadynamics

simulations and are described in sections 3.13.2 and 3.13.3.

3.13.1 Typical output

A reconnaissance metadynamics simulation produces at least four output

ﬁles. These ﬁles are called BASINS,ONIONS,PPCA DIAGNOSTICS and CLUSTER DATA.0.

The ﬁrst of these ﬁles contains the locations of the various basins found dur-

ing cluster analysis and their shapes. This data is formatted as follows:

•The ﬁrst column contains the basin number

•The second column contains the timestep at which the basin was cre-

ated

•The following dcolumns contain the values of the collective coordinates

at the basin center

•The next d2columns contain the covariance of the collective coordinates

inside the basin

•The ﬁnal column contains the value of the “diﬀusion constant” in this

particular basin.

The second ﬁle, ONIONS, gives the details of the bias. In reconnaissance

metadynamics the bias consists of Gaussian functions that are added to the

distance from basin centers. These Gaussian functions are only added when

one is within a certain distance of the basin center, the basin size. This

basin size changes as the simulation progresses and the ONIONS ﬁle stores the

details of all basin expansion events as well as the details of the hill addition

events. Each event is recorded on a single line of the ONIONS ﬁle, which is

formatted as follows:

•The ﬁrst column contains the time step at which the contribution to

the bias potential was added.

•The second column contains the number of the basin to which this

particular Gaussian was added

•The third column contains the distance from the basin center speciﬁed

in column 2 at which the centroid of the Gaussian is to be found

•The fourth column contains the width of the Gaussian

•The ﬁfth column contains the height of the Gaussian

•The ﬁnal column contains the current size of the basin speciﬁed in

column 2.

Basin expansion events can be distinguished from hill addition events as

in the former case the height of the “Gaussian” in column 4 is zero.

The PPCA DIAGNOSTICS ﬁle contains a information on how successful the

PPCA algorithm, that is used for clustering, has been. By default ﬁts of the

data are performed with 1 to 10 Gaussian functions. Bayesian information

criteria ( −log(L) + nplog(M), log(L), npand Mbeing the log likelyhood,

number of parameters and number of datapoints respectively) and fuzzy vol-

umes ( PN

n=1 q|Σn|) are reported for each ﬁt. The best ﬁt to the data is the

one with the minimum value for the Bayesian Criterion and it is from this

ﬁt that the bias is constructed. In our early applications of the methodology

we have found that the if a large number of Gaussian is regularly needed to

ﬁt the collected data the method will not be successful. However, a change

of cvs can often resolve this problem. After the PPCA DIAGNOSTICS ﬁle has

reported on the quality of the ﬁts with diﬀerent numbers of Gaussians the

weights of the various Gaussians in the best ﬁt to the data are reported along

with information on which Gaussians have been accepted into the biassing

strategy.

All the remaining ﬁles produced during a reconnaissance metadynamics

simulation are only required to restart simulations.

Unlike in normal metadynamics one cannot obtain the free energy surface

from an examination of the bias potential at the end of the simulation. How-

ever, a graphical tool that can be added to vmd is available from the PLUMED

website. With this tool one can visualize the results from from reconnaissance

metadynamics simulation.

3.13.2 Controlling the clustering

As discussed in reference [19] in reconnaissance metadynamics one periodi-

cally analyses the trajectory in order to obtain collective coordinates to bias.

The frequency of these analyses are controlled by the CLUSTER keyword. The

line containing this keyword should also contain an integer, which controls

the frequency at which the values of the cvs are stored for subsequent cluster

analysis and a integer than controls the frequency of clustering. The ﬁrst

of these quantities should be proceeded by STORE FREQ and the second by

RUN FREQ. One may cluster over multiple timescales separately by including

this line multiple times.

Example.

The following lines tell PLUMED to run cluster analyses every 10000 steps and every 100000 steps

RECONNAISSANCE

CLUSTER RUN FREQ 100000 STORE FREQ 100

CLUSTER RUN FREQ 1000000 STORE FREQ 1000

Once the clustering has ﬁnished PLUMED must make certain decisions on

how these basins should be incorporated into the biassing strategy. The

parameters controlling these decisions are given on the line starting with the

BASINS keyword. This line provides three parameters:

•BASIN TOL - the fraction of the trajectory that must be within a cluster

in order for it to be used for biassing.

•INITIAL SIZE - the initial size of the basin

•EXPAND PARAM - the parameter that controls the basin expansion

Guidance on how to set the value of BASIN TOL is provided in the sup-

plementary information of reference [19]. The supplementary information

also shows how, when the angular variables of a multivariate Gaussian are

integrated out, the resulting probability distribution as a function of r(the

distance from the center of the basin in the metric of the basin) is equal to a

Gaussian with standard deviation 1

√2centered at √d−1. With this in mind

the initial size of basins is set equal to √d−1 + the value of the initial size

parameter. Therefore a sensible value for this parameter is around 1.5 - i.e.

approximately two standard deviations.

The ﬁnal parameter described above EXPAND PARAM controls the way basins

change size as a function of time. As discussed in the supplementary informa-

tion of reference [19] we introduce a probabilistic criterion for basin expansion

that is derived using ideas from the theory of diﬀusion. Since writing that

paper we have slightly changed the way that this parameter functions. In

particular we realized that the diﬀusion constant in a given basin could be

also be ﬁt from the data using:

D=1

d∆t(M−1)

i=2 |si−si−1|2(3.11)

where Mis the number of vectors used for clustering, dis the dimension-

ality, ∆tis the STORE FREQ parameter and the squared diﬀerences between

vectors of CVs in the sum are calculated in the metric of the basin. The

EXPAND PARAM provided in the input should be a number between 0 and 1.

It is used to scale this estimate value and expresses the level of conﬁdence a

user has in the ﬁtting. In practice this parameter gives some control over the

rate of exploration of phase space. If it is large phase space will be explored

rapidly, while if it is small phase space will be explored more slowly.

3.13.3 Controlling the bias

The bias in reconnaissance metadynamics consists of a number of Gaussian

functions that are periodically added to the potential. These Gaussians are

diﬀerent from those added in normal metadynamics because they act added

on the distance from the center of a basin. Hence, the keyword that controls

the hill addition in reconnaissance metadynamics is ONIONS and NOT HILLS.

However, the syntax for ONIONS is the same as the syntax for HILLS. That is

to say one must provide:

•HEIGHT - The height of the Gaussian functions.

•W STRIDE - The frequency with which to attempt hill addition.

•WIDTH - The width of the Gaussian functions.

Once again guidelines as to how to set these parameters are provided in

the supplementary information of reference [19].

Bringing all of the above together a typical input ﬁle for a reconnaissance

metadynamics simulation should look like this:

Example.

Typical input for a reconnaissance metadynamics simulation

RECONNAISSANCE

ONIONS HEIGHT 1.0 W STRIDE 1000 WIDTH 1.5

BASINS BASIN TOL 0.2 EXPAND PARAM 0.3 INITIAL SIZE 3.0

CLUSTER RUN FREQ 500000 STORE FREQ 250

TORSION LIST 13 15 17 19

TORSION LIST 15 17 19 21

3.13.4 Restarting a simulation

To restart a reconnaissance metadynamics simulation, the ﬂag RESTART must

be added on the line containing the directive RECONNAISSANCE. This allows

one to restart the simulation after an interruption or after a run has termi-

nated. At the start of the simulation the BASINS,ONIONS and CLUSTER DATA.*

ﬁles are read in so the bias can be restarted. Data from the restarted simula-

tion will then be appended to the BASINS and ONIONS ﬁles. Please note that

the RESTART ﬂag is a directive to PLUMED to restart the reconnaissance meta-

dynamics simulation. It not an instruction to restart the MD run. Hence, to

restart the simulation one must also follow the usual procedure for restarting

the MD run. This will depend on the particular MD engine you are using -

details can be found in the documentation for the particular MD code you

are using.

Example.

A sample input ﬁle for restarting a reconnaissance metadynamics simulation

RECONNAISSANCE RESTART

ONIONS HEIGHT 1.0 W STRIDE 1000 WIDTH 1.5

BASINS BASIN TOL 0.2 EXPAND PARAM 0.3 INITIAL SIZE 3.0

CLUSTER RUN FREQ 500000 STORE FREQ 250

TORSION LIST 13 15 17 19

TORSION LIST 15 17 19 21

3.13.5 Using a subset of the deﬁned cvs

By default reconnaissance metadynamics is performed using all the collective

coordinates deﬁned in the PLUMED input ﬁle. However, there may be occasions

where it is desirable to include some constraints to prevent exploration of the

uninteresting parts of phase space. In these cases it would also be desirable to

not include these cvs in the reconnaissance metadynamics. Consequentially,

a keyword exists which allows one to explicitly specify which of the CVs are

to be used in the reconnaissance metadynamics. This keyword is CV LIST

and it should be added to the line containing the directive RECONNAISSANCE.

It should be followed by a tag which gives the name of a list of the cvs to

be used for reconnaissance. This list should be speciﬁed using the syntax

described in section 4.

Example.

A sample input ﬁle for a reconnaissance metadynamics simulation using only three of the four collective

coordinates speciﬁed in the PLUMED input ﬁle

RECONNAISSANCE CV LIST <cvlist>

ONIONS HEIGHT 1.0 W STRIDE 1000 WIDTH 1.5

BASINS BASIN TOL 0.2 EXPAND PARAM 0.3 INITIAL SIZE 3.0

CLUSTER RUN FREQ 500000 STORE FREQ 250

TORSION LIST 13 15 17 19

TORSION LIST 15 17 19 21

TORSION LIST 17 19 21 23

TORSION LIST 19 21 23 25

cvlist->

1 2 3

cvlist<-

3.14 Driven Adiabatic Free Energy Dynam-

ics (d-AFED)

The driven adiabatic free energy (d-AFED) algorithm [34] is a variant of

the earlier AFED method [35, 36], which required cumbersome coordinates

transformations. In the d-AFED method, an extra dynamical variable Sis

coupled to a collective variable s(r), where rrepresents the coordinates of

a number of atoms in the system. The coupling is mediated by a potential

energy function with harmonic constant κ,

V(S, s(r)) = 1

2κ(S−s(r))2.(3.12)

The dynamics of the Smeta-variable is adiabatically decoupled from the

dynamics of the underlying physical system by choosing a large mass mS

¯m, where ¯mis a typical mass of the physical system. Thanks to the adiabatic

separation, a temperature TS> T can be assigned to the Smeta-variable.

With this choice of msand Ts, the physical system will evolve fast at room

temperature Taround the instantaneous value of s(r) = S. On the other

hand, Swill evolve slowly, but have a temperature large enough to drive the

system over high free energy barriers.

In the limit of κ→ ∞, it can be shown that the free energy surface

at temperature Tcan be recovered from the density ρadb(S) sampled at

temperature TSduring the adiabatic d-AFED simulation using

∆G(S) = −kBTSlog ρadb(S).(3.13)

This result generalizes well to the case where more than one collective variable

is used and ∆G(S) is a multi-dimensional free energy surface. Note that

the d-AFED method is similar to the Temperature Accelerated Molecular

Dynamics (TAMD) method devised by other authors[37].

The d-AFED method requires very eﬃcient thermostatting of the meta-

variable S. Therefore, in the present implementation, Sis coupled to a

Generalized Gaussian Moment Thermostat (GGMT) [38]. The meta-variable

is coupled to two thermostatting variables pηand pζ, with associated masses

Qηand Qζ, respectively. Given a typical time scale τof the thermostated

system, optimal masses are Qη=kBTSτ2and Qζ=8

3(kBTS)3τ2. The order-2

GGMT dynamics for one degree of freedom is

˙pS=V(S, s(r)) −pη

Qη

pS−pζ

Qζ"kBTS+1

mS#pS,(3.14)

˙pη=p2

mS−kBTS,(3.15)

˙pζ=1

3"p2

mS#2

−(kBTS)2,(3.16)

S=pS

,˙η=pη

Qη

,˙

ζ=pζ

Qζ

.(3.17)

If multiple reaction coordinates are used, one separate GGMT thermostat

is associated to each of them. The implemented integrator for the dynamics

above is based on a Trotter decomposition of the corresponding Liouville

operator [34]. The quality of the integration can be monitored using the

quantity HS, which would be conserved if the dynamics of Swas decoupled

from the physical system,

HS(S, pS, η, pη, ζ, pζ) = p2

2mS

+V(S)+ p2

2Qη

+p2

2Qζ

+kBTS(η+ζ) (3.18)

The heat transfer WSfrom the meta-variable to the physical system can be

calculated,

WS=Zt

0dt0κ(S−s(r)) pS

(3.19)

The eﬀective adiabaticity of the coupling can thus be asserted. In addition,

for each collective variable, the quantity HS−WSshould be strictly conserved

and provides a quality check for the simulation.

3.14.1 Input for d-AFED

For each CV, a DAFED directive is used to deﬁne the parameters of the cor-

responding dynamics. On the same line, the number of the CV to which the

directive applies is speciﬁed after the keyword CV. The temperature TSin

K is given after keyword TEMPERATURE. The thermostat time constant τis

given in ps after keyword TAUTHERMO. The mass mSand harmonic constant

κ, are given after the keywords MASS and KAPPA, respectively. The units of κ

and mSdepend on the nature of the CV. They should always be such that

κS2and m˙

S2are both in units of energy (kJ/mol= amu nm2/ ps2), see the

example below.

In addition, tow optional keywords can be used with the DAFED directive.

First, for periodic CVs such as torsion angles, the Svariable should also

evolve on a periodic interval. This is speciﬁed by the keyword PERIODIC,

followed by two numbers for the lower and upper bounds. The numbers can

be replaced by MINUS PI,PLUS PI, or PLUS 2PI to specify −π, +π, or +2π,

respectively.

The optional keyword JACOBIAN FORCE causes a bias force F=−2kBT/S

to be applied to the dynamics of S. This is useful with distance CVs in order

to counterbalance the eﬀect of the Jacobian factor and sample a more uniform

distribution along the CV.

Example.

The following lines couple CV 1 (a distance in nm) to a meta-variable of mass 105amu with a harmonic

constant of 106kJ/mol/nm2and CV 2 (a unitless number) to a meta-variable with mass 103amu*nm2

with a harmonic constant of 104kJ/mol. For both CV, the d-AFED temperature is 600 K and the GGMT

thermostat time constant is 0.2 ps. See text for the optional keywords JACOBIAN FORCE and PERIODIC.

DISTANCE LIST 1 34

TORSION LIST 5 15 29 36

DAFED CV 1 TEMPERATURE 600 MASS 1e5 KAPPA 1e6 TAUTHERMO 0.2 JACOBIAN FORCE

DAFED CV 2 TEMPERATURE 600 MASS 1e3 KAPPA 1e4 TAUTHERMO 0.2 PERIODIC MINUS PI PLUS PI

DAFED CONTROL RESTART checkpoint file WRITE STATE -1 N RESPA 1

PRINT W STRIDE 100

ENDMETA

A separate DAFED CONTROL directive contains general controls for the d-

AFED simulation. The d-AFED dynamics, including all variables described

in Eqs. (3.14 - 3.17) can be restarted exactly from a previous run using a

checkpoint ﬁle. Following the WRITE STATE keyword appears the number of

steps after which a checkpoint ﬁle is saved. A value of −1 implies that a

checkpoint ﬁle is written only when GROMACS saves its own checkpoint

ﬁle, i.e. at regular wall clock time intervals. The checkpoint ﬁle is saved in

the current directory with default name DAFED STATE. The optional keyword

RESTART is used to specify the path to the checkpoint ﬁle from which to

restart.

3.14.2 Typical output for d-AFED

WIth the d-AFED method, the COLVAR ﬁle will contain the following data,

if dcollective variables are used:

•time step

•value of the collective variable s1(r)...sd(r)

Then for each of the Sj, j = 1...d, appears a set of 4 columns with :

•the meta-variable S

•the instantaneous temperature of Sin K

•the conserved quantity HS, see Eq. 3.18, in kJ/mol

•the work WSfrom Sto the physical system, see Eq. 3.19, in kJ/mol

These ﬁelds are labeled sj,T sj,E sj, and W sj, respectively, in the

COLVAR header line, j= 1...d. Additional collective variables can be moni-

tored during a d-AFED run, in which case more columns will appear before

the ﬁrst d-AFED keyword.

Note that for each extended variable Sj, the quantity HSj+WSjshould

be conserved, j= 1...d. Let Hbe the pseudo-energy of the physical system

including the associated thermostats and barostats. Then the total energy

of the simulation, H+Pd

j=1 HSj, should be conserved as well. In addition,

considering the physical system only, the quantity H − Pd

j=1 WSjshould be

conserved.

Chapter 4

Collective variables

PLUMED contains implementations of a large number of CVs so one can prop-

erly describe the processes involved in a wide variety of interesting problems.

In the following chapter we describe all the CVs, which are implemented in

PLUMED together with their analytic ﬁrst derivative.

In general to instruct PLUMED to use a particular CV a line starting with

the keyword indicating the CV type must be included in the input ﬁle. This

keyword should then be followed by the various pieces of CV-speciﬁc infor-

mation required to calculate the variable along with the keywords that tell

PLUMED how this particular CV is to be employed. (N.B. unless stated ex-

plicitly these pieces of data can be speciﬁed in any order)

For metadynamics the lines deﬁning the variables on which the user desires

there to be a biasing potential should contain the SIGMA keyword. This

keyword should then be followed by the width of the Gaussian hills (in the

units of the CV) on that particular CV. This keyword serves two functions;

namely, it instructs PLUMED to use metadynamics and tells it the widths of

the hills. Obviously the SIGMA keyword is not only required if you are running

metadynamics and is not required with other methods.

Specifying lists of atoms

Most collective variables require the user to specify one or several groups of

atoms in their deﬁnition. Whenever a set of atom groups is required, the LIST

keyword must be used. This keyword is then followed a set of tags which

specify the number and order of the groups of atoms to be employed. The

atoms involved in each of the groups invoked must be speciﬁed somewhere in

the input ﬁle. These group speciﬁcations must then start with the the name

of the group followed by the -> sign and ﬁnish with the same name followed

by the <- sign. Between these two delimiters the indices of the atoms which

comprise the group must then be listed, separated by spaces or line feeds.

Example.

The following syntax instructs PLUMED to use the distance between the center of mass of atoms 6 and 10

and the center of mass of atoms 8, 15 and 21 as a CV:

DISTANCE LIST <g1> <g2> SIGMA 1.0

g1->

6 10

g1<-

g2->

8 15 21

g2<-

Inside the group deﬁnition blocks, one can either specify the atom num-

bers explicitly, or one can use the LOOP keyword to deﬁne a regular sequence

of indices with a given starting number, end number and stride.

Example.

The following two commands are equivalent deﬁnitions of the group g1:

g1->

10 12 14 16 18 20

g1<-

g1->

LOOP 10 20 2

g1<-

For the special, and rather common, case of a group composed of a single

atom the user can specify the number of the atom of interest rather than the

corresponding <g> tag.

Example.

The following two commands are equivalent ways instruct PLUMED to use the distance between atom 5

and group <g1> as a CV:

DISTANCE LIST <g1> <g2> SIGMA 1.0

g1->

10 12 14 16 18 20

g1<-

g2->

g2<-

DISTANCE LIST <g1> 5 SIGMA 1.0

g1->

10 12 14 16 18 20

g1<-

4.1 Absolute position

The POSITION keyword instructs PLUMED to use the absolute position of an

atom or a group of atoms, speciﬁed by using the LIST keyword, as a CV.

This CV accepts several options that allow the user to restrict the bias to

a given direction, e.g. z, to bias the position of the particle as projected on

a selected line segment or, in analogy with the path CV, to bias the atoms’

distance from a line segment. The keyword DIR accepts as input X, Y or Z

and limits the restraint to the chosen direction.

Example.

The following line instructs PLUMED to use the ycoordinate of atom 13 as a CV.

POSITION LIST 13 SIGMA 0.35 DIR Y

The keyword LINE POS instructs PLUMED to use the projection of the

atoms position on a line as a CV, while the keyword LINE DIST instructs

PLUMED to use the distance from the line as a CV. In both these cases the

line is deﬁned by stating its start and end points. The line can then either

be in constrained to be in the XY, XZ or YZ planes (keywords (XY, XZ or

YZ respectively) or it can have an arbitrary orientation in space (keyword

XYZ). Obviously if the line is constrained to the XY, XZ or YZ planes then

the start and end points are two dimensional vectors whereas if it has an

arbitrary orientation these vectors have three components.

Example.

The following lines instruct PLUMED to use the projection and distance of the coordinates of atom 13 on

a generic line segment deﬁned by the start and end points (0,0,0) and (2,3,4) respectively as the two

collective coordinates.

POSITION LIST 13 SIGMA 0.35 LINE POS XYZ 0. 0. 0. 2. 3. 4.

POSITION LIST 13 SIGMA 0.35 LINE DIST XYZ 0. 0. 0. 2. 3. 4.

4.2 Distance

The DISTANCE keyword instructs PLUMED to use the distance between the

center of mass of two groups of atoms as a CV. Two groups must be deﬁned

using the LIST keyword and the syntax described in section 4.

Example.

The following lines instruct PLUMED to use the distance between atom number 13 and the center of mass

of the four atoms in list <g1> as a CV.

DISTANCE LIST 13 <g1> SIGMA 0.35

g1->

17 20 22 30

g1<-

The optional ﬂag NOPBC can be used to calculate the distance without

applying periodic boundary conditions. This should be done only if all the

atoms in the groups are part of the same molecule. See also Sec. 4.28.

The keyword DIR can be used to calculate the component of this distance

along the cartesian axes (X, Y or Z) and or the component in the planes

(XY, XZ or YZ).

Example.

The following line instruct PLUMED to use the X-component of the distance between atom number 20 and

25 as a CV.

DISTANCE LIST 20 25 DIR X SIGMA 0.35

Whenever one wants to use the diﬀerence between two distances one may

use the keyword DIFFDIST. This may turn to be useful in bond breaking/

bond formation.

Example.

The following line instruct PLUMED to use the difference between the distance between 20 and 25 and the

distance between 30 and 31 as CV.

DISTANCE LIST 20 25 DIFFDIST 30 31

Groups are also accepted as input instead of two atoms.

Other two useful variants are the following: the distance of a point from

an axis and the projection of the point on the axis. The ﬁrst is introduced by

the additional keyword POINT FROM AXIS followed by the atom or the group

deﬁning the point respect to which the distance has to be calculated. The

axis is deﬁned through the standard two groups appearing in the deﬁnition

of the distance collective variable.

Example.

The following line instruct PLUMED to use the distance between one atom, 26 and the axis deﬁned by the

two atoms 20 and 25 as CV.

DISTANCE LIST 20 25 POINT FROM AXIS 26

In a similar way it is possible to calculate the projection of this point on

an axis to be used as a CV by using the keyword PROJ ON AXIS.

Example.

The following line instruct PLUMED to use the projection of the coordinate if atom 26 on the axis deﬁned by

the two atoms 20 and 25 as CV.

DISTANCE LIST 20 25 PROJ ON AXIS 26

Similarly to all the other variables, these two keywords may accept groups

instead of atom indexes.

4.3 Minimum distance

The MINDIST keyword instructs PLUMED to use the minimum distance between

two groups of atoms as a CV. To ensure diﬀerentiability, this quantity is

implemented as:

s=β

log Pij exp(β/||rij||),

where by default β= 500. The value of βcan however be tuned if needed

by using the optional keyword BETA. Much like the distance variable when

calculating the minimum distance one must deﬁne two groups using the LIST

keyword and the syntax described in section 4.

Example.

The following lines instruct PLUMED to use the minimum distance between atom number 13 and the set of

atoms in list <g1> as a CV.

MINDIST LIST 13 <g1> SIGMA 0.35 BETA 500.

g1->

17 20 22 30

g1<-

The optional ﬂag NOPBC can be used to calculate the distance without

applying periodic boundary conditions. This should be only be done if all

the atoms in the groups are part of the same molecule. See also Sec. 4.28.

4.4 Angles

The ANGLE keyword instructs PLUMED to use the angle deﬁned by the centers

of mass of three groups of atoms as a CV. The compulsory LIST keyword

must be followed by three properly deﬁned groups (see Section 4).

Example.

The following lines instruct PLUMED to use the angle deﬁned by atom number 102 and the centers of mass

of the atoms in groups g1 and g2 as a CV.

ANGLE LIST <g1> <g2> 102 SIGMA 0.05

g1->

13 15

g1<-

g2->

LOOP 1000 3000 3

g2<-

It is also possible to use the sine or cosine of the angle as a collective

coordinate by including the SIN or COS keywords respectively similarly to

what is done for TORSION.

4.5 Torsion

The TORSION keyword instructs PLUMED to use a dihedral angle as the CV.

This angle can either be deﬁned by four atoms or, more generally, by the

positions of the centers of mass of four groups of atoms. The compulsory

LIST keyword must be followed by four properly deﬁned groups (see Section

4).

Example.

The following lines instruct PLUMED to use to the torsion angle about the centers of mass of the four groups

<g1>,<g2>,<g3>,<g4> as a CV.

TORSION LIST <g1> <g2> <g3> <g4> SIGMA 0.35

It is also possible to use the sine or cosine of the torsional angle as a

collective coordinate by including the SIN or COS keywords respectively.

Example.

The following line instructs PLUMED to use the cosine of the torsion angle about the centers of mass of the

four groups <g1>,<g2>,<g3>,<g4> as a CV.

TORSION LIST <g1> <g2> <g3> <g4> COS SIGMA 0.35

4.6 Coordination number

The COORD keyword instructs PLUMED to use the total number of contacts be-

tween the atoms in group G1and those in group G2- the coordination number

between these two groups. To ensure diﬀerentiability, this is implemented as

the sum:

s=X

i∈G1X

j∈G2

sij,

where this sum is extended to all pairs of atoms with i∈ G1and j∈ G2. The

individual contributions sij are deﬁned using a switching function, which, in

the present case, is given by:

sij =









1 for rij ≤0

1−(rij

r0)n

1−(rij

r0)mfor rij >0

where rij =|ri−rj|−d0. The user must supply the r0,d0,nand mparame-

ters, using the additional keywords R 0,D 0,NN and MM respectively and thus

has a great deal of control over the deﬁnition of the switching function. In

general a good ﬁrst guess for these parameters can be achieved by looking at

the pair distribution function and setting d0equal to the position of the ﬁrst

peak in the pair distribution function, r0as the full width at half maximum

of the peak and nand mto force sij '0 at the ﬁrst minimum of the pair

distribution function. However, oftentimes diﬀerent choices for these param-

eters will lead to better results because of certain speciﬁc properties of the

system of interest. An optional keyword PAIR treats the atoms in a pairwise

fashion so that instead of simply counting the number of bonds between two

groups of atoms one can deﬁne which precise bonds between the two groups

should be monitored. In this case the groups, <g1> and <g2> must have

the same number of atoms as the switching functions are on the distances

between the ith atom of group <g1> and the ith atom of group <g2>.

Example.

The following lines instruct PLUMED to use the coordination of the atoms in group g1 – 13 and 15 – with

the atoms in group solvent as the CV.

COORD LIST <g1> <solvent> NN 6 MM 12 D 0 2.5 R 0 0.5 SIGMA 0.35

g1->

13 15

g1<-

solvent->

LOOP 1000 3000 3

solvent<-

The optional ﬂag NOPBC can be used to calculate the distance without

applying periodic boundary conditions. This should only be done when all

the atoms are part of the same molecule. See also Sec. 4.28.

4.7 Hydrogen bonds

HBONDS is the keyword for a variable that counts the number of intra-molecular

hydrogen bonds between a group of hydrogen bond donors and a group of

hydrogen bond acceptors. This is deﬁned as:

s=X

1−(dij

r0)n

1−(dij

r0)m,

where i∈ D is the group of donors and j∈ A are the acceptors.

The two groups must be deﬁned using the compulsory LIST keyword fol-

lowed by two groups (see Section 4). PLUMED then assumes that there is only

one donor/acceptor per residue and, that within the list, the donor/acceptor

atoms on neighboring residues are consecutive. The values of r0,nand m

can be speciﬁed using the R 0,NN and MM keywords. If no value is given for

r0,nand mthe default values of r0= 2.5, n= 6 and m= 12 are assumed.

The TYPE keyword selects which residues to include in the count:

•With TYPE 0, all donor-acceptor pairs are included;

•If TYPE 1 is speciﬁed only those donor-acceptor pairs separated by an

odd number of residues greater than 4 are counted. This allows one to

monitor parallel β-sheet formations.

•If TYPE 2 is speciﬁed only those donor-acceptor pairs that are separated

by exactly 4 residues are included. This allows the formation of α-

helical conformations (αtype) to be monitored;

•If TYPE 3 is speciﬁed, only those donor-acceptor pairs that are sepa-

rated by an even number of residues greater than 4 are counted. This

allows anti-parallel β-sheet formations (β-even type) to be monitored.

•If TYPE 4 is speciﬁed, only the ﬁrst donor and the ﬁrst acceptor and so

on are counted. This allows to monitor a set of native hydrogen bonds.

•If TYPE 5 is speciﬁed, only hydrogen bonds between atoms belonging to

diﬀerent residues are counted. Moreover, pairs with an index diﬀerence

less than 5 are also discarded, so as to avoid counting H and a O that

are in the same peptide group (NH-C=O) (contributed by M. Cuendet).

This option requires the user to specify to which residue each atom is

belonging using the RESLIST keyword (see example below).

Example.

The following lines instruct PLUMED to use the count of the total number of hydrogen bonds between the

pairs in groups Hand Oas a CV. The default switching function with r0= 2.5,n= 6 and m= 12 is

implied.

HBONDS LIST <H> <O> TYPE 0 SIGMA 0.1

H->

6 10

H<-

O->

8 12

O<-

In the following example, a modiﬁed switching function is employed.

HBONDS LIST <H> <O> TYPE 0 SIGMA 0.1 NN 8 MM 20 R 0 2.5

In the following example, inter-residue hydrogen bond are counted

HBONDS LIST <H> <O> RESLIST <resH> <resO> TYPE 5

H-> 1 9 17 H<-

O-> 5 13 21 O<-

resH-> 1 2 3 resH<-

resO-> 1 2 3 resO<-

The optional ﬂag NOPBC can be used to calculate the distance without

applying periodic boundary conditions. This should be done only when the

atoms are part of the same molecule. See also Sec. 4.28.

4.8 Interfacial water

WATERBRIDGE is the keyword for a CV that counts the number of interfacial

contacts. This variable does this by calculating the number of atoms from

group G0that are simultaneously in contact with atoms from both groups

G1and G2. A typical application of this CV is to count the number of water

molecules at the interface of two surfaces. This is calculated using:

sWatBr =

i



1−(|ri−rj|

r0)n

1−(|ri−rj|

r0)m





1−(|ri−rj|

r0)n

1−(|ri−rj|

r0)m

.

The syntax of the command requires the user to specify three groups of atoms

after the keyword LIST starting with the G1and G2groups and closing with

the G0group. (see Section 4). The parameters of the switching function are

then deﬁned using the usual NN,MM and R 0 keywords.

Example.

The following command instructs PLUMED to use the number of atoms in the solvent group that are

simultaneously in contact with either atom 6 or 10 and either atom 8, 15 or 21 as a CV.

WATERBRIDGE LIST <type1> <type2> <solvent> NN 8 MM 12 R 0 4.0 SIGMA 0.1

type1->

6 10

type1<-

type2->

8 15 21

type2<-

solvent->

LOOP 100 1000 3

solvent<-

4.9 Radius of gyration

One can employ the radius of gyration of a group of atoms deﬁned with the

compulsory additional keyword LIST by using the RGYR directive. The LIST

keyword (see Section 4) must be followed by only one properly deﬁned group.

This CV is calculated using :

sGyr =Pn

i|ri−rCOM|2

imi1/2,

where the sums are over the natoms in group Gand the center of mass is

deﬁned using:

rCOM =Pn

irimi

imi

Example.

The following lines instruct PLUMED to use the radius of gyration of the group <g1> as a CV.

RGYR LIST <g1> SIGMA 0.35

N.B. The radius of gyration is calculated without applying periodic bound-

ary conditions so the atoms in group <g1> should all be part of the same

molecule. See also Sec. 4.28.

4.9.1 Gyration tensor based CVs

The RGYR directive can be used also to access to a number of CVs based on

gyration tensor [39]:

S=1

N



Px2

iPxiyiPxizi

PxiyiPy2

iPyizi

PxiziPyiziPz2





,(4.1)

which describes the spatial distribution of mass in a molecule or com-

plex of molecules. Alternatively, tensor of inertia may be used for the same

purpose:

I=



Pmi(y2

i+z2

i)P−mixiyiP−mixizi

P−mixiyiPmi(x2

i+z2

i)P−mixizi

P−mixiziP−miyiziPmi(x2

i+y2

i)



.(4.2)

The weighting of atomic contribution is controlled by keyword MASS-WEIGHTED

(default) and NO-WEIGHTED. When MASS-WEIGHTED directive is applied, the

individual atomic contribution to the gyration tensor is weighted by miN

Pmi

and the center of mass is used as origin of the system. If NO-WEIGHTED op-

tion is chosen, coordinates are related to the geometrical center of object and

no mass-weighting is performed in calculation of CV.

Diagonalizing of gyration tensor provides the principal moments of gyra-

tion - S1,S2,S3and three eigenvectors corresponding to the principal axes of

inertia. The individual gyration tensor based CVs are available by specifying

the directive RGYR and the keyword TYPE followed by one of the following:

•TRACE: trace of inertia tensor

•GTPC 1: the largest eigenvalue of gyration tensor. The square root

1=√S1of principal moment S1is used as CV. If we approximate

the shape and mass distribution of the object with an ellipsoid, S0

corresponds to the largest radius of this ellipsoid.

•GTPC 2: the middle principal moment of gyration tensor, S0

2=√S2.

•GTPC 3: the smallest principal moment of gyration tensor, S0

3=√S3.

•RGYR 1: the largest radius of gyration around principal axes of inertia,

rg1=√S1+S2.

•RGYR 2: the middle radius of gyration around principal axes of inertia,

rg2=√S1+S3.

•RGYR 3: the smallest radius of gyration around principal axes of inertia,

rg3=√S2+S3.

•ASPHERICITY: the deviation of mass distribution from spherical sym-

metry. The modiﬁed version of asphericity was implemented as a

CV: b0=√b, where original Suter’s [40] asphericity is given by b=

S1−1

2(S2+S3).

•ACYLINDRICITY: the deviation from cylindrical symmetry. Again the

modiﬁed version c0=√cis used as a CV rather then original form [40]

c=S2−S3.

•KAPPA2: the relative shape anisotropy[40] κ2= 1 −3S1S2+S1S3+S2S3

(S1+S2+S3)2re-

ﬂects both symmetry and dimensionality of an object. Limited between

values of 0 and 1, κ2reaches 1 for an ideal linear arrangement and drops

to zero in case of highly symmetric conﬁgurations (at least tetrahedral

symmetry).

Example.

The following lines instruct PLUMED to use the trace of the inertia tensor of the group <g1> as a CV.

RGYR TYPE INERTIA LIST <g1> SIGMA 0.35

Example.

The following lines instruct PLUMED to use the asphericity of the group <g1> as a CV.

RGYR TYPE ASPHERICITY LIST <g1> SIGMA 0.35

4.10 Dipole

DIPOLE instructs PLUMED to use the electrical dipole generated by a group of

atoms as a CV:

sdipole =|

riqi|.

Only the LIST keyword followed by one properly deﬁned group is required

to deﬁne this CV (see Section 4).

Example.

The following lines instruct PLUMED to use the dipole of the group <g1> as a CV.

DIPOLE LIST <g1> SIGMA 0.35

4.11 Dihedral correlation

DIHCOR is the keyword for a CV that measures the similarity between adjacent

dihedral angles:

sDC =

i=2

2(1 + cos (φi−φi−1)) .

The syntax for this CV requires the user to specify the number of dihedrals

NDand, in the subsequent NDlines, the indices of the four atoms deﬁning

each dihedral φi.

Example.

The following lines instruct PLUMED to use the dihedral correlation for the 3 dihedrals listed as a CV.

DIHCOR NDIH 3 SIGMA 0.1

168 170 172 188

170 172 188 190

172 188 190 197

4.12 Alpha-beta similarity

ALPHABETA is a keyword for a CV that measures the similarity of dihedral

angles to a reference value (see also ??). It is calculated using:

sαβ =1

i=1 1 + cos φi−φRef

i.

The syntax for this CV requires the user to specify the number of dihedrals

NDand, in the subsequent NDlines, the indices of the four atoms deﬁning

each dihedral followed by the value of the dihedral in the reference confor-

mation φRef

Example.

The following lines instruct PLUMED to use the Alpha-beta similarity of three dihedrals as a CV.

ALPHABETA NDIH 3 SIGMA 0.1

168 170 172 188 3.14

170 172 188 190 .56

188 190 192 230 3.14

4.13 Alpharmsd

ALPHARMSD is the keyword for a CV that counts the number of 6-residue

segments in the protein chain that resemble an ideal alpha helix (i.e. the

average experimental structure). This CV is calculated using:

sαrmsd =X

nhRMSD {Ri}i∈Ωα,nR0oi,

where nis the coordination (switching) function deﬁned in Sec. 4.6. The sum

over αin the above runs over all the 6-residue segments in the protein chain,

where each of the residues is deﬁned based on the positions of the backbone

atoms N, CA, C, O and CB, Ωα. The distance used in the switching function

is then the root mean square diﬀerence between the distance matrix of the

atoms in the set Ωαwith the corresponding distances between these atoms

in the ideal alpha helix {R0}:

RMSD {Ri}i∈Ωα,nR0o=v

Npairs X

i,j∈Ωαdij −d0

ij2.

See Ref.[41] for more details (although note that in PLUMED the RMSD be-

tween distance matrices is used rather than the cartesian RMSD. However,

these two measures are essentially equivalent).

To use this CV the syntax requires the user to specify the coordination

function parameters R0,D 0,NN and MM (see Sec. 4.6) (we suggest val-

ues of R 0=0.8 Angstrom, NN=8, MM=12, D 0=0), and a conversion factor

ANGSTROM SCALE which converts the ideal alpha positions from Angstrom to

the length units of the MD code (e.g. in Gromacs units are nanometers

therefore ANGSTROM SCALE is 0.1). In addition a list of list of atom indices for

the N, CA, C, O and CB (in this order) for each residue must be provided

for all the consecutive residues (in ascending order) which form the chain.

For those residues, such as glycine, which do not have a CB the atom index

for the corresponding hydrogen should be used.

Important note: for those MD codes which, like Gromacs4, do not keep

the protein whole but instead split it by the PBC, ALPHARMSD requires the

additional option NOPBC, together with the ALIGN ATOMS command for all the

atoms employed in the ALPHARMSD.

Example.

The following lines deﬁne an Alpharmsd CV for all 16 consecutive residues of a protein in the MD code

Gromacs4 (which uses nanometers as the length unit).

ALPHARMSD LIST <ncacocb> SIGMA 0.5 R 0 0.08 NN 8 MM 12 ANGSTROM SCALE 0.1 NOPBC

ncacocb->

1 5 23 24 7

25 27 42 43 29

44 54 56 57 51

58 68 70 71 65

72 74 77 78 76

79 81 101 102 83

103 105 116 117 107

118 120 138 139 122

140 142 162 163 144

164 166 179 180 168

181 183 190 191 185

192 194 214 215 196

216 218 225 226 220

227 229 236 237 231

238 240 243 244 242

245 247 267 268 249

ncacocb->

4.14 Antibetarmsd

ANTIBETARMSD counts the number of pairs of 3-residue segments in the pro-

tein chain which are similar to the ideal antiparallel beta (i.e. the average

experimental structure). See Ref.[41] for details (but remember that here the

RMSD among distance matrices is used instead of the cartesian RMSD as the

two are basically equivalent). The deﬁnition, input parameters and syntax

for this CV are the same as for the ALPHARMSD and again we suggest values

of R0=0.8 Angstrom, NN=8, MM=12, D 0=0 for the parameters. The only

diﬀerence in the implementation is the additional option STRANDS CUTOFF

which allows the user to specify a threshold distance beyond which pairs of

3-residue segments are considered far. This option considerably speeds up

the computation (often 1 nm is good choice for this quantity).

Important note: for those MD codes which, like Gromacs4, do not keep

the protein whole but instead split it by the PBC, ANTIBETARMSD requires

the additional option NOPBC, together with the ALIGN ATOMS command for

all the atoms employed in the ANTIBETARMSD.

Example.

The following lines deﬁne an Antibetarmsd CV for all 16 consecutive residues of a protein in the MD code

Gromacs4 (which uses nanometers as the length unit).

ANTIBETARMSD LIST <ncacocb> SIGMA 0.5 R 0 0.08 NN 8 MM 12 ANGSTROM SCALE 0.1

STRANDS CUTOFF 1. NOPBC

ncacocb->

1 5 23 24 7

25 27 42 43 29

44 54 56 57 51

58 68 70 71 65

72 74 77 78 76

79 81 101 102 83

103 105 116 117 107

118 120 138 139 122

140 142 162 163 144

164 166 179 180 168

181 183 190 191 185

192 194 214 215 196

216 218 225 226 220

227 229 236 237 231

238 240 243 244 242

245 247 267 268 249

ncacocb->

4.15 Parabetarmsd

PARABETARMSD counts the number of pairs of 3-residue segments in the protein

chain which are similar to the ideal parallel beta (i.e. the average experimen-

tal structure). See Ref.[41] for details (but remember that here the RMSD

among distance matrices is used instead of the cartesian RMSD as the two are

basically equivalent). The deﬁnition, input parameters, and syntax are the

same as for theALPHARMSD and again we suggest values of R0=0.8 Angstrom,

NN=8, MM=12, D 0=0 for the parameters. The only diﬀerence in the imple-

mentation is the additional option STRANDS CUTOFF which allows the user to

specify a threshold distance beyond which pairs of 3-residue segments are

considered far. This option considerably speeds up the computation (often

1 nm is good choice for this quantity).

Important note: for those MD codes which, like Gromacs4, do not keep

the protein whole but instead split it by the PBC, PARABETARMSD requires

the additional option NOPBC, together with the ALIGN ATOMS command for

all the atoms employed in the PARABETARMSD.

Example.

The following lines deﬁne a Parabetarmsd CV for all 16 consecutive residues of a protein in the MD code

Gromacs4 (which uses nanometers as the length unit).

PARABETARMSD LIST <ncacocb> SIGMA 0.5 R 0 0.08 NN 8 MM 12 ANGSTROM SCALE 0.1

STRANDS CUTOFF 1. NOPBC

ncacocb->

1 5 23 24 7

25 27 42 43 29

44 54 56 57 51

58 68 70 71 65

72 74 77 78 76

79 81 101 102 83

103 105 116 117 107

118 120 138 139 122

140 142 162 163 144

164 166 179 180 168

181 183 190 191 185

192 194 214 215 196

216 218 225 226 220

227 229 236 237 231

238 240 243 244 242

245 247 267 268 249

ncacocb->

4.16 Electrostatic potential

Using the ELSTPOT keyword one can instruct PLUMED to use the electrostatic

potential exerted by a group of atoms on the center of mass of a second group

of atoms (or single atom) as a CV. This CV is calculated using:

sELST =

|ri−rCOM|∗f(|ri−rCOM|, R0, CU T )

where

rCOM =PNB

irimi

PNB

imi

Here the sum in the ﬁrst equation above is over the NAatoms in the group

whose charges exert the electric potential, while in the second equation the

sum is over the NBatoms in the group that deﬁnes the point at which this

potential is felt. f(x) is a smoothing function deﬁned as:

f(x, R0, CUT ) = 









1.0x<R0

cos πx

2(CU T −R0)R0≤x≤CUT

0x > CUT

where R0is the onset and CUT is a cutoﬀ distance.

Example.

The following lines instruct PLUMED to use the electrostatic potential exerted by the atoms in group2 on the

center of mass of the atoms in group1 as a CV:

ELSTPOT LIST <group1> <group2> R 0 4.0 CUT 12.0 SIGMA 0.01

group1->

LOOP 1 8 1

group1<-

group2->

LOOP 9 16 1

group2<-

4.17 Puckering coordinates

PUCKERING refers to the set of collective coordinates for 6-membered rings in

polar coordinates [42]. Given the coordinates zj, that represent the displace-

ments of the j−th atom from the mean ring plane, three variables Q, θ, φ can

be obtained starting from the general deﬁnition for 6-membered rings

q2cos φ2=s1

j=1

zjcos 2π

3(j−1)

q2sin φ2=−s1

j=1

zjsin 2π

3(j−1)

q3=s1

j=1

(−1)j−1zj

Q=qq2

2+q2

3≥0

θ= arctan (q2/q3)∈[0, π]

φ=φ2∈[0,2π)

Each one can be selected as a TYPE of PUCKERING. The CV accept the

LIST and SIGMA keywords. As of now LIST accept only 6 atoms. The atoms

in the list have to be enumerated following the chemical sequence (although

the numbering scheme in the topology does not have to be sequential). In

order to fulﬁll IUPAC convention for sugar hexopyranose rings the ﬁrst atom

in the list has to be the ring oxygen, followed by the anomeric carbon. Qis

in general a fast degree of freedom.

Example.

The following lines deﬁne a PUCKERING CV:

PUCKERING LIST <group1> TYPE PHI SIGMA 0.1

group1->

1 2 3 4 5 6

group1<-

4.18 Path collective variables

One can instruct PLUMED to use path collective variables [43] using the S PATH

and Z PATH keywords. In this scheme one deﬁnes a path as a set of Nreference

conformations that deﬁne the path in conﬁguration space Xfrom some initial

state to some ﬁnal state. The svariable (deﬁned with S PATH) then measures

the position along the path, and is deﬁned as:

s=Z−1

i=1

ie−λd(Xi,X(t)),

where X(t) is the conﬁguration of the system at any given time, d:X ×X →

R+

0is a metric on X, and Z=PN

i=1 e−λd(Xi,X(t)) is a normalization factor in

which the prefactor, λ, should be chosen so as to have λd(Xi, Xi±1)'2.3 on

average.

By contrast the zvariable (deﬁned with Z PATH) measures the position

oﬀ the path, and is deﬁned as:

z=−λ−1log Z.

For both S PATH and Z PATH the following keywords must be used:

•TYPE, which deﬁnes the metric used to calculate distances in conﬁgu-

ration space. The following sections provide more details on this but,

suﬃce to say, currently one can use either MSD (mean square deviation

see section 4.18.1), DMSD (distance root mean squared deviation see sec-

tion 4.18.2) or CMAP (the distance between contact matrices see section

4.18.3) [44].

•NFRAMES which sets the number of reference structures, N, that are

used in the deﬁnition of the path.

•LAMBDA, the prefactor λin the exponential term of the equation that

deﬁnes both sand z.

•Optionally, you can also use NEIGHLIST which deﬁnes a neighbor list on

the closest frames to speed up the calculation followed by the number of

steps in which the list must be calculated and the number of elements

it must contain (e.g. NEIGHLIST 50 10 means that each 50 steps the

neighbour list must be calculated and it will contain the 10 closest

elements to the current molecular dynamic snapshots. All the other

are discarded up to the next neighbor list calculation.).

Other keywords are speciﬁc to the type of path variable being deﬁned

(i.e. mean square displacement in Cartesian coordinates, MSD in distances

or contact map distance).

An important change occurred from the version 1.2 to 1.3 : the conven-

tion for the names has changed the former RMSD has been replaced by MSD.

Similarly DRMS has been changed into DMSD. This was done since many users

noticed the inconsistencybetween the deﬁnitions and the related keywords.

N.B. before using this feature please ensure that the parameters in metadyn.h

(common ﬁles directory) are set properly. In particular look for the section

containing:

// path dimensions

#define MAXATOMS PATH 230

#define NMAX PATH 8

#define MAXFRAMES PATH 22

#define MAXATOMS RMSD 230

#define MAXCHARS PATH 40

// cmap

#define MAXDIM CMAP 3800

#define MAXNUM GROUP 10

#define MAXATOM GROUP 30

and ensure that MAXATOMS PATH is greater than or equal to the number

of atoms per frame involved in your path and MAXFRAMES PATH is greater

than or equal to the number of frames you are using to deﬁne your path. In

addition NMAX PATH maximum number of path variables that you can use

simultaneously. If you change any of these values you must subsequently

recompile all the MD codes in which you have implemented PLUMED in order

for your changes to take eﬀect.

4.18.1 Mean square deviation

As already brieﬂy mentioned, when using the path collective coordinates

S PATH and Z PATH, the command MSD after the TYPE keyword instructs

PLUMED to the the mean square deviation of a subset of the atoms in the

system (the displacement set B) calculated after the system has been aligned

to another subset of the atoms (the alignment set A) as the metric used in

the deﬁnition of the path. This quantity is calculated using

d(Xj, Xi) =

a=1

wT OT

(X(j)

a−Mij({z})X(i)

a)2,

where Mij({z}) is the roto-translation matrix calculated using the Kears-

ley [45] algorithm and wjis the number speciﬁed (called the displacement

parameter) for the beta column in the provided input PDB. If this is one

then wj=1 and wT OT =N(while diﬀerently wT OT =PN

iwi). The weights for

the alignment are actually taken into account when forming the alignment

matrix that compose the Mij({z}) and are denoted here with the set {z}

deﬁned by the beta column in the reference PDB.

To use path CVs the user must specify the coordinates of the atoms in

each of the reference frames of the path in a set of supplementary input ﬁles.

The user should specify the basename of these ﬁles (¡basename¿) after the

keyword FRAMESET and be aware that PLUMED then will expect to ﬁnd N

ﬁles named ¡basename¿.i.pdb, which contain the coordinates of the atoms

in both the displacement and alignment sets for the ith frame in the path.

The particular set/s each atom is involved in is speciﬁed using the the last

two numerical ﬁelds of the frameset ﬁle. Values of 1.0 and 0.0 indicate that

the atom is to be used in the alignment set only, while values of 0.0 and 1.0

indicates that the atom is to be used in the displacement set only. Values of

1.0 and 1.0 indicate that the atom is to be used in both the alignment and

displacement sets.

Version 1.1 and higher allow these alignment and displacement indicators to

be non-integer. This allows users to perform a weighted alignment in cases

where the alignment of one region of system is considered more important

and a weighted displacement evaluation in cases where the displacement of a

particular atom/s is though to be of particular importance. Be aware how-

ever that this feature can produce strange numbers that are not trivial to

interpret and is thus perhaps best left to experienced users.

N.B. PLUMED contains a hard coded limit on the number of atoms that can

be used in the alignment so, like in the previous section, users should ensure

that the value of this hard limit (MAXATOMS RMSD in metadyn.h) is suﬃciently

large for their needs prior to compilation.

The unit of distance in the PDB ﬁles is ˚

Angstrom. Engines like GROMACS,

whose internal units are diﬀerent, will perform appropriate conversions au-

tomatically.

Clearly, the atom indices in these PDB ﬁles must be the same as the indices

of the atoms they refer to in the system topology. It is therefore likely that

the atom indices in the frameset ﬁles may be non-consecutive.

Additional keywords are supported: NO ROT and NO CENTER. These two key-

words prevent the rotation and the center of mass alignment respectively

whenever one uses MSD.

Example.

The following command deﬁnes a path using MSD metrics. 2 frames are used to deﬁne the path, which

has λ= 9.0.

SPATH TYPE MSD FRAMESET frame NFRAMES 2 LAMBDA 9.0 SIGMA 0.1

Z PATH TYPE MSD FRAMESET frame NFRAMES 2 LAMBDA 9.0 SIGMA 0.1

Two PDB ﬁles must be provided: frame 1.pdb and frame 2.pdb. The last two columns of these ﬁles specify

which atoms are to be used for alignment and which are to be used to calculate the MSD.

ATOM 1 C ALA 2 -0.186 -1.490 -0.181 1.00 0.00

ATOM 2 O ALA 2 -0.926 -2.447 -0.497 1.00 1.00

ATOM 15 N ALA 2 0.756 0.780 -0.955 1.00 0.00

ATOM 17 CA ALA 2 0.634 -0.653 -1.283 1.00 1.00

ATOM 19 CB ALA 2 2.063 -1.233 -1.286 1.00 1.00

END

4.18.2 Distance mean square deviation

Instead of using the MSD in the deﬁnition of the metric for the path in the

S PATH and Z PATH CVs the command DMSD after the TYPE keyword allows

one to use a metric based on the mean square deviation of the distances

between a subset of the atoms in the system:

d(Xj, Xi) = 2

NA(NA−1)

NA−1

a=1

b=a+1

(r(j)

ab −r(i)

ab )2,

where r(j)

ab is the distance of atoms aand bin the j-th reference frame. For this

metric the coordinates of the atoms in the reference frames of the path are

speciﬁed using the keyword FRAMESET along with a set of pdb ﬁles containing

the atom coordinates. This is the same way that they are speciﬁed when the

root mean square deviation metric is used (see section 4.18.1).

4.18.3 Contact map distances

The ﬁnal choice one can employ to deﬁne the metric for the path in the

S PATH and Z PATH CVs is to use the command CMAP after the TYPE keyword.

This sets the metric for the path to be the distance between the contact

matrices for a given subset of the atoms in the system:

d(Xj, Xi) = ||D(j)

ab −D(i)

ab ||.

Given two sets of atoms a, b ∈ J, this contact matrix Dab is calculated using:

Dab(X) = θ(cab −rab)wab 1−(rab/r(0)

ab )n

ab

1−(rab/r(0)

ab )m

ab,

where θ(x) is a step function which vanishes if x < 0.

As with other variables, the parameters r(0)

ab ,nab and mab allow for a

great freedom in the deﬁnition of the switching function. What is more the

parameter cab allows one to set the values of the switching function to zero at

large separations. Finally, if the formation of particular contacts is deemed

to be of great importance the weights wab can be used to change their relative

importance.

Like the other metrics for the path collective variables to use path col-

lective variables with a contact map metric information must be provided in

supplementary ﬁles about the frames that make up the path. The syntax

requires the user to specify a ﬁle name for the indices of the atoms and the

parameters deﬁning the calculation of the contact matrix (after the keyword

INDEX) and another ﬁlename for the values of the reference matrices D(i)

ab ,

after the keyword MAP.

The index ﬁle, speciﬁed after the INDEX keyword, must contain one line for

each of the elements in the contact matrix Dab, which speciﬁed how that

particular contact should be calculated. Each of these lines should begin

with the CONTACT keyword followed by a numerical label for the contact.

The next two ﬁelds are the indices a, b of the two atoms that make up the

contact, which are followed by the values of r(0)

ab ,nab,mab,cab and wab in the

switching function.

The values ﬁle, speciﬁed after the keyword MAP, contains the values of the

reference matrices D(i)

ab used in the deﬁnition of the path. Each line in this ﬁle

corresponds to one element in the contact matrix. These lines are formatted

with the numerical label for the contact as the ﬁrst ﬁeld. This is followed by

the indices of the two atoms that make up the contact a, b and ﬁnally the

value of this particular switching function in the reference frame. Unlike the

MSD and DMSD metrics all the reference frames are placed in a single ﬁle

with each reference frame separated by the END keyword.

The optional ﬂag NOPBC can be used to calculate the distance without

applying periodic boundary conditions. This should be done only if all the

atoms in the groups are part of the same molecule. See also Sec. 4.28.

Example.

The following command deﬁnes a path using the contact map metric. 2 frames are used to deﬁne the path

with λset equal to 0.1.

SPATH TYPE CMAP NFRAMES 2 INDEX fr.ndx MAP fr.mps LAMBDA 0.1 SIGMA 1. NOPBC

The fr.ndx ﬁle should contain the details on how each of the switching functions in the contact matrix

should calculated:

CONTACT 1 1 2 3.0 6 10 100.0 1

CONTACT 2 1 15 3.0 6 10 100.0 1

CONTACT 3 17 2 3.0 6 10 100.0 1

CONTACT 4 15 19 3.0 6 10 100.0 1

The fr.mps ﬁle contains the values of the reference matrices:

1 1 2 0.99491645

2 1 15 0.76586085

3 17 2 0.79183088

4 15 19 0.81924184

END

1 1 2 0.99369661

2 1 15 0.76748693

3 17 2 0.76454272

4 15 19 0.72917217

END

One can also use contact matrices that involve contacts between the cen-

ters of mass of groups of atoms. However, in this case, a further additional

ﬁle containing the deﬁnition of the groups must be provided. The name of

this ﬁle is speciﬁed in the PLUMED input ﬁle, using the GROUP keyword. Then

within the speciﬁed ﬁle each group is deﬁned on a single line starting with

the GROUP keyword. This keyword is followed by a numerical label for the

group, the number of atoms in the group and then a list of the indices of

the various atoms that make up the group. To instruct PLUMED to use these

group contacts rather than atomic contacts one must use lines starting with

the keyword GROUP in the index ﬁle. The remainder of the format of the

lines in the index ﬁle and the format of the corresponding lines in the map

ﬁle are identical to the lines used to specify atomic contacts. However, the

indices that would have speciﬁed the indices of the atoms involved in the

atomic contact must be replaced with the indices from the group ﬁle of the

two groups that make up the contact.

Example.

The following command deﬁnes a path using the contact matrix metrics. 2 frames are used to deﬁne the

path, and a value of λ= 0.1is set.

S PATH TYPE CMAP NFRAMES 2 INDEX fr.ndx MAP fr.mps GROUP fr.grp LAMBDA 0.1 SIGMA 1.

The ﬁle fr.grp deﬁnes three groups of atoms:

GROUP 1 4 23 43 56 457

GROUP 2 5 76 47 97 322 695

GROUP 3 4 17 15 19 2

The fr.ndx ﬁle contains the parameters of the contacts between groups:

GROUP 1 1 2 3.0 6 10 100.0 1

GROUP 2 1 3 3.0 6 10 100.0 1

GROUP 3 2 3 3.0 6 10 100.0 1

Finally, the fr.mps has the values of the contact matrix in the reference positions:

1 1 2 0.9949

2 1 3 0.7658

3 2 3 0.7918

END

1 1 2 0.9936

2 1 3 0.6674

3 2 3 0.8645

END

It is possible to have maps in which there are both atomic and group

contacts. However, be aware that, in the index ﬁles in which the switching

functions are speciﬁed, the deﬁnitions of the group contacts MUST follow

the deﬁnitions of the atomic CONTACT functions.

Example.

An example of an index ﬁle containing both atom-atom contacts and group contacts:

CONTACT 1 123 545 7.0 6 10 100.0 0.50

CONTACT 2 224 244 8.5 6 10 100.0 0.50

GROUP 3 1 2 3.0 6 10 100.0 1.00

4.18.4 Using path variables as MSD, DMSD and CMAP

and the TARGETED statement

If one deﬁnes a zpath collective variable with a single frame it is clear from

the deﬁnition

z=−λ−1log Z=−λ−1log X

e−λdf

that this is equivalent to using to the squared distance of the current conﬁg-

uration from a reference structure in the chosen metric. This CV can thus

be used in simulations which employ the standard MSD, distance DMSD or

CMAP distance from a single frame as a collective coordinate. However, in

this case we recommend use of the alias TARGETED instead, which deﬁnes the

exact same CV but with a far simpler syntax.

Example.

The following command instructs PLUMED to do steered MD towards a target frame using MSD metrics.

PRINT W STRIDE 10

TARGETED TYPE MSD FRAMESET ref frame.pdb

STEER CV 1 TO 3.0 VEL 0.5 KAPPA 500.0

ENDMETA

A single PDB ﬁle must be provided: ref frame.pdb, in which the last two columns specify which of the

atoms are to be used for alignment and which are to be used to calculate the MSD/DMSD (see section

4.18.1).

ATOM 1 C ALA 2 -0.186 -1.490 -0.181 1.00 0.00

ATOM 2 O ALA 2 -0.926 -2.447 -0.497 1.00 1.00

ATOM 15 N ALA 2 0.756 0.780 -0.955 1.00 0.00

ATOM 17 CA ALA 2 0.634 -0.653 -1.283 1.00 1.00

ATOM 19 CB ALA 2 2.063 -1.233 -1.286 1.00 1.00

END

In the case of CMAP the input for PLUMED is

PRINT W STRIDE 10

TARGETED TYPE CMAP INDEX CMAPINDEX MAP CMAPVALUES

STEER CV 1 TO 3.0 VEL 0.5 KAPPA 500.0

ENDMETA

where the CMAPVALUES ﬁle contains only one map (for details on the format of the CMAPINDEX and

CMPAVALUES ﬁle see section 4.18.3).

Additionally one can specify another keyword SQRT that transform the

metrics in its square root, namely the mean square deviation would be

changed into the more commonly used root mean square deviation. Simi-

larly it applies to DMSD and CMAP. It should be stressed that the use of this

keyword is particularly dangerous especially for values close to zero since the

square root has a cusp there. The former example would look like:

Example.

PRINT W STRIDE 10

TARGETED TYPE MSD SQRT FRAMESET ref frame.pdb

STEER CV 1 TO 3.0 VEL 0.5 KAPPA 500.0

ENDMETA

4.19 Contact Map

The Contact Map is deﬁned as the sum of the contacts between a number

of atom pairs speciﬁed by the user. A contact is deﬁned in terms of the

switching functions introduced for the coordination number CV in section

4.6. Each contact can have its own set of parameters, which are deﬁned in

a ﬁle speciﬁed by the keyword INDEX. See the documentation in paragraph

4.18.3 for a detailed explanation of the index ﬁle syntax.

As for PCV in contact map space (see paragraph 4.18.3), one can deﬁne

contacts between the centers of mass of groups of atoms. The name of the

ﬁle in which groups are deﬁned is speciﬁed by using the GROUP keyword. See

the documentation in paragraph 4.18.3 for additional details.

The optional ﬂag NOPBC can be used to calculate distances without ap-

plying periodic boundary conditions.

Example.

The following command deﬁnes the CV Contact Map. Distances are calculated without periodic boundary

conditions.

CMAP INDEX fr.ndx SIGMA 1. NOPBC

The fr.ndx ﬁle should contain the details on how each of the switching functions in the contact matrix

should calculated:

CONTACT 1 1 2 3.0 6 10 100.0 1

CONTACT 2 1 15 3.0 6 10 100.0 1

CONTACT 3 17 2 3.0 6 10 100.0 1

CONTACT 4 15 19 3.0 6 10 100.0 1

4.20 Energy

The ENERGY keyword instructs PLUMED to use the total potential energy of

the system [46, 47, 48, 49] as a CV. Currently this CV is available only in

GROMACS4, AMBER and DL POLY.

Example.

The following lines instruct PLUMED to use the potential energy of the system as a CV.

ENERGY SIGMA 100.0

4.21 Helix loops

The HELIX keyword instructs PLUMED to use the number of α-helix loops as a

CV. A helix loop is formed when the pair of dihedral angles (Φ,Ψ) for three

consecutive residues along the chain all adopt a particular pair of reference

values (¯

Φ,¯

Ψ). Typically in an alpha helix ¯

Φ and ¯

Ψ have values of -1.200 and

-0.785 respectively. Having speciﬁed reference values for the dihedrals the

total number of loops is calculated using:

N−1

i=2

i+1

j=i−1

4hcos(Φj−¯

Φi)+1ihcos(Ψj−¯

Ψi)+1i,(4.3)

where Nis the total number of residues. This CV requires the user to specify

the number of loops with the NLOOP keyword. Then for each loop the user

should provide three sets of four atoms that deﬁne the dihedrals Φi−1,Φi,Φi+i

and a reference value for the dihedral ¯

Φialong with three sets of four atoms

that deﬁne the dihedrals Ψi−1,Ψi,Ψi+iand a reference value the dihedral ¯

Ψi.

Example.

The following lines instruct PLUMED to use the number of α-helix loops as a CV.

HELIX NLOOP 2 SIGMA 0.1

10 12 14 20 20 22 24 35 35 37 39 49 -1.200 12 14 20 22 22 24 35 37 37 39 49 51 -0.785

20 22 24 35 35 37 39 49 49 51 53 59 -1.200 22 24 35 37 37 39 49 51 51 53 59 61 -0.785

4.22 PCA projection

The PCA keyword instructs PLUMED to use as CV the projection of a set

of atoms on a previously calculated Principal Components Analysis (PCA)

eigenvector[50]. A typical application of this CV is to explore the system

along its principal directions of ﬂuctuations [51].

The current conformation Xis ﬁrst aligned to a reference structure Xref

(except if the optional NOALIGN keyword is used) and then projected on the

speciﬁed eigenvector e0. The set of atoms used for alignment must be the

same as the one the eigenvector refers to. The user should specify the ﬁlename

of the reference structure after the keyword FRAME and the ﬁlename of the

eigenvector after the keyword EIGENVEC.

If the optional DIFF keyword is used, the diﬀerence between the current

(aligned) conformation and the centered reference structure is projected on

the eigenvector.

The CV is calculated as :

sP CA(X, Xref , e0) =

i=1

<Rref (Xi−XCM ), e0

or, when using the DIFF keyword, as :

sP CA(X, Xref , e0) =

i=1

<Rref (Xi−XCM )−Xref

0, e0

where Nis the number of atoms used to align and project, Rref is the 3×3

rotation matrix, calculated using the Kearsley [45] algorithm, that optimally

overlaps the current centered set of atoms (Xi−XCM ) into the centered

reference set of atoms Xref

0,XCM is the current centroid of conformation X

(i.e. identical masses are assumed) and <, > denotes the usual inner product.

If the optional NOALIGN keyword is used, no alignment is performed (i.e.

the above rotation matrix Ris the identity). This latter keyword is supported

mainly for debug purposes.

Example.

The following command deﬁnes a CV as projection, along the eigenvector listed in egv0.dat, of the current

conformation aligned to the reference structure speciﬁed in ref.dat:

PCA FRAME ref.dat EIGENVEC egv0.dat SIGMA 0.1

where ref.dat contains the reference structure in the format ATOMID X Y Z:

# my reference structure

2 1.324 1.045 1.550

5 1.316 1.174 1.469

6 1.345 1.174 1.350

7 1.287 1.286 1.538

and egv0.dat contains the eigenvector in the same format:

# my first PCA eigenvector

2 0.12 3.28 0.19

5 0.53 1.37 1.10

6 -0.23 1.93 0.33

7 5.32 -2.56 1.44

More eigenvectors can be used with additional PCA keywords as additional

CVs, nevertheless, the set of atoms and the reference structure must be the

same for all of them.

Example.

The following command deﬁnes two CVs as projections, along two distinct eigenvectors listed in egv0.dat

and egv1.dat, of the difference between the current aligned conformation and the reference structure

speciﬁed in ref.dat:

PCA FRAME ref.dat EIGENVEC egv0.dat DIFF SIGMA 0.1

PCA FRAME ref.dat EIGENVEC egv1.dat DIFF SIGMA 0.1

Some ﬁnal notes :

•the atom and eigenvector coordinates should be in engine units (e.g. in

nm for GROMACS)

•the alignment is not mass-weighted (identical masses for all the involved

atoms are assumed), and since the routine used for the alignment is the

same as the one used to calculate the RMSD, the same cautions hold. In

particular, (i) users should ensure that the value of the hardcoded limit

(MAXATOMS RMSD in metadyn.h) is suﬃciently large for their needs prior

to compilation, (ii) the use of double precision code is recommended as

well as (iii) the use of the directive ALIGN ATOMS. See also 4.28.

•starting the simulation from the identical conformation used for align-

ment should be avoided since it could generate numerical instabilities

in the calculation of the rotation matrix, possibly leading to a crash of

the simulation.

•comments line beginning with ’#’ are allowed but blank lines must be

avoided from the input ﬁles.

4.23 SPRINT topological variables

The SPRINT keyword instructs PLUMED to use as CV the “Social PeRmutation

INvarianT” coordinates described in Ref. [52]:

Si=√N λmax vmax,sorted

where λmax and vmax

iare the largest eigenvalue and the corresponding

eigenvector of the N×Ncontact matrix among Natoms:

j=1

Cij vmax

j=λmax vmax

and Cij is the same coordination function deﬁned for the CV COORD in Sec-

tion 4.6, with the corresponding parameters speciﬁed by the keywords R 0,

D0(both in units of the host MD code, e.g. Bohr for CPMD), NN, and MM.

In Sithe eigenvector components are sorted from the smallest to the largest,

so that S1< S2< S3< ... < SN. Which Sihas to be used as CV is speciﬁed

by the keyword INDEX: e.g. INDEX 2 means that the 2nd one has to be used,

namely S2. Note that the sorting of the eigenvector is performed only within

sets of atoms of the same element (automatically identiﬁed in PLUMED by

the atomic mass): if the list of Natoms includes e.g. two carbon and four

hydrogen atoms, the SPRINT coordinates are sorted only within carbon atoms

and within hydrogen atoms, without mixing the two elements (since atoms of

diﬀerent elements are not indistinguishable). Due to this, after keyword R0

it must be speciﬁed the number of diﬀerent element pairs and the R 0-value

for each possible pair of elements. The same holds for D 0. See below for an

example.

Example.

The following command deﬁnes a SPRINT CV S2from a set of two carbon (1,2) and four hydrogen atoms

(3,4,5,6):

SPRINT LIST <all> NN 6 MM 12 R 0 3 5.0 4.2 4.2 INDEX 2 SIGMA 1.0

all->

1 2 3 4 5 6

all<-

In the example above R 0 3 5.0 4.2 4.2 refers to the three pairs of

elements C-C, C-H, and H-H.

Note that it is desirable to keep a tail of the coordination function long

enough so that the system appears formally as a single connected cluster of

atoms. If a part is disconnected from the rest, the Perron-Frobenius theo-

rem does not hold anymore and several Sicomponents will go to zero (see

Ref. [52]).

4.24 Radial distribution function

The keyword RDF instructs plumed to use the number of distances between

atoms in a given range as a collective variable. A recent paper [53] showed

how a number of such collective variables could be used to deﬁne the instan-

taneous radial distribution function and how this description of the structure

of small clusters could be used to enhance sampling in reconnaissance meta-

dynamics simulations.

The number of distances within a given range is calculated using:

s=X

i,j Zb

aw(r−rij)dr(4.4)

where rij is the distance between atoms iand jand wis a Gaussian win-

dow function with width σ. When using multiple such CVs calculating the set

of distances separately for each bead in the RDF would be computational

expensive and counterproductive. As such all RDF collective coordinates

that involve the same atoms are calculated at the same time. In order that

multiple RDF collective coordinates can be used in a single calculation RDF

beads are labeled using the RDF LABEL keyword. Obviously if all your

RDF CVs come from the same RDF the RDF LABEL should be 1 for all

your RDF coordinates. An example input for this type of CV is as follows:

Example.

The following command instructs plumed to calculate two collective coordinates the number of distances

between 3.0 and 3.5 and the number of distances between 3.5 and 4.0. The value of σin this calculation

is 0.25.

RDF RDF LABEL 1 LIST <all> <all> RANGE 3.0 3.5 WIDTH 0.25

RDF RDF LABEL 1 LIST <all> <all> RANGE 3.5 4.0 WIDTH 0.25

all->

1 2 3 4 ...

all<-

Example.

The following command instructs plumed to calculate two collective coordinates the number of distances

between atoms in group ¡all¿ between 3.0 and 3.5 and the number of distances in group ¡all2¿ between

3.5 and 4.0. The value of σin this calculation is 0.25.

RDF RDF LABEL 1 LIST <all> <all> RANGE 3.0 3.5 WIDTH 0.25

RDF RDF LABEL 2 LIST <all2> <all2> RANGE 3.5 4.0 WIDTH 0.25

all->

1 2 3 4 ...

all<-

all2->

5 6 7 8 ...

all2-<

4.25 Angular distribution function

Rather than using the radial distribution function as a collective variable one

can use the distribution of angles between central atoms and the atoms in

the ﬁrst hydration sphere [53]. Once again angular distribution functions of

this type have been combined employed successfully with the reconnaissance

metadynamics algorithm. The number of angles within a given range is

calculated using:

i=1

j=1

k=1

σ(rij)σ(rik)Zb

aw(θ−θjik)dθ(4.5)

where σ(r) = 1−r−d0

r0n

1−r−d0

r0m(4.6)

where wis once again a Gaussian window function with width σ. The

two switching functions σ(rij ) and σ(rik) are used to make sure that one only

takes angles in the ﬁrst hydration sphere. A detailed description of how to set

the parameters in these functions can be found in the section of this manual

on coordination numbers. Much as was described above for the RDF the

RDF LABEL keyword ensures that the code does not waste time calculating

all the angles involved in these ADF beads multiple times. If all your ADF

CVs are based on the positions of the same atoms then RDF LABEL should

be 1 for all your ADF coordinates. In other words this keyword should be

used to diﬀerentiate between the various angular distributions functions you

are calculating in input.

Example.

The following command instructs plumed to use the number of angles in the ﬁrst coordination sphere that

are between 0.5 and 1.0 radians as a CV. The value of σin this calculation is 0.25.

ADF RDF LABEL 1 LIST <all> <all> <all> RANGE 0.5 1.0 WIDTH 0.25 R 0 3.0 NN 6 MM 12 D 0

1.5

all->

1 2 3 4 ...

all<-

4.26 Polynomial combination of CVs

Polynomial combinations of collective variables, given by the functional form

i=1

ci(CVi−si)ni

can be deﬁned using the POLY directive, followed by the TERMS keyword that

speciﬁes the number kof terms to be included in the sum. In the next k

lines, the compulsory keyword CV deﬁnes the number CViof the collective

variable in the i-th term; optionally the values of ci,siand nican be speciﬁed

preceded respectively by the keywords COEFF,SHIFT and EXP. If not speci-

ﬁed they assume the default values of ci= 1 si= 0 ni= 1. Fractional and

negative values of the exponent are allowed, but singularities should be care-

fully avoided by imposing restraints on the regions explored by the collective

variables involved.

Warning! To combine more than 20 collective variables (k > 20), the hardset allocation for intpar

and vecpar in metadyn.h should be increased accrdingly and the code recompiled.

Example.

The following command deﬁnes a new CV as CV1−CV2

POLY TERMS 2 SIGMA 1.0

CV 1

CV 2 COEFF -1 EXP 2

More complicated functions might need multiple auxiliary CVs to be de-

ﬁned.

Example.

The following command deﬁnes a new CV as √3CV1+ CV2

POLY TERMS 2

CV 1 COEFF 3

CV 2

NOHILLS CV 3

POLY TERMS 1 SIGMA 1.0

CV 3 EXP 0.5

4.27 Function of CVs

General function of CVs with arbitrary form can be deﬁned using the FUNCTION

directive. You should deﬁne this function AFTER you deﬁned the variables

you are combining.

This function, still experimental, makes use of libmatheval library that

you should download from http://www.gnu.org/software/libmatheval/

and compile along with them. You may ﬁnd useful infos on the syntax in

http://www.gnu.org/software/libmatheval/manual/. It is still experi-

mental so if you want to compile it you should hack your Makeﬁle after

patching. I tried in NAMD2.7. You should change a couple of lines in the

Makeﬁle.

Example.

in the Makeﬁle (dots represent the arguments that you already ﬁnd in the Makeﬁle)

CXXBASEFLAGS = ... $(COPTI)/my/path/to/matheval/install dir x86 64/include

and later

namd2: -L/my/path/to/matheval/install dir x86 64/include/lib -lmatheval ...

100

while in SANDER (v10) I had to change

Example.

in $AMBERHOME/src/conﬁg amber.h I added the needed stuff:

AMBERBUILDFLAGS=-DAMBER -DHAVE MATHEVAL -L/my/path/to/matheval/install dir x86 64/lib

-lmatheval -I/my/path/to/matheval/install dir x86 64/include

It is crucial to put the function of CVs after all the cvs you use to deﬁne it!!!!

Example.

The following command deﬁnes a new CV as function of the two angles

PRINT W STRIDE 1

ANGLE LIST 7 9 15

ANGLE LIST 9 15 17

FUNCTION " 2*CV 1 + (CV 2+CV 1)0.2 "

Additional note is that the function is not periodic.

4.28 A note on periodic boundary conditions

PLUMED is designed so that for the majority of the CVs implemented the

periodic boundary conditions are treated in the same manner as they would

be treated in the host code. However, there are some exceptions; namely:

•Average coordinate of a group of atoms;

•RGYR;

•DISTANCE,MINDIST,COORD,HBONDS, and path collective variables S PATH

and Z PATH with contact map metrics (TYPE CMAP), if the NOPBC ﬂag is

used;

•Path Collective Variables S PATH and Z PATH with RMSD (TYPE RMSD);

101

•ALPHARMSD,ANTIBETARMSD,PARABETARMSD.

In all these cases, it is essential that the atoms involved in the deﬁnition

of the CV are all part of a single, unbreakable object such as a molecule.

Furthermore, it is essential that in the coordinates passed to PLUMED the

molecules are kept intact. We are aware of at least two cases where this

condition is not satisﬁed; namely, when using domain decomposition or the

option for periodic molecules within the host code GROMACS4.

In these cases one must use the additional directive ALIGN ATOMS, which

takes the LIST keyword. By using this command the user can deﬁne an or-

dered group of atoms, which are to be aligned such that the distance between

adjacent atoms in the list is minimized. In the majority of cases, the atoms

in this list should be in the same order as they appear in the pdb ﬁle as

usually these ﬁles are arranged in way that reﬂects how close together atoms

are in the molecular topology. As for the number of atoms that must be

speciﬁed in this list it is often suﬃcient to just specify those atoms involved

in the CVs. However, in cases where the atoms involved are separated by

a large distances along a chain alignment of intermediate atoms will also be

required.

Example.

Input for running a metadynamics simulation using the end-to-end distance in a protein as a CV with

GROMACS4 and domain decomposition:

PRINT W STRIDE 10

HILLS HEIGHT 2.0 W STRIDE 10

DISTANCE LIST 9 238 SIGMA 0.35 NOPBC

ALIGN ATOMS LIST <C-alpha>

C-alpha->

9 16 31 55 69 90 102 114 124 138 160 174 194 208 224 238

C-alpha<-

ENDMETA

102

Chapter 5

Postprocessing

5.1 Estimating the free energy after a meta-

dynamics run

The program sum hills.f90 is a tool for summing up the Gaussians laid

during the metadynamics trajectory and obtaining the free energy surface.

5.1.1 Installation instructions

As sum hills.f90 is a simple fortran 90 program, the installation is straight-

forward so long as you have a fortran compiler available on your machine. As

an example, with the gnu g95 compiler one would compile sum hills.f90

using the following command:

g95 -O3 sum hills.f90 serial.f90 -o sum hills.x

For post processing of large HILLS ﬁles we recommend that, if you have a

multicore machine available, you use the parallel version, which is compiled

thus:

mpif90 -O3 sum hills.f90 parallel.f90 -o sum hills mpi.x

5.1.2 Usage

The sum hills program takes its input parameters from the command line.

If run without options, this brief summary of options is printed out.

103

Example.

USAGE: sum hills.x -file HILLS -out fes.dat -ndim 3 -ndw 1 2 -kt 0.6 -ngrid 100 100 100

-ndim 3 number of collective variables NCV

-ndw 1 ... CVs for the free-energy surface

-ngrid 50 ... mesh dimension. DEFAULT :: 100

-dp ... size of the mesh of the output free energy

-fix 1.1 ... optional definition of the FES domain

-stride 10 how often the FES is written

-cutoff e 1.e-6 the hills are cut off at 1.e-6

-cutoff s 6.25 the hills are cut off at 6.25 std dev from the center

-2pi x [0; 2π]periodicity on the x CV,

if -fix is not used 2pi is used

-pi x [−π;π]periodicity on the x CV,

if -fix is not used 2pi is used

-kt 0.6 kT in the energy units

-grad apply periodicity using degrees

-bias <biasfact>writing output the bias for a well tempered mtd

-file HILLLS input file

-out fes.dat output file

-hills nhills number of Gaussians that are read

-aver naver time-average the bias profile over the last naver Gaussians

The program works in the following way.

Using the -file and -out ﬂags one tells the program the names of the

input ﬁle containing the Gaussian hills ﬁle and the name of the ﬁle in which

the free-energy will be outputted. In the absence of any instruction sum hills

assumes these ﬁles are to be called HILLS and fes.dat.

The number of CVs in the HILLS ﬁle is speciﬁed using -ndim whilst the

number of CVs the output free energy surface is to be plotted as a function

of is controlled by -ndw followed by the list of CVs in the desired order. Note

that after being read the CV are reordered in the code according to ndw. As

a direct result EVERY output uses this new order. If the number of CVs

requested using -ndw is less than the number of CVs in in HILLS ﬁle the CVs

not speciﬁed for output will be integrated out with the Boltzmann weight

KbTspeciﬁed by -kt (kT must be given in the energy units that were used

by in the code in which the simulation was performed)

The position, width and height of the Gaussians are read from the ﬁle

speciﬁed by the -file option (HILLS is the default) and the free-energy

surface is printed out on a grid in gnuplot format with a blank line added

after each block of data. Gnuplot is very handy for quick visualization of 3D

data (e.g. the FES as a function of 2 CVs).

For eﬃciency, the Gaussians are truncated at a certain distance from their

center before being placed on the grid. -cutoff sand -cutoff e allow one

104

to tune this cutoﬀ distance. With -cutoff s the value read speciﬁes this

truncation distance in terms of the number of standard deviations from the

center. For consistency this should be set equal to the DP2CUTOFF used

in the metadynamics simulation (the default value for this parameter is 6.25

both in PLUMED and in sum hills). Alternatively, the user may specify this

cutoﬀ distance as the distance at which the energy of the Gaussian hill falls to

less than some critical value speciﬁed using the -cutoff e ﬂag. The energy

speciﬁed after this ﬂag must be in the units used during the metadynamics

simulation. N.B. if the cutoﬀ used in post-processesing is diﬀerent to that

used during the simulation the calculated free energy diﬀers from the bias

which was actually applied during the metadynamics simulation.

The size of the output grid can be controlled either with -ngrid followed

by the number of grid points in each CV direction or by -dp followed by

the size of the voxel in each CV direction. If neither of these options are

speciﬁed, the grid size is assumed to be 100 in each direction and the voxel

size is calculated such that all the input Gaussians ﬁt into the grid. The

-fix option allow one to ﬁx the boundaries of the output free energy and

after this ﬂag. two real numbers are expected for each CV.

-stride allows one to print out the evolution of the free-energy as a

function of time, i.e. as a function of the index of the Gaussian. -stride

expects an integer which speciﬁes how many Gaussians are added to the

calculated free energy surface between print outs. The progressive free energy

is printed in ﬁles named fes.dat.XXX where XXX is an increasing counter.

The ﬁnal free-energy is printed in fes.dat. This is useful for creating movies

of the how the free-energy wells ﬁll as a function of time.

-hills is used to integrate only a part of the Gaussian ﬁles. It expects

an integer that speciﬁes the maximum number of Gaussians to be read from

input.

-naver is used to plot the time average of the bias proﬁle over the last

given number of hills (see Eq. 33 in Ref. [23])

When periodic CVs like angles and dihedrals are used as CVs, periodicity

options must be speciﬁed. The ﬂags -pi and -2pi, which are followed by

the CV index, specify that that the CV is periodic between [-pi;pi] or [0;2pi]

respectively. The -grad ﬂag speciﬁes that the CV values are being given

degrees rather than radians.

105

5.2 Evaluating collective variables on MD tra-

jectories

The program driver can be used to evaluate the value of any of the CVs

implemented in PLUMED for all the structures in a given trajectory ﬁle.

For GROMACS users: note that driver has some limitation in the choice

of the simulation box shape. With GROMACS, it could be convenient to use

the -rerun option of mdrun together with PLUMED.

5.2.1 Installation instructions

Before compiling driver, the ﬁles contained in the common files directory

must be linked to the utilities/driver directory (i.e. this can be done

using ./getlinks.sh).

To compile using g95/gcc, type:

make arch=g95

To compile with gfortran/gcc, type:

make arch=gfortran

To compile with gfortran/gcc on a 64 bit machine, type:

make arch=gfortan 64

To compile with ifort/icc Intel compilers, type:

make arch=intel

To use other compilers and/or compiler ﬂags you will need to modify the

Makeﬁle.

5.2.2 Usage

This program takes its input parameters from the command line. If run

without options, this brief summary of options is printed out.

106

Example.

Invoking driver without arguments prints a list of the available options:

USAGE :

driver -pdb PDB FILE -dcd DCD FILE -plumed PLUMED FILE -ncv

(-interval min1 max1 min2 max2 -out OUT FILE -nopbc -cell CELLX CELLY CELLZ)

-pdb pdb (one frame) connected to dcd file

-dcd trajectory file

-plumed PLUMED-like input file

-ncv number of collective variables

-out pdb output filename for clustering format (optional)

-interval extract frames with CV in this interval (optional)

-nopbc don’t apply pbc (optional)

-cell provide fixed box dimension in Angstrom

for orthorhombic PBC (optional)

The user must provide a structure ﬁle in PDB format (only one frame,

not a movie), within which the atoms’ masses and charges are inserted in

the occupancy ﬁeld and in the B-factor ﬁeld respectively. This data must be

provided as it is required for the calculation of certain collective variables,

such as the center of masses or dipoles.

The trajectory ﬁle must be a CHARMM format DCD ﬁle (also NAMD

2.1 and later). To convert other trajectory ﬁles into this format, the user can

download the utility CatDCD from:

http://www.ks.uiuc.edu/Development/MDTools/catdcd/.

Within the input ﬁle driver the user can specify a printing stride (in

number of DCD frames) using the keyword W STRIDE and the list CVs he/she

wishes to calculate. This ﬁle has the same syntax as the PLUMED input ﬁle.

The CVs value will be printed on the COLVAR ﬁle. For a complete list of the

CVs driver can calculate, please see section 4.

Example.

If you wish to evaluate the distance between atom 13 and 17 for every frame in your trajectory ﬁle, the

input ﬁle for driver would be as follows:

PRINT W STRIDE 1

DISTANCE LIST 13 17

ENDMETA

By default, driver uses orthorhombic periodic boundary conditions and

looks for cell information in the DCD ﬁle. If these are not present, you must

107

either provide the ﬁxed dimensions of the box in Angstrom (only orthorhom-

bic PBCs are allowed) using the keyword -cell CELLX CELLY CELLZ or

switch oﬀ the pbc using the -nopbc ﬂag.

Driver can also be used to extract frames in a speciﬁc window of the CVs

space and write these extracted frames to a PDB ﬁle. To use this functional-

ity use the option -out output.pdb and specify the interval with -interval

CVmin CVmax.

Example.

To extract those frames in which the distance between two speciﬁed atoms is between 10.0 and 12.0 ˚

while the angle deﬁned by three speciﬁed atoms is between 2.0 and 2.3 rad, the user should prepare the

following input ﬁle (named plumed.dat):

PRINT W STRIDE 1

DISTANCE LIST 13 17

ANGLE LIST 19 20 22

ENDMETA

and then invoke driver using the following command:

./driver -pdb dialanine.pdb -dcd dialanine.dcd -plumed plumed.dat -ncv 2 -out

window.pdb -interval 10.0 12.0 2.0 3.0

The output ﬁle window.pdb is written in a format appropriate for use with the GROMACS tool g cluster,

which can be used to perform a cluster analysis on your set of frames.

5.3 Processing COLVAR ﬁles

Since PLUMED 1.2, a more ﬂexible format has been adopted for COLVAR ﬁles.

These ﬁles can be parsed with the simple plumedat.sh tool. If run it without

options, this brief summary of options is printed out.

108

Example.

Invoking plumedat.sh without arguments prints a list of the available options:

syntax:

plumedat.sh f1 f2 ... < file

plumedat.sh -l < file

file is the name of the COLVAR file

f1, f2, ... are the names of the required fields

if a required field is not available in the COLVAR file, "NA" is written in the output

with -l, the available choices are listed

example:

plumedat.sh time temp < COLVAR

prints a two-column file, with the time in the first column and the temperature in the

second column

5.4 PLUMED as a standalone program

PLUMED can be run as a standalone program so that it can be easily integrated

in a script. This is particularly useful whenever the time between one PLUMED

call and the other is large (e.g. in ab initio programs) and one aims at mini-

mizing the time needed for implementation. plumed standalone consists in

a simple program, very much like the driver tool, that reads the conﬁgura-

tion and prints out the forces to be added and the other various output ﬁles

typical of PLUMED.

5.4.1 Installation instructions

Before compiling plumed standalone, the ﬁles contained in the common files

directory must be linked to the utilities/standalone directory (i.e. this

can be done using ./getlinks.sh).

To compile using g95/gcc, type:

make arch=g95

To compile with gfortran/gcc, type:

make arch=gfortran

To compile with gfortran/gcc on a 64 bit machine, type:

make arch=gfortan 64

To compile with ifort/icc Intel compilers, type:

109

make arch=intel

To use other compilers and/or compiler ﬂags you will need to modify the

Makeﬁle. In this way you may obtain an executable which is plumed standalone.

5.4.2 Usage

PLUMED standalone expects a standard PLUMED input, a conﬁguration ﬁle for

your system (much in the format of xyz) and gives a screen output. The

conﬁguration ﬁle has the following format:

Example.

Example of plumed standalone input conﬁguration

TIME 0.100000 100

AMPLI 1.000000

BOLTZ 0.001987

BOX 1000.000000 1000.000000 1000.000000

-13.523670417 0.140088464 4.263822219 131.207999 0.000000

-9.821878867 -0.695171589 5.042585958 101.119998 0.000000

-7.193416272 -2.392065561 2.830069705 163.190996 0.000000

-3.844793108 -3.737129154 4.056782255 129.197997 0.000000

The conﬁguration ﬁle must have the keyword TIME followed by the step

length (in the units of the program) and AMPLI which is the conversion factor

between ˚

A and the actual length unit in the conﬁguration ﬁle. This is needed

when a RMSD calculation with respect to a pdb ﬁle in ˚

A must be performed.

In this case, if the conﬁguration is in Bohr, the AMPLI factor must be set

equal to 1.889725989. BOLTZ is the Boltzmann factor in the chosen units and

BOX followed by three numbers speciﬁes the dimensions of the orthorombic

simulation box.

The following lines specify the conﬁguration. The ﬁrst three columns are

the x, y and z coordinates, the fourth is the mass in units of the program

(in the case above a coarse grained program takes the mass of all the single

residues) and in the last column you may put the charge (in the code above

it was not needed). PLUMED invocation may be done with command line:

110

Example.

Example of invocation of PLUMED standalone

./plumed standalone -coord x.xyz -plumed plumed input.cfg

plumed standalone returns back a ﬁle named plumed forces.dat which

contains the additional energy and forces from the bias potential calculated

by the PLUMED module. Eventually, these must be summed to the ones of the

original program.

Example.

Example of plumed standalone output ﬁle. In the ﬁrst line is the additional bias potential calculated by

PLUMED, followed by the x, y and z components of the force for each atom.

2.35680

3.335998704981455 -3.684076849212145 -16.098665997131253

-0.9064447402311846 2.913567594563722 -1.0502785066801152

-0.7948355287665761 0.9861425018822194 0.6558434539267759

-1.9437110159112139 -0.6087819098852943 2.4119663558915296

5.5 Reweighting well-tempered metadynam-

ics calculations

From a converged metadynamics run we can calculate directly the canonical

probability distribution of the collective variables at a given temperature.

On the contrary, the statistics of other degrees of freedom is somehow dis-

torted by the application, during the simulation, of a time-dependent exter-

nal potential on the CVs. Diﬀerent possible techniques have been proposed

to reconstruct the probability distribution of variables other than the CVs

[54, 29, 21].

In well-tempered metadynamics, the reconstruction of the distribution of

variables diﬀerent from the CVs is particularly simple since for long times

the amount of bias added decreases to zero and the system becomes closer

and closer to equilibrium. The algorithm described in Ref. [21] consists of

three diﬀerent steps:

111

1. Accumulate the histogram of the CVs plus the other variables of inter-

est between two updates of the bias potential;

2. When a new Gaussian is added, evolve the histogram following:

P(R, t + ∆t) = e−β(˙

V(S(R),t)−h ˙

V(S,t)i)∆tP(R, t),(5.1)

where P(R, t) is the biased probability distribution, ˙

V(S(R), t) the

time derivative of the bias potential and the average in the exponent is

calculated in the biased ensemble;

3. At the end of the simulation, the unbiased distribution PB(R) can be

recovered from the histogram collected by using a standard umbrella

sampling reweighting:

PB(R)∝eβV (S(R),t)·P(R, t).(5.2)

5.5.1 Installation instructions

As reweight.f90 is a simple fortran 90 program, the installation is straight-

forward so long as you have a fortran compiler available on your machine.

As an example, with the gnu g95 compiler one would compile reweight.f90

using the following command:

g95 -O3 reweight.f90 -o reweight

112

5.5.2 Usage

Example.

We performed a metadynamics run with 2 CVs and we are interested in reconstructing the distribution of

a third variable.

reweight -colvar COLVAR -hills HILLS -ncv 2 -nvar 3 -stride 1 -fes 3 -temp 300

-welltemp

-hills HILLS filename

-colvar COLVAR filename

-out FES filename

-ncv number of variables in HILLS

-nvar number of variables in COLVAR

-stride ratio between COLVAR and HILLS stride

-fes ID of the variables for FES in output

-temp temperature in Kelvin

-ngrid histogram grid dimension

-nreject discard initial steps

-timeout stride for FES printout

-pi ID of the variables with [−π;π]periodicity

-welltemp control for well-tempered metadynamics

-kjoule energy in kjoule/mol

The code needs two ﬁles with the same format of the PLUMED HILLS and

COLVAR ﬁles. In the latter, the metadynamics CVs should appear in the ﬁrst

dcolumn followed by the variables whose distribution one wants to reweight.

The ratio between the stride in COLVAR and HILLS must be constant and

greater than 1. The more data you have for the histogram, the better.

Some important things to keep in mind:

•For the choice of the bin size, please follow the suggestions described

in section 3.4.5;

•Eq. 5.1 is exact. However, at the beginning of the simulation the

average of ˙

V(S(R), t) can be calculated only approximately. Luckily,

a possible initial error is recovered for long times. Alternatively, one

could discard the ﬁrst part of the trajectory using the -nreject option.

Please always check that your results are robust to a discard of initial

parts of the trajectory;

•As for the calculation of the FES with sum hills, remember to control

the convergence by plotting the reconstructed distribution at diﬀerent

times by using -timeout;

113

5.6 Bias-exchange simulations via the linux

shell

Bias-exchange simulations [18] can be performed using every MD engine,

employing the tool in the directory utilities/bias-exchange. Each replica

(walker) runs independently in a diﬀerent directory for a duration τexch equal

to the time between two exchanges of the bias. When all runs are ﬁnished,

a script called bias-exchange.sh takes care of attempting exchanges of the

bias and re-launching the simulation for each replica.

This procedure via the linux shell can be ineﬃcient in terms of computer

time (due to the need to stop and restart the simulations to perform ex-

changes) if τexch is chosen very short, e.g. <100 timesteps. However in the

literature times larger than 10 ps are typically employed for classical simu-

lations of biomolecules, while for ab-initio simulations this should not be an

issue already for τexch ∼0.1 ps. Note that for GROMACS PLUMED provides

also bias-exchange within a single parallel run, which is computationally more

eﬃcient (see Section 3.6.2).

A simulation proceeds as follows (see also the directory example). First,

the Fortran code exchange-tool.f90 is compiled using the command make

(modify the Makeﬁle if necessary). The script bias-exchange.sh and the

program exchange-tool.x must be copied in a chosen directory BASE DIR,

and the following variables inside the script bias-exchange.sh must be mod-

iﬁed:

•BASE DIR

•NWALKER is the number of replicas

•KBT is the Boltzmann factor kBTin units of the MD engine

•NSIMULATIONS is the maximum number of runs between exchanges for

each walker

Then, inside BASE DIR the user must create NWALKER directories called walker1,

walker2, etc., and in each one the following ﬁles must be present:

•an input for restarting an MD simulation of length τexch

•aPLUMED input ﬁle called plumed.dat (see example below)

114

•a script run-walker.sh (see example below)

Finally, the bias-exchange simulation is started by launching the MD runs in

each walker1,walker2, etc. directories using the command ./run-walker.sh

&(the latter script must be executable: it can be made so e.g. using com-

mand chmod a+x run-walker.sh). This script must launch the MD run in

foreground (no &character at the end of the line) and it must end with the

following lines:

touch READY

../bias-exchange.sh &

After a time τexch the runs ﬁnish and control passes to the bias-exchange.sh

script (in background), which attempts to exchange the bias (i.e. ﬁles HILLS

and plumed.dat, not atomic coordinates) between pairs of randomly chosen

replicas. Detailed output is printed in the ﬁle bias-exchange.log.

It is required to employ the keyword HILLS LABEL in each plumed.dat

input ﬁle, with a diﬀerent label for each walker: in this way PLUMED includes

the information (necessary for the bias-exchange tool) about the active CVs

in the COLVAR and HILLS ﬁles.

115

Example.

Example of plumed.dat input ﬁle in directory walker1. Note the keyword HILLS LABEL and that only CV

1 is biased.

HILLS RESTART HEIGHT 0.001 W STRIDE 100

PRINT W STRIDE 10

HILLS LABEL A

COORD LIST 1 <g234> NN 6 MM 12 R 0 5.0 SIGMA 0.1

g234->

2 3 4

g234<-

COORD LIST 2 <g134> NN 6 MM 12 R 0 5.0 SIGMA 0.1

g134->

1 3 4

g134<-

COORD LIST 3 <g124> NN 6 MM 12 R 0 5.0 SIGMA 0.1

g124->

1 2 4

g124<-

COORD LIST 4 <g123> NN 6 MM 12 R 0 5.0 SIGMA 0.1

g123->

1 2 3

g123<-

NOHILLS CV 2

NOHILLS CV 3

NOHILLS CV 4

ENDMETA

The other walkers will have a similar plumed.dat input ﬁle, but with

diﬀerent label (e.g. HILLS LABEL B for walker2, etc.) and diﬀerent active

CV (e.g. the second one in walker2, etc.).

Example.

Example of run-walker.sh script ﬁle: the script must be executable.

#!/bin/bash

# note that CPMD is launched in foreground:

cpmd.x input . -plumed >> output

# this creates and empty file called READY,

# telling the bias-exchange tool that the MD run has finished:

touch READY

# note that bias-exchange.sh is launched in background:

../bias-exchange.sh &

116

At the end of the bias-exchange simulation, the multidimensional free-

energy landscape as a function of several CVs can be reconstructed accord-

ing to the weighted-histogram algorithm in Ref. [29], e.g. employing the

METAGUI program [30] downloadable from

http://www.plumed-code.org/contributions

The bias-exchange simulation can be continued, after NSIMULATIONS steps

are reached and all replicas have ﬁnished their run, in the following way:

•enlarge NSIMULATIONS in the script bias-exchange.sh

•in each walker-directory, delete the ﬁle READY

•in each walker-directory, restart the run using ./run-walker.sh &

117

Bibliography

[1] M. Bonomi, D. Branduardi, G. Bussi, C. Camilloni, D. Provasi, P. Rai-

teri, D. Donadio, F. Marinelli, F. Pietrucci, R. A. Broglia, M. Parrinello,

PLUMED: a portable plugin for free energy calculations with molecular

dynamics, Comp. Phys. Comm. 180 (2009) 1961.

[2] B. Hess, C. Kutzner, D. van der Spoel, E. Lindahl, GROMACS 4: Al-

gorithms for highly eﬃcient, load-balanced, and scalable molecular sim-

ulation, J. Chem. Theory Comput. 4 (3) (2008) 435–447.

[3] J. C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa,

C. Chipot, R. D. Skeel, L. Kal´e, K. Schulten, Scalable molecular dynam-

ics with NAMD, J. Comput. Chem. 26 (16) (2005) 1781–802.

[4] W. Smith, C. Yong, P. Rodger, Mol. Simulat. 28 (2002) 385–471.

[5] S. J. Plimpton, Fast parallel algorithms for short-range molecular dy-

namics, J. Comp. Phys. 117 (1995) 1–19.

[6] D. A. Case, T. A. Darden, T. E. C. III, C. L. Simmerling, J. Wang,

R. E. Duke, R. Luo, K. M. Merz, D. A. Pearlman, M. Crowley, R. C.

Walker, W. Zhang, B. Wang, S. Hayik, A. Roitberg, G. Seabra, K. F.

Wong, F. Paesani, X. Wu, S. Brozell, V. Tsui, H. Gohlke, L. Yang,

C. Tan, J. Mongan, V. Hornak, G. Cui, P. Beroza, D. H. Mathews,

C. Schafmeister, W. S. Ross, P. Kollman, AMBER 9, University of Cal-

ifornia, San Francisco.

[7] M. Harvey, G. Giupponi, G. D. Fabritiis, ACEMD: Accelerated molecu-

lar dynamics simulations in the microseconds timescale, J. Chem. The-

ory and Comput. 5 (2009) 1632.

118

[8] P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavaz-

zoni, D. Ceresoli, G. L. Chiarotti, M. Cococcioni, I. Dabo, A. D. Corso,

S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann,

C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari,

F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto,

C. Sbraccia, S. Scandolo, G. Sclauzero, A. P. Seitsonen, A. Smogunov,

P. Umari, R. M. Wentzcovitch, QUANTUM ESPRESSO: a modular

and open-source software project for quantum simulations of materials,

J. Phys.: Condens. Matter 21 (2009) 395502.

[9] A. Laio, M. Parrinello, Escaping free energy minima, Proc. Natl. Acad.

Sci. USA 99 (2002) 12562–12566.

[10] G. Torrie, J. Valleau, Nonphysical sampling distributions in monte carlo

free energy estimation: Umbrella sampling, J. Comput. Phys. 23 (1977)

187–199.

[11] S. Kumar, J. M. Rosenberg, D. Bouzida, R. H. Swendsen, P. A. Kollman,

Multidimensional free-energy calculations using the weighted histogram

analysis method, J. Comput. Chem. 16 (1995) 1339–1350.

[12] B. Roux, The calculation of the potential of mean force using computer-

simulations, Comput. Phys. Comm. 91 (1995) 275–282.

[13] C. Jarzynski, Nonequilibrium equality for free energy diﬀerences, Phys.

Rev. Lett. 78 (1997) 2690–2693.

[14] G. Crooks, Nonequilibrium measurements of free energy diﬀerences for

microscopically reversible markovian systems, J. Stat. Phys. 90 (5-6)

(1998) 1481–1487.

[15] A. Barducci, G. Bussi, M. Parrinello, Well-tempered metadynamics: A

smoothly converging and tunable free-energy method, Phys. Rev. Lett.

100 (2) (2008) 020603.

[16] P. Raiteri, A. Laio, F. Gervasio, C. Micheletti, M. Parrinello, Eﬃcient

reconstruction of complex free energy landscapes by multiple walkers

metadynamics, J. Phys. Chem. B 110 (2006) 3533–3539.

119

[17] G. Bussi, F. L. Gervasio, A. Laio, M. Parrinello, Free-energy landscape

for beta hairpin folding from combined parallel tempering and metady-

namics, J. Am. Chem. Soc. 128 (41) (2006) 13435–41.

[18] S. Piana, A. Laio, A bias-exchange approach to protein folding, J. Phys.

Chem. B 111 (17) (2007) 4553–9.

[19] G. A. Tribello, M. Ceriotti, M. Parrinello, A self-learning algorithm for

biased molecular dynamics, Proc. Natl. Acad. Sci. USA 107 (41) (2010)

17509–17514.

[20] M. Marchi, P. Ballone, Adiabatic bias molecular dynamics: A method

to navigate the conformational space of complex molecular systems, J.

Chem. Phys. 110 (8) (1999) 3697–3702.

[21] M. Bonomi, A. Barducci, M. Parrinello, Reconstructing the equilibrium

Boltzmann distribution from well-tempered metadynamics, J. Comput.

Chem. 30 (11) (2009) 1615–1621.

[22] J. VandeVondele, M. Krack, F. Mohamed, M. Parrinello, Quickstep:

Fast and accurate density functional calculations using a mixed gaussian

and plane waves . . . , Comp. Phys. Comm. 167 (2005) 103–128.

[23] A. Laio, F. L. Gervasio, Metadynamics: a method to simulate rare events

and reconstruct the free energy in biophysics, chemistry and material

science, Rep. Prog. Phys. 71 (2008) 126601.

[24] A. Barducci, M. Bonomi, M. Parrinello, Metadynamics, WIREs Com-

put. Mol. Sci. 1 (2011) 826.

[25] V. Babin, C. Roland, C. Sagui, Adaptively biased molecular dynamics

for free energy calculations, J. Chem. Phys. 128 (2008) 134101.

[26] F. Baftizadeh, P. Cossio, F. Pietrucci, A. Laio, Protein folding and

ligand-enzyme binding from bias-exchange metadynamics simulations,

Current Physical Chemistry (in press).

[27] Y. Crespo, F. Marinelli, F. Pietrucci, A. Laio, Metadynamics con-

vergence law in a multidimensional system, Phys. Rev. E 81 (2010)

055701(R).

120

[28] C. Camilloni, D. Provasi, G. Tiana, R. A. Broglia, Exploring the protein

G helix free-energy surface by solute tempering metadynamics, Proteins

71 (2008) 1647.

[29] F. Marinelli, F. Pietrucci, A. Laio, S. Piana, A kinetic model of trp-

cage folding from multiple biased molecular dynamics simulations, PLoS

Comput. Biol. 5(8) (2009) e100045.

[30] X. Biarnes, F. Pietrucci, F. Marinelli, A. Laio, Metagui. a vmd inter-

face for analyzing metadynamics and molecular dynamics simulations,

Comp. Phys. Comm. 183 (2011) 203.

[31] A. Grossﬁeld, Wham.http://membrane.urmc.rochester.edu/

Software/WHAM/WHAM.html.

[32] D. Provasi, M. Filizola, Putative active states of a prototypic g-protein-

coupled receptor from biased molecular dynamics, Biophys. J. 98 (2010)

2347–2355.

[33] C. Camilloni, R. A. Broglia, G. Tiana, Hierarchy of folding and unfolding

events of protein g, ci2, and acbp from explicit-solvent simulations, J.

Chem. Phys. 134 (2011) 045105.

[34] J. B. Abrams, M. E. Tuckerman, Eﬃcient and direct generation of mul-

tidimensional free energy surfaces via adiabatic dynamics without coor-

dinate transformations, J. Phys. Chem. B 112 (2008) 15742–15757.

[35] L. Rosso, M. E. Tuckerman, An adiabatic molecular dynamics method

for the calculation of free energy proﬁles, Mol. Sim. 28 (2002) 91–112.

[36] L. Rosso, P. Minary, Z. Zhu, M. E. Tuckerman, On the use of adiabatic

molecular dynamics to calculate free energy proﬁles, J. Chem. Phys. 116

(2002) 4389–4402.

[37] L. Maragliano, E. Vanden-Eijnden, A temperature accelerated method

for sampling free energy and determining reaction pathways in rare

events simulations, Chem. Phys. Lett. 426 (2006) 168–175.

[38] Y. Liu, M. E. Tuckerman, Generalized gaussian moment thermostatting:

A new continuous dynamical approach to the canonical ensemble, J.

Chem. Phys. 112 (2000) 1685.

121

[39] J. Vymetal, J. Vondrasek, Gyration- and inertia-tensor-based collective

coordinates for metadynamics. application on the conformational be-

havior of polyalanine peptides and trp-cage folding, J. Phys. Chem. A

115 (2011) 11455–11465.

[40] D. N. Theodorou, U. W. Suter, Shape of unperturbed linear polymers:

polypropylene, Macromolecules 18 (6) (1985) 1206–1214.

[41] F. Pietrucci, A. Laio, A collective variable for the eﬃcient exploration

of protein beta-structures with metadynamics: application to sh3 and

gb1, J. Chem. Theory Comput. 5 (9) (2009) 2197–2201.

[42] M. Sega, E. Autieri, F. Pederiva, On the calculation of puckering free

energy surfaces, J. Chem. Phys. 130 (22) (2009) 225102.

[43] D. Branduardi, F. L. Gervasio, M. Parrinello, From A to B in free energy

space, J. Chem. Phys. 126 (5) (2007) 054103.

[44] M. Bonomi, D. Branduardi, F. Gervasio, M. Parrinello, The unfolded

ensemble and folding mechanism of the C-terminal GB1 βhairpin, J.

Am. Chem. Soc. 130 (42) (2008) 13938–13944.

[45] S. K. Kearsley, On the orthogonal transformation used for structural

comparison, Acta Cryst. A 45 (1989) 208–210.

[46] H. Li, D. Min, Y. Liu, W. Yang, Essential energy space random walk via

energy space metadynamics method to accelerate molecular dynamics

simulations, J. Chem. Phys. 127 (9) (2007) 094101.

[47] C. Michel, A. Laio, A. Milet, Tracing the entropy along a reactive path-

way: The energy as a generalized reaction coordinate, J. Chem. Theory

Comput. 5 (9) (2009) 2193–2196.

[48] D. Donadio, P. Raiteri, M. Parrinello, Topological defects and bulk melt-

ing of hexagonal ice, J. Phys. Chem. B 109 (12) (2005) 5421–5424.

[49] M. Bonomi, M. Parrinello, Enhanced sampling in the well-tempered

ensemble, Phys. Rev. Lett. 104 (2010) 190601.

[50] A. Amadei, A. B. M. Linssen, H. J. C. Berendsen, Essential dynamics

of proteins., Proteins Struct. Funct. Genet. 17 (1993) 412–425.

122

[51] L. Sutto, M. D. Abramo, F. L. Gervasio, Comparing the eﬃciency of bi-

ased and unbiased molecular dynamics in reconstructing the free energy

landscape of met-enkephalin, J. Chem. Theory Comput. 6 (12) (2010)

3640–3646.

[52] F. Pietrucci, W. Andreoni, Graph theory meets ab initio molecular dy-

namics: atomic structures and transformations at the nanoscale, Phys.

Rev. Lett. 107 (2011) 085504.

[53] G. A. Tribello, J. Cuny, H. Eshet, M. Parrinello, Exploring the free

energy surfaces of clusters using reconnaissance metadynamics, J. Chem.

Phys 135 (11) (2011) 114109.

[54] G. Tiana, Estimation of microscopic averages from metadynamics, Eur.

Phys. J. B 63 (2) (2008) 235–238.

123

Index

Absolute position, see Collective vari-

ables, Absolute position

ADF, see Collective variables, ADF

Adiabatic bias molecular dynamics,

Alpha-beta similarity, see Collective

variables, Alpha-beta similar-

ity

Alpharmsd, see Collective variables,

Alpharmsd

Angle, see Collective variables, Angle

Antibetarmsd, see Collective variables,

Antibetarmsd

Atom lists, 64

Bias exchange simulations, 42

Bias-exchange simulations via the linux

shell, 114

Collective variables

ADF, 98

Absolute position, 66

Alpha-beta similarity, 77

Alpharmsd, 78

Angle, 69

Antibetarmsd, 79

Contact Matrix distance, 91

Coordination number, 70

DMSD, 91

Diﬀerence of distances, 67

Dihedral correlation, 77

Dipole, 76

Distance from an axis, 68

Distance, 67

Electrostatic potential, 81

FUNCTION, 100

Gyration tensor based CVs, 75

Hydrogen bonds, 71

Interfacial water, 73

MSD, 91

Minimal distance, 68

POLY, 99

Parabetarmsd, 80

Path variables, 83

Projection on an axis, 68

Puckering coordinates, 82

RDF, 97

Radius of Gyration, 74

SPRINT, 96

Targeted, 91

Torsion, 70

Contact Matrix distance, see Collec-

tive variables, Contact Matrix

distance

Coordination number, see Collective

variables, Coordination num-

ber

Diﬀerence of distances, see Collective

variables, Diﬀerence of distances

Dihedral correlation, see Collective vari-

ables, Dihedral correlation

124

Dipole, see Collective variables, Dipole

Distance, see Collective variables, Dis-

tance

Distance from an axis, see Collective

variables, Distance from an axis

DMSD, see Collective variables, DMSD

Electrostatic potential, see Collective

variables, Electrostatic poten-

tial

FUNCTION, see Collective variables,

FUNCTION

Grid for metadynamics, 30

Gyration tensor based CVs, see Col-

lective variables, Gyration ten-

sor based CVs

Hydrogen bonds, see Collective vari-

ables, Hydrogen bonds

Interfacial water, see Collective vari-

ables, Interfacial water

Keywords

R 0 , 97

ABMD, 48

ACYLINDRICITY, 76

ALIGN ATOMS, 95, 102

ALPHABETA, 36, 77

ALPHARMSD, 78, 102

ANGLE, 69

ANTIBETARMSD, 79, 102

ASPHERICITY, 76

BASINS, 54, 56

BASIN TOL, 56, 57

BETA, 69

BIASXMD, 42

CLUSTER, 54, 56

CMAP, 92

COEFF, 99

COLVAR, 53, 54, 62, 63

COMMITMENT, 53

COORD, 36, 96, 101

COS, 69, 70

CV LIST, 59

CV, 61, 99

DAFED CONTROL, 62

DAFED STATE, 62

DAFED, 61

DIFF, 94

DIHCOR, 77

DIPOLE, 76

DIR, 67

DISTANCE, 67, 101

DMSD, 84, 92

DRMS, 84

D 0, 96

EIGENVEC, 94

ELSTPOT, 81

ENDMETA, 26

ENERGY, 9, 93

EXPAND PARAM, 57

EXP, 99

EXTERNAL, 50

E sj, 63

FILENAME, 32, 33

FRAME, 94

FUNCTION, 100

GRID, 31, 32

GTPC 1, 75

GTPC 2, 75

GTPC 3, 75

HBONDS, 71, 101

HEIGHT, 58

HELIX, 93

HILLS LABEL, 43, 115

125

HILLS, 27, 28, 58

INDEX 2, 96

INDEX, 92, 96

INITIAL SIZE, 57

INTERVAL, 36, 37

INVERT, 36, 38, 39

JACOBIAN FORCE, 62

KAPPA2, 76

KAPPA, 61

LIST, 54, 64, 102

LWALL, 37, 50

MASS-WEIGHTED, 75

MASS, 61

MINDIST, 68, 101

MINUS PI, 62

MM, 96

MSD, 84

NLOOP, 93

NN, 96

NO-WEIGHTED, 75

NOALIGN, 94

NOHILLS, 35, 36, 42

NOPBC, 67, 69, 71, 73, 88, 92, 101

NOSPLINE, 32

NOTE, 26

NO CENTER, 86

NO ROT, 86

ONIONS, 54, 58

PARABETARMSD, 80, 102

PCA, 94, 95

PERIODIC, 62

PLUS 2PI, 62

PLUS PI, 62

POINT FROM AXIS, 68

POLY, 99

POSITION, 66

PRINT, 26

PROJ GRAD, 53, 54

PROJ ON AXIS, 68

PTMETAD, 41

PUCKERING, 82

RDF, 97

READ GRID, 33

RECONNAISSANCE, 54, 58, 59

RESTART, 58, 62

RGYR 1, 75

RGYR 2, 76

RGYR 3, 76

RGYR, 74, 75, 101

RMSD, 84

RUN FREQ, 56

R 0, 96

SHIFT, 99

SIGMA, 36, 38, 42, 64

SIMTEMP, 41

SIN, 69, 70

SPRINT, 96

SQRT, 91

STEERPLAN, 46

STEER, 46

STORE FREQ, 56, 57

S PATH, 83–85, 87, 101

TARGETED, 91

TAUTHERMO, 61

TEMPERATURE, 61

TERMS, 99

TORSION, 69, 70

TRACE, 75

TYPE, 75

T sj, 63

UMBRELLA, 45

UWALL, 50

WATERBRIDGE, 73

WIDTH, 58

WRITE GRID, 32

WRITE STATE, 62

126

W STRIDE, 32, 58

W sj, 63

Z PATH, 83–85, 87, 101

#, 26

sj, 63

Minimal distance, see Collective vari-

ables, Minimal distance

MSD, see Collective variables, MSD

Output ﬁles, Metadynamics, 27

Parabetarmsd, see Collective variables,

Parabetarmsd

Parallel tempering metadynamics, 41

Path variables, see Collective variables,

Path variables

Periodic boundary conditions, 101

plumedat.sh, 27, 108

POLY, see Collective variables, POLY

Projection on an axis, see Collective

variables, Projection on an axis

Puckering coordinates, see Collective

variables, Puckering coordinates

Radius of Gyration, see Collective vari-

ables, Radius of Gyration

RDF, see Collective variables, RDF

SPRINT, see Collective variables, SPRINT

Standalone, 109

Steered MD, 46

Steerplan, 46

Tabulated potentials, 50

Targeted, see Collective variables, Tar-

geted

Torsion, see Collective variables, Tor-

sion

Umbrella sampling, 45

Units, 27

Walls, 50

Well-tempered metadynamics Reweight,

111

127

Manual

Navigation menu

Versions of this User Manual:

Views

Navigation