TY - JOUR
T1 - Prediction of octanol-water partition coefficients for the SAMPL6- log P molecules using molecular dynamics simulations with OPLS-AA, AMBER and CHARMM force fields
AU - Fan, Shujie
AU - Iorga, Bogdan I.
AU - Beckstein, Oliver
N1 - Funding Information:
The authors would like to thank David L. Mobley and Teresa Danielle Bergazin for useful discussions and sharing unpublished data. Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Awards Number R01GM118772 and R01GM125081. BII was supported in part by Grants ANR-10-LABX-33 (LabEx LERMIT) and ANR-14-JAMR-0002-03 (JPIAMR) from the French National Research Agency (ANR), and by a Grant DIM MAL-INF from the Région Ile-de-France
Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020/5/1
Y1 - 2020/5/1
N2 - All-atom molecular dynamics simulations with stratified alchemical free energy calculations were used to predict the octanol-water partition coefficient log Pow of eleven small molecules as part of the SAMPL6-log P blind prediction challenge using four different force field parametrizations: standard OPLS-AA with transferable charges, OPLS-AA with non-transferable CM1A charges, AMBER/GAFF, and CHARMM/CGenFF. Octanol parameters for OPLS-AA, GAFF and CHARMM were validated by comparing the density as a function of temperature, the chemical potential, and the hydration free energy to experimental values. The partition coefficients were calculated from the solvation free energy for the compounds in water and pure (“dry”) octanol or “wet” octanol with 27 mol% water dissolved. Absolute solvation free energies were computed by thermodynamic integration (TI) and the multistate Bennett acceptance ratio with uncorrelated samples from data generated by an established protocol using 5-ns windowed alchemical free energy perturbation (FEP) calculations with the Gromacs molecular dynamics package. Equilibration of sets of FEP simulations was quantified by a new measure of convergence based on the analysis of forward and time-reversed trajectories. The accuracy of the log Pow predictions was assessed by descriptive statistical measures such as the root mean square error (RMSE) of the data set compared to the experimental values. Discarding the first 1 ns of each 5-ns window as an equilibration phase had a large effect on the GAFF data, where it improved the RMSE by up to 0.8 log units, while the effect for other data sets was smaller or marginally worsened the agreement. Overall, CGenFF gave the best prediction with RMSE 1.2 log units, although for only eight molecules because the current CGenFF workflow for Gromacs does not generate files for certain halogen-containing compounds. Over all eleven compounds, GAFF gave an RMSE of 1.5. The effect of using a mixed water/octanol solvent slightly decreased the accuracy for CGenFF and GAFF and slightly increased it for OPLS-AA. The GAFF and OPLS-AA results displayed a systematic error where molecules were too hydrophobic whereas CGenFF appeared to be more balanced, at least on this small data set.
AB - All-atom molecular dynamics simulations with stratified alchemical free energy calculations were used to predict the octanol-water partition coefficient log Pow of eleven small molecules as part of the SAMPL6-log P blind prediction challenge using four different force field parametrizations: standard OPLS-AA with transferable charges, OPLS-AA with non-transferable CM1A charges, AMBER/GAFF, and CHARMM/CGenFF. Octanol parameters for OPLS-AA, GAFF and CHARMM were validated by comparing the density as a function of temperature, the chemical potential, and the hydration free energy to experimental values. The partition coefficients were calculated from the solvation free energy for the compounds in water and pure (“dry”) octanol or “wet” octanol with 27 mol% water dissolved. Absolute solvation free energies were computed by thermodynamic integration (TI) and the multistate Bennett acceptance ratio with uncorrelated samples from data generated by an established protocol using 5-ns windowed alchemical free energy perturbation (FEP) calculations with the Gromacs molecular dynamics package. Equilibration of sets of FEP simulations was quantified by a new measure of convergence based on the analysis of forward and time-reversed trajectories. The accuracy of the log Pow predictions was assessed by descriptive statistical measures such as the root mean square error (RMSE) of the data set compared to the experimental values. Discarding the first 1 ns of each 5-ns window as an equilibration phase had a large effect on the GAFF data, where it improved the RMSE by up to 0.8 log units, while the effect for other data sets was smaller or marginally worsened the agreement. Overall, CGenFF gave the best prediction with RMSE 1.2 log units, although for only eight molecules because the current CGenFF workflow for Gromacs does not generate files for certain halogen-containing compounds. Over all eleven compounds, GAFF gave an RMSE of 1.5. The effect of using a mixed water/octanol solvent slightly decreased the accuracy for CGenFF and GAFF and slightly increased it for OPLS-AA. The GAFF and OPLS-AA results displayed a systematic error where molecules were too hydrophobic whereas CGenFF appeared to be more balanced, at least on this small data set.
KW - AMBER force field
KW - CHARMM force field
KW - Free energy perturbation
KW - Ligand parametrization
KW - Molecular dynamics
KW - OPLS-AA force field
KW - Octanol-water partition coefficient
KW - Solvation free energy
UR - http://www.scopus.com/inward/record.url?scp=85078245327&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078245327&partnerID=8YFLogxK
U2 - 10.1007/s10822-019-00267-z
DO - 10.1007/s10822-019-00267-z
M3 - Article
C2 - 31960254
AN - SCOPUS:85078245327
SN - 0920-654X
VL - 34
SP - 543
EP - 560
JO - Journal of Computer-Aided Molecular Design
JF - Journal of Computer-Aided Molecular Design
IS - 5
ER -