Article (Scientific journals)
Many-Body Descriptors for Predicting Molecular Properties with Machine Learning: Analysis of Pairwise and Three-Body Interactions in Molecules
Pronobis, Wiktor; Tkatchenko, Alexandre; Müller, Klaus-Robert
2018In Journal of Chemical Theory and Computation, 14, p. 13
Peer Reviewed verified by ORBi
 

Files


Full Text
126-many-body-descriptors-ML-JCTC-2018.pdf
Publisher postprint (4.62 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] Machine learning (ML) based prediction of molecular properties across chemical compound space is an important and alternative approach to efficiently estimate the solutions of highly complex many-electron problems in chemistry and physics. Statistical methods represent molecules as descriptors that should encode molecular symmetries and interactions between atoms. Many such descriptors have been proposed; all of them have advantages and limitations. Here, we propose a set of general two-body and three-body interaction descriptors which are invariant to translation, rotation, and atomic indexing. By adapting the successfully used kernel ridge regression methods of machine learning, we evaluate our descriptors on predicting several properties of small organic molecules calculated using density-functional theory. We use two data sets. The GDB-7 set contains 6868 molecules with up to 7 heavy atoms of type CNO. The GDB-9 set is composed of 131722 molecules with up to 9 heavy atoms containing CNO. When trained on 5000 random molecules, our best model achieves an accuracy of 0.8 kcal/mol (on the remaining 1868 molecules of GDB-7) and 1.5 kcal/mol (on the remaining 126722 molecules of GDB-9) respectively. Applying a linear regression model on our novel many-body descriptors performs almost equal to a nonlinear kernelized model. Linear models are readily interpretable: a feature importance ranking measure helps to obtain qualitative and quantitative insights on the importance of two- and three-body molecular interactions for predicting molecular properties computed with quantum-mechanical methods.
Disciplines :
Physics
Author, co-author :
Pronobis, Wiktor ;  Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
Tkatchenko, Alexandre  ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Physics and Materials Science Research Unit
Müller, Klaus-Robert
 These authors have contributed equally to this work.
External co-authors :
yes
Language :
English
Title :
Many-Body Descriptors for Predicting Molecular Properties with Machine Learning: Analysis of Pairwise and Three-Body Interactions in Molecules
Publication date :
11 May 2018
Journal title :
Journal of Chemical Theory and Computation
ISSN :
1549-9618
eISSN :
1549-9626
Publisher :
American Chemical Society, United States - District of Columbia
Volume :
14
Pages :
13
Peer reviewed :
Peer Reviewed verified by ORBi
Focus Area :
Physics and Materials Science
European Projects :
H2020 - 725291 - BeStMo - Beyond Static Molecules: Modeling Quantum Fluctuations in Complex Molecular Environments
Funders :
CE - Commission Européenne [BE]
Available on ORBilu :
since 01 January 2019

Statistics


Number of views
131 (15 by Unilu)
Number of downloads
851 (15 by Unilu)

Scopus citations®
 
61
Scopus citations®
without self-citations
44
OpenCitations
 
54
WoS citations
 
58

Bibliography


Similar publications



Contact ORBilu