HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

Vaibhav Bihani; Utkarsh Pratiush; Sajid Mannan; Tao Du; Zhimin Chen; Santiago Miret; Matthieu Micoulaut; Morten M Smedskjaer; Sayan Ranu; N M Anoop Krishnan

EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

Abstract

Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model atomic interactions. However, thorough evaluations of these deploying EGraFFs for the downstream task of real-world atomistic simulations, is lacking. To this end, here we perform a systematic benchmarking of 6 EGraFF algorithms (NequIP, Allegro, BOTNet, MACE, Equiformer, TorchMDNet), with the aim of understanding their capabilities and limitations for realistic atomistic simulations. In addition to our thorough evaluation and analysis on eight existing datasets based on the benchmarking literature, we release two new benchmark datasets, propose four new metrics, and three challenging tasks. The new datasets and tasks evaluate the performance of EGraFF to out-of-distribution data, in terms of different crystal structures, temperatures, and new molecules. Interestingly, evaluation of the EGraFF models based on dynamic simulations reveals that having a lower error on energy or force does not guarantee stable or reliable simulation or faithful replication of the atomic structures. Moreover, we find that no model clearly outperforms other models on all datasets and tasks. Importantly, we show that the performance of all the models on out-of-distribution datasets is unreliable, pointing to the need for the development of a foundation model for force fields that can be used in real-world simulations. In summary, this work establishes a rigorous framework for evaluating machine learning force fields in the context of atomic simulations and points to open research challenges within this domain.

Benchmarks

BenchmarkMethodologyMetrics
formation-energy-on-3bpaBOTNet
MAE: 5
formation-energy-on-3bpaAllegro
MAE: 4.13
formation-energy-on-3bpaNequIP
MAE: 3.15
formation-energy-on-3bpaMACE
MAE: 4
formation-energy-on-acetylacetoneAllegro
MAE: 0.92
formation-energy-on-acetylacetoneNequIP
MAE: 1.38
formation-energy-on-acetylacetoneMACE
MAE: 2
formation-energy-on-acetylacetoneBOTNet
MAE: 2
formation-energy-on-aspirinBOTNet
MAE: 12.63
formation-energy-on-aspirinAllegro
MAE: 14.36
formation-energy-on-aspirinMACE
MAE: 13.79
formation-energy-on-aspirinNequIP
MAE: 9.27
formation-energy-on-ethanolBOTNet
MAE: 203.83
formation-energy-on-ethanolMACE
MAE: 209.96
formation-energy-on-ethanolAllegro
MAE: 6.94
formation-energy-on-ethanolNequIP
MAE: 4.99
formation-energy-on-geteMACE
MAE: 2670
formation-energy-on-geteAllegro
MAE: 1009.4
formation-energy-on-geteBOTNet
MAE: 3034
formation-energy-on-geteNequIP
MAE: 1780.951
formation-energy-on-lipsMACE
MAE: 30
formation-energy-on-lipsNequIP
MAE: 165.43
formation-energy-on-lipsBOTNet
MAE: 28
formation-energy-on-lipsAllegro
MAE: 31.75
formation-energy-on-lips20NequIP
MAE: 26.8
formation-energy-on-lips20Allegro
MAE: 33.17
formation-energy-on-lips20BOTNet
MAE: 24.59
formation-energy-on-lips20MACE
MAE: 14.05
formation-energy-on-naphthaleneMACE
MAE: 161.74
formation-energy-on-naphthaleneNequIP
MAE: 2.66
formation-energy-on-naphthaleneAllegro
MAE: 5.82
formation-energy-on-naphthaleneBOTNet
MAE: 182.55
formation-energy-on-salicylic-acidNequIP
MAE: 6.29
formation-energy-on-salicylic-acidMACE
MAE: 165.29
formation-energy-on-salicylic-acidBOTNet
MAE: 153.06
formation-energy-on-salicylic-acidAllegro
MAE: 8.59

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp