HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Text-Guided Molecule Generation with Diffusion Language Model

Haisong Gong; Qiang Liu; Shu Wu; Liang Wang

Text-Guided Molecule Generation with Diffusion Language Model

Abstract

Text-guided molecule generation is a task where molecules are generated to match specific textual descriptions. Recently, most existing SMILES-based molecule generation methods rely on an autoregressive architecture. In this work, we propose the Text-Guided Molecule Generation with Diffusion Language Model (TGM-DLM), a novel approach that leverages diffusion models to address the limitations of autoregressive methods. TGM-DLM updates token embeddings within the SMILES string collectively and iteratively, using a two-phase diffusion generation process. The first phase optimizes embeddings from random noise, guided by the text description, while the second phase corrects invalid SMILES strings to form valid molecular representations. We demonstrate that TGM-DLM outperforms MolT5-Base, an autoregressive model, without the need for additional data resources. Our findings underscore the remarkable effectiveness of TGM-DLM in generating coherent and precise molecules with specific properties, opening new avenues in drug discovery and related scientific domains. Code will be released at: https://github.com/Deno-V/tgm-dlm.

Code Repositories

deno-v/tgm-dlm
Official
jax
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
text-based-de-novo-molecule-generation-onTGM-DLM
BLEU: 82.6
Exact Match: 24.2
Frechet ChemNet Distance (FCD): 0.77
Levenshtein: 17.003
MACCS FTS: 85.4
Morgan FTS: 68.8
Parameter Count: 180000000
RDK FTS: 73.9
Text2Mol: 58.1
Validity: 87.1
text-based-de-novo-molecule-generation-onTGM-DLM w/o corr
BLEU: 82.8
Exact Match: 24.2
Frechet ChemNet Distance (FCD): 0.89
Levenshtein: 16.897
MACCS FTS: 87.4
Morgan FTS: 72.2
Parameter Count: 180000000
RDK FTS: 77.1
Text2Mol: 58.9
Validity: 78.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp