Training a Scientific Reasoning Model for Chemistry
Abstract
A reasoning model trained with reinforcement learning on chemistry problems outperforms general-purpose models and specialized models in molecular design tasks, requiring less data.
Reasoning models are large language models that emit a long chain-of-thought before answering, providing both higher accuracy and explicit reasoning for their response. A major question has been whether language model reasoning generalizes beyond mathematics, programming, and logic, where most previous work has focused. We demonstrate that reasoning models can be post-trained for chemistry without additional domain pretraining, and require substantially less data compared to contemporary domain-specific models. We report ether0, a 24B parameter LLM (based on Mistral-Small-24B) that can reason in natural language and respond with chemical structures. This reasoning model was trained with reinforcement learning on 640,730 experimentally-grounded chemistry problems across 375 tasks ranging from synthesizability, to blood-brain barrier permeability, to human receptor activity, to scent. Our model exceeds general-purpose chemistry models, frontier models, and human experts on molecular design tasks. It is also more data efficient relative to specialized models. We anticipate that this method can be applied to train data-efficient language models specialized for tasks across a wide variety of scientific domains.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- ChemVLR: Prioritizing Reasoning in Perception for Chemical Vision-Language Understanding (2026)
- Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design (2026)
- Grounded Chess Reasoning in Language Models via Master Distillation (2026)
- A Multi-task Large Reasoning Model for Molecular Science (2026)
- Reward Design for Physical Reasoning in Vision-Language Models (2026)
- Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries (2026)
- Learning Adaptive Reasoning Paths for Efficient Visual Reasoning (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2506.17238 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash