20260512.0001v1MethodReleased: March 13, 20252 Views

Siamese Foundation Models for Crystal Structure Prediction

Liming Wu|Wenbing Huang|Rui Jiao|Jianxing Huang|Liwei Liu|Yipeng Zhou|Hao Sun|Yang Liu|Fuchun Sun|Yuxiang Ren|Jirong Wen

Abstract

Predicting crystal structures from chemical compositions is a fundamental challenge in materials discovery, complicated by complex 3D geometries that distinguish it from fields like protein folding. Here, we present Diffusion-based crystAl Omni (DAO), a pretrain-finetune framework for crystal structure prediction integrating two Siamese foundation models: a structure generator and an energy predictor. The generator is pretrained via a two-stage pipeline on a vast dataset of stable and unstable structures, leveraging the predictor to relax unstable configurations and guide the generative sampling. Across two well-known benchmarks, pretraining significantly enhances performance across multiple backbone architectures. Ablation studies confirm that the synergy between the generator and predictor mutually benefits both components. We further validate DAO on three real-world superconductors (Cr6Os2, Zr16Rh8O4, and Zr16Pd8O4) typically inaccessible to conventional computation. For Cr6Os2, DAO achieves a 100% match rate with experimental references and an atomic-position error of 0.0012 under 20-shot generation, performing over 2000 × faster per iteration than DFT-based structure predictors. These compelling results collectively highlight the potential of our approach for advancing materials science research.

Keywords

crystal structure predictiondiffusion modelsfoundation modelsSiamese networksCrysformerenergy-guided samplingsuperconductorsmaterials discoverypolymorphismcrystal generation

External Source

This is an externally sourced paper. It was originally published independently.