首页 正文

M3-20M: A large-scale multi-modal molecule dataset for AI-driven drug design and discovery

{{output}}
This paper introduces M3-20M, a large-scale Multi-Modal Molecule dataset that contains over 20 million molecules, with the data mainly being integrated from existing databases and partially generated by large language models. Designed to support AI-driven drug... ...