
Date: 8 – 11 July 2025
Venue: NUS University Town
Daily Programme Schedule
Morning Sessions
All morning sessions are to be held at Auditorium 1
8:00am | Registration | |||
8:50am | Opening Remarks |
Time | Speaker | Title | Abstract | Topic |
---|---|---|---|---|
9:00–9:40am | Gábor Csányi | Machine learning force fields shows extreme generalisation |
I will introduce the general problem of first principles force fields: creating surrogate models for quantum mechanics that yield the energy of a configuration of atoms in 3D space, as we would find them in materials or molecules. Over the last decade significant advances were made in the attainable accuracy, and today we can model materials and molecules with a per-atom energy accuracy of up to 1 part in 10,000 with a speedup of over a million or more compared to the explicit quantum mechanical calculation, enabling molecular dynamics on large length and time scales. The most surprising aspect of the best model is its extreme generalisation: fitted only on small periodic crystals, it shows stable trajectories on arbitrary chemical systems, from water to nanoparticles and proteins. I will show some of the technical details behind the success of our models: equivariant many-body graph polynomials with very few and weak nonlinearities. The relationship between the architectural elements and the extreme generalisation is still largely a mystery. The locality of the graph structure is key to its success, as well as high body order and message passing. The force fields get significantly better with more data, yet model size and complexity can remain largely the same. Integrating explicit long range electrostatics with such general "foundational" force fields is a challenge, as well as combining large datasets for materials and organic molecules, due to the incompatibility of leading DFT approximations.
|
AI for Chemistry |
9:40–10:00am | Rafael-Gomez Bombarelli | ML Gradients for Computational Chemistry |
In the physical sciences, ML has found great synergy with physics-based simulations. On the one hand, simulations can provide abundant, reproducible and scalable training data. Machine learning models can act as surrogates of expensive simulators, allowing improvements in speed and accessible length- and time-scales. In this talk, we describe recent work in the use of differentiable programming and deep learning models to fuse optimization and learning in molecular simulations, including the use of differentiable uncertainty to power active learning, alchemical ML potentials that capture atomic disorder in solids, learning collective variables or transport operators for accelerated simulations, and using discrete generative models and reinforcement learning for surface reconstruction. Lastly, we encourage a discussion around the need to relate (ML-accelerated) simulations to tangible impact in chemicals and materials.
|
AI for Chemistry |
10:00–10:20am | Ekin Doǧuş Çubuk | TBC |
N/A
|
AI for Materials Science |
Tea Break | ||||
10:40–11:20am | Weinan E | Building AI-Powered Infrastructure for Scientific Research |
I will discuss the progress we have made towards building a new generation of AI-powered infrastructure for scientific research.
|
AI for Science |
11:20–11:40am | Marin Soljačić | Novel computing paradigms |
Certain novel schemes for computing that use photons (instead of electrons).
|
Unconventional Computing |
11:40am–12:00pm | Richard Parker | TBC |
N/A
|
AI for Science |
Break | ||||
12:20–1:00pm | Gerbrand Ceder | AI and autonomous laboratories for materials synthesis |
Computational materials science has seen tremendous progress since the early days of Density Functional Theories. Stable algorithms enabled high-throughput computing which in turn enabled machine-learned potentials (MLP). Though far from perfect at this point, MLPs hold tremendous promise for accelerating materials simulation and discovery. Such progress is not parallelled on the experimental side, making it the gating factor in materials development. In response we built A-lab, an autonomous facility for the closed-loop synthesis of inorganic materials from powder precursors. All synthesis and characterization actions in A-lab, including powder mixing and grinding, firing, characterization by XRD and SEM, and all sample transfers between them are fully automated, leading to a lab that can synthesize and structurally characterize compounds within 10–20 hours of initiation. The A-lab leverages ab-initio computations through an API with the Materials Project, historical data sets that are text-mined from the literature, machine learning for optimization of synthesis routes and interpretation of characterization data, and active learning to plan and interpret the outcomes of experiments performed using robotics. The automation of synthesis and analysis can be further integrated into scientific workflows similar to computational workflows.
|
Self-Driving Labs |
1:00–1:40pm | Kristin Persson | Fueling The Era of Data-Driven Materials Design and Synthesis |
Fueled by increased availability of materials data, machine learning is poised to revolutionize materials science by enabling accelerated discovery, design, and optimization of materials. As one of the first and most visible of materials data providers, the Materials Project (www.materialsproject.org) uses supercomputing and an industry-standard software infrastructure together with state-of-the-art quantum mechanical theory to compute the properties of all known inorganic materials and beyond. The data, currently covering over 160,000 materials and millions of properties, is offered for free to the community together with online analysis and design algorithms. Serving a rapidly expanding community of more than 600,000 registered users, the Materials Project delivers millions of data records daily through its API, fostering data-rich research across materials science. This wealth of data is inspiring the development of machine learning algorithms aimed at predicting material properties, characteristics, and synthesizability. However, we note that truly accelerating materials innovation also requires rapid synthesis, testing and feedback, seamlessly coupled to existing data-driven predictions and computations. The ability to devise data-driven methodologies to guide synthesis efforts is needed as well as rapid interrogation and recording of results – including ‘non-successful’ ones. This talk will outline the rise of data-driven materials design, predictive synthesis and showcase successes as well as comment on current pitfalls and future directions.
|
AI for Chemistry |
Lunch |
Afternoon Sessions
(Parallel Sessions)
Morning Sessions
All morning sessions are to be held at Auditorium 1
Time | Speaker | Title | Abstract | Topic |
---|---|---|---|---|
8:50am | Session Chair / Announcements | |||
9:00–9:40am | Klaus Robert Müller | AI for the sciences -- toward understanding |
In recent years, machine learning (ML) and artificial intelligence (AI) methods have begun to play a more and more enabling role in the sciences and in industry. In particular, the advent of large and/or complex data corpora has given rise to new technological challenges and possibilities. In his talk, Müller will touch upon the topic of ML applications in the sciences, in particular in medicine, physics, chemistry. He will focus on techniques from explainable AI and their use for extracting information from machine learning models in order to further our understanding by explaining nonlinear ML models. Finally, Müller will briefly discuss perspectives and limitations.
|
AI for Science |
9:40–10:00am | Jack Wells | TBC |
TBC
|
TBC |
10:00am-10:20am | Yu Xie | Scalable emulation of protein equilibrium ensembles with generative deep learning |
Following the sequence and structure revolutions, predicting the dynamical mechanisms of proteins that implement biological function remains an outstanding scientific challenge. Several experimental techniques and molecular dynamics (MD) simulations can, in principle, determine conformational states, binding configurations and their probabilities, but suffer from low throughput. Here we develop a Biomolecular Emulator (BioEmu), a generative deep learning system that can generate thousands of statistically independent samples from the protein structure ensemble per hour on a single graphical processing unit. By leveraging novel training methods and vast data of protein structures, over 200 milliseconds of MD simulation, and experimental protein stabilities, BioEmu’s protein ensembles represent equilibrium in a range of challenging and practically relevant metrics. Qualitatively, BioEmu samples many functionally relevant conformational changes, ranging from formation of cryptic pockets, over unfolding of specific protein regions, to large-scale domain rearrangements. Quantitatively, BioEmu samples protein conformations with relative free energy errors around 1 kcal/mol, as validated against millisecond-timescale MD simulation and experimentally-measured protein stabilities. By simultaneously emulating structural ensembles and thermodynamic properties, BioEmu reveals mechanistic insights, such as the causes for fold destabilization of mutants, and can efficiently provide experimentally-testable hypotheses.
|
AI for Biology |
Tea Break | ||||
10:40–11:20am | Alex Aliper | From Algorithm to Human Clinical Trials: Accelerating Drug Discovery and Development With Generative AI and Robotics |
In this talk we will cover the application of AI to disease modeling, target discovery, indication prioritization, indication expansion, and small molecule drug design. We will explore key case studies and cover the current state of the industry, highlighting its limitations, bottlenecks, and opportunities for advancing drug discovery. We will also discuss the applications of generative AI to development of foundational models for chemistry and multi-species multi-omics life models for aging and fundamental biological research.
|
AI for Biology |
11:20–11:40am | Wei Lu | TBC |
TBC
|
Unconventional Computing |
12:00–12:20pm | Yujie Huang | KronosAI |
KronosAI has achieved state-of-the-art for silicon photonics. We are building foundation model-based physics solvers that are faster, more accurate and more generalizable than traditional numerical solvers. Yujie will discuss insights gained from KronosAI's training, and how our work is paving the way for sustainable, eco-friendly engineering by integrating inverse design and design space exploration into everyday practice.
|
AI for Physics |
12:20–12:20pm | Hsin-Yuan Huang | The vast world of quantum advantage |
While quantum devices promise extraordinary capabilities, discerning genuine advantages from mere illusions remains a formidable challenge. In this endeavor, quantum theorists are like prophets, striving to foretell a future where quantum technologies will transform our world. Most people understand quantum advantage primarily as offering faster computation. This talk explores the vast world of quantum advantages that extends far beyond computation to include fundamental advantages in learning, sensing, cryptography, and memory storage. I will also demonstrate how some quantum advantages are inherently unpredictable using classical resources alone, suggesting a landscape far richer than currently envisioned. As quantum technologies proliferate, these unpredictable advantages may enable transformative applications beyond the reach of our classical imagination.
|
Unconventional Computing |
Break | ||||
Poster Session & Lunch |
Afternoon Sessions
(Parallel Sessions)
Morning Sessions
All morning sessions are to be held at Auditorium 1
Time | Speaker | Title | Abstract | Topic |
---|---|---|---|---|
8:50am | Session Chair: Terence O'Kane | |||
9:00–9:40am | Stan Posey | Directions in Energy Efficient AI for Driving Earth Digital Twins |
AI is becoming a critical component of Earth system science workflows that are experiencing rapid growth in data from model output of increasing resolution in weather and climate models, and Earth observation systems that produce orders of magnitude more data than their previous generations. Efforts are underway in the weather and climate modeling community towards refining the horizontal resolution of atmosphere GCMs towards km-scale to explicitly resolve certain small-scale convective cloud processes and provide more realistic local information on climate change. At the same time, exascale HPC systems have arrived and in most cases are designed with GPU accelerator technology that offers opportunities in reasonable simulation turn-around times balanced with efficiency in energy consumption. Ultimately, output from global storm-resolving models at km-scale will become the essential driver behind the deployment of Earth digital twins for programs like the EC Destination Earth and NVIDIA Earth-2. For model emulation of the Earth system, AI models become increasingly accurate as they train on more data, yet computational and storage requirements in data-distributed computing environments with energy-efficiency considerations are the current challenges for the HPC vendor community. This talk will describe advances in HPC for GPU-accelerated numerical models, AI software and system features for large-scale data handling, and ML model training and inference that when combined, provide the critical components towards the vision of Earth system digital twins.
|
AI for Climate and Weather |
9:40-10:00am | Wessel Bruinsma | A Foundation Model for the Earth System: Air Pollution and Ocean Waves |
Aurora is a foundation model for the Earth system pretrained on a large and diverse collection of geophysical data. The key ability of Aurora is that the model can be fine-tuned to produce forecasts for a wide variety of environmental forecasting applications, often matching or even outperforming state-of-the-art traditional approaches at a fraction of the computational cost. In this first part of a two-part talk on Aurora, I will discuss the concept of a foundation model for the Earth system and show how Aurora can be fine-tuned to produce state-of-the-art operational forecasts for air pollution and ocean waves.
|
AI for Climate and Weather |
10:00-10:20am | TBC | TBC | N/A |
TBC |
Tea Break | ||||
10:40-11:00am | Dale Barker | Preparations for Next-Generation Weather and Climate Modelling at the Centre for Climate Research Singapore (CCRS) |
The tropical urban nature of the weather and climate in the Singapore region places particular requirements on future observation, models and IT infrastructure. Significant progress has been made in recent years to meet these requirements through added value km-scale Numerical Weather Prediction (NWP) and regional climate projections based largely on physical climate modelling. However, recent advances in AI4Weather/Climate create exciting opportunities for further added value. This talk will provide an overview of CCRS’ plans to move from the current generation physical climate/weather modelling system (based on the Unified Model - UM) to next-generation weather and climate modelling approaches employing a combination of physical, hybrid AI and fully-data driven models suitable for applications in the Singapore region.
|
AI for Climate and Weather |
11:00-11:40am | Sam Altman (TBC) | TBC | TBC |
AI Agents and LLMs for Science |
11:40-12:00pm | Boris Kozinsky | TBC | TBC |
AI for Chemistry |
12:00-12:20pm | Antonio Helio Castro Neto | TBC | TBC |
AI for Materials Science |
Break | ||||
Poster Session & Lunch |
Afternoon Sessions
(Parallel Sessions)
2:40pm to 3:00pm | |||||
Venue | Auditorium 1 | Lecture Theatre 50 | Lecture Theatre 51 | Lecture Theatre 52 | Lecture Theatre 53 |
Topic | AI for Materials Science | AI for Physics | ML Algorithmic Advances | AI for Biology | AI for Medicine and Healthcare |
Speakers | Stephen Roche | David Rousseau | Invited Talk | Invited Talk | Invited Talk |
Titles | Artificial Intelligence enabling disruptive Innovative Advanced Materials and Devices design & engineering | The impact of Artificial Intelligence on precision Higgs Boson physics | TBC | TBC | TBC |
Abstract | I will discuss the emerging need to massively use and deploy Artificial Intelligence (AI) techniques in materials and devices design and engineering, by overviewing our journey into the research of innovative advanced materials using AI in the field of interconnects technologies. I will point towards the revolution of fully describing as-grown materials with AI-driven reconstruction of digital models/twins with atomistic modelling, hence bringing the accuracy of first principles methods to unprecedented multimillions atoms models. This opens a tremendous capability to correlate in-depth atomistic-scale features of as-grown materials with resulting local and global chemical, physical and devices properties, offering reverse engineering and AI-driven co-piloting for industrial innovation strategies. |
Particle physics has a rich history of developing advanced simulators, ranging from modelling proton collisions to creating virtual detectors. We are now integrating Artificial Intelligence through "Neural Simulation Based Inference," which enables the analysis of complex phenomena that cannot be computed analytically from first principles. |
N/A |
N/A |
N/A |
4:40 to 5:00pm | |||||
Speakers | Kostya Novoselov | Gianmarco Mengaldo | Andrey Ustyuzhanin | Sergei V. Kalinin | Valentine P. Ananikov |
Title | TBC | TBC | TBC | Rewards are all we need: building autonomous materials s | Solving the Problem of Sleeping and Lost Data in Chemistry with Artificial Intelligence |
Abstract |
TBC
|
TBC
|
TBC
|
The trajectory of scientific research worldwide is guided by long-term goals, spanning the spectrum from curiosity and fundamental discoveries in physics to the applied challenges of enhancing materials and devices for a wide array of applications. However, the execution and assessment of daily research efforts typically hinges on multiobjective reward functions, which can be evaluated either during or at the conclusion of an experimental campaign. Although this concept is tacitly acknowledged within the scientific community, the implementation of autonomous experimental workflows in automated laboratories necessitates the formulation of robust reward functions and their seamless integration across various domains. Should these reward functions be universally established, the entirety of experimental efforts could be conceptualized as optimization and decision making problems. Here, I will present our latest advancements in the development of autonomous research systems based on electron and scanning probe microscopy, as well as for automated materials synthesis based on reward driven workflows and reward integration across domains. We identify several categories of reward functions that are discernible during the experimental process, including imaging optimization, fundamental physical discoveries, the elucidation of correlative structure-property relationships, and the optimization of microstructures. The operationalization of these rewards function on autonomous microscopes is demonstrated, as well as strategies for human in the loop intervention. Utilizing these classifications, we construct a framework that facilitates the integration of multiple optimization workflows, demonstrated through the synchronous orchestration of diverse characterization tools across a shared chemical space, and the concurrent navigation of costly experiments and models that adjust for epistemic uncertainties between them.
|
Rapid accumulation of experimental data, coupled with the slow speed of its analysis, presents a significant challenge in modern chemistry and materials science. A substantial portion of experimental results remains underutilized, categorized as "sleeping" or "lost" data, which hinders innovation and slows down progress. In several data-intensive research projects in chemistry, only 10% of the data is analyzed manually, while around 90% of the data remains not analyzed. Artificial intelligence (AI) offers an efficient solution by enabling the systematic retrieval, organization, and analysis of such data, leading to improved decision-making and accelerated discovery.
|
Morning Sessions
All morning sessions are to be held at Auditorium 1
8:50am | Session Chair: Ryo Yoshida |
Time | Speaker | Title | Abstract | Topic |
---|---|---|---|---|
9:00–9:40am | Aaron Chatterji (TBC) | TBC |
N/A
|
AI Agents and LLMs for Science |
9:40–10:00am | Alexandre Tkatchenko | Realizing Schrödinger's Dream with AI-Enabled Molecular Simulations |
The convergence between accurate quantum-mechanical (QM) models (and codes) with efficient machine learning (ML) methods seem to promise a paradigm shift in molecular simulations. Many challenging applications are now being tackled by increasingly powerful QM/ML methodologies. These include modeling covalent materials, molecules, molecular crystals, surfaces, and even whole proteins in explicit water. In this talk, I will attempt to provide a reality check on these recent advances and on the developments required to enable fully quantum dynamics of complex functional (bio)molecular systems. Multiple challenges are highlighted that should keep theorists in business for the foreseeable future.
|
AI for Physics |
10:00–10:20am | Benjamin Sanchez-Lengeling | TBC |
N/A
|
AI for Materials Science |
Tea Break | ||||
10:20–10:40am | Xujie Si | The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry |
Formalizing mathematics into machine-checkable logic is essential for advancing scientific rigor and enabling powerful AI reasoning. However, the process of translating informal mathematical text into formal languages remains a major bottleneck. This talk explores the challenge of autoformalization—the automated conversion of natural mathematical language into formal logic—through the lens of Euclidean geometry, one of the oldest and most foundational domains in mathematics. I will present insights from our recent work on LeanEuclid and PyEuclid, which demonstrate how modern Large Language Models (LLMs), combined with formal methods, can help bridge the gap between informal and formal mathematical reasoning.
|
N/A |
10:40–11:00am | Yang-Hui He | The AI Mathematician |
We argue how AI can assist mathematics in three ways: theorem-proving, conjecture formulation, and language processing. Inspired by initial experiments in geometry and string theory in 2017, we summarize how this emerging field has grown over the past years, and show how various machine-learning algorithms can help with pattern detection across disciplines ranging from algebraic geometry to representation theory, to combinatorics, and to number theory. At the heart of the programme is the question how does AI help with theoretical discovery, and the implications for the future of mathematics.
|
AI for Mathematics |
11:00–11:20am | Sergei Gukov | What kind of game is mathematics? |
It comes as no surprise that solving challenging research-level math problems drives progress in mathematics. What is more surprising, though, is that solving such long-standing open problems also contributes to an entirely different field: the development of the next generation AI systems. We live in an exciting time where mathematics and AI can greatly benefit each other, and the goal of the talk is to explain how and why, drawing on specific examples from knot theory and combinatorial group theory. Based on recent work with Ali Shehper, Anibal Medina-Mardones, Lucas Fagan, Bartłomiej Lewandowski, Angus Gruen, Yang Qiu, Piotr Kucharski, and Zhenghan Wang.
|
AI for Mathematics |
11:20–11:40am | Nicola Marzari | The electronic-structure genome of inorganic materials |
The structure and properties of inorganic materials have been extensively explored in the last decade with machine learning models built on computational databases. Typically, the descriptors are based on atomic positions, and the properties are thermodynamic quantities such as energies, forces, stresses. This extremely successful paradigm has even given rise to foundational - i.e., universal - machine learning models. Here, we switch our attention to electronic-structure properties, and to electronic-structure descriptors. For this, we have built robust and reliable protocols able to map automatically the electronic-structure of a material (typically calculated at the level of density-functional theory) into the exact but also minimal set of maximally localized Wannier functions. These latter provide the most compact representation of any desired manifold of electronic structure bands. We constructed more than 1.3M Wannier functions for 20,000+ inorganic, stoichiometric, and experimentally known materials, drawn from the Materials Cloud MC3D database. For these, we explore materials or materials combinations that could deliver optimal performance as thermoelectrics, as nonlinear Hall materials, and as heterojunctions for solar cells.
|
AI for Materials Science |
11:40am–12:00pm | Xavier Bresson | Graph Transformers for Molecular Science -- Overcoming Limitations in Graph Representation Learning |
Graph Neural Networks (GNNs) have shown great potential in graph representation learning but are limited by over-squashing and poor long-range dependency capture. In this work, we introduce Graph ViT, a novel approach that leverages Visual Transformers (ViT). This new architecture addresses the standard challenges by effectively capturing long-range dependencies, improving memory and computational efficiency, and offering high expressive power in graph isomorphism. These advantages enable Graph ViT to outperform traditional message-passing GNNs, especially in molecular science applications.
|
ML Algorithmic Advances |
Break | ||||
12:20–12:40pm | Sunny Lu | TBC |
N/A
|
AI for Finance |
12:40–1:00pm | TBC | TBC |
N/A
|
TBC |
1:00–1:20pm | TBC | TBC |
N/A
|
TBC |
1:20–1:40pm | TBC | TBC |
N/A
|
N/A |
Lunch |
Afternoon Sessions
(Parallel Sessions)