AI4X Programme Schedule
AI4X Logo

Date: 8 – 11 July 2025

Venue: NUS University Town

Daily Programme Schedule

Morning Sessions
All morning sessions are to be held at Auditorium 1
8:00amRegistration
8:50amOpening Remarks
Time Speaker Title Abstract Topic
9:00–9:40am Gábor Csányi Machine learning force fields shows extreme generalisation
I will introduce the general problem of first principles force fields: creating surrogate models for quantum mechanics that yield the energy of a configuration of atoms in 3D space, as we would find them in materials or molecules. Over the last decade significant advances were made in the attainable accuracy, and today we can model materials and molecules with a per-atom energy accuracy of up to 1 part in 10,000 with a speedup of over a million or more compared to the explicit quantum mechanical calculation, enabling molecular dynamics on large length and time scales. The most surprising aspect of the best model is its extreme generalisation: fitted only on small periodic crystals, it shows stable trajectories on arbitrary chemical systems, from water to nanoparticles and proteins. I will show some of the technical details behind the success of our models: equivariant many-body graph polynomials with very few and weak nonlinearities. The relationship between the architectural elements and the extreme generalisation is still largely a mystery. The locality of the graph structure is key to its success, as well as high body order and message passing. The force fields get significantly better with more data, yet model size and complexity can remain largely the same. Integrating explicit long range electrostatics with such general "foundational" force fields is a challenge, as well as combining large datasets for materials and organic molecules, due to the incompatibility of leading DFT approximations.
AI for Chemistry
9:40–10:00am Rafael-Gomez Bombarelli ML Gradients for Computational Chemistry
In the physical sciences, ML has found great synergy with physics-based simulations. On the one hand, simulations can provide abundant, reproducible and scalable training data. Machine learning models can act as surrogates of expensive simulators, allowing improvements in speed and accessible length- and time-scales. In this talk, we describe recent work in the use of differentiable programming and deep learning models to fuse optimization and learning in molecular simulations, including the use of differentiable uncertainty to power active learning, alchemical ML potentials that capture atomic disorder in solids, learning collective variables or transport operators for accelerated simulations, and using discrete generative models and reinforcement learning for surface reconstruction. Lastly, we encourage a discussion around the need to relate (ML-accelerated) simulations to tangible impact in chemicals and materials.
AI for Chemistry
10:00–10:20am Ekin Doǧuş Çubuk TBC
N/A
AI for Materials Science
Tea Break
10:40–11:20am Weinan E Building AI-Powered Infrastructure for Scientific Research
I will discuss the progress we have made towards building a new generation of AI-powered infrastructure for scientific research.
AI for Science
11:20–11:40am Marin Soljačić Novel computing paradigms
Certain novel schemes for computing that use photons (instead of electrons).
Unconventional Computing
11:40am–12:00pm Richard Parker TBC
N/A
AI for Science
Break
12:20–1:00pm Gerbrand Ceder AI and autonomous laboratories for materials synthesis
Computational materials science has seen tremendous progress since the early days of Density Functional Theories. Stable algorithms enabled high-throughput computing which in turn enabled machine-learned potentials (MLP). Though far from perfect at this point, MLPs hold tremendous promise for accelerating materials simulation and discovery. Such progress is not parallelled on the experimental side, making it the gating factor in materials development. In response we built A-lab, an autonomous facility for the closed-loop synthesis of inorganic materials from powder precursors. All synthesis and characterization actions in A-lab, including powder mixing and grinding, firing, characterization by XRD and SEM, and all sample transfers between them are fully automated, leading to a lab that can synthesize and structurally characterize compounds within 10–20 hours of initiation. The A-lab leverages ab-initio computations through an API with the Materials Project, historical data sets that are text-mined from the literature, machine learning for optimization of synthesis routes and interpretation of characterization data, and active learning to plan and interpret the outcomes of experiments performed using robotics. The automation of synthesis and analysis can be further integrated into scientific workflows similar to computational workflows.
Self-Driving Labs
1:00–1:40pm Kristin Persson Fueling The Era of Data-Driven Materials Design and Synthesis
Fueled by increased availability of materials data, machine learning is poised to revolutionize materials science by enabling accelerated discovery, design, and optimization of materials. As one of the first and most visible of materials data providers, the Materials Project (www.materialsproject.org) uses supercomputing and an industry-standard software infrastructure together with state-of-the-art quantum mechanical theory to compute the properties of all known inorganic materials and beyond. The data, currently covering over 160,000 materials and millions of properties, is offered for free to the community together with online analysis and design algorithms. Serving a rapidly expanding community of more than 600,000 registered users, the Materials Project delivers millions of data records daily through its API, fostering data-rich research across materials science. This wealth of data is inspiring the development of machine learning algorithms aimed at predicting material properties, characteristics, and synthesizability. However, we note that truly accelerating materials innovation also requires rapid synthesis, testing and feedback, seamlessly coupled to existing data-driven predictions and computations. The ability to devise data-driven methodologies to guide synthesis efforts is needed as well as rapid interrogation and recording of results – including ‘non-successful’ ones. This talk will outline the rise of data-driven materials design, predictive synthesis and showcase successes as well as comment on current pitfalls and future directions.
AI for Chemistry
Lunch
Afternoon Sessions
(Parallel Sessions)
2:40pm to 3:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics AI Agents and LLMs for Science Urban AI & Sustainability AI for Medicine and Healthcare
Speakers Kedar Hippalgaonkar Andrey Ustyuzhanin Invited Talk Invited Talk Invited Talk
Title Generative Design of Inorganic Materials TBC TBC TBC TBC
Abstract
Materials discovery is fundamental to advance next-generation technologies as well as for sustainable and circular economy. Beyond computational screening, generative models are efficient at finding materials with desired properties. Here, we introduce a Materials Generative Design and Testing (Mat-GDT) framework. Mat-GDT integrates property-directed generative design with accelerated validation through high-throughput simulations and experiments. The central idea of the framework is constructed around a foundation AI model for inorganic materials, which enables Mat-GDT to solve key challenges in sustainable design of functional materials. We demonstrate that domain-specific implementations of Mat-GDT will solve the outstanding challenge of data driven inverse design of inorganic functional materials.
N/A
Rapid accumulation of experimental data, coupled with the slow speed of its analysis, presents a significant challenge in modern chemistry and materials science. A substantial portion of experimental results remains underutilized, categorized as "sleeping" or "lost" data, which hinders innovation and slows down progress. In several data-intensive research projects in chemistry, only 10% of the data is analyzed manually, while around 90% of the data remains not analyzed. Artificial intelligence (AI) offers an efficient solution by enabling the systematic retrieval, organization, and analysis of such data, leading to improved decision-making and accelerated discovery.
N/A
N/A
3:00pm to 3:15pm
Speakers TBC Apivich Hemachandra Tong Xie, Zonglin Yang Tej S Choksi Xia Dong
Title Distance weighted self-attention for nonlocal density functional approximation by artificial neural network (TBC) PIED: Physics-Informed Experimental Design for Inverse Problems MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses MapReader: a framework for learning a visual language model for map analysis BrainOOD: Out-of-distribution Generalizable Brain Network Analysis
Abstract
3:15pm to 3:30pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics AI Agents and LLMs for Science Urban AI & Sustainability AI for Medicine and Healthcare
Speakers Artem Mishchenko Daniil Sherki Tong Xie Nitish Govindarajan Li Fang
Title Elf autoencoder for unsupervised exploration of flat-band materials using electronic band structure fingerprints Combining Flow Matching and Transformers for Efficient Solution of Bayesian Inverse Problems Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model DesCartes: Hybrid Artificial Intelligence for optimal planning and decision making in urban systems Decoding Brain Waves: EEG Attention Detection Across Performance Levels
Abstract
3:30pm to 3:45pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics AI Agents and LLMs for Science Urban AI & Sustainability AI for Medicine and Healthcare
Speakers Anupam Bhattacharya Caroline Chaux Su Jian Subrat Prasad Panda TBC
Title Triviality metric for high-throughput detection of topological flat band materials Algorithm unrolling for solving inverse problems in signal and image processing Hybrid and Generative Models for Material Science Event Extraction Programmatic Reinforcement Learning for Trustworthy Microgrid Management Classification of Brain Conditions (TBC)
Abstract
3:45pm to 4:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics AI Agents and LLMs for Science Urban AI & Sustainability AI for Medicine and Healthcare
Speakers Ruiming Zhu Alex Massucco Zhang Zhouran Hydar Saharudin Onno Kampman
Title Dis-CSP: Disordered Crystal Structure Predictions Finite-difference least square methods for solving Hamilton-Jacobi equations using neural networks MatPilot: an LLM-enabled AI Materials Scientist under the Framework of Human-Machine Collaboration Mapping Intelligibility: Hybrid AI, Sustainability, and “Wind Maps” of Singapore Conversational Self-Play for Discovering and Understanding Psychotherapy Approaches
Abstract
4:00pm to 4:20pm – Tea Break
4:20pm to 4:40pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Chemistry
Speakers Tian Xie Mario Lanza Martinez Limsoon Wong Mimi Hii Wen Jie Ong
Title Accelerating materials design with AI emulators and generators Memristor-based neuromorphic computation: status, challenges, and potential solutions AI as a partner in scientific exploration Building Lab of the Future Accelerating AI-Driven Chemical and Materials Discovery with NVIDIA ALCHEMI
Abstract
The design of novel materials has been a cornerstone of technological progress, driving transformative innovations such as the adoption of electric vehicles, the development of highly efficient solar cells, and the widespread use of superconductors in magnetic resonance imaging (MRI) systems. At Microsoft Research, we aim to advance foundational artificial intelligence (AI) capabilities to accelerate the materials discovery process and deliver real-world impact across diverse domains. We have created two state-of-the-art AI models to accelerate materials design. MatterGen is a generative AI model that improves the efficiency of exploring the vast chemical space. It directly generates novel material structures given a “prompt” specifying material properties for the target application. MatterSim is an AI emulator that predicts which of those proposed materials are viable and if they really have the properties we requested. It efficiently simulates a broad range of properties of many different materials, including metals, oxides, sulfides, halides, and their various states such as crystals, amorphous solids, and liquids. Working together, MatterGen and MatterSim function as a flywheel to accelerate the design of novel materials by thousands of times and deliver real-world impact in broad domains.
In this talk, I am going to present the status of memristor-based neuromorphic computation. I will present the state-the-art performance and the current challenges to overcome in the next five years. In this talk I will discuss two different strategies on how to overcome such problems. The first one consists of integrating two-dimensional materials at the back-end-of-line of silicon microchips, and we will present the performance of on-chip memristors made of multilayer hexagonal boron nitride. The second strategy will consist of unconventional biasing of established fully CMOS-compatible electronic devices to produce outstanding neuromorphic responses. I will also discuss the main technical problems to be faced in the next years when following each strategy, and I will provide some recommendations on how to solve them.
AI is often seen as a data analysis tool, but its real strength may lie in reading, synthesizing knowledge, and suggesting connections—like an average but somewhat reflective scientist with exceptional recall. This talk explores how AI aids discovery not by generating insights but by managing literature reviews, background research, and pattern recognition, allowing scientists to focus on critical thinking. Through some examples from my experience, I will show how AI helps navigate scientific information, synthesize ideas, and uncover overlooked connections.
Data scarcity is a major impediment for the development of effective AI models. The lack of unbiased ‘real world’ data is especially acute in Chemistry. There has been a significant effort in recent years to address this issue. In this presentation, I will share my personal journey of setting up the Centre for Rapid Online Analysis of Reactions to support AI-enabled research, and highlight some challenges in generating meaningful data to improve AI predictions.
The discovery of novel chemicals and materials will revolutionize various industries such as energy storage, environmental remediation, and manufacturing. Traditional discovery methods have been time-consuming and costly, often taking years or even decades to move from hypothesis to production. However, with the advent of AI, this paradigm is rapidly changing. AI technologies have the potential to revolutionize the way chemicals and materials are discovered and developed, making the process faster, more efficient, and more cost-effective. NVIDIA ALCHEMI (AI Lab for Chemistry and Materials Innovation) is at the forefront of this revolution, enabling AI to accelerate chemical and materials discovery by accelerating the components needed to deploy these AI methods in real-world workloads with maximum efficiency and usability.
4:40pm to 5:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Chemistry
Speakers Nikita Kazeev Wilfred G. van der Wiel Paul Smolensky Alexei Lapkin Xiaonan Wang
Title Generation of Novel Stable Beautiful Materials Reconfigurable Nonlinear Computing in Silicon Mechanisms of Symbol Processing in Transformers A knowledge foundation for symbolic AI in engineering AI-driven discovery and design of new materials and catalysts for sustainability
Abstract
Abstract
N/A
A major scientific goal is creating matter that can learn, where its behavior depends on both its present and history. This matter would have long-term memory, enabling autonomous interaction with its environment and self-regulation of actions. We may call such matter ‘intelligent’. Here we introduce a number of experiments towards ‘designless’ nanomaterial systems, where we make use of ‘material learning’ to realize functionality. By exploiting the nonlinearity and tunability of disordered silicon-based nanoelectronic devices – reconfigurable nonlinear computing units, RNPUs – we can significantly facilitate handwritten digit classification. An alternative material-learning approach is followed by mapping the RNPU on a deep-neural-network model, which allows applying standard machine-learning techniques to find functionality. We also introduced a gradient descent approach in materia, using homodyne gradient extraction. Recently, we showed that our devices are not only suitable for solving static problems but also highly efficient in real-time processing of temporal signals like speech recognition at room temperature.
Large Language Models (LLMs) have demonstrated impressive abilities in symbol processing through in-context learning (ICL). This success flies in the face of decades of predictions that artificial neural networks cannot master abstract symbol manipulation. We seek to understand the mechanisms that enable robust symbol processing in transformer networks, illuminating both the unanticipated success and the limitations of transformers in this domain. Drawing from symbolic AI and Production System architectures, we develop PSL—a high-level symbolic language—and create compilers that generate 100% mechanistically interpretable transformer models. PSL is shown to be Turing Universal, offering insights into transformer ICL and guiding the development of improved symbol-processing architectures.
Direct application of AI in chemistry or chemical engineering is frequently ineffective due to sparse data. To address this, a consortium funded by PIPS developed a knowledge graph-based framework to host model and process ontologies, creating a digital twin environment. This framework allows automated model construction, calibration, and identification against experimental data. It supports generating synthetic data and merging it with experimental datasets for AI applications. The system also enables symbolic AI reasoning, demonstrated through process model assembly with agents and reinforcement learning within the same knowledge base.
New materials and catalysts are vital to sustainable development but traditional experimentation is slow and resource-intensive. This talk highlights how artificial intelligence (AI) can accelerate chemical innovation by integrating with traditional methodologies. Key advances include data-driven active learning for complex systems, intelligent synthesis platforms, and a multi-scale data ecosystem. These enable AI-powered discovery and optimization, particularly for decarbonization in the chemical industry, offering scalable solutions through foundation models and enhanced knowledge integration across various scales.
5:00pm to 5:15pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Biology
Speakers Theo Jaffrelot Inizan Pol Benítez Colominas Peng Yongqian Jacob Lynge Elholm Tej S Choksi
Title Generative model for enhancing reticular material discovery Thermal Effects on Optoelectronics: A Graph Neural Network Approach Information fusion strategy based on language model and contrastive learning for knowledge retrieval of metallic materials Accelerating the characterisation of molecular photoswitches for solar thermal energy storage Harnessing Physics-inspired Machine Learning to Design Nanocluster Catalysts for Dehydrogenating Liquid Organic Hydrogen Carriers (TBC)
Abstract
5:15pm to 5:30pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Chemistry
Speakers Valentyn Volkov Wu Tianqi Qian Hangwei Bogdan Protsenko Nitish Govindarajan
Title AI-empowered discovery of novel materials for smart electronic devices Prediction of the Future State of Photobioreactors Using Time Series Prediction Algorithm CogCommon: Enhancing Cross-Domain Knowledge Extraction with LLM-Assisted Commonality Discovery Multispectral diagnostics of catalytic reactions in microfluidic systems Comprehensive sampling of coverage effects in catalysis by leveraging generalization in neural network models
Abstract
5:30pm to 5:45pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Chemistry
Speakers Peichen Zhong Jiong Lu Leo McKee-Reid Runze Zhang Mikhail V. Polynski
Title Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials AI for carbon-based quantum materials Towards Data-Driven Scientific Discovery AutoEIS: Automated Bayesian Model Selection and Analysis for Electrochemical Impedance Spectroscopy Machine Learning-Powered Exploration of Catalytic Reaction Networks for CO₂ Conversion into Value-Added Chemicals
Abstract
5:45pm to 6:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI Agents and LLMs for Science Self-Driving Labs AI for Chemistry
Speakers Yulin Cai Yue Li, Zhao Shi Pablo Sanchez Martin Anton Paar TBC
Title Apply DFT to DFT: Symmetry-preserved Generation of Crystalline Electron Densities In-memory Subnet Computation for Area and Energy Efficient AI Literature-based Hypothesis Generation: Predicting the evolution of scientific literature to support scientists Rheological Characterization of Polymers and Automated Solutions for Improved Efficiency and Insight Redefining Catalysis Predictions Through Physics-Based Gaussian Model and Data-Driven Benchmarks: AuPd Alloy in Oxygen Reduction Reaction Catalysis for Fuel Cell Applications (TBC)
Abstract
End of Day 1
Morning Sessions
All morning sessions are to be held at Auditorium 1
Time Speaker Title Abstract Topic
8:50amSession Chair / Announcements
9:00–9:40am Klaus Robert Müller AI for the sciences -- toward understanding
In recent years, machine learning (ML) and artificial intelligence (AI) methods have begun to play a more and more enabling role in the sciences and in industry. In particular, the advent of large and/or complex data corpora has given rise to new technological challenges and possibilities. In his talk, Müller will touch upon the topic of ML applications in the sciences, in particular in medicine, physics, chemistry. He will focus on techniques from explainable AI and their use for extracting information from machine learning models in order to further our understanding by explaining nonlinear ML models. Finally, Müller will briefly discuss perspectives and limitations.
AI for Science
9:40–10:00am Jack Wells TBC
TBC
TBC
10:00am-10:20am Yu Xie Scalable emulation of protein equilibrium ensembles with generative deep learning
Following the sequence and structure revolutions, predicting the dynamical mechanisms of proteins that implement biological function remains an outstanding scientific challenge. Several experimental techniques and molecular dynamics (MD) simulations can, in principle, determine conformational states, binding configurations and their probabilities, but suffer from low throughput. Here we develop a Biomolecular Emulator (BioEmu), a generative deep learning system that can generate thousands of statistically independent samples from the protein structure ensemble per hour on a single graphical processing unit. By leveraging novel training methods and vast data of protein structures, over 200 milliseconds of MD simulation, and experimental protein stabilities, BioEmu’s protein ensembles represent equilibrium in a range of challenging and practically relevant metrics. Qualitatively, BioEmu samples many functionally relevant conformational changes, ranging from formation of cryptic pockets, over unfolding of specific protein regions, to large-scale domain rearrangements. Quantitatively, BioEmu samples protein conformations with relative free energy errors around 1 kcal/mol, as validated against millisecond-timescale MD simulation and experimentally-measured protein stabilities. By simultaneously emulating structural ensembles and thermodynamic properties, BioEmu reveals mechanistic insights, such as the causes for fold destabilization of mutants, and can efficiently provide experimentally-testable hypotheses.
AI for Biology
Tea Break
10:40–11:20am Alex Aliper From Algorithm to Human Clinical Trials: Accelerating Drug Discovery and Development With Generative AI and Robotics
In this talk we will cover the application of AI to disease modeling, target discovery, indication prioritization, indication expansion, and small molecule drug design. We will explore key case studies and cover the current state of the industry, highlighting its limitations, bottlenecks, and opportunities for advancing drug discovery. We will also discuss the applications of generative AI to development of foundational models for chemistry and multi-species multi-omics life models for aging and fundamental biological research.
AI for Biology
11:20–11:40am Wei Lu TBC
TBC
Unconventional Computing
12:00–12:20pm Yujie Huang KronosAI
KronosAI has achieved state-of-the-art for silicon photonics. We are building foundation model-based physics solvers that are faster, more accurate and more generalizable than traditional numerical solvers. Yujie will discuss insights gained from KronosAI's training, and how our work is paving the way for sustainable, eco-friendly engineering by integrating inverse design and design space exploration into everyday practice.
AI for Physics
12:20–12:20pm Hsin-Yuan Huang The vast world of quantum advantage
While quantum devices promise extraordinary capabilities, discerning genuine advantages from mere illusions remains a formidable challenge. In this endeavor, quantum theorists are like prophets, striving to foretell a future where quantum technologies will transform our world. Most people understand quantum advantage primarily as offering faster computation. This talk explores the vast world of quantum advantages that extends far beyond computation to include fundamental advantages in learning, sensing, cryptography, and memory storage. I will also demonstrate how some quantum advantages are inherently unpredictable using classical resources alone, suggesting a landscape far richer than currently envisioned. As quantum technologies proliferate, these unpredictable advantages may enable transformative applications beyond the reach of our classical imagination.
Unconventional Computing
Break
Poster Session & Lunch
Afternoon Sessions
(Parallel Sessions)
2:40pm to 3:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics AI for Chemistry AI for Education & Policy AI for Medicine and Healthcare
Speakers Abhishek Singh Invited Talk Invited Talk Invited Talk Invited Talk
Title AI-based Hierarchical Representations of Materials for Structure–Property Prediction TBC TBC TBC TBC
Abstract
Materials representation across multiple length scales is essential for enabling AI models to solve structure-to-property prediction problems in complex systems such as superalloys. Properties like Vickers hardness are primarily governed by microstructural features and formation energy correlates to atomic arrangements and, making it crucial to capture relevant information from different structural hierarchies. Accurate and interpretable representations across these scales allow machine learning approaches to accelerate materials discovery and design. At the microstructural level, three frameworks are used to represent image-based information: (1) statistical representations using 2-point spatial correlations to capture phase distribution patterns, (2) geometry-driven image processing techniques that extract morphological descriptors such as area, perimeter, and shape of precipitates, and (3) deep learning models like convolutional neural networks that automatically learn hierarchical features directly from raw SEM images. These image-derived features are combined with metadata such as composition and processing history to predict mechanical properties like Vickers hardness. At the atomic level, graph-based representations like the CLEAR (Chemistry and Local Environment Adaptive Representation) descriptor-based model represents crystal structures by combining elemental properties with interatomic distances through Voronoi-based neighbours. By applying pooling operations, these graph features are transformed into fixed-size vectors that enable predictive modelling of formation energy and phase stability.
N/A
N/A
N/A
N/A
3:00 to 3:15pm
Speakers Roman Eremin Wang Benquan Liwei Yu Kenneth Y T Lim Xu Ting
Title Data-driven assessment of thermodynamic stability and search for competing phases: Application to the 2D Material Defect dataset Seeing the Invisible: Breaking the Diffraction Limit with Geometry-aware Deep Learning No-Free-Lunch Theories for Tensor-Network Machine Learning Models Exploratory investigation of electrodermal activity in learning from a large language model versus from curated texts RetiRAG: A Retrieval-Augmented Generation Framework for Specialized Ophthalmology Applications
Abstract
3:15pm to 3:30pm
Speakers Jiadong Dan Mihir Rajendra Athavale Alastair Price Ng Sook Mun Viktor Schlegel
Title Bridging Contextual Information in Deep Learning for Structural Defect Classification Diffusion Model-Driven Optimization of High-Efficiency Wavelength-Scale Microring Lasers Adapting hybrid density functionals with machine learning Simulating Professional Workplaces: A Pedagogical Framework for Generative AI-Powered Role-Play for Competency-Based Education MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
Abstract
3:30pm to 3:45pm
Speakers Alexander Shapeev, Timofei Miryashkin Wang Benquan Paul Fuchs Amelie Favreau Zhou Jun
Title Fusion of quantum-mechanical and experimental data for phase diagram calculation Topology-Preserving Deep Learning for Structural Integrity in Optical Semiconductor Characterization at Deeply Subwavelength Resolution chemtrain: Learning deep potential models via automatic differentiation and statistical physics AI regulations under the EU AI Act in critical urban systems Effectiveness of Graph Neural Network Operators in Virulence Classification of Mouse-Influenza A Protein Interactions
Abstract
3:45pm to 4:00pm
Speakers Savyasanchi Aggarwal Mihir Rajendra Athavale Vir Karan Jie Gao Nigel Foo Hon Wei
Title Predicting defect formation energies in semiconductors using machine learning Simultaneous Optimization of Yield, Threshold, and Wavelength in Microring Lasers Using Bayesian Optimization Decoding kinetic selectivity in diffusion-limited solid state synthesis reactions through machine-learning accelerated molecular dynamics AI and Evidence-Based Policymaking: Who Stays in the Human-AI Loop and Why? Personalized dose selection platform for patients with solid tumors in the PRECISE CURATE.AI feasibility trial
Abstract
4:00pm to 4:20pm – Tea Break
4:20pm to 4:40pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science
AI for Physics
AI for World Domination ML Algorithmic Advances
AI for Biology
Speakers Isao Tanaka Yong Xu Invited Talk Kharen Musaelian Zhang Yang
Title Recommender system for discovery of new inorganic compoundsa Deep learning density functional theory and beyond TBC TBC Deep learning-based protein structure prediction with D-I-TASSER
Abstract
We have adopted methods to establish recommender systems useful for efficient discovery of currently unknown chemically relevant compositions (CRCs) of inorganic ionic compounds from vast candidates. Using information of compounds registered in inorganic crystal structure database (ICSD) and tensor-based recommender system, we systematically calculated recommendation scores for chemical compositions of up to 23 billion five-element ionic systems. The high success rate, despite not using any prior knowledge or first principles theoretical results, proves the powerfulness of current recommendation systems. Synthesis experiments were made at the chemical compositions with high recommendation scores. Novel oxides and nitrides were efficiently discovered accordingly. Recommender system was also used to predict successful processing conditions for new compounds based on our parallel experiment-dataset.
First-principles methods based on density functional theory (DFT) have become indispensable tools in physics, chemistry, materials science, etc., but are bottlenecked by the efficiency-accuracy dilemma. The integration of first-principles methods with deep learning offers a transformative opportunity to overcome these limitations. In this talk, I will explore the emerging interdisciplinary field of deep-learning DFT, which employs advanced deep learning techniques to address key limitations in DFT computations. Specifically, I will present our recent work on developing a deep neural network framework, DeepH, that learns the relationship between the DFT Hamiltonian and atomic structures [1-3]. Trained on DFT data for small structures, these neural network models can generalize to predict properties of unseen large material structures without invoking time-consuming DFT self-consistent field iterations, making efficient and accurate study of large-scale materials feasible. Combined with recent methodological developments, these innovations pave the way for deep-learning electronic structure calculations [4-12]. This paradigm shift promises to transform the landscape of first-principles computations, significantly accelerating future materials discovery and design.
N/A
N/A
Recent breakthroughs in AI-driven methods such as AlphaFold have reshaped the landscape of protein structure prediction, raising questions about the continued relevance of traditional physics-based simulations. To bridge the gap between these two paradigms, we developed D-I-TASSER, a hybrid approach that integrates deep learning-derived inter-residue potentials with iterative Monte Carlo fragment assembly simulations, enhanced by a novel domain-splitting strategy tailored for large, multi-domain proteins. Benchmark tests and CASP assessments demonstrate that D-I-TASSER significantly surpasses both AlphaFold2 and AlphaFold3 in atomic-level accuracy, particularly for complex multi-domain targets. Large-scale folding experiments reveal that D-I-TASSER achieves correct folding in 81% of domains and 73% of full-length proteins across the human proteome. These results not only bring physics-based folding simulations back into the mainstream of protein structure prediction, but also underscore the compelling power of integrating cutting-edge deep learning techniques with physics-based modeling for high-accuracy, genome-scale prediction of protein structure and function.
4:40pm to 5:00pm
Speakers Stephen Gregory Dale Anatole von Lilienfeld Invited Talk Emily Lines Ron Dror
Title Identifying and embedding transferability in data-driven representations of chemical space Quantum machine learning in chemical space TBC Challenges for reliable forest monitoring with high resolution remote sensing and AI AI for Structure-Based Drug Design
Abstract
Many of the most relevant observables of matter depend explicitly on atomistic and electronic structure, rendering physics-based approaches to chemistry and materials necessary. Unfortunately, due to the combinatorial scaling of the number of chemicals and potential reaction settings, gaining a holistic and rigorous understanding through exhaustive quantum and statistical mechanics-based sampling is prohibitive --- even when using high-performance computers. Accounting for explicit and implicit dependencies and correlations, however, will not only deepen our fundamental understanding but also benefit exploration campaigns (computational and experimental). I will discuss recently gained insights from my lab elucidating such relationships thanks to alchemical perturbation density functional theory and supervised machine learning.
N/A
In recent years there have been concurrent rapid rises in high resolution remote sensing forest data and deep learning methods to exploit these for large-scale forest monitoring. Such methods have shown to be very powerful in rapidly retrieving properties of interest of large areas of forest with little effort, at least compared to standard field techniques. However, many deep learning approaches have been developed using data collected at small spatio-temporal scales, in a single or small number of forest types, and without ground validation data. Further, this approach often involves intensive and time-consuming data collection and processing, but generates results restricted to specific ecosystems and sensor types. There is a lack of widespread acknowledgement of how the types and structures of data used affects performance and accuracy of analysis algorithms, or the need for high quality ground truthing, leading to unrealiable validation data and inappropriate application of tools for monitoring.
Recent years have seen dramatic advances in both computational prediction and experimental determination of protein structures. These structures hold great promise for the discovery of highly effective drugs with minimal side effects, but using structures to design such drugs remains challenging. I will describe recent progress toward this goal, with a focus on machine learning approaches.
5:00pm to 5:15pm
Speakers TBC Brandon Sutherland, Chen Hao Xia Qian Wang Grigoriy Shutov Kevin Michalewicz
Title Probing out-of-distribution generalization in machine learning for materials (TBC) Learning the Hamiltonian of Large, Disordered Atomic Systems CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading Wells and context aware diffusion model for geologic cross section generation ANTIPASTI: interpretable prediction of antibody binding affinity exploiting Normal Modes and Deep Learning
Abstract
5:15pm to 5:30pm
Speakers Sherif Abdulkader Tawfik Abbas Igor Poltavskyi Dinil Mon Divakaran Jiaxi Zhao Karis Kungsamutr
Title Embedding material graphs using the electron-ion potential: application to material fracture Crash testing machine learning force fields for molecules, materials, and interfaces: molecular dynamics in the TEA challenge 2023 Large Language Models for Cybersecurity: New Opportunities Generative subgrid-scale modeling Unsupervised Learning of Transient Structural Motifs of ssDNA Using Translationally and Rotationally Invariant Features
Abstract
5:30pm to 5:45pm
Speakers TBC Adil Kabylda TBC Benjamin Y J Wong Deepan Balakrishnan
Title Bridging the AI4Materials Innovation Gap — A Startup’s Blueprint for Industrial Impact (TBC) Molecular Simulations with a Pretrained Neural Network and Universal Pairwise Force Fields Contributed Talk Embedding Theoretical Baselines For Satellite Force Estimations Physics-Informed Automatic Differentiation for Single-Shot Nanoscale 3D Imaging in In Situ Transmission Electron Microscopy
Abstract
5:45pm to 6:00pm
Speakers TBC Ivan S. Novikov TBC Ekaterina V. Skorb Ivona Martinovic
Title Finding Perovskite Composites With Preferable Features: Simple ML algorithms (TBC) Magnetic moment tensor potentials for investigating magnetic materials Contributed Talk Deep Topological Denoising: Autoencoder-Driven Noise Suppression for Atomic Force Microscopy data with Topological Data Analysis Validation Generalization and Scoring in RNA 3D Structure Prediction: A Benchmarking Study
Abstract
End of Day 2
Morning Sessions
All morning sessions are to be held at Auditorium 1
Time Speaker Title Abstract Topic
8:50amSession Chair: Terence O'Kane
9:00–9:40am Stan Posey Directions in Energy Efficient AI for Driving Earth Digital Twins
AI is becoming a critical component of Earth system science workflows that are experiencing rapid growth in data from model output of increasing resolution in weather and climate models, and Earth observation systems that produce orders of magnitude more data than their previous generations. Efforts are underway in the weather and climate modeling community towards refining the horizontal resolution of atmosphere GCMs towards km-scale to explicitly resolve certain small-scale convective cloud processes and provide more realistic local information on climate change. At the same time, exascale HPC systems have arrived and in most cases are designed with GPU accelerator technology that offers opportunities in reasonable simulation turn-around times balanced with efficiency in energy consumption. Ultimately, output from global storm-resolving models at km-scale will become the essential driver behind the deployment of Earth digital twins for programs like the EC Destination Earth and NVIDIA Earth-2. For model emulation of the Earth system, AI models become increasingly accurate as they train on more data, yet computational and storage requirements in data-distributed computing environments with energy-efficiency considerations are the current challenges for the HPC vendor community. This talk will describe advances in HPC for GPU-accelerated numerical models, AI software and system features for large-scale data handling, and ML model training and inference that when combined, provide the critical components towards the vision of Earth system digital twins.
AI for Climate and Weather
9:40-10:00am Wessel Bruinsma A Foundation Model for the Earth System: Air Pollution and Ocean Waves
Aurora is a foundation model for the Earth system pretrained on a large and diverse collection of geophysical data. The key ability of Aurora is that the model can be fine-tuned to produce forecasts for a wide variety of environmental forecasting applications, often matching or even outperforming state-of-the-art traditional approaches at a fraction of the computational cost. In this first part of a two-part talk on Aurora, I will discuss the concept of a foundation model for the Earth system and show how Aurora can be fine-tuned to produce state-of-the-art operational forecasts for air pollution and ocean waves.
AI for Climate and Weather
10:00-10:20am TBC TBC
N/A
TBC
Tea Break
10:40-11:00am Dale Barker Preparations for Next-Generation Weather and Climate Modelling at the Centre for Climate Research Singapore (CCRS)
The tropical urban nature of the weather and climate in the Singapore region places particular requirements on future observation, models and IT infrastructure. Significant progress has been made in recent years to meet these requirements through added value km-scale Numerical Weather Prediction (NWP) and regional climate projections based largely on physical climate modelling. However, recent advances in AI4Weather/Climate create exciting opportunities for further added value. This talk will provide an overview of CCRS’ plans to move from the current generation physical climate/weather modelling system (based on the Unified Model - UM) to next-generation weather and climate modelling approaches employing a combination of physical, hybrid AI and fully-data driven models suitable for applications in the Singapore region.
AI for Climate and Weather
11:00-11:40am Sam Altman (TBC) TBC
TBC
AI Agents and LLMs for Science
11:40-12:00pm Boris Kozinsky TBC
TBC
AI for Chemistry
12:00-12:20pm Antonio Helio Castro Neto TBC
TBC
AI for Materials Science
Break
Poster Session & Lunch
Afternoon Sessions
(Parallel Sessions)
2:40pm to 3:00pm
Venue Auditorium 1 Lecture Theatre 50 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics ML Algorithmic Advances AI for Biology AI for Medicine and Healthcare
Speakers Stephen Roche David Rousseau Invited Talk Invited Talk Invited Talk
Titles Artificial Intelligence enabling disruptive Innovative Advanced Materials and Devices design & engineering The impact of Artificial Intelligence on precision Higgs Boson physics TBC TBC TBC
Abstract
I will discuss the emerging need to massively use and deploy Artificial Intelligence (AI) techniques in materials and devices design and engineering, by overviewing our journey into the research of innovative advanced materials using AI in the field of interconnects technologies. I will point towards the revolution of fully describing as-grown materials with AI-driven reconstruction of digital models/twins with atomistic modelling, hence bringing the accuracy of first principles methods to unprecedented multimillions atoms models. This opens a tremendous capability to correlate in-depth atomistic-scale features of as-grown materials with resulting local and global chemical, physical and devices properties, offering reverse engineering and AI-driven co-piloting for industrial innovation strategies.
Particle physics has a rich history of developing advanced simulators, ranging from modelling proton collisions to creating virtual detectors. We are now integrating Artificial Intelligence through "Neural Simulation Based Inference," which enables the analysis of complex phenomena that cannot be computed analytically from first principles.
N/A
N/A
N/A
3:00 to 3:15pm
Speakers Salanne Mathieu TBC Chen Mengyi Anoop Patil Kerem Delikoyun
Titles Machine learning of electronic charge densities in the simulation of electrochemical interfaces Deep Learning Based Event 4-Dimensional Track Reconstruction in LArTPC Detector Learning Macroscopic Dynamics from Partial Microscopic Observations Variational Autoencoders Capture Plant Biomolecular Stress Information OAH-Net: a deep neural network for efficient and robust hologram reconstruction for off-axis digital holographic microscopy
Abstract
3:15 to 3:30pm
Speakers Andy Paul Chen Pinaki Sengupta Shiqi Wu Jie Zhang Meng Wang
Title Virp: neural network-accelerated prediction of physical properties in site-disordered materials Spin-1/2 kagome Heisenberg antiferromagnet: Machine learning discovery of the spinon pair-density-wave ground state Learning Dynamics of Nonlinear Field-Circuit Coupled Problems with a Physics-Data Combined Model Sparse Autoencoders Reveal Interpretable Structure in Small Gene Language Models Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model
Abstract
3:30 to 3:45pm
Speakers Luo Ran Ron Ziv Dmitry Sorokin Kwoh Chee Keong Nicholas Ng
Title Machine Learning-Driven Insights for Phase-Stable FAxCs1–xPb(IyBr1–y)3 Perovskites in Tandem Solar Cells Utilizing Machine Learning for Identifying Quantum-Many-Body Phase Transitions Magnetic control of tokamak plasmas through deep reinforcement learning with privileged information Effectiveness of Graph Neural Network Operators in Virulence Classification of Mouse-Influenza A Protein Interactions Parallelized, Low-cost, and Real-time Machine Learning Model for Immune Cell Morphological Profiling
Abstract
3:45 to 4:00pm
Speakers Yeo Zhen Yuan Yixiao Yang Beatrice Soh Peter Wang Igor Balashov
Title Unsupervised Machine Learning for Phase Identification and Characterization of High-Resolution STEM EELS in Novel Battery Materials Physics-informed neural network for single-shot phase retrieval in cryo-EM Learning Non-Equilibrium Dynamics of Polymer Chains: A Data-Driven Approach Flash optimization of drug combinations for Acinetobacter baumannii with IDentif.AI-AMR Expanding AI-Based Optical Urinalysis with Synthetic Data: A Large-Scale 849-Patient Study
Abstract
4:20 to 4:40pm
Speakers Artem Maevskiy Li Qianxiao Jürgen Bajorath Leonard Ng Wei Tat Chenru Duan
Title Discovering solid electrolytes through the analysis of machine-learned potential energy surfaces Constructing macroscopic dynamics using deep learning Chemical language models and their learning characteristics Beyond Automation: Self-Driving Laboratories as Engines for Next-Generation Photovoltaic Innovation and Scale-Up Generative Modeling on Sampling Rare Events in Chemistry Reactions
Abstract
We discuss some recent work on constructing stable and interpretable macroscopic dynamics from trajectory data using deep learning. We adopt a modelling approach: instead of generic neural networks as functional approximators, we use a model-based ansatz for the dynamics following a suitable generalisation of the classical Onsager principle for non-equilibrium systems. This allows the construction of macroscopic dynamics that are physically motivated and can be readily used for subsequent analysis and control. We discuss applications in the analysis of polymer stretching in elongational flow. Moreover, we will also discuss some algorithmic challenges associated with learning (macroscopic) dynamics for scientific applications.
In the life sciences and drug discovery, a variety of generative machine learning models are utilized for different applications. Among these are chemical language models (CLMs) that are based on deep learning architectures adopted from natural language processing. CLMs learn textual representations of molecular structure and probability distributions to predict new chemical matter and are often conditioned by context-dependent rules such a specific property constraints. Transformers have become preferred CLM architectures. Hallmarks of transformer CLMs include the self-attention mechanism and ability to learn a variety of mappings of molecular representations and associated property measures. The ensuing versatility of CLMs in addressing different machine translation tasks provides new opportunities for generative molecular design. Transformer CLMs often deliver promising results in off-the-beaten-path prediction tasks. However, rationalizing predictions of these models is challenging and a topical issue in explainable artificial intelligence (XAI). So far, transformer predictions have mostly been analyzed by determining attention weight distributions and attention flow, but other approaches are beginning to emerge. For instance, depending on the application, careful control calculations often help to unveil model-specific learning characteristics. This is often crucial to avoid over-interpretation of predictions or confusion caused by “Clever Hans” effects.
The transition from laboratory discovery to commercial manufacturing remains a critical challenge in photovoltaic technology development. This talk presents our comprehensive vision for self-driving laboratories (SDLs) that not only accelerate development but fundamentally reshape how we approach photovoltaic innovation. We begin by demonstrating our recent achievements in roll-to-roll manufacturing of perovskite solar cells, where our SDL platform enabled the fabrication of modules with 11% efficiency under ambient conditions, processing over 11,800 devices in 24 hours. This unprecedented throughput, combined with our cost projections of ~0.7 USD/W, establishes a new benchmark for scalable photovoltaic manufacturing. However, the true potential of SDLs extends far beyond automation. We will discuss our perspective on the evolution of SDL paradigms - from current focused optimization approaches to future systems capable of autonomous hypothesis generation and testing. This includes our ongoing work towards hybrid perovskite solar cells (HPSCs) through cross-domain optimization, where SDLs simultaneously consider materials properties, device architectures, and manufacturing constraints. Our latest innovations in integrating biomass-derived materials and developing novel device structures demonstrate how SDLs can unlock previously unexplored pathways in photovoltaic development. We propose a roadmap for future SDL development that emphasizes the democratization of these technologies, integration of manufacturing concepts from the outset, and the potential for autonomous agents to accelerate discovery.
Transition state (TS) search is key in chemistry for elucidating reaction mechanisms and exploring reaction networks. The search for accurate 3D TS structures, however, requires numerous computationally intensive quantum chemistry calculations due to the complexity of potential energy surfaces. We developed an object-aware SE(3) equivariant diffusion model (OA-ReactDiff) that satisfies all physical symmetries and constraints for generating sets of structures – reactant, TS, and product – in an elementary reaction. By learning the joint distribution of reactant, TS, and product, OA-ReactDiff can sample novel reactions that go beyond empirical expectation of chemists. With an initial guess adopting prior knowledge in chemical reactions and optimal transport framework, we developed React-OT, a model tailored for TS generation. Provided reactant and product, React-OT generates a TS structure in sub-seconds instead of hours, which is typically required when performing quantum chemistry-based optimizations. The generated TS structures achieve a median of 0.05 Å root mean square deviation compared to the true TS. We envision the proposed approach useful in constructing large reaction networks with unknown mechanisms.
4:40 to 5:00pm
Speakers Kostya Novoselov Gianmarco Mengaldo Andrey Ustyuzhanin Sergei V. Kalinin Valentine P. Ananikov
Title TBC TBC TBC Rewards are all we need: building autonomous materials s Solving the Problem of Sleeping and Lost Data in Chemistry with Artificial Intelligence
Abstract
TBC
TBC
TBC
The trajectory of scientific research worldwide is guided by long-term goals, spanning the spectrum from curiosity and fundamental discoveries in physics to the applied challenges of enhancing materials and devices for a wide array of applications. However, the execution and assessment of daily research efforts typically hinges on multiobjective reward functions, which can be evaluated either during or at the conclusion of an experimental campaign. Although this concept is tacitly acknowledged within the scientific community, the implementation of autonomous experimental workflows in automated laboratories necessitates the formulation of robust reward functions and their seamless integration across various domains. Should these reward functions be universally established, the entirety of experimental efforts could be conceptualized as optimization and decision making problems. Here, I will present our latest advancements in the development of autonomous research systems based on electron and scanning probe microscopy, as well as for automated materials synthesis based on reward driven workflows and reward integration across domains. We identify several categories of reward functions that are discernible during the experimental process, including imaging optimization, fundamental physical discoveries, the elucidation of correlative structure-property relationships, and the optimization of microstructures. The operationalization of these rewards function on autonomous microscopes is demonstrated, as well as strategies for human in the loop intervention. Utilizing these classifications, we construct a framework that facilitates the integration of multiple optimization workflows, demonstrated through the synchronous orchestration of diverse characterization tools across a shared chemical space, and the concurrent navigation of costly experiments and models that adjust for epistemic uncertainties between them.
Rapid accumulation of experimental data, coupled with the slow speed of its analysis, presents a significant challenge in modern chemistry and materials science. A substantial portion of experimental results remains underutilized, categorized as "sleeping" or "lost" data, which hinders innovation and slows down progress. In several data-intensive research projects in chemistry, only 10% of the data is analyzed manually, while around 90% of the data remains not analyzed. Artificial intelligence (AI) offers an efficient solution by enabling the systematic retrieval, organization, and analysis of such data, leading to improved decision-making and accelerated discovery.
5:00 to 5:15pm
Speakers Artem Dembitskiy Feng Ling Aravindan Kamatchi Sundaram Le Duy Dung Tianze Zheng
Title Datasets for Benchmarking Machine Learning Models for Accelerated Search of Fast Ionic Conductors Multiple Descents in Deep Learning as a Sequence of Order-Chaos Transitions Development and Analysis of the Largest, Accurate, and Comprehensive Multicomponent (High Entropy) Alloy Database Using Large Language Models Finding Environmentally-Friendly Chemical Synthesis with AI and High-Throughput Robotics Data-Driven Parametrization of Molecular Mechanics Force Fields for Expansive Chemical Space Coverage
Abstract
5:15 to 5:30pm
Speakers Riko I Made Yuxuan Yang Ekaterina V. Skorb, Mariia Ashikhmina Simon Rihm Zhiyuan Liu
Title Health Diagnosis and Recuperation of Aged Li-ion Batteries Using Data Analytics Dynamical Systems' Predictability Using Machine Learning Large Language Model for Automating the Analysis of Cryoprotectants Digital Exploration of Metal-Organic Polyhedra in The World Avatar NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
Abstract
5:30 to 5:45pm
Speakers Bryant Li Zhou Fang Bram Hoex Yue Li Zarif Ikram
Title Machine Learning-Aided Atomistic Modeling of Solid-State Electrolyte Interphase Formation for Li/Li₇P₃S₁₁ Dynamical Error Metrics for Machine Learning Forecasting DARWIN 1.5: Large Language Models as Materials Science-Adapted Learners Transforming the Synthesis of Carbon Nanotubes with Machine Learning Models and Automation Gradient-Guided Discrete Walk-Jump Sampling for Biological Sequence Generation
Abstract
5:45 to 6:00pm
Speakers Alexandre Duval Anshuman Pasupalak Ben Leong Flore Mekki-Berrada Yoon Ji Wei
Title From Generative Models to Real-World Materials: Entalpic’s AI-Driven Discovery Loop Weighted Confidence Ensemble for Uncertainty Quantification in Physics-Informed Neural Networks Automated Benchmarking of Large Language Models: Applying Regression to Estimate LLM Accuracy Self-Driving Labs: How to Tell the Optimizer Not to Sample in an Undesired Region? Leveraging Implicit Representations of Atomistic Foundation Models for Generation and Optimization of Molecules
Abstract
End of Day 3
Morning Sessions
All morning sessions are to be held at Auditorium 1
8:50amSession Chair: Ryo Yoshida
Time Speaker Title Abstract Topic
9:00–9:40am Aaron Chatterji (TBC) TBC
N/A
AI Agents and LLMs for Science
9:40–10:00am Alexandre Tkatchenko Realizing Schrödinger's Dream with AI-Enabled Molecular Simulations
The convergence between accurate quantum-mechanical (QM) models (and codes) with efficient machine learning (ML) methods seem to promise a paradigm shift in molecular simulations. Many challenging applications are now being tackled by increasingly powerful QM/ML methodologies. These include modeling covalent materials, molecules, molecular crystals, surfaces, and even whole proteins in explicit water. In this talk, I will attempt to provide a reality check on these recent advances and on the developments required to enable fully quantum dynamics of complex functional (bio)molecular systems. Multiple challenges are highlighted that should keep theorists in business for the foreseeable future.
AI for Physics
10:00–10:20am Benjamin Sanchez-Lengeling TBC
N/A
AI for Materials Science
Tea Break
10:20–10:40am Xujie Si The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry
Formalizing mathematics into machine-checkable logic is essential for advancing scientific rigor and enabling powerful AI reasoning. However, the process of translating informal mathematical text into formal languages remains a major bottleneck. This talk explores the challenge of autoformalization—the automated conversion of natural mathematical language into formal logic—through the lens of Euclidean geometry, one of the oldest and most foundational domains in mathematics. I will present insights from our recent work on LeanEuclid and PyEuclid, which demonstrate how modern Large Language Models (LLMs), combined with formal methods, can help bridge the gap between informal and formal mathematical reasoning.
N/A
10:40–11:00am Yang-Hui He The AI Mathematician
We argue how AI can assist mathematics in three ways: theorem-proving, conjecture formulation, and language processing. Inspired by initial experiments in geometry and string theory in 2017, we summarize how this emerging field has grown over the past years, and show how various machine-learning algorithms can help with pattern detection across disciplines ranging from algebraic geometry to representation theory, to combinatorics, and to number theory. At the heart of the programme is the question how does AI help with theoretical discovery, and the implications for the future of mathematics.
AI for Mathematics
11:00–11:20am Sergei Gukov What kind of game is mathematics?
It comes as no surprise that solving challenging research-level math problems drives progress in mathematics. What is more surprising, though, is that solving such long-standing open problems also contributes to an entirely different field: the development of the next generation AI systems. We live in an exciting time where mathematics and AI can greatly benefit each other, and the goal of the talk is to explain how and why, drawing on specific examples from knot theory and combinatorial group theory. Based on recent work with Ali Shehper, Anibal Medina-Mardones, Lucas Fagan, Bartłomiej Lewandowski, Angus Gruen, Yang Qiu, Piotr Kucharski, and Zhenghan Wang.
AI for Mathematics
11:20–11:40am Nicola Marzari The electronic-structure genome of inorganic materials
The structure and properties of inorganic materials have been extensively explored in the last decade with machine learning models built on computational databases. Typically, the descriptors are based on atomic positions, and the properties are thermodynamic quantities such as energies, forces, stresses. This extremely successful paradigm has even given rise to foundational - i.e., universal - machine learning models. Here, we switch our attention to electronic-structure properties, and to electronic-structure descriptors. For this, we have built robust and reliable protocols able to map automatically the electronic-structure of a material (typically calculated at the level of density-functional theory) into the exact but also minimal set of maximally localized Wannier functions. These latter provide the most compact representation of any desired manifold of electronic structure bands. We constructed more than 1.3M Wannier functions for 20,000+ inorganic, stoichiometric, and experimentally known materials, drawn from the Materials Cloud MC3D database. For these, we explore materials or materials combinations that could deliver optimal performance as thermoelectrics, as nonlinear Hall materials, and as heterojunctions for solar cells.
AI for Materials Science
11:40am–12:00pm Xavier Bresson Graph Transformers for Molecular Science -- Overcoming Limitations in Graph Representation Learning
Graph Neural Networks (GNNs) have shown great potential in graph representation learning but are limited by over-squashing and poor long-range dependency capture. In this work, we introduce Graph ViT, a novel approach that leverages Visual Transformers (ViT). This new architecture addresses the standard challenges by effectively capturing long-range dependencies, improving memory and computational efficiency, and offering high expressive power in graph isomorphism. These advantages enable Graph ViT to outperform traditional message-passing GNNs, especially in molecular science applications.
ML Algorithmic Advances
Break
12:20–12:40pm Sunny Lu TBC
N/A
AI for Finance
12:40–1:00pm TBC TBC
N/A
TBC
1:00–1:20pm TBC TBC
N/A
TBC
1:20–1:40pm TBC TBC
N/A
N/A
Lunch
Afternoon Sessions
(Parallel Sessions)
2:40pm to 3:00pm
Venue Auditorium 1 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science AI for Physics ML Algorithmic Advances AI for Medicine and Healthcare
Speakers Teck Leong Tan Khoo Jun Yong Kharen Musaelian Invited Talk
Title Advances in Multi-component Alloy Design: From Interpretable Machine-Learning to Generative AI TBC TBC TBC
Speakers Wang Hanmo Dimitar Georgiev Liu Fusheng Yilin Ning
Title Integrating graph neural networks with physics-informed loss function for mechanical response prediction of hollow concrete structures with morphed honeycomb configurations Hyperspectral unmixing for Raman spectroscopy via physics-constrained autoencoders Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models Generative artificial intelligence and ethical considerations in health care: a scoping review and ethics checklist
Abstract
3:15pm to 3:30pm
Speakers Ng Chee Koon Ying Zhang Sohei Arisaka Siqi Li
Title Composition driven machine learning for unearthing high-strength lightweight multi-principal element alloys A Symmetry-Aware Multimodal Transformer for Spin Hall Conductivity Prediction Learning the deflated conjugate gradient method using gradient-based meta-solving Federated Learning in Healthcare: A Benchmark Comparison of Engineering and Statistical Approaches for Structured Data Analysis
Abstract
3:30pm to 3:45pm
Speakers Jiayu Peng Xiaogang Liu Abrari Noor Hasmi Mingxuan Liu
Title Predicting the Chemical (Dis)order in Multicomponent Materials with High-Throughput Simulations and Representation Learning Non-structured Solution-Processed Scintillators for Improved X-ray Imaging Phase Space Visualization and Neural Networks: Learning Hamiltonian Dynamics in the Duffing System FAIM: Fairness-aware interpretable modeling for trustworthy machine learning in healthcare
Abstract
3:45pm to 4:00pm
Speakers Claudiane Ouellet-Plamondon Jin-Kyu So Vladislav Trifonov Uchenna Akujuobi
Title Minimizing the carbon footprint of 3D printing concrete: Leveraging parametric LCA and neural networks through multiobjective optimization Super-resolution optical metrology and imaging of 3D nano-scale objects (TBC) Efficient preconditioning for iterative methods with graph neural networks Gastro-Health Project: Revolutionizing Personalized Nutrition and Health Forecasting Through Integrated AI Technologies
Abstract
4:00pm to 4:20pm – Tea Break
4:20pm to 4:40pm
Venue Auditorium 1 Lecture Theatre 51 Lecture Theatre 52 Lecture Theatre 53
Topic AI for Materials Science Unconventional Computing AI for Mathematics AI for Climate and Weather
Speakers Ryo Yoshida Mario Lanza Martinez Sebastian Ullrich Terence O'Kane
Title Foundational Computational Polymer Properties Database: Bridging to Real-World AI Applications Memristor-based neuromorphic computation: status, challenges, and potential solutions Trustless AI with the Lean Theorem Prover On neural ODEs, normalizing flows, and Bayesian inference in application to climate teleconnections
Abstract
The first milestone of AI for Science is to create a comprehensive data platform. However, the development of foundational database in polymer material science has significantly lagged behind. To overcome this barrier, we have constructed a massive computational database encompassing a vast chemical space of polymers, using RadonPy, a fully automated pipeline tool for all-atom classical molecular dynamics and first-principles-based computational experiments. The foundation models pre-trained on this computational database have demonstrated scalable Sim2Real transfer in a wide range of real-world predictive tasks through fine-tuning. This presentation will discuss several aspects of scaling laws for Sim2Real transfer learning and their practical applications for the discovery of new polymers.
In this talk, I am going to present the status of memristor-based neuromorphic computation. I will present the state-the-art performance and the current challenges to overcome in the next five years. In this talk I will discuss two different strategies on how to overcome such problems. The first one consists of integrating two-dimensional materials at the back-end-of-line of silicon microchips, and we will present the performance of on-chip memristors made of multilayer hexagonal boron nitride. The second strategy will consist of unconventional biasing of established fully CMOS-compatible electronic devices to produce outstanding neuromorphic responses. I will also discuss the main technical problems to be faced in the next years when following each strategy, and I will provide some recommendations on how to solve them.
The explosive growth of AI reasoning capabilities brings ambitious goals such as solving unproven mathematical conjectures or generating complex verified software into the realm of possibility. But when such synthesized proofs may span hundreds of pages and thus stretch or transcend human comprehension, who will verify and establish trust in them? The Lean theorem prover is an open-source tool for authoring and checking machine-readable proofs of arbitrary logical statements, reducing the trust bottleneck to the correctness of the tool and the statement -- a reduction by orders of magnitudes. In this talk, I will describe the fundamentals of this approach and how AI is benefiting from Lean and vice versa already today.
Since the publication of the classic paper by Chen et al (2018 NeurIPS), there has been an explosion of applications of Neural network and operator methods for solving ODEs (NODEs) and more recently PDEs. In particular, neural operator methods are becoming the preferred architectures for application to the development of neural Atmospheric General Circulation Models for weather and climate prediction. In this presentation I will discuss the close relationship between neural methods, dynamical systems, and Bayesian inference in application to diagnosing the dynamics and causal relationships of observed and simulated climate teleconnections. More specifically, I will discuss the so-called "loss of hyperbolicity" linked with the alignment of dynamical vectors characterizing the emergent dynamics of persistent synoptic weather events due to flow instabilities. I then show how dynamical systems approaches to identifying the hyperbolic splitting of the tangent space defined by the leading physical modes, closely mirrors mapping methods employed in NODEs. Lastly, I show how further generalization to normalizing flows connects to Bayesian structure learning methods in application to inferring Granger causal relationships from climate data.
Speakers Markus J. Buehler Fabio Sciarrino Xujie Si Megan Stanley
Title AI for scientific discovery: Physics-aware models that connects scales, disciplines, and modalities Quantum machine learning via photonics The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry A Foundation Model for the Earth System: Modelling, Scaling, and High-Resolution Weather Forecasting
Abstract
5:00pm to 5:15pm
Speakers Artem Oganov Jun Zhao Wenda Li Xin Wang
Title AI Discovers New Materials And Phenomena – And Natural Intelligence Explains Them Quantum Networks in 6G Communications: Technologies, Challenges, and Applications Will mathematical theorem proving be solved by scaling laws? CondensNet: A Physically-Constrained Hybrid Deep Learning Model for Stable Long-Term Climate Simulations
Abstract
5:15pm to 5:30pm
Speakers Balamurugan Ramalingam Remmy Augusta Menzata Zen Abhik Roychoudhury Jiawen Wei
Title Multi-objective synthesis optimization and kinetics of a sustainable terpolymer Discovering Fault-Tolerant Quantum Circuits and Quantum Error Correction Codes via Reinforcement Learning AI for Program Verification XAI4Extremes: An explainable AI framework for understanding extreme-weather precursors
Abstract
5:30pm to 5:45pm
Speakers Ervin Chia Partha Pratim Kundu Michael Shalyt TBC
Title Classifying Petabytes of varying structural motifs in supercooled water AI Guided Multi-Objective Heterogeneous Chiplet Placement for Advanced Do LLMs Understand Calculus? Evaluating Symbolic Math Generalization with ASyMOB OMG-HD: A High-Resolution AI Weather Model for End-to-End Forecasts from Observations (TBC)
Abstract
5:45pm to 6:00pm
Speakers Dai Haiwen Shuangyue Geng TBC TBC
Title Metastable Polymorph Stabilization with Physics-based Descriptor Engineering and Machine Learning The analysis and interpretation of quantum adversarial examples TBC WeatherReal: A Benchmark Based on In-Situ Observations for Evaluating Weather Models (TBC)
Abstract
End of Day 4