A new Columbia University AI program observed physical phenomena and uncovered relevant variables—a necessary precursor to any physics theory. But the variables it discovered were unexpected.
Energy, Mass, Velocity. These three variables make up Einstein’s iconic equation E=MC2. But how did Albert Einstein know about these concepts in the first place? Before understanding physics you need to identify relevant variables. Not even Einstein could discover relativity without the concepts of energy, mass, and velocity. But can variables like these be discovered automatically? Doing so would greatly accelerate scientific discovery.
This is the question that Columbia Engineering researchers posed to a new artificial intelligence program. The AI program was designed to observe physical phenomena through a video camera and then try to search for the minimal set of fundamental variables that fully describe the observed dynamics. The study was published in the journal Nature Computational Science on July 25.
The image shows a chaotic swing stick dynamical system in motion. Our work aims at identifying and extracting the minimum number of state variables needed to describe such system from high-dimensional video footage directly. Credit: Yinuo Qin/Columbia Engineering
The scientists started by feeding the system raw video footage of physics phenomena for which they already knew the solution. For example, they fed a video of a swinging double-pendulum known to have exactly four “state variables”—the angle and angular velocity of each of the two arms. After several hours of analysis, the AI outputted its answer: 4.7.
“We thought this answer was close enough,” said Hod Lipson, director of the Creative Machines Lab in the Department of Mechanical Engineering, where the work was primarily done. “Especially since all the AI had access to was raw video footage, without any knowledge of physics or geometry. But we wanted to know what the variables actually were, not just their number.”
Next, the researchers proceeded to visualize the actual variables that the program identified. Extracting the variables themselves was difficult because the program cannot describe them in any intuitive way that would be understandable to humans. After some investigation, it appeared that two of the variables the program chose loosely corresponded to the angles of the arms, but the other two remain a mystery.
“We tried correlating the other variables with anything and everything we could think of: angular and linear velocities, kinetic and potential energy, and various combinations of known quantities,” explained Boyuan Chen PhD ’22, now an assistant professor at Duke University, who led the work. “But nothing seemed to match perfectly.” The team was confident that the AI had found a valid set of four variables, since it was making good predictions, “but we don’t yet understand the mathematical language it is speaking,” he explained.
Boyuan Chen explains how a new AI program observed physical phenomena and uncovered relevant variables—a necessary precursor to any physics theory. Credit: Boyuan Chen/Columbia Engineering
After validating a number of other physical systems with known solutions, the scientists inputted videos of systems for which they did not know the explicit answer. One of these videos featured an “air dancer” undulating in front of a local used car lot. After several hours of analysis, the program returned 8 variables. Likewise, a video of a Lava lamp also produced 8 eight variables. When they provided a video clip of flames from a holiday fireplace loop, the program returned 24 variables.
A particularly interesting question was whether the set of variables was unique for every system, or whether a different set was produced each time the program was restarted. “I always wondered, if we ever met an intelligent alien race, would they have discovered the same physics laws as we have, or might they describe the universe in a different way?” said Lipson. “Perhaps some phenomena seem enigmatically complex because we are trying to understand them using the wrong set of variables.”
In the experiments, the number of variables was the same each time the AI restarted, but the specific variables were different each time. So yes, there are indeed alternative ways to describe the universe and it is quite possible that our choices aren’t perfect.
According to the researchers, this sort of AI can help scientists uncover complex phenomena for which theoretical understanding is not keeping pace with the deluge of data—areas ranging from biology to cosmology. “While we used video data in this work, any kind of array data source could be used—radar arrays, or DNA arrays, for example,” explained Kuang Huang PhD ’22, who coauthored the paper.
The work is part of Lipson and Fu Foundation Professor of Mathematics Qiang Du’s decades-long interest in creating algorithms that can distill data into scientific laws. Past software systems, such as Lipson and Michael Schmidt’s Eureqa software, could distill freeform physical laws from experimental data, but only if the variables were identified in advance. But what if the variables are yet unknown?
Hod Lipson explains how the AI program was able to discover new physical variables. Credit: Hod Lipson/Columbia Engineering
Lipson, who is also the James and Sally Scapa Professor of Innovation, argues that scientists may be misinterpreting or failing to understand many phenomena simply because they don’t have a good set of variables to describe the phenomena. “For millennia, people knew about objects moving quickly or slowly, but it was only when the notion of velocity and acceleration was formally quantified that Newton could discover his famous law of motion F=MA,” Lipson noted. Variables describing temperature and pressure needed to be identified before laws of thermodynamics could be formalized, and so on for every corner of the scientific world. The variables are a precursor to any theory. “What other laws are we missing simply because we don’t have the variables?” asked Du, who co-led the work.
The paper was also co-authored by Sunand Raghupathi and Ishaan Chandratreya, who helped collect the data for the experiments. Since July 1, 2022, Boyuan Chen has been an assistant professor at Duke University. The work is part of a joint University of Washington, Columbia, and Harvard NSF AI institute for dynamical systems, aimed to accelerate scientific discovery using AI.
Reference: “Automated discovery of fundamental variables hidden in experimental data” by Boyuan Chen, Kuang Huang, Sunand Raghupathi, Ishaan Chandratreya, Qiang Du and Hod Lipson, 25 July 2022, Nature Computational Science.