28–30 nov. 2022
Laboratoire de physique nucléaire et des hautes énergies (LPNHE)
Fuseau horaire Europe/Paris

Multiview Symbolic Regression : learning equations behind examples

28 nov. 2022, 17:30
20m
Amphithéâtre Charpak (Laboratoire de physique nucléaire et des hautes énergies (LPNHE))

Amphithéâtre Charpak

Laboratoire de physique nucléaire et des hautes énergies (LPNHE)

4 Place Jussieu, Tour 22, 1er étage, 75005 Paris

Orateur

Etienne Russeil

Description

Symbolic Regression is a data-driven method that searches the space of mathematical equations with the goal of finding the best analytical representation of a given dataset. It is a very powerful tool, which enables the emergence of underlying behavior governing the data generation process. Furthermore, in the case of physical equations, obtaining an analytical form adds a layer of interpretability to the answer which might highlight interesting physical properties.
However equations built with traditional symbolic regression approaches are limited to describing one particular event at a time. That is, if a given parametric equation was at the origin of two datasets produced using two sets of parameters, the method would output two particular solutions, with specific parameter values for each event, instead of finding a common parametric equation. In fact there are many real world applications – in particular astrophysics – where we want to propose a formula for a family of events which may share the same functional shape, but with different numerical parameters
In this work we propose an adaptation of the Symbolic Regression method that is capable of recovering a common parametric equation hidden behind multiple examples generated using different parameter values. We call this approach Multiview Symbolic Regression and we demonstrate how it can reconstruct well known physical equations. Additionally we explore possible applications in the domain of astronomy for light curves modeling. Building equations to describe astrophysical object behaviors can lead to better flux prediction as well as new feature extraction for future machine learning applications.

Auteur principal

Co-auteurs

Dr Emille Ishida (CNRS/LPC-Clermont) Emmanuel Gangler (LPC) Dr Fabricio Olivetti de França (CMCC Federal University of ABC) Konstantin Malanchev (University of Illinois at Urbana-Champaign)

Documents de présentation