27 novembre 2023 à 1 décembre 2023
Fuseau horaire Europe/Paris

Efficient Sampling from Bayesian Network Posteriors for Optimal Uncertainties

1 déc. 2023, 10:15
25m
Architectures (Adversarial, Bayesian, ... ) Architectures

Orateur

Sebastian Bieringer (Hamburg University, Institute for experimental physics)

Description

Bayesian neural networks are a key technique when including uncertainty predictions into neural network analysis, be it in classification, regression or generation. Although being an essential building block for classical Bayesian techniques, Markov Chain Monte Carlo methods are seldomly used to sample Bayesian neural network weight posteriors due to slow convergence rates in high dimensional parameter spaces. Metropolis-Hastings corrected chains exhibit two major issues: using a stochastic Metropolis-Hastings term and bad acceptance rates. We present solutions to both problems in form of a correction term to the loss objective and novel proposal distributions based on the Adam-optimizer. The combined algorithm shows fast convergence and good uncertainty estimation for physics use cases without dramatically increasing the cost of computation over gradient descent based optimization.

Auteurs principaux

Prof. Gregor Kasieczka (Hamburg University) Prof. Mathias Trabs (Karlsruhe Institute of Technology) Sebastian Bieringer (Hamburg University, Institute for experimental physics)

Documents de présentation