6–11 Jul 2025
PALAIS DU PHARO, Marseille, France
Europe/Paris timezone

QCD in Language Models: What they really know about QCD?

Not scheduled
20m
PALAIS DU PHARO, Marseille, France

PALAIS DU PHARO, Marseille, France

Parallel T16 - AI for HEP (special topic 2025) T16

Description

This study presents an analysis of modern open-source large language models (LLMs)—including Llama, Qwen, and Gemma—to evaluate their encoded knowledge of Quantum Chromodynamics (QCD). Through reverse engineering of these models' representations, we uncover the naturally idiosyncratic patterns in how foundational QCD concepts are embedded within their parameter spaces. Our methodology combines targeted probing techniques and knowledge extraction protocols to assess the models' understanding of critical QCD principles like color confinement, asymptotic freedom, and the running coupling constant. We further demonstrate how these latent representations can be leveraged for "connecting dots" between important QCD concepts, potentially enabling novel insights since LLMs allow us to understand the problem holistically limited only by memory. This work provides a tool for utilizing LLMs as an assistant in theoretical physics research, while also highlighting current limitations in their representation of advanced quantum field theory concepts that future model development should address.

Author

Presentation materials

There are no materials yet.