Matteo Merler

Researcher @ FBK NLP

me.jpg

Via Sommarive, 18

Trento, Italy

I am currently a pre-doctoral researcher at the FBK NLP in Trento, Italy. Previously, I obtained my MSc degree in Machine Learning, Data Science and Artificial Intelligence from Aalto University in Helsinki, Finland.

My research interests revolve around bridging current large-scale models (such as LLMs and VLMs) with other learning methods, such as Reinforcement Learning (RL) and Symbolic Planning (SP). Specifically, I believe that a key strength of RL is the ability to learn from experience while interacting with the world, which is something current LLMs lack. At the same time, pure RL requires the agent to learn about the environment from scratch in every new situation, and hardly generalizes to new tasks.

I’m also fascinated by model-based RL and the concept of world models as a whole. I believe this to be a key ability for general agents that current LLMs can imitate but ultimately lack. I want to investigate the extent to which current large-scale models possess so-called “internal” world models and develop techniques to specifically teach these models to plan ahead and imagine the future state of the world.

Therefore, I am interested in developing AI agents that are able to understand natural language instructions and leverage the large amount of real-world knowledge encoded in LLMs, but at the same time adapt to situations that require more complex reasoning and long-horizon planning, going beyond memorization and pattern matching, but rather understanding the world from direct experience and interaction with the real world.

news

Mar 05, 2025 My Master’s Thesis was awarded as one of the three best at the Aalto University School of Science in 2024.
Sep 25, 2024 Our paper “Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search” has been accepted at NeurIPS 2024.
Jul 08, 2024 Our paper “In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery” has been accepted at the ACL 2024 Student Research Workshop.

selected publications

  1. Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
    In Advances in Neural Information Processing Systems, 2024