me.jpg
Matteo Merler
Researcher @ FBK NLP

Via Sommarive, 18

Trento, Italy

about

I am a researcher at the FBK NLP in Trento, Italy. This Fall, I will start my PhD at the Bethge Lab in Tübingen, Germany, as an ELLIS PhD Student and as part of the International Max Planck Research School for Intelligent Systems (IMPRS-IS). I also work with Tom Silver’s group for robotics and planning in Princeton, US. Previously, I obtained my MSc degree in Machine Learning, Data Science and Artificial Intelligence from Aalto University in Helsinki, Finland.

I am interested in developing autonomous AI agents that learn from experience and adapt dynamically to new situations. For this, my research is at the intersection of large-scale vision and language models, reinforcement learning, and planning. In particular, I believe world modeling to be a key component for building intelligent agents that can reason about the world and plan their actions accordingly. I work with both simulated games and real-world robotics as environments for agents to learn and adapt in. I am also curious about the connections between AI and cognitive science, and how insights from human cognition can inform the development of more intelligent agents.

news

Jul 02, 2026 We released a new preprint: QVal: Cheaply Evaluating Dense Supervision Signals for Long-Horizon LLM Agents, led by Sergio Hernández.
Jun 13, 2026 We released a new preprint: DecSelfMask: Leveraging Unlabeled Text via Self-Relevance-Guided Masking for Decoder-Only Classification, led by Pietro Ferrazzi.
Nov 25, 2025 A preliminary version of our latest work, Guiding Reinforcement Learning with Selective Vision-Language Model Supervision, has been published in the ECAI 2025 CAIPI workshop. We are currently extending this work into a full-length paper for a conference submission.
May 20, 2025 We released a new preprint: ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models.
Mar 05, 2025 My Master’s Thesis was awarded as one of the three best at the Aalto University School of Science in 2024.

selected publications

  1. ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models
    May 2025
  2. Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
    Nicola Dainese*Matteo Merler*Minttu Alakuijala, and Pekka Marttinen
    In Advances in Neural Information Processing Systems, May 2024