me.jpg
Matteo Merler
Researcher @ FBK NLP

Via Sommarive, 18

Trento, Italy

about

I am a researcher at the FBK NLP in Trento, Italy. This Fall, I will start my PhD at the Bethge Lab in Tübingen, Germany, as an ELLIS PhD Student and as part of the International Max Planck Research School for Intelligent Systems (IMPRS-IS). I also work with Tom Silver’s group for robotics and planning in Princeton, US. Previously, I obtained my MSc degree in Machine Learning, Data Science and Artificial Intelligence from Aalto University in Helsinki, Finland.

I am interested in developing autonomous AI agents that learn from experience and adapt dynamically to new situations. For this, my research is at the intersection of large-scale vision and language models, reinforcement learning, and planning. In particular, I believe world modeling to be a key component for building intelligent agents that can reason about the world and plan their actions accordingly. I work with both simulated games and real-world robotics as environments for agents to learn and adapt in. I am also curious about the connections between AI and cognitive science, and how insights from human cognition can inform the development of more intelligent agents.

news

Jun 13, 2026 We released a new preprint: DecSelfMask: Leveraging Unlabeled Text via Self-Relevance-Guided Masking for Decoder-Only Classification, led by Pietro Ferrazzi.
Nov 25, 2025 A preliminary version of our latest work, Guiding Reinforcement Learning with Selective Vision-Language Model Supervision, has been published in the ECAI 2025 CAIPI workshop. We are currently extending this work into a full-length paper for a conference submission.
May 20, 2025 We released a new preprint: ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models.
Mar 05, 2025 My Master’s Thesis was awarded as one of the three best at the Aalto University School of Science in 2024.
Sep 25, 2024 Our paper “Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search” has been accepted at NeurIPS 2024.

selected publications

  1. ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models
    May 2025
  2. Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
    Nicola Dainese*Matteo Merler*Minttu Alakuijala, and Pekka Marttinen
    In Advances in Neural Information Processing Systems, May 2024