<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>R-PlanGPT: Neuro-Symbolic Plan Generation via Transformer-based Language Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Massimiliano Tummolo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mattia Chiari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luca Putelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicholas Rossetti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ivan Serina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alfonso Emilio Gerevini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Brescia</institution>
          ,
          <addr-line>Brescia, Italy, via Branze 38</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>R-PlanGPT is a neuro-symbolic architecture designed to generate solution plans for classical planning problems by learning from examples. It combines a generative model (PlanGPT), a symbolic validator (VAL) and a classical planner (LPG). PlanGPT learns to solve new instances within the same domain as a general policy, treating planning as a generative task and producing action sequences given the initial state and the goal of a problem. In order to guarantee the correctness of the neural model output, VAL is called to validate every plan produced by PlanGPT. If the solution is not valid, it is repaired by LPG. We demonstrate the capabilities of R-PlanGPT on standard planning benchmarks, highlighting its ability to generate valid, high-quality plans.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Recent advancements in Large Language Models (LLMs)
have shown remarkable performance across various natural
language processing tasks [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. However, a broader use of
these technologies includes mathematical inference tasks
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and code writing [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In terms of reasoning abilities,
although there is a basic understanding that these model
are capable or common-sense reasoning [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], an important
benchmark for these abilities is automated planning [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and,
in particular, solving planning problems [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
        ].
      </p>
      <p>
        In this demo, we present R-PlanGPT, a neuro-symbolic
architecture that addresses plan generation as a sequence
modeling task using a GPT-based architecture. As it can
be seen in Figure 1, the system is composed by three main
modules: PlanGPT, a GPT-model trained from scratch on
classical planning problems (expressed in PDDL [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]) which
learns to generate action sequences that solve planning
instances in a given domain. Since there is no
theoretical guarantee that the neural model provides the correct
solution of the problem, R-PlanGPT includes a validator
[
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ] to ensure plan soundness. If the solution is not valid,
R-PlanGPT invokes a classical planner [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] to repair the
output of the neural model and to provide a valid plan [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
We test R-PlanGPT on several benchmark domains from the
International Planning Competition (IPC) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Our results
show that although the GPT-model reaches good results
by itself, the inclusion of the symbolic components further
increase the performance, making R-PlanGPT capable of
generating valid high-quality plans.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>
        Automated Planning is a branch of Artificial Intelligence
focused on generating a sequence of actions (a plan) that
an agent can perform to transition from an initial state to a
goal state, given a formal model of the domain [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. While
classical planners focus on solving individual problem
instances, Generalized Planning (GP) instead seeks to derive
general policies that solve several problems within a domain.
A general policy [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] is a mapping from states (or
observations) to actions, enabling an agent to solve previously
unseen problems without having to compute each solution
from scratch. For instance, in the well-known blocksworld
domain, a general policy might instruct the agent to “clear
all blocks and then stack them in goal order,” regardless
of the number of blocks involved. Recent learning-based
approaches have explored extracting general policies from
solved examples, often using neural networks such as CNNs
or GNNs [
        <xref ref-type="bibr" rid="ref19 ref20 ref21">19, 20, 21</xref>
        ]. However, these typically require
domain-specific encoding and provide limited
expressivity or scalability. On the other hand, Transformer-based
language models have demonstrated strong capabilities in
sequence modelling tasks and show potential for learning
policies from data without handcrafted features [
        <xref ref-type="bibr" rid="ref10 ref22 ref8">8, 22, 10</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>
        This section details the pipeline for training and using
RPlanGPT for classical planning tasks. As shown in Figure
1, the system is composed of a transformer-based model
(PlanGPT [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]) a symbolic validator (VAL [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]), and the
LPG planner [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>First, we create a dataset composed of solved planning
instances from classical domains written in PDDL to train
the neural component; these are generated using standard
domain-specific generators, following the IPC conventions
to ensure a range of complexity. Each problem is solved
using the LPG planner, and we collect up to four plans
per instance to provide diversity. To avoid overfitting to
naming conventions, object names in problems and plans
are randomized.</p>
      <p>
        Next, we train from scratch the neural component of the
system, PlanGPT, which is based on a GPT architecture
and receives as input a textual prompt encoding the initial
state and goal of the planning problem. The model is trained
with standard cross-entropy loss to predict the next token
in a plan sequence. To prevent overfitting, we use a custom
early stopping criterion called Coverage Early Stopping [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ],
which terminates training when the percentage of valid
plans generated on the validation set stabilizes.
      </p>
      <p>At inference time, after the training, PlanGPT
autoregressively generates grounded action sequences. However,
PlanGPT can generate actions with unmet preconditions
or fails to complete all the goals, thus generating an invalid
plan. To check if a plan produced by PlanGPT is a valid
solution, we incorporate a symbolic validator (VAL) to assess
the correctness of each generated action and to verify that
each goal is satisfied at the end of the generation. To prevent
the model from generating non-applicable actions, we
further introduce the Validated Multi-Beam Search (VAL-MB)
strategy, which integrates VAL into the decoding process by
validating candidate actions during generation and pruning
invalid beams on the fly. Finally, if the generated output
is invalid or incomplete, we apply a plan repair strategy
using LPG. In this setup, a valid plan prefix is extracted and
provided as a seed to LPG, which continues the search and
produces a complete solution, combining the strengths of
neural generation with symbolic reasoning.</p>
    </sec>
    <sec id="sec-4">
      <title>4. System Demonstration</title>
      <p>As shown in Figure 2, the demo presents an interactive
system designed to showcase how to generate valid plans
using R-PlanGPT. The platform allows users to interact
with the whole pipeline, from problem specification to plan
generation and validation.</p>
      <p>The user begins by selecting a classical planning domain
from a predefined set (e.g., blocksworld, logistics) and
uploads a corresponding PDDL problem file, which includes
the initial state and goal. The system automatically parses
and verifies the syntactic correctness of the uploaded file
against the domain definition to ensure compatibility. This
procedure includes two key steps: (i) a conversion
mechanism that remaps object names not present in the model’s
vocabulary to randomly selected placeholder names from
the vocabulary, ensuring compatibility with the model’s
tokenization; and (ii) a check on the number of objects present
in the problem. Since PlanGPT is trained with a fixed
vocabulary size and limited object capacity, the system ensures
that the number of objects in the problem does not exceed
the maximum supported.If either check fails, the user
receives an error message and is asked to revise the input.</p>
      <p>Once validated, the user can trigger the plan generation
step using PlanGPT. The system supports various
generation strategies: greedy decoding, multi-beam search,
sampling, or the Validated Multi-Beam approach. In VAL-MB,
the symbolic validator is invoked at each generation step to
discard invalid actions on the fly, enforcing plan soundness
during decoding. For other decoding strategies, validation
is performed after the complete plan has been generated.
If the generated plan is deemed valid by the VAL tool, it is
directly displayed to the user as a viable solution.</p>
      <p>Otherwise, the validator identifies the first precondition
violation, and the system extracts the longest valid plan
prefix. At this point, the user is ofered the option to invoke
LPGto repair or complete the plan. LPG uses the prefix as
Model
Selection 
IPnrpoubtlem 
PSroolvbelem </p>
      <p>Load
1 (define (problem BLOCKS-4-0)
2 (:domain BLOCKS)
3 (:objects D B A C - block)
4 (:INIT (CLEAR C) (CLEAR A) (CLEAR B) (CLEAR D) (ONTABLE C) (ONTABLE A)
5 (ONTABLE B) (ONTABLE D) (HANDEMPTY))
76 )(:goal (AND (ON D C) (ON C B) (ON B A)))
8</p>
      <p>Submit
a seed to guide its search process, producing a valid plan
more eficiently than starting from scratch.</p>
      <p>Once the solution is obtained, the output is displayed,
and the user is shown the generated plan, a summary of the
selected domain and problem, and the validation results if
requested.</p>
    </sec>
    <sec id="sec-5">
      <title>5. System Evaluation</title>
      <p>We evaluate R-PlanGPT on a suite of classical planning
benchmarks, demonstrating its efectiveness. The
IPCScoreQuality (IPCQ) metric evaluates the quality of solutions of a
planning system by comparing each plan’s cost to the
bestknown plan for the same problem. Higher scores indicate
plans that are closer to optimal, while unsolved problems
score zero. As shown in Table 1, R-PlanGPT outperforms
all other approaches in terms of IPCQ, achieving the
highest overall score of 175.00 on the IPC domains. This
result demonstrates that combining PlanGPT with LPG as a
post-repair step leads to plans of higher quality than either
PlanGPT or symbolic planners alone. Despite the good
performance of the neural model by itself, the integration of
a symbolic planner improves it in domains where PlanGPT
alone struggled, such as logistics, satellite, and depots.
Moreover, it reaches the performance of LPG and LAMA in
most domains.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>
        In this demo, we have presented R-PlanGPT, a system that
integrates language models with symbolic tools for solving
classical planning problems. Future developments will
focus on supporting nondeterministic planning, incorporating
macro-actions and temporal logic, enhancing
interpretability, or other forms of neuro-symbolic integration [
        <xref ref-type="bibr" rid="ref24 ref25 ref26">24, 25, 26</xref>
        ].
      </p>
    </sec>
    <sec id="sec-7">
      <title>7. Acknowledgements</title>
      <p>This work has been supported by: MUR (Italian Ministry
of University and Research) PRIN-2020 project RIPER (n.
20203FFYLK); PNRR MUR project PE0000013-FAIR, cascade
funding call, ResilientPlans; AI4WATER project, part of
the PRIMA Programme supported by the European Union
and by MUR; and by and by Regione Lombardia through
the initiative "Programma degli interventi per la ripresa
economica: sviluppo di nuovi accordi di collaborazione con
le università per la ricerca, l’innovazione e il trasferimento
tecnologico" - DGR n. XI/4445/2021.
The author(s) have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Radford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Narasimhan</surname>
          </string-name>
          ,
          <article-title>Improving language understanding by generative pre-training</article-title>
          , in: preprint,
          <year>2018</year>
          . api.semanticscholar.org/CorpusID:49313245.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kaiser</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          ,
          <source>in: NIPS</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>5998</fpage>
          -
          <lpage>6008</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schuurmans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bosma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ichter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. H.</given-names>
            <surname>Chi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Le</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>Chain-of-thought prompting elicits reasoning in large language models</article-title>
          ,
          <source>in: NeurIPS</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Joty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. C. H.</given-names>
            <surname>Hoi</surname>
          </string-name>
          , Codet5:
          <article-title>Identifier-aware unified pre-trained encoder-decoder models for code understanding and generation</article-title>
          ,
          <source>in: EMNLP (1)</source>
          ,
          <source>Association for Computational Linguistics</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>8696</fpage>
          -
          <lpage>8708</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Geva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Khashabi</surname>
          </string-name>
          , E. Segal,
          <string-name>
            <given-names>T.</given-names>
            <surname>Khot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Roth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Berant</surname>
          </string-name>
          ,
          <article-title>Did Aristotle use a laptop? A question answering benchmark with implicit reasoning strategies</article-title>
          ,
          <source>Trans. Assoc. Comput. Linguistics</source>
          <volume>9</volume>
          (
          <year>2021</year>
          )
          <fpage>346</fpage>
          -
          <lpage>361</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>V.</given-names>
            <surname>Pallagani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. C.</given-names>
            <surname>Muppasani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Roy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fabiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Loreggia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Murugesan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Horesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Sheth</surname>
          </string-name>
          ,
          <article-title>On the prospects of incorporating large language models (llms) in automated planning and scheduling (APS)</article-title>
          , in: ICAPS, AAAI Press,
          <year>2024</year>
          , pp.
          <fpage>432</fpage>
          -
          <lpage>444</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Rossetti</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <article-title>On planning through llms</article-title>
          , in: ICAPS, AAAI Press,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.</given-names>
            <surname>Pallagani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Muppasani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Horesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Murugesan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Loreggia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fabiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Joseph</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kethepalli</surname>
          </string-name>
          ,
          <article-title>Plansformer tool: Demonstrating generation of symbolic plans using transformers, in: IJCAI, IJCAI Org</article-title>
          .,
          <year>2023</year>
          , pp.
          <fpage>7158</fpage>
          -
          <lpage>7162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Serina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serina</surname>
          </string-name>
          ,
          <article-title>A preliminary study on BERT applied to automated planning</article-title>
          , in: IPS/RiCeRcA/SPIRIT@AI*IA, volume
          <volume>3345</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Valmeekam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Hernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sreedharan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kambhampati</surname>
          </string-name>
          ,
          <string-name>
            <surname>Planbench:</surname>
          </string-name>
          <article-title>An extensible benchmark for evaluating large language models on planning and reasoning about change</article-title>
          ,
          <source>in: NeurIPS</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D.</given-names>
            <surname>McDermott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ghallab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Howe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ram</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Veloso</surname>
            ,
            <given-names>D. S.</given-names>
          </string-name>
          <string-name>
            <surname>Weld</surname>
            ,
            <given-names>D. E.</given-names>
          </string-name>
          <string-name>
            <surname>Wilkins</surname>
          </string-name>
          ,
          <article-title>Pddlthe planning domain definition language</article-title>
          ,
          <year>1998</year>
          . URL: https://api.semanticscholar.org/CorpusID:59656859.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Howey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Long</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Fox, VAL: automatic plan validation, continuous efects and mixed initiative planning using PDDL, in: ICTAI</article-title>
          , IEEE Computer Society,
          <year>2004</year>
          , pp.
          <fpage>294</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Rossetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tummolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          , I. Serina,
          <string-name>
            <given-names>M.</given-names>
            <surname>Olivato</surname>
          </string-name>
          ,
          <article-title>Enhancing gpt-based planning policies by model-based plan validation</article-title>
          ,
          <source>Proceedings of the 18th International Conference on Neural-Symbolic Learning and Reasoning</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serina</surname>
          </string-name>
          ,
          <article-title>LPG: A planner based on local search for planning graphs with action costs</article-title>
          , in: AIPS, AAAI Press,
          <year>2002</year>
          , pp.
          <fpage>13</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Tummolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Rossetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          , I. Serina,
          <string-name>
            <given-names>M.</given-names>
            <surname>Olivato</surname>
          </string-name>
          ,
          <article-title>Integrating classical planners with gpt-based planning policies</article-title>
          ,
          <source>in: AI*IA, Lecture Notes in Computer Science</source>
          , Springer,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>A.</given-names>
            <surname>Taitler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Alford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Espasa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Behnke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gimelfarb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pommerening</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sanner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Scala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schreiber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Segovia-Aguas</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Seipp,</surname>
          </string-name>
          <article-title>The 2023 international planning competition</article-title>
          ,
          <source>AI Mag</source>
          .
          <volume>45</volume>
          (
          <year>2024</year>
          )
          <fpage>280</fpage>
          -
          <lpage>296</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ghallab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Nau</surname>
          </string-name>
          , P. Traverso,
          <source>Automated planning - theory and practice</source>
          , Elsevier,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hu</surname>
          </string-name>
          , G. De Giacomo,
          <article-title>Generalized planning: Synthesizing plans that work for multiple environments</article-title>
          , in: IJCAI,
          <string-name>
            <given-names>IJCAI</given-names>
            <surname>Org</surname>
          </string-name>
          .,
          <year>2011</year>
          , pp.
          <fpage>918</fpage>
          -
          <lpage>923</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>E.</given-names>
            <surname>Groshev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Goldstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tamar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Abbeel</surname>
          </string-name>
          ,
          <article-title>Learning generalized reactive policies using deep neural networks</article-title>
          ,
          <source>in: ICAPS</source>
          , AAAI Press,
          <year>2018</year>
          , pp.
          <fpage>408</fpage>
          -
          <lpage>416</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ståhlberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bonet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gefner</surname>
          </string-name>
          ,
          <article-title>Learning general optimal policies with graph neural networks: Expressive power, transparency, and limits</article-title>
          , in: ICAPS, AAAI Press,
          <year>2022</year>
          , pp.
          <fpage>629</fpage>
          -
          <lpage>637</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Toyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Thiébaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. W.</given-names>
            <surname>Trevizan</surname>
          </string-name>
          , L. Xie, Asnets:
          <article-title>Deep learning for generalised planning</article-title>
          ,
          <source>J. Artif. Intell. Res</source>
          .
          <volume>68</volume>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>68</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>T.</given-names>
            <surname>Silver</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Srinivas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. B.</given-names>
            <surname>Tenenbaum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. P.</given-names>
            <surname>Kaelbling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Katz</surname>
          </string-name>
          ,
          <article-title>Generalized planning in PDDL domains with pretrained large language models</article-title>
          , in: AAAI, AAAI Press,
          <year>2024</year>
          , pp.
          <fpage>20256</fpage>
          -
          <lpage>20264</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>N.</given-names>
            <surname>Rossetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tummolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          , I. Serina,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Olivato</surname>
          </string-name>
          ,
          <article-title>Learning general policies for planning through gpt models</article-title>
          ,
          <source>Proceedings of the International Conference on Automated Planning and Scheduling</source>
          <volume>34</volume>
          (
          <year>2024</year>
          )
          <fpage>500</fpage>
          -
          <lpage>508</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Percassi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          , I. Serina,
          <string-name>
            <given-names>M.</given-names>
            <surname>Olivato</surname>
          </string-name>
          ,
          <article-title>Goal recognition as a deep learning task: The grnet approach</article-title>
          , in: ICAPS, AAAI Press,
          <year>2023</year>
          , pp.
          <fpage>560</fpage>
          -
          <lpage>568</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Loreggia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serina</surname>
          </string-name>
          ,
          <article-title>Fast and slow goal recognition</article-title>
          , in: M.
          <string-name>
            <surname>Dastani</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          <string-name>
            <surname>Sichman</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Alechina</surname>
          </string-name>
          , V. Dignum (Eds.),
          <source>Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS</source>
          <year>2024</year>
          , Auckland, New Zealand, May 6-
          <issue>10</issue>
          ,
          <year>2024</year>
          , International Foundation for Autonomous
          <source>Agents and Multiagent Systems / ACM</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>354</fpage>
          -
          <lpage>362</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>L.</given-names>
            <surname>Serina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Gerevini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Putelli</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Serina</surname>
          </string-name>
          ,
          <article-title>Towards eficient online goal recognition through deep learning</article-title>
          , in: AAMAS,
          <source>International Foundation for Autonomous Agents and Multiagent Systems / ACM</source>
          ,
          <year>2025</year>
          , pp.
          <fpage>1895</fpage>
          -
          <lpage>1903</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>