<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Generalizable Neuro-Symbolic Question Answering Systems for Commonsense</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alessandro Oltramari</string-name>
          <email>alessandro.oltramari@us.bosch.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jonathan Francis</string-name>
          <email>jon.francis@us.bosch.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Filip Ilievski</string-name>
          <email>ilievski@isi.edu</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kaixin Ma</string-name>
          <email>kaixinm@andrew.cmu.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roshanak Mirzaee</string-name>
          <email>rk.mirzaee.m@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bosch Research and Technology Center</institution>
          ,
          <addr-line>Pittsburgh</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Carnegie Mellon University</institution>
          ,
          <addr-line>Pittsburgh</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Computer Science and Engineering, Michigan State University</institution>
          ,
          <addr-line>East Lansing</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Marina del Rey</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Italian National Research Council, Institute of Cognitive Science and Technology</institution>
          ,
          <addr-line>Trento</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>1. Extended abstract In this contribution [1], we analyze different methods for integrating large language models (LLMs) and knowledge graphs (KGs), presenting a quantitative and qualitative evaluation of these neuro-symbolic strategies on a broad spectrum of question answering benchmark datasets focused on common sense reasoning. We illustrate how the combination of data-driven algorithms and symbolic knowledge is necessary to enable domain generalizability and robustness in downstream tasks. Our work, despite being prior to the disruptive innovation triggered by ChatGPT [2], remains actual, as it stems from a comprehensive analysis of the intrinsic problems that LLMs have in dealing with high-level forms of inferences requiring spatial, temporal, analogical, causal reasoning. We argue that such problems are dependent on the inner limitations of the neural architectures behind those models and cannot simply be solved by increasing the size of the training data, or by further scaling the number of parameters. In our original paper we claim that two orthogonal extensions are required for LLMs: first, augmenting them with symbolic knowledge, which is available in large amounts thanks to opensource commonsense KGs; second, extending the resulting neuro-symbolic language models with components that can dynamically perform reasoning. We refer to the former direction as horizontal augmentation, as it focuses on expanding the coverage of the structured knowledge available to LLMs methods, and to the latter as vertical augmentation, as it explores in depth how specific forms of reasoning can be enabled. Our paper mostly focuses on horizontal augmentation, but it also lays the foundations of vertical augmentation. In regards of the former, we study techniques like attentionbased knowledge injection, and fine-tuning LLMs with synthetic question-answer pairs generated from KGs: in both techniques, symbolic knowledge is typically translated into latent vectorial representations. However, if sub-symbolic expressions can augment training signals with features transformed from explicit semantic content, such knowledge infusion process doesn't mandate how inferences on the learned knowledge should be performed. Such mechanisms are typically executed using logic-based reasoning, e.g., Region-Connection-Calculus for spatial knowledge [3], Allen's axioms for temporal (or spatial) knowledge [4], etc. (relevant work exists showing how deep learning models can replicate logical reasoning, e.g., [5, 6], but it doesn't follow that any form of reasoning should be reduced to sub-symbolic learning - or at least this is an assumption only for some closely paired neuro-symbolic systems). Implementing inference mechanisms1, and integrating them with large-scale knowledge infusion methods, is what we advocate for in the future of neuro-symbolic AI research and development.</p>
      </abstract>
    </article-meta>
  </front>
  <body />
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Oltramari</surname>
          </string-name>
          , Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee:
          <article-title>Generalizable Neuro-symbolic Systems for Commonsense Question Answering</article-title>
          . In Pascal Hitzler, Md Kamruzzaman Sarker (eds.),
          <source>Neuro-Symbolic Artificial Intelligence: The State of the Art. Frontiers in Artificial Intelligence and Applications</source>
          Vol.
          <volume>342</volume>
          , IOS Press, Amsterdam,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2] “ChatGPT - Release Notes”.
          <source>Archived from the original on May 4</source>
          ,
          <year>2023</year>
          . Retrieved May 4,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Cohn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bennett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gooday</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. M.</given-names>
            <surname>Gotts</surname>
          </string-name>
          ,
          <article-title>Qualitative spatial representation and reasoning with the region connection calculus, geoinformatica 1 (</article-title>
          <year>1997</year>
          )
          <fpage>275</fpage>
          -
          <lpage>316</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J. F.</given-names>
            <surname>Allen</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Ferguson, Actions and events in interval temporal logic</article-title>
          ,
          <source>Journal of logic and computation 4</source>
          (
          <year>1994</year>
          )
          <fpage>531</fpage>
          -
          <lpage>579</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Eberhart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hitzler</surname>
          </string-name>
          ,
          <article-title>On the capabilities of pointer networks for deep deductive reasoning</article-title>
          ,
          <source>arXiv preprint arXiv:2106.09225</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A. d.</given-names>
            <surname>Garcez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Bowman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Lamb</surname>
          </string-name>
          , L. de Penning,
          <string-name>
            <given-names>B.</given-names>
            <surname>Illuminoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Poon</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Gerson Zaverucha, Neural-symbolic learning and reasoning: A survey and interpretation</article-title>
          ,
          <source>Neuro-Symbolic Artificial Intelligence: The State of the Art</source>
          <volume>342</volume>
          (
          <year>2022</year>
          )
          <article-title>1</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>