<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Explainable Mobility Prediction in Urban Transit Zones</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adrian M.P. Braşoveanu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lyndon J.B. Nixon</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arno Scharl</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Günther Charwat</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Egon Prünster</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Modul Technology GmbH</institution>
          ,
          <addr-line>Am Kahlenberg 1, 1190, Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Modul University Vienna</institution>
          ,
          <addr-line>Am Kahlenberg 1, 1190, Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>ummadum Service GmbH</institution>
          ,
          <addr-line>Prinz-Eugen-Straße 2/5, 1040 Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>webLyzard technology GmbH</institution>
          ,
          <addr-line>Liechtensteinstrasse 41/26, 1090 Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sustainable mobility behavior remains a challenging objective, as individuals rarely alter their transportation habits solely for environmental benefits. Creating efective incentives can prompt change, but first, it requires a comprehensive understanding of mobility patterns. This paper examines the prediction of personal travel activities (including next-trip forecasting and activity classification) utilizing recent journey data collected through a mobility application. A graph-based fusion ensemble, comprising a graph convolutional network and a statistical user pattern, is designed for structured prediction with multiple outputs, including origin, destination, transportation mode, and time. An explainable prediction pipeline is built on top of this ensemble. The results are then converted into a knowledge graph, enabling us to conduct sophisticated analyses and improve our weekly workflows.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;sustainable transportation</kwd>
        <kwd>mobility prediction</kwd>
        <kwd>graph convolutional network</kwd>
        <kwd>explainable mobility graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Achieving sustainable transportation patterns is challenging since individuals rarely modify
established routines solely for environmental benefits. Key considerations encompass personal convenience
(overcrowded transit deterring usage), network development (separate cycling corridors), capability
requirements, motivation systems (e.g., notifications or rewards, and inclusive design features. If
individuals are to change their behavior, the proposed benefits of the motivation systems must exceed
the perceived advantages of continuing current practices, including financial savings, comfort zones,
and ingrained patterns.</p>
      <p>The AI-CENTIVE project’s primary goal is to use neural network architectures to understand
transportation behaviors in Austrian urban transit zones and identify the most efective reward structures to
encourage a shift towards eco-friendly travel options, such as using bicycles or public transit instead
of private vehicles for urban commuting. The research involves developing ensembles that combine
statistical patterns and graph convolutional networks (GCNs). Our experiments quantify environmental
impacts by combining baseline predictions with simulated incentives across mobility networks.</p>
      <p>The main contribution presented in this paper is the semantic conversion workflow developed to
support and analyze explainable AI predictions. The output of the ensembles is converted into a mobility
knowledge graph, and the best predictions are analyzed using SPARQL queries.</p>
      <p>The paper is organized as follows: Section 2 ofers a brief overview of related work, Section 3 describes
the general method, while Section 4 presents the explainability pipeline and the associated analysis
that can be performed with it. The paper concludes with a brief overview and future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>This brief section covers only graph mobility network surveys and related work on explainability.</p>
      <p>
        Examining graph-based computational methods, Jiang’s survey [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] catalogs trafic applications, and
model variations, including standard graph networks, convolutional adaptations, and unsupervised
graph encoders. Expanding this scope significantly, Wang et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] provides detailed analysis
covering broader transportation applications: vehicle storage systems, safety enhancement, autonomous
navigation, and metropolitan design optimization.
      </p>
      <p>
        Beyond academic surveys identifying research directions, practical implementation insights from the
Google Maps arrival prediction system [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] deserve particular attention through detailed experimental
variations and qualitative performance evaluation on actual transportation data. Notable mention
regarding Google concerns their adaptive approach toward evolving privacy regulations shown by
shifting personal movement histories onto individual devices.
      </p>
      <p>
        Understanding model decision processes remains crucial for discovering inherent biases often
stemming from unbalanced training samples. Schwalbe and Finzel [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] deliver extensive examination
synthesizing over fifty specialized surveys addressing interpretability challenges across computational
domains. Wang et. al [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] focus on knowledge graph related research in smart city domain which was
the starting point of our mobility graph idea for fast analysis idea presented in this paper.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>A core challenge the AI-CENTIVE project addresses is the observation that individuals are often
unwilling to change their mobility habits solely for environmental reasons. To overcome this, the
project’s overarching mission is to develop AI-based incentivization techniques to influence citizens’
mobility choices, utilizing multimodal models of mobility activity and data analytics. The ultimate
vision is to enable and incentivize Austrian citizens to adopt more sustainable mobility choices.</p>
      <p>The work presented here builds upon the Ummadum platform which allows users to log their
sustainable mobility activities via a mobile app. This process reports CO2 emissions reductions to
raise users’ sustainability awareness and rewards sustainable behavior to incentivize environmentally
friendly choices. The AI-CENTIVE community within Ummadum serves to engage participants, assess
the efectiveness of incentives, conduct pilot tests for integrating AI-driven solutions, and plan rollout
strategies across Austria. All trip data is collected with user consent, strictly adhering to GDPR best
practices by anonymizing user identities and locations for analysis.</p>
      <p>The project’s dataset for examining Vienna mobility patterns was provided by Ummadum. This
commuter dataset comprises approximately 450,000 user trips collected between January 2024 and
April 2025 from users of the Ummadum mobile app. The dataset focuses on activities such as biking,
walking, public transport, and car sharing (as a rider or driver), also including data about activity status
(e.g., cancelled, finished). To maintain GDPR compliance and route anonymity, user data and trip data
are anonymized, with segments added or removed from trips. Locations for origins and destinations
are expressed through zip codes, and information about location type (e.g., ofice, public venue) is
also collected. The dataset also includes details on the incentives provided, such as points, rewards,
carsharing, community types administering rewards, or activity challenges where users earn additional
rewards based on their activity levels.</p>
      <p>The project’s approach towards incentivization primarily involves rewards and notifications. Rewards
include classic elements like: i) challenges; ii) lotteries, or iii) collection of points for sustainable activities.
Notifications are displayed directly to the users based on their past history. Three types of notifications
were implemented: i) success notifications to motivate participants based on their previous sustainable
mobility actions; ii) AI recommendation notifications to suggest future sustainable mobility options
using contextual information such as behavior, mobility preferences, location, and time; and iii) weather
notifications for notable weather events. Explanations for the AI notifications are included, based
on past user history. These notifications are part of a simple workflow that runs automatically each
Monday morning, scheduling notifications after checking eligibility criteria (e.g., high confidence, no
more than one notification per user per day).</p>
      <p>Initial insights from the first 2025 pilot program indicated a total CO2 savings of 331 tons. These
insights were used to refine a second pilot in 2025. This second pilot incorporates improvements such as
explainable AI notifications (justifying notifications based on the user’s history, including similar trips
or partial routes), weather alerts, and enhanced challenges within the rewards system. The overall goal
of the second pilot was to test and refine AI-driven incentivization techniques to encourage sustainable
mobility choices among Austrian citizens using the Ummadum platform.</p>
      <p>
        We have examined several models designed for a structured prediction task that simultaneously
forecasts multiple outputs, including trip origin and destination locations, hour and day of the trip,
activity type, distance, and duration. These models, including Transformer and GCN ensembles that
include statistical models (e.g., ARIMA, XGBoost) are presented in a previous publication [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The best
model is an ensemble that combines statistical user patters (e.g., temporal and spatial patterns related
to the past trips, including same origin or destination, or time intervals) and a GCN architecture.
      </p>
      <p>To enhance transparency and regulatory compliance of our Graph Neural Network mobility
predictions, we developed a semantic conversion pipeline that transforms prediction outputs into structured
RDF knowledge graphs. This approach addresses the requirements of the European AI Act for
explainable AI systems in transportation applications.</p>
      <p>
        Our system employs a multilayered semantic framework combining established ontologies with
custom ontologies 1. The core structure integrates PROV-O (W3C Provenance Ontology) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for
comprehensive provenance tracking, allowing full traceability from input data through model inference
to final predictions. Temporal relationships are modeled using the W3C TIME [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] ontology, providing a
standardized representation of trip scheduling and duration predictions.
      </p>
      <p>For mobility-specific concepts, we defined custom RDF vocabularies in the modultech.eu/aicentive
namespace, establishing classes such as TripPrediction, PredictionExplanation, and PatternEvidence.
These vocabularies maintain compatibility with existing transport standards like GTFS (General Transit
Feed Specification) while extending semantic representation for explainable AI applications. The
explainability vocabulary captures structured reasoning through properties that include routeFrequency,
timeFrequency, and activityFrequency, allowing systematic analysis of prediction confidence factors.
Each prediction entity links to explanation instances that contain machine-readable pattern evidence
and human-readable justification text.</p>
      <p>The conversion pipeline automatically processes GNN outputs into RDF using rdflib, preserving all
prediction metadata including confidence scores, historical pattern evidence, and explanatory reasoning.
The resulting knowledge graph enables direct SPARQL querying without database infrastructure,
1The ontologies were designed for ofline use and are publicly available through
https://github.com/modultechnology/aicentiveontologies
GitHub
supporting complex analytical queries for model evaluation and regulatory auditing. Semantic queries
can systematically identify low-confidence predictions, analyze temporal patterns in model performance,
and extract compliance reports for regulatory review. This queryable format facilitates automated
detection of prediction biases and supports continuous model improvement through pattern analysis.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <p>This semantic conversion enables sophisticated analytical capabilities through SPARQL querying that
would be dificult or impossible with traditional CSV formats. Researchers can execute complex queries
to identify high-confidence predictions for regulatory audit trails, systematically analyze low-confidence
routes that require model improvement, and investigate user behavior patterns by correlating confidence
scores with historical pattern evidence (see Table 1).</p>
      <p>The semantic structure supports automated quality assurance through queries that filter predictions
by confidence thresholds, group routes by performance metrics, and extract explanation metadata for
transparency reporting. Beyond basic filtering, the pipeline enables advanced analytical patterns such as
identifying users with consistently high prediction confidence, discovering temporal patterns in model
performance, and analyzing the relationship between route familiarity and prediction accuracy. The
explainability metadata becomes queryable, allowing researchers to systematically study which types
of historical evidence correlate with prediction success and identify biases in model performance across
diferent user groups or activity types. This semantic approach transforms static prediction output into a
dynamic knowledge graph that supports continuous model evaluation, regulatory compliance reporting,
and data-driven insights for model improvement. The standardized format also enables integration
with other transportation data sets and supports collaborative research through interoperable semantic
web technologies. Most importantly, queryable explanations provide the foundation for automated
compliance checking required by emerging AI regulations, while simultaneously supporting research
into the relationship between explainability quality and prediction accuracy.</p>
      <p>One particular advantage of this approach is speed. It is generally 3x-5x times faster than the pure
CSV approach. This is mainly due to the SPARQL engines being optimized for triple pattern matching
than traditional SQL or pandas operations on CSV data. The processing eficiency is higher also due to
SPARQL’s lazy evaluation mechanism (e.g., only process the data needed to answer specific queries)
and selective loading (e.g., retrieving only data needed for each analysis). RDFlib itself is also optimized
to handle larger datasets more eficiently through streaming.</p>
      <p>Table 1 showcases several SPARQL queries that are routinely used for explainable mobility analysis.
Table 2 presents the kind of automated analysis that can be done automatically on top of the evaluation
results. All the results presented in this table refer to the last week of evaluation from our second pilot.</p>
      <p>The semantic approach directly addresses EU AI Act mandates for transparency and auditability
in AI applications. The structured explainability format enables automated compliance monitoring
and systematic verification of AI decision-making processes. Beyond regulatory compliance, the
standardized RDF format enhances research reproducibility and enables integration with broader
smart city transportation planning systems, positioning our work at the intersection of responsible AI
development and practical mobility applications.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The semantic conversation pipeline and the rapid analysis it enables represents one of the highlights of
our projects. Due to examining the results through such tools, we were able to quickly improve our
best model, and also deliver high quality notifications to our community members.</p>
      <p>Future investigations will incorporate emissions calculations from the carbon assessment tool
developed by project partner BOKU to evaluate environmental benefits across diferent reward mechanisms.
This enables concrete quantification regarding pollution reduction achieved through altered journey
selections. Additionally, data about weather conditions will enhance the ensemble accuracy.</p>
      <p>Building widespread participation in eco-friendly journey choices remains equally crucial. Monitoring
increased adoption rates of sustainable options helps evaluate progress in behavioral transformation.
Furthermore, we intend to target problematic urban locations lacking vehicle storage facilities or
convenient transit access within a reasonable walking range. Quantifying improvements within these
zones demonstrates enhanced accessibility plus livability improvements. Such enhanced measurements
strengthen our predictive capabilities while supporting inclusive, evidence-based approaches toward
sustainable transportation development. Through our project initiatives, we aim to empower community
members and organizational stakeholders in creating more responsive, equitable, and environmentally
responsible urban transit zones.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgments</title>
      <p>The AI-CENTIVE project (FFG G.A. 892238) was funded within the RTI Initiative of the ICT of the
Future “IKT der Zukunft” – an initiative of the Federal Ministry for Climate Protection, Environment,
Energy, Mobility, Innovation and Technology (BMK).
6.1. Declaration on Generative AI
During the preparation of this work, the authors have used Grammarly for grammar and spelling
checks.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <article-title>Graph Neural Network for Trafic Forecasting: A Survey, Expert Syst</article-title>
          .
          <source>Appl</source>
          .
          <volume>207</volume>
          (
          <year>2022</year>
          )
          <article-title>117921</article-title>
          . doi:
          <volume>10</volume>
          .1016/J.ESWA.
          <year>2022</year>
          .
          <volume>117921</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Han</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>A survey on knowledge graph related research in smart city domain</article-title>
          ,
          <source>ACM Transactions on Knowledge Discovery from Data</source>
          <volume>18</volume>
          (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          . doi:
          <volume>10</volume>
          .1145/3672615.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Derrow-Pinion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>She</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nunkesser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wiltshire</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. W.</given-names>
            <surname>Battaglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sanchez-Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Velickovic</surname>
          </string-name>
          ,
          <article-title>ETA Prediction with Graph Neural Networks in Google Maps</article-title>
          , in: G. Demartini, G. Zuccon,
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Culpepper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          , H. Tong (Eds.),
          <source>CIKM '21: The 30th ACM International Conference on Information and Knowledge Management</source>
          , Virtual Event, Queensland, Australia, November 1 -
          <issue>5</issue>
          ,
          <year>2021</year>
          , ACM,
          <year>2021</year>
          , pp.
          <fpage>3767</fpage>
          -
          <lpage>3776</lpage>
          . doi:
          <volume>10</volume>
          .1145/3459637.3481916.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Schwalbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Finzel</surname>
          </string-name>
          ,
          <source>A Comprehensive Taxonomy for Explainable Artificial Intelligence: A Systematic Survey of Surveys on Methods and Concepts</source>
          ,
          <source>Data Min. Knowl. Discov</source>
          .
          <volume>38</volume>
          (
          <year>2024</year>
          )
          <fpage>3043</fpage>
          -
          <lpage>3101</lpage>
          . doi:
          <volume>10</volume>
          .1007/S10618-022-00867-8.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Bra</surname>
          </string-name>
          ¸soveanu,
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Nixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Scharl</surname>
          </string-name>
          ,
          <article-title>Hybrid ai models for structured mobility prediction in metropolitan areas</article-title>
          ,
          <source>in: IWANN2025</source>
          ,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Missier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Belhajjame</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Cheney,</surname>
          </string-name>
          <article-title>The w3c prov family of specifications for modelling provenance metadata</article-title>
          ,
          <source>in: Proceedings of the 16th international conference on extending database technology</source>
          ,
          <year>2013</year>
          , pp.
          <fpage>773</fpage>
          -
          <lpage>776</lpage>
          . doi:
          <volume>10</volume>
          .1145/2452376.2452478.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Hobbs</surname>
          </string-name>
          , Time ontology in owl,
          <source>W3C working draft, W3C</source>
          <volume>1</volume>
          (
          <year>2006</year>
          )
          <article-title>1</article-title>
          . URL: https: //www.w3.org/TR/2022/CRD-owl-time-
          <volume>20221115</volume>
          /.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>