<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Shared Micro-mobility Demand Forecasting using Gradient Boosting methods</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Antonios</forename><surname>Tziorvas</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Informatics</orgName>
								<orgName type="institution">University of Piraeus</orgName>
								<address>
									<settlement>Piraeus</settlement>
									<country key="GR">Greece</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">George</forename><forename type="middle">S</forename><surname>Theodoropoulos</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Informatics</orgName>
								<orgName type="institution">University of Piraeus</orgName>
								<address>
									<settlement>Piraeus</settlement>
									<country key="GR">Greece</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yannis</forename><surname>Theodoridis</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Informatics</orgName>
								<orgName type="institution">University of Piraeus</orgName>
								<address>
									<settlement>Piraeus</settlement>
									<country key="GR">Greece</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Shared Micro-mobility Demand Forecasting using Gradient Boosting methods</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">1EB111BA902DD76AD5E0BC35267142E8</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T18:37+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Gradient Boosting</term>
					<term>Demand Forecasting</term>
					<term>Spatial/Temporal Features</term>
					<term>Open shared mobility data</term>
					<term>E-scooters/E-bikes</term>
					<term>Urban micro-mobility management</term>
					<term>Intelligent Transportation Systems</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Urban demand forecasting plays a critical role in optimizing routing, dispatching, and congestion management within Intelligent Transportation Systems. By leveraging data fusion and analytics techniques, traffic density estimation serves as a key intermediate measure for identifying and predicting emerging demand patterns. In this paper, we propose two gradient boosting model variations, one for classification and one for regression, both capable of generating demand forecasts at various temporal horizons, from 5 minutes up to one hour. Our approach effectively integrates spatial and temporal features, enabling accurate predictions that are essential for improving the efficiency of shared (micro-)mobility services. To evaluate the effectiveness of our approach, we utilize open shared mobility data derived from e-scooters and e-bikes networks in two Dutch metropolitan areas. These real-world datasets enable us to validate our approach and demonstrate its effectiveness in capturing the complexities of modern urban mobility. Ultimately, our methodology offers novel insights on urban micro-mobility management, helping to tackle the challenges arising from rapid urbanization and thus, contributing to more sustainable, efficient, and liveable cities.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Urban shared mobility, also known as Mobility-as-a-Service (MaaS) <ref type="bibr" target="#b0">[1]</ref>, integrates transportation modes, such as public transit, micromobility services (e.g., bike-and scootersharing), and commute-based models (e.g., carpooling). The need for accurate mobility pattern forecasting is growing rapidly as real-time data analytics in Intelligent Transportation Systems (ITS) help alleviate congestion, reduce travel times, and enhance road safety in increasingly complex urban environments. In shared mobility services, including ride-hailing and bike-sharing, predicting demand across spatial and temporal dimensions is essential for efficient resource allocation, reduced waiting times, and optimized fleet deployment. This insight supports smart city initiatives by informing human-centric urban infrastructure design and facilitating sustainable development through data-driven decisions in areas, such as energy consumption, public transit planning, and emergency services.</p><p>Several Machine Learning (ML) approaches have been proposed for detecting and forecasting spatio-temporal patterns in timeseries, including Long Short-Term Memory (LSTM) networks <ref type="bibr" target="#b1">[2]</ref>, Graph Neural Networks (GNNs) <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>, and Diffusion-based Models <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b5">6]</ref>. However, these models often suffer from high computational complexity due to the intricate nature of spatio-temporal data, which combines spatial correlations with temporal dependencies <ref type="bibr" target="#b6">[7]</ref>.</p><p>A common representation for such data is the Spatio-Temporal Graph (STG), where nodes denote spatial locations and edges encode relationships over time. Capturing both spatial and temporal dependencies requires advanced architectures, such as Spatio-Temporal Graph Neural Networks (STGNNs) <ref type="bibr" target="#b7">[8]</ref> or Spatio-Temporal Graph Convolutional Networks (STGCNs) <ref type="bibr" target="#b8">[9]</ref>. However, these methods impose sig-nificant computational demands, limiting their real-world applicability. <ref type="bibr" target="#b9">[10]</ref>. While optimization techniques such as compression and pruning can reduce model size and inference time, they often come at the cost of decreased predictive accuracy.</p><p>This trade-off between complexity, accuracy, and efficiency underscores the need for alternative methodologies. In this work, we propose a novel approach based on gradientboosted trees, a class of models known for their superior performance on structured tabular data <ref type="bibr" target="#b10">[11]</ref>. Our method effectively forecasts micro-mobility demand across spatial and temporal dimensions while maintaining computational efficiency, making it suitable for large-scale deployment.</p><p>In our approach, we provide a robust feature extraction pipeline that can capture both spatial and temporal dependencies and integrate them into our model. We also present a gradient boosting ML algorithm capable of predicting the demand in a given area, either in the form of levels (e.g., 'Low', 'Medium', 'High') through a classifier, or in absolute demand values through a regressor. The performance of our model is evaluated using two real-world micro-mobility datasets and the results turn out to be quite promising since the our approach can effectively adapt to the uniqueness of each area and adequately model its intricacies.</p><p>The rest of this paper is structured as follows: Section 2 reviews related work, Section 3 introduces our forecasting methodology, including the feature extraction process and training configuration. Section 4 describes the experimental setup and presents the results of both predictive models, offering detailed performance metrics. Finally, Section 5 concludes the paper with a summary of the key contributions and directions for future research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>A wide variety of modeling approaches have been proposed to tackle the challenges related to spatio-temporal forecasting. In this section, we will identify and review previous approaches in the context of shared mobility demand forecasting by categorizing them in three groups: (traditional or advanced) ML approaches, deep learning models, and ensemble and hybrid methods, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">ML Approaches</head><p>Traditional machine learning (ML) methods provide a stable analytical foundation for structured predictive problems. Feng et al. <ref type="bibr" target="#b11">[12]</ref> developed a predictive model for bike-sharing demand in Chicago using Poisson regression, incorporating time, weather, population density, and activity density as features. Additionally, a Random Forest model was employed to enhance predictive accuracy through ensemble learning, aiding in demand-supply optimization and the identification of potential new station locations. Similarly, Xiao et al. <ref type="bibr" target="#b12">[13]</ref> applied clustering techniques to forecast short-term car demand, utilizing an improved k-nearest neighbor (kNN) model for online car-hailing services in Hefei City, China. Their results highlight the effectiveness of clustering methodologies in enhancing demand prediction accuracy.</p><p>A notable advancement in bike-share traffic prediction is the two-step pattern recognition model proposed by <ref type="bibr">Sohrabi &amp; Ermagun [14]</ref>. This approach first generates traffic profiles, where bike station traffic is represented as timeseries profiles with 𝑡-minute intervals and an overlap length 𝑂. The kNN algorithm is then applied to identify spatiotemporal similarities, incorporating historical traffic data, temporal factors (e.g., weather, weekdays), and spatial characteristics (e.g., socio-demographics, land use, infrastructure). By leveraging weighted Euclidean distance metrics, this method enables accurate short-and long-term traffic forecasting at both station and system-wide levels, demonstrating the efficacy of integrating spatiotemporal features with ML techniques.</p><p>Smith et al. <ref type="bibr" target="#b14">[15]</ref> employ ML to predict bicycle and pedestrian traffic patterns across 20 U.S. metropolitan statistical areas (MSAs). Building on previous efforts by Le et al. <ref type="bibr" target="#b15">[16]</ref>, where stepwise linear regression models were developed based on Census-derived, neighborhood-level covariates, this study incorporates traffic counts aggregated from 4,145 locations alongside novel street-level data, including Google Street View imagery and Point of Interest (PoI) data. The authors evaluate several ML regressors, including linear, ridge, lasso, bagging, gradient boosting, and random forest, reporting significant improvements in predictive accuracy over traditional linear regression, particularly when using augmented datasets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Deep Learning Models</head><p>Ge et al. <ref type="bibr" target="#b16">[17]</ref> introduced a Self-Attention ConvLSTM (SA-ConvLSTM) model to enhance the accuracy of ConvLSTM for online car-hailing demand forecasting. By converting car-hailing trajectories into grid-based images, the model incorporates a self-attention module to capture long-range spatiotemporal dependencies, leveraging pairwise similarity scores across input and memory positions.</p><p>Luo et al. <ref type="bibr" target="#b17">[18]</ref> proposed a Spatial-Temporal Diffusion Convolutional Network (ST-DCN) to address limitations in modeling dynamic spatiotemporal dependencies for taxi demand forecasting. Their approach integrates a two-phase graph diffusion convolutional network with an attention mechanism to model spatial dependencies, while a temporal convolution module captures long-term trends, including recent, daily, and weekly patterns. The use of stacked convolution layers further enhances the model's ability to process extended timeseries sequences.</p><p>Li et al. <ref type="bibr" target="#b18">[19]</ref> proposed a hierarchical framework for proactive bike redistribution in bike-sharing systems. A bipartite clustering algorithm groups stations, followed by citywide rental prediction using a Gradient Boosting Regression Tree. Rental proportions and inter-cluster transitions are then estimated via a multi-similarity-based model to predict station-level rentals and returns. Experiments on real-world datasets from New York City and Washington D.C. indicate significant performance gains over baseline models, particularly during periods of atypical demand.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Ensemble and Hybrid Models</head><p>The concept of ensemble learning, which involves combining the predictions of multiple simple models to form a new, more robust model, has been extensively explored in the literature and has been shown to produce better results compared to its individual constituent models. Building on this idea, Yuming et al. <ref type="bibr" target="#b19">[20]</ref> developed a stacking ensemble learning framework that integrates the predictions of three distinct base learners: Random Forest, LightGBM, and Long Short-Term Memory (LSTM) networks. These base learners were trained in parallel using a common set of input features, including temporal, spatial, and weather-related variables. The predictions generated by each base learner were then aggregated by a Support Vector Regression (SVR) meta-learner that produced the final demand forecast values.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Positioning of Our Work with Respect to State-of-the-Art</head><p>Our approach diverges from these approaches in multiple ways. Firstly, our methodology employs a unified modeling workflow that can tackle the problem in two alternative ways, by either regression or classification, allowing us to adjust to specific needs. Additionally, our feature engineering pipeline integrates both spatial and temporal features which are fed to a tuned gradient boosting model, enabling us to better identify the intricacies of urban micro-mobility. Lastly, by utilizing two real-world datasets for evaluation, our method can be adequately validated under different scenarios, ensuring it is effective and robust. Nevertheless, as part of our future research, we plan to align related work with our approach for a fair comparison under the same settings over the same real datasets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Our Approach for Efficient Shared Micro-Mobility Demand Forecasting</head><p>In this section, we define the shared mobility demand forecasting problem and present our proposed approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Problem Definition</head><p>Generally speaking, a spatio-temporal timeseries is a sequence of data points, where each one is associated with a time index and a spatial location. For the purposes of our work, an area of interest (a city, a region, etc.) is partitioned into a set of geographical districts, where each district 𝐴 is represented as a polygon. We denote 𝐷 𝐴 𝑡 as the percentile-encoded demand of micro-mobility vehicles within district 𝐴 at time 𝑡. Given 𝜏 past observations for district 𝐴, we construct its spatio-temporal timeseries as</p><formula xml:id="formula_0">𝑋 𝐴 𝑡 = {𝐷 𝐴 𝑡−𝜏 , 𝐷 𝐴 𝑡−𝜏 +1 , . . . , 𝐷 𝐴 𝑡−1 , 𝐷 𝐴 𝑡 }.</formula><p>The problem addressed in this paper is to predict the micro-mobility demand value 𝐷 𝐴 𝑡+𝑚 for district 𝐴, 𝑚 time steps ahead, given 𝑋 𝐴 𝑡 i.e., the current along with the past 𝜏 observations. We will exemplify the above definition through a real case extracted from our real-world dataset (dataset details are presented in Section 4). As illustrated in Figure <ref type="figure" target="#fig_0">1</ref>, the problem at hand is to predict (as accurately as possible) the anticipated demand values of each district's demand timeseries (in orange) given the current and some past demand values (in blue), taking into account relevant (temporal, spatial, context) features. To efficiently achieve this goal, seasonal etc. patterns hidden in these timeseries should be disclosed. For instance, looking at the timeseries of two major districts in Rotterdam that are displayed in Figure <ref type="figure" target="#fig_0">1</ref>, we identify seasonal commuting patterns: the highest and most spread peak occurs on Friday, due to a combination of leisure and work time. This is made more evident in Figure <ref type="figure" target="#fig_1">2</ref>, which showcases the combination of the hour-of-day and day-of-week features in Rotterdam Centrum district. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Methodology</head><p>We propose the so-called Gradient Boosting Demand Predictor, in two variations, differentiated by their output type:</p><p>• A 𝑁 -class classifier (per horizon per district) • A regressor (per horizon per district)</p><p>The purpose of the classifier is to classify the future demand of a district into predefined levels (e.g., 'Low', 'Medium', 'High'), whereas the regressor aims to predict the actual demand values for the respective horizon. In other words, for an area of interest partitioned into 𝑀 districts with the goal of 𝑇 individual prediction horizons, both architectures require training 𝑇 × 𝑀 models, where each model predicts the output for a specific district-horizon pair. The reasoning behind training individual models per district is to allow each model to specialize in its respective spatial region, focusing exclusively on its unique spatial and temporal patterns. Nevertheless, in our future work we plan to assess the efficiency of single global model for all districts (where the district ID will be given as input to the model).</p><p>As illustrated in Figure <ref type="figure" target="#fig_2">3</ref>, the initial step of our methodology is the partitioning of the area of interest into districts,  modeled as an adjacency graph that allows us to extract the neighbors of each district. Two districts are defined as neighbors if they share a common boundary <ref type="foot" target="#foot_0">1</ref> . The demands of the target as well as of the neighboring districts are represented by the respective spatiotemporal timeseries (step 2), as defined in Section 3.1, which are fed into a feature extraction pipeline (step 3), to be detailed in Section 3.3. Finally, all these features are combined into a single, augmented dataset (step 4) to be used for the purposes of training our two models, the classifier and the regressor (step 5).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Feature Extraction</head><p>Given a timeseries 𝑋 𝐴 𝑡 of a target district 𝐴, feature extraction is performed to enhance the spatio-temporal modeling capabilities of the proposed method. The extracted features are categorized into four primary groups: time-related, lagged, rolling, and exponentially weighted features. These features are derived not only from the target district but also from its neighboring districts, incorporating both spatial and temporal dependencies.</p><p>Time-related capture temporal patterns influencing the target variable. Key attributes include the hour of the day (0-23) to identify daily patterns like peak periods, and the day of the week (0-6, with 0 as Monday) to capture weekly trends. Monthly variations are represented by the month (0-11, with 0 as January), accounting for seasonal behaviors. Additionally, the minute component (0-59) highlights finer time dependencies and intra-hour variations.</p><p>The second category of features, denoted to as lagged features, is designed to capture the temporal dependencies between past observations of the timeseries and its current state. For each district, a set of lagged features is generated by systematically shifting the original timeseries observations backward in time. Formally, given a timeseries 𝑋 𝐴 𝑡 representing the observed values of a target district A at time 𝑡, the lagged feature at a temporal offset 𝜏 is defined as 𝑋 𝐴 𝑡−𝜏 . A range of temporal offsets is selected to ensure that both short-term and long-term dependencies are captured. Specifically, the lagged features are computed at intervals of 1, 5, 10, and 15 minutes, capturing short-term temporal dependencies across multiple time horizons.   This shifting operation effectively creates additional feature representations of the past states of the system, allowing the model to learn historical patterns that influence future predictions. By incorporating lagged features into the predictive framework, the approach enhances its ability to capture recurrent patterns, trends, and autocorrelations inherent in spatio-temporal mobility data.</p><p>The rolling features provide a smoothed representation of the data over various intervals, capturing localized fluctuations that are critical for accurate forecasting. These features are computed using sliding windows, where statistical measures are aggregated over a sequence of recent time steps to encapsulate short-term trends and variability in mobility patterns. For each district, a rolling window of varying sizes-ranging from 5 to 15 minutes in 5-minute increments-is applied to the timeseries data. Within each window, the mean is computed to capture central tendency and, over larger windows, the degree of short-term variability.</p><p>Finally, the objective of the exponentially weighted features is to prioritize recent observations while retaining information from historical data. This is achieved through the application of exponentially weighted techniques, which assign progressively smaller weights to older data points, ensuring that the model places greater emphasis on more recent trends. Among these methods, the Exponential Weighted Moving Average (EWMA) is particularly effective, as it assigns exponentially decreasing weights to past observations, thereby capturing temporal dependencies in a manner that balances adaptability and historical context. For a given timeseries 𝑋𝑡, the EWMA at time 𝑡 is computed recursively as 𝐸𝑊 𝑀 𝐴𝑡 = 𝛼𝑋𝑡 + (1 − 𝛼)𝐸𝑊 𝑀 𝐴𝑡−1 where 𝛼 represents the smoothing factor, typically defined as 𝛼 =<ref type="foot" target="#foot_1">2</ref> 𝑁 +1 for a given window size 𝑁 . This formulation allows the predictive framework to swiftly adapt to evolving patterns, including sudden variations induced by external factors such as weather conditions, road incidents, or public events, while maintaining a stable representation of long-term trends.</p><p>The features used in our methodology are summarized in Table <ref type="table" target="#tab_2">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Experimental Study</head><p>This section describes the data and preprocessing steps used in our experimental study (Section 4.1) as well as our findings (Section 4.2). The experiments were conducted on a server with an AMD Epyc 64-Core CPU and an Nvidia A100 GPU with 40GB of memory. Our source code is publicly available for reproducibility purposes. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Experimental Setup</head><p>In our study, we utilize shared micro-mobility data provided by Deelfiets Nederland, a popular micro-mobility service provider in Netherlands, through an open API <ref type="foot" target="#foot_2">3</ref> . Data consists of real-time GPS positions for vehicles in the Netherlands, which we query every 60 seconds, creating a dataset of sufficient granularity for our use-case. The dataset includes the latitude, longitude, vehicle type and company name for each vehicle. For the purposes of this study, we utilize mobility data from two major metropolitan areas, Amsterdam and Rotterdam, focusing specifically on the ten most densely populated districts in each city. These districts exhibit diverse urban characteristics and distinct spatiotemporal patterns, making them well-suited for evaluating the model's adaptability to localized mobility trends. The temporal coverage of our data is about 2 months (November 11, 2024 -January 15, 2025), and 5 months (August 06, 2024 -January 15, 2025), respectively. Since the original data source is a stream instead of a static dataset, we manually append the temporal indicator in each subsequent request.</p><p>To derive demand estimates at the district level, a spatial filtering and aggregation procedure is applied. First, a point-polygon intersection is performed to associate each mobility trace with its corresponding district boundaries, discarding traces that fall outside the predefined study areas. Next, the filtered traces are aggregated based on their unique spatio-temporal identifiers (i.e., timestamp and district ID), summing the occurrences within each region to obtain demand estimates. This transformation preserves the spatial and temporal integrity of the data while ensuring compatibility with the forecasting models. In its final form, the dataset consists of 89,022 observations across 97 districts in Amsterdam and 218,325 observations across 22 districts in Rotterdam.</p><p>To validate our models, we use a 70-30 train-test temporal split validation method. To ensure effective training of the classifier, additional preprocessing steps were taken apart from the feature extraction process discovered above. To this extent, a quantile-based discretization function was employed to transform continuous density values into 3 discrete demand levels (Low, Medium, High, as they will be defined in the following paragraph). This transformation enhances the classifier's ability to model class distinctions and mitigates the challenges associated with imbalanced class distributions.</p><p>We define 'Medium' demand as the range where demand levels are centered around 50% of the value of the day with the highest demand, which serves as the baseline value. Specifically, for a given threshold 𝑑 ∈ (0, 50), the range [50 − 𝑑, 50 + 𝑑) is categorized as normal demand. Demand levels below (above) this range are classified as 'Low' ('High', respectively) demand. In our experiments, we set 𝑑 = 17, in order for the demand levels to correspond to three equally sized ranges. This threshold provides a balanced distribution of data across the defined demand categories, facilitating a robust analysis of varying demand patterns.</p><p>The optimization framework employed in this study incorporates Bayesian Optimization due to its ability to balance exploration and exploitation effectively. This approach facilitates the exploration of the search space for better solutions while simultaneously exploiting regions with high potential, thereby reducing the number of iterations required for optimization. Its suitability is particularly evident when dealing with extensive experimental evaluations, such as the computationally intensive training of ML models, as it leverages prior knowledge to guide sampling and optimize resources efficiently. Additionally, the probabilistic framework of Bayesian Optimization enables the automatic adjustment of the exploration-exploitation trade-off through its acquisition functions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Experimental Results</head><p>In this section, we assess the overall performance of our models across multiple districts and timestamps. Table <ref type="table">2</ref> summarizes the training and inference times of our models across different prediction horizons in the two cities of interest. Training time corresponds to the overall time required to train the respective model, whereas inference time corresponds to the average time taken to generate a specific prediction for a given time horizon. It turns out that training of our models is extremely fast since it takes 1-2 seconds or even less to be performed, while the inference time is impressively in the order of 1 microsecond. This analysis showcases the computational efficiency of the models, providing insights into their scalability and suitability for real-world spatio-temporal forecasting tasks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 2</head><p>Training and inference times of the two models across different horizons. Tables 3-6 present the predictive performance of the proposed spatio-temporal forecasting models evaluated at four prediction horizons: 5, 15, 30, and 60 minutes. The quality metrics we used include 𝑅 2 , RMSE, sMAPE and MASE for the regressor and F1-score, Accuracy, Recall and Precision for the classifier. These metrics collectively assess the model's predictive accuracy, robustness, and relative performance across districts with varying population sizes and temporal dynamics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>City</head><p>Aggregated metrics are added to summarize the overall performance across all districts, offering insights into the model's general effectiveness and consistency across varying prediction horizons. It is evident that both architectures perform really well at both short-term (5-15 min.) and longterm (30-60 min.) forecasting. From the summary statistics we can deduce that both models consistently perform well for the vast majority of the districts. Indicatively, the regressor's 𝑅 2 ranges on the average from 0.94 down to 0.84 as we move from lower to higher prediction horizons. As for the classifier, 𝐹 1-score ranges on the average from 0.89 to 0.80, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">A Note on Feature Importance</head><p>In order to assess the effect of the underlying features in the quality of prediction of either the classifier or the regressor, we present the average feature importance across all prediction horizons for both models. The feature importance values were computed using XGBoost's internal feature importance computation algorithm. Given the minimal contributions of some features (importance values below 10 −4 ), the y-axis of the corresponding plots has been log-scaled to improve visualization. The results of this analysis are grouped by city and illustrated in Figures <ref type="figure">4 and 5</ref>.</p><p>In both figures we can see that the average contribution of the features related to the target is higher than those of the neighbors, as expected. Overall, in decreasing order, the most important features appear to be the target_ewm_*, the target_rolling_mean_*, and the target_lag_*, denoting the EWMA, the rolling mean, and the lagged value, respectively, </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>This study investigated spatio-temporal forecasting methods for density prediction in urban environments. By integrating spatial dependencies with temporal trends, the pro-  posed methodology effectively captured localized patterns, generating accurate forecasts across multiple horizons. Key contributions include a flexible feature engineering pipeline that incorporates both intra-and inter-district interactions, alongside the application of efficient gradient boosting architectures. This approach enhances predictive accuracy while minimizing computational overhead, rendering it suitable for large-scale forecasting tasks. Experimental results demonstrated the efficacy of gradient-boosted density predictors for both regression and classification, exhibiting competitive performance across varying prediction horizons. Although accuracy slightly declines for longer horizons, the model remains robust, underscoring its adaptability and practical applicability in real-world mobility forecasting.</p><p>This research also unveils opportunities for future exploration. Specifically, the integration of feature interactions and their contributions to the overall model quality warrant further investigation. Incorporating external factors, such as weather conditions or public events, could potentially enhance predictive performance. Additionally, applying the proposed methodology to other geographical areas and diverse mobility scenarios would help validate its generalization capabilities and adaptability to varying contexts. Finally, a comprehensive experimental comparison with related work under specific settings is planned to facilitate fair benchmarking.</p><p>In conclusion, this study showcases the ability of datadriven approaches to effectively tackle spatio-temporal fore-casting challenges. By leveraging the inherent spatial segmentation of cities into districts, the methodology enables the extraction of localized temporal patterns, facilitating more informed decision-making and contributing to smarter, more efficient urban planning from the perspective of mobility data science <ref type="bibr" target="#b20">[21]</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Two indicative timeseries for two regions of Rotterdam: Delfshaven (top) and Rotterdam Centrum (bottom). The x-axis denotes time, while the y-axis indicates the measured demand values.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Average hourly demand across the week in Rotterdam Centrum. The coloring indicates the relative demand intensity and, along with the peaks, aid in highlighting daily and weekly usage patterns.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Illustration of our Gradient Boosting Density Predictor proposed methodology</figDesc><graphic coords="4,72.00,65.61,451.26,100.32" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>the current district at the present time step neighbor_* Feature variable of each of the target's neighbors in neighboring districts Time-Related hour-of-day Hour of the day of the timestamp (0-23) minute-of-hour Minute of the hour of the timestamp (0-59) day-of-week Day of the week of the timestamp (0=Monday, 6=Sunday) month-of-year Month of the year of the timestamp (0=January, 11=December) Lagged neighbor_lag_{1, 5, 10, 15} Lagged value ({1, 5, 10, 15}-step(s) back) of the target variable for neighboring districts target_lag_{1, 5, 10, 15} Lagged value ({1, 5, 10, 15}-step(s) back) of the target variable for the current district Rolling neighbor_rolling_mean_{5, 10, 15} Rolling mean of neighbors over the last {5, 10, 15} observations target_rolling_mean_{5, 10, 15} Rolling mean of the target variable over the last {5, 10, 15} observations EWMA neighbor_ewm_{5, 10, 15} Exponentially weighted moving average of neighbors with a span of {5, 10, 15} target_ewm_{5, 10, 15} Exponentially weighted moving average of the target variable with a span of {5, 10, 15}</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>2</head><label>2</label><figDesc></figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 4 :Figure 5 :</head><label>45</label><figDesc>Figure 4: Average feature importance across all 4 prediction horizons of the regressor in Rotterdam (left) and Amsterdam (right)</figDesc><graphic coords="7,72.00,65.61,451.28,225.64" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 1</head><label>1</label><figDesc>Summary table of the extracted features used in our approach</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 3</head><label>3</label><figDesc>Predictive performance of the regressor in Rotterdam across different prediction horizons.</figDesc><table><row><cell>District</cell><cell>Avg Pop</cell><cell>R 2</cell><cell cols="3">5 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">15 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">30 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">60 min RMSE MAE sMAPE</cell></row><row><cell>Kralingen-Crooswijk</cell><cell>507</cell><cell>0.97</cell><cell>9.70</cell><cell>6.94</cell><cell>0.01</cell><cell>0.96</cell><cell>11.00</cell><cell>7.86</cell><cell>0.02</cell><cell>0.95</cell><cell>13.17</cell><cell>9.81</cell><cell>0.02</cell><cell>0.90</cell><cell>18.11</cell><cell>13.45</cell><cell>0.03</cell></row><row><cell>Rotterdam Centrum</cell><cell>502</cell><cell>0.99</cell><cell>15.25</cell><cell>11.18</cell><cell>0.02</cell><cell>0.98</cell><cell>20.33</cell><cell>14.77</cell><cell>0.02</cell><cell>0.97</cell><cell>22.95</cell><cell>17.37</cell><cell>0.03</cell><cell>0.93</cell><cell>34.66</cell><cell>26.62</cell><cell>0.04</cell></row><row><cell>Hillegersberg-Schiebroek</cell><cell>476</cell><cell>0.80</cell><cell>17.36</cell><cell>12.67</cell><cell>0.02</cell><cell>0.76</cell><cell>18.82</cell><cell>14.06</cell><cell>0.03</cell><cell>0.73</cell><cell>20.03</cell><cell>15.20</cell><cell>0.03</cell><cell>0.68</cell><cell>21.79</cell><cell>16.78</cell><cell>0.03</cell></row><row><cell>Delfshaven</cell><cell>347</cell><cell>0.98</cell><cell>6.43</cell><cell>4.70</cell><cell>0.01</cell><cell>0.97</cell><cell>8.03</cell><cell>5.75</cell><cell>0.02</cell><cell>0.96</cell><cell>10.13</cell><cell>7.27</cell><cell>0.02</cell><cell>0.92</cell><cell>13.78</cell><cell>9.97</cell><cell>0.03</cell></row><row><cell>Feijenoord</cell><cell>347</cell><cell>0.97</cell><cell>6.31</cell><cell>4.61</cell><cell>0.01</cell><cell>0.92</cell><cell>10.09</cell><cell>7.72</cell><cell>0.02</cell><cell>0.94</cell><cell>9.29</cell><cell>7.00</cell><cell>0.02</cell><cell>0.92</cell><cell>10.31</cell><cell>7.63</cell><cell>0.02</cell></row><row><cell>Noord</cell><cell>307</cell><cell>0.88</cell><cell>15.69</cell><cell>10.11</cell><cell>0.03</cell><cell>0.85</cell><cell>17.49</cell><cell>11.61</cell><cell>0.03</cell><cell>0.81</cell><cell>19.76</cell><cell>13.61</cell><cell>0.04</cell><cell>0.73</cell><cell>23.96</cell><cell>16.87</cell><cell>0.04</cell></row><row><cell>Prins Alexander</cell><cell>303</cell><cell>0.99</cell><cell>6.04</cell><cell>4.37</cell><cell>0.01</cell><cell>0.99</cell><cell>5.91</cell><cell>4.26</cell><cell>0.01</cell><cell>0.98</cell><cell>6.76</cell><cell>4.92</cell><cell>0.01</cell><cell>0.98</cell><cell>7.99</cell><cell>5.87</cell><cell>0.02</cell></row><row><cell>Charlois</cell><cell>242</cell><cell>0.97</cell><cell>5.24</cell><cell>3.84</cell><cell>0.02</cell><cell>0.97</cell><cell>5.93</cell><cell>4.42</cell><cell>0.02</cell><cell>0.96</cell><cell>6.89</cell><cell>5.20</cell><cell>0.02</cell><cell>0.94</cell><cell>7.84</cell><cell>5.92</cell><cell>0.03</cell></row><row><cell>Ĳsselmonde</cell><cell>184</cell><cell>0.94</cell><cell>5.25</cell><cell>3.25</cell><cell>0.02</cell><cell>0.90</cell><cell>6.58</cell><cell>4.18</cell><cell>0.03</cell><cell>0.89</cell><cell>7.11</cell><cell>4.50</cell><cell>0.03</cell><cell>0.84</cell><cell>8.37</cell><cell>5.57</cell><cell>0.04</cell></row><row><cell>Spaanse Polder</cell><cell>36</cell><cell>0.90</cell><cell>2.18</cell><cell>1.36</cell><cell>0.03</cell><cell>0.69</cell><cell>3.88</cell><cell>2.18</cell><cell>0.05</cell><cell>0.87</cell><cell>2.53</cell><cell>1.76</cell><cell>0.04</cell><cell>0.54</cell><cell>4.75</cell><cell>3.43</cell><cell>0.07</cell></row><row><cell>Average (𝜇 ± 𝜎)</cell><cell></cell><cell>0.94 ± 0.06</cell><cell>8.94 ± 5.29</cell><cell>6.30 ± 3.77</cell><cell>0.02 ± 0.01</cell><cell>0.90 ± 0.10</cell><cell>10.80 ± 5.98</cell><cell>7.68 ± 4.41</cell><cell>0.02 ± 0.01</cell><cell>0.90 ± 0.08</cell><cell>11.86 ± 6.86</cell><cell>8.66 ± 5.17</cell><cell>0.03 ± 0.01</cell><cell>0.84 ± 0.14</cell><cell>15.16 ± 9.40</cell><cell>11.21 ± 7.22</cell><cell>0.03 ± 0.02</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 4</head><label>4</label><figDesc>Predictive performance of the classifier in Rotterdam across different prediction horizons.</figDesc><table><row><cell>District</cell><cell>Avg Pop</cell><cell></cell><cell cols="2">5 min</cell><cell></cell><cell></cell><cell cols="2">15 min</cell><cell></cell><cell></cell><cell cols="2">30 min</cell><cell></cell><cell></cell><cell cols="2">60 min</cell><cell></cell></row><row><cell></cell><cell></cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell></row><row><cell>Kralingen-Crooswijk</cell><cell>507</cell><cell>0.92</cell><cell>0.92</cell><cell>0.92</cell><cell>0.92</cell><cell>0.90</cell><cell>0.90</cell><cell>0.90</cell><cell>0.90</cell><cell>0.89</cell><cell>0.88</cell><cell>0.88</cell><cell>0.88</cell><cell>0.85</cell><cell>0.85</cell><cell>0.85</cell><cell>0.85</cell></row><row><cell>Rotterdam Centrum</cell><cell>502</cell><cell>0.91</cell><cell>0.99</cell><cell>0.96</cell><cell>0.87</cell><cell>0.87</cell><cell>0.98</cell><cell>0.94</cell><cell>0.80</cell><cell>0.83</cell><cell>0.98</cell><cell>0.92</cell><cell>0.78</cell><cell>0.75</cell><cell>0.97</cell><cell>0.84</cell><cell>0.70</cell></row><row><cell>Hillegersberg-Schiebroek</cell><cell>476</cell><cell>0.75</cell><cell>0.95</cell><cell>0.82</cell><cell>0.71</cell><cell>0.73</cell><cell>0.94</cell><cell>0.79</cell><cell>0.70</cell><cell>0.71</cell><cell>0.94</cell><cell>0.77</cell><cell>0.67</cell><cell>0.67</cell><cell>0.93</cell><cell>0.76</cell><cell>0.63</cell></row><row><cell>Delfshaven</cell><cell>347</cell><cell>0.91</cell><cell>0.96</cell><cell>0.91</cell><cell>0.91</cell><cell>0.89</cell><cell>0.95</cell><cell>0.90</cell><cell>0.89</cell><cell>0.86</cell><cell>0.93</cell><cell>0.86</cell><cell>0.87</cell><cell>0.80</cell><cell>0.90</cell><cell>0.81</cell><cell>0.80</cell></row><row><cell>Feijenoord</cell><cell>347</cell><cell>0.88</cell><cell>0.95</cell><cell>0.90</cell><cell>0.87</cell><cell>0.87</cell><cell>0.94</cell><cell>0.89</cell><cell>0.85</cell><cell>0.84</cell><cell>0.93</cell><cell>0.88</cell><cell>0.81</cell><cell>0.74</cell><cell>0.90</cell><cell>0.82</cell><cell>0.71</cell></row><row><cell>Noord</cell><cell>307</cell><cell>0.82</cell><cell>0.99</cell><cell>0.85</cell><cell>0.80</cell><cell>0.71</cell><cell>0.98</cell><cell>0.74</cell><cell>0.69</cell><cell>0.55</cell><cell>0.91</cell><cell>0.55</cell><cell>0.63</cell><cell>0.59</cell><cell>0.97</cell><cell>0.62</cell><cell>0.58</cell></row><row><cell>Prins Alexander</cell><cell>303</cell><cell>0.97</cell><cell>0.98</cell><cell>0.97</cell><cell>0.97</cell><cell>0.97</cell><cell>0.98</cell><cell>0.97</cell><cell>0.97</cell><cell>0.96</cell><cell>0.98</cell><cell>0.96</cell><cell>0.96</cell><cell>0.96</cell><cell>0.97</cell><cell>0.96</cell><cell>0.95</cell></row><row><cell>Charlois</cell><cell>242</cell><cell>0.91</cell><cell>0.91</cell><cell>0.92</cell><cell>0.91</cell><cell>0.90</cell><cell>0.90</cell><cell>0.90</cell><cell>0.90</cell><cell>0.89</cell><cell>0.89</cell><cell>0.89</cell><cell>0.89</cell><cell>0.87</cell><cell>0.87</cell><cell>0.87</cell><cell>0.87</cell></row><row><cell>IJsselmonde</cell><cell>184</cell><cell>0.92</cell><cell>0.97</cell><cell>0.92</cell><cell>0.92</cell><cell>0.91</cell><cell>0.97</cell><cell>0.91</cell><cell>0.91</cell><cell>0.90</cell><cell>0.97</cell><cell>0.90</cell><cell>0.91</cell><cell>0.89</cell><cell>0.97</cell><cell>0.89</cell><cell>0.90</cell></row><row><cell>Spaanse Polder</cell><cell>36</cell><cell>0.95</cell><cell>0.99</cell><cell>0.98</cell><cell>0.92</cell><cell>0.90</cell><cell>0.98</cell><cell>0.94</cell><cell>0.87</cell><cell>0.93</cell><cell>0.98</cell><cell>0.96</cell><cell>0.90</cell><cell>0.91</cell><cell>0.98</cell><cell>0.95</cell><cell>0.88</cell></row><row><cell>Average (𝜇 ± 𝜎)</cell><cell></cell><cell>0.89 ± 0.06</cell><cell>0.96 ± 0.03</cell><cell>0.91 ± 0.05</cell><cell>0.88 ± 0.07</cell><cell>0.87 ± 0.08</cell><cell>0.95 ± 0.03</cell><cell>0.89 ± 0.07</cell><cell>0.85 ± 0.09</cell><cell>0.84 ± 0.12</cell><cell>0.94 ± 0.04</cell><cell>0.86 ± 0.12</cell><cell>0.83 ± 0.11</cell><cell>0.80 ± 0.11</cell><cell>0.93 ± 0.05</cell><cell>0.84 ± 0.10</cell><cell>0.79 ± 0.12</cell></row></table><note>of the target variable. Focusing on the time-related features only, hour-of-day and day-of-week show the relatively highest importance in both cases, underlying the presence of seasonal patterns.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 5</head><label>5</label><figDesc>Predictive performance of the regressor in Amsterdam across different prediction horizons.</figDesc><table><row><cell>District</cell><cell>Avg. Pop</cell><cell>R 2</cell><cell cols="3">5 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">15 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">30 min RMSE MAE sMAPE</cell><cell>R 2</cell><cell cols="3">60 min RMSE MAE sMAPE</cell></row><row><cell>Oostelijk Havengebied</cell><cell>110</cell><cell>0.97</cell><cell>2.34</cell><cell>1.83</cell><cell>0.02</cell><cell>0.96</cell><cell>2.75</cell><cell>2.16</cell><cell>0.02</cell><cell>0.95</cell><cell>3.24</cell><cell>2.53</cell><cell>0.02</cell><cell>0.92</cell><cell>4.16</cell><cell>3.24</cell><cell>0.03</cell></row><row><cell>Buitenveldert-West</cell><cell>87</cell><cell>0.97</cell><cell>2.12</cell><cell>1.57</cell><cell>0.02</cell><cell>0.95</cell><cell>2.66</cell><cell>1.96</cell><cell>0.02</cell><cell>0.93</cell><cell>3.28</cell><cell>2.40</cell><cell>0.02</cell><cell>0.87</cell><cell>4.37</cell><cell>3.17</cell><cell>0.03</cell></row><row><cell>Burgwallen-Nieuwe Zijde</cell><cell>72</cell><cell>0.98</cell><cell>1.91</cell><cell>1.41</cell><cell>0.02</cell><cell>0.96</cell><cell>2.55</cell><cell>1.91</cell><cell>0.03</cell><cell>0.93</cell><cell>3.38</cell><cell>2.58</cell><cell>0.03</cell><cell>0.89</cell><cell>4.35</cell><cell>3.35</cell><cell>0.04</cell></row><row><cell>Jordaan</cell><cell>71</cell><cell>0.98</cell><cell>2.31</cell><cell>1.78</cell><cell>0.02</cell><cell>0.94</cell><cell>4.04</cell><cell>2.75</cell><cell>0.03</cell><cell>0.91</cell><cell>4.67</cell><cell>3.42</cell><cell>0.04</cell><cell>0.85</cell><cell>6.14</cell><cell>4.66</cell><cell>0.05</cell></row><row><cell>Scheldebuurt</cell><cell>71</cell><cell>0.98</cell><cell>1.56</cell><cell>1.19</cell><cell>0.01</cell><cell>0.96</cell><cell>2.06</cell><cell>1.57</cell><cell>0.02</cell><cell>0.94</cell><cell>2.66</cell><cell>2.02</cell><cell>0.02</cell><cell>0.89</cell><cell>3.50</cell><cell>2.64</cell><cell>0.03</cell></row><row><cell>Middenmeer</cell><cell>66</cell><cell>0.96</cell><cell>1.94</cell><cell>1.46</cell><cell>0.02</cell><cell>0.94</cell><cell>2.28</cell><cell>1.73</cell><cell>0.02</cell><cell>0.90</cell><cell>2.89</cell><cell>2.20</cell><cell>0.03</cell><cell>0.86</cell><cell>3.55</cell><cell>2.71</cell><cell>0.03</cell></row><row><cell>IJburg West</cell><cell>63</cell><cell>0.93</cell><cell>1.62</cell><cell>1.28</cell><cell>0.02</cell><cell>0.91</cell><cell>1.83</cell><cell>1.45</cell><cell>0.02</cell><cell>0.87</cell><cell>2.15</cell><cell>1.70</cell><cell>0.02</cell><cell>0.79</cell><cell>2.76</cell><cell>2.20</cell><cell>0.03</cell></row><row><cell>Westelijk Havengebied</cell><cell>63</cell><cell>0.88</cell><cell>3.72</cell><cell>1.60</cell><cell>0.03</cell><cell>0.85</cell><cell>4.05</cell><cell>1.75</cell><cell>0.03</cell><cell>0.83</cell><cell>4.28</cell><cell>1.87</cell><cell>0.04</cell><cell>0.78</cell><cell>4.74</cell><cell>2.20</cell><cell>0.04</cell></row><row><cell>Landlust</cell><cell>59</cell><cell>0.99</cell><cell>1.42</cell><cell>1.07</cell><cell>0.02</cell><cell>0.97</cell><cell>1.96</cell><cell>1.51</cell><cell>0.03</cell><cell>0.96</cell><cell>2.60</cell><cell>2.01</cell><cell>0.04</cell><cell>0.92</cell><cell>3.49</cell><cell>2.70</cell><cell>0.05</cell></row><row><cell>Nieuwmarkt/Lastage</cell><cell>58</cell><cell>0.92</cell><cell>2.82</cell><cell>1.94</cell><cell>0.02</cell><cell>0.88</cell><cell>3.34</cell><cell>2.37</cell><cell>0.03</cell><cell>0.76</cell><cell>4.71</cell><cell>3.35</cell><cell>0.04</cell><cell>0.70</cell><cell>5.29</cell><cell>3.88</cell><cell>0.05</cell></row><row><cell>Average (𝜇 ± 𝜎)</cell><cell></cell><cell>0.95 ± 0.04</cell><cell>2.18 ± 0.69</cell><cell>1.51 ± 0.28</cell><cell>0.02 ± 0.01</cell><cell>0.93 ± 0.04</cell><cell>2.75 ± 0.81</cell><cell>1.92 ± 0.41</cell><cell>0.02 ± 0.01</cell><cell>0.90 ± 0.06</cell><cell>3.38 ± 0.89</cell><cell>2.41 ± 0.59</cell><cell>0.03 ± 0.01</cell><cell>0.85 ± 0.07</cell><cell>4.24 ± 0.99</cell><cell>3.08 ± 0.77</cell><cell>0.04 ± 0.01</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 6</head><label>6</label><figDesc>Predictive performance of the classifier in Amsterdam across different prediction horizons.</figDesc><table><row><cell>District</cell><cell>Avg. Pop</cell><cell></cell><cell cols="2">5 min</cell><cell></cell><cell></cell><cell cols="2">15 min</cell><cell></cell><cell></cell><cell cols="2">30 min</cell><cell></cell><cell></cell><cell cols="2">60 min</cell><cell></cell></row><row><cell></cell><cell></cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell><cell>F1</cell><cell>Acc.</cell><cell cols="2">Prec. Rec.</cell></row><row><cell>Oostelijk Havengebied</cell><cell>110</cell><cell>0.90</cell><cell>0.92</cell><cell>0.93</cell><cell>0.89</cell><cell>0.89</cell><cell>0.90</cell><cell>0.89</cell><cell>0.88</cell><cell>0.86</cell><cell>0.88</cell><cell>0.87</cell><cell>0.85</cell><cell>0.76</cell><cell>0.78</cell><cell>0.75</cell><cell>0.79</cell></row><row><cell>Buitenveldert-West</cell><cell>87</cell><cell>0.92</cell><cell>0.93</cell><cell>0.94</cell><cell>0.91</cell><cell>0.90</cell><cell>0.91</cell><cell>0.92</cell><cell>0.89</cell><cell>0.88</cell><cell>0.89</cell><cell>0.89</cell><cell>0.86</cell><cell>0.83</cell><cell>0.86</cell><cell>0.85</cell><cell>0.82</cell></row><row><cell>Burgwallen-Nieuwe Zijde</cell><cell>72</cell><cell>0.94</cell><cell>0.94</cell><cell>0.95</cell><cell>0.94</cell><cell>0.92</cell><cell>0.92</cell><cell>0.92</cell><cell>0.91</cell><cell>0.89</cell><cell>0.89</cell><cell>0.89</cell><cell>0.89</cell><cell>0.85</cell><cell>0.84</cell><cell>0.85</cell><cell>0.84</cell></row><row><cell>Jordaan</cell><cell>71</cell><cell>0.91</cell><cell>0.94</cell><cell>0.94</cell><cell>0.89</cell><cell>0.87</cell><cell>0.92</cell><cell>0.89</cell><cell>0.85</cell><cell>0.84</cell><cell>0.90</cell><cell>0.87</cell><cell>0.81</cell><cell>0.82</cell><cell>0.89</cell><cell>0.82</cell><cell>0.82</cell></row><row><cell>Scheldebuurt</cell><cell>71</cell><cell>0.91</cell><cell>0.93</cell><cell>0.93</cell><cell>0.90</cell><cell>0.88</cell><cell>0.90</cell><cell>0.90</cell><cell>0.87</cell><cell>0.85</cell><cell>0.88</cell><cell>0.88</cell><cell>0.83</cell><cell>0.79</cell><cell>0.83</cell><cell>0.83</cell><cell>0.77</cell></row><row><cell>Middenmeer</cell><cell>66</cell><cell>0.85</cell><cell>0.94</cell><cell>0.90</cell><cell>0.81</cell><cell>0.79</cell><cell>0.91</cell><cell>0.81</cell><cell>0.77</cell><cell>0.73</cell><cell>0.90</cell><cell>0.80</cell><cell>0.70</cell><cell>0.68</cell><cell>0.83</cell><cell>0.69</cell><cell>0.69</cell></row><row><cell>IJburg West</cell><cell>63</cell><cell>0.73</cell><cell>0.96</cell><cell>0.81</cell><cell>0.68</cell><cell>0.63</cell><cell>0.95</cell><cell>0.66</cell><cell>0.61</cell><cell>0.57</cell><cell>0.79</cell><cell>0.54</cell><cell>0.73</cell><cell>0.61</cell><cell>0.92</cell><cell>0.60</cell><cell>0.62</cell></row><row><cell>Westelijk Havengebied</cell><cell>63</cell><cell>0.91</cell><cell>0.89</cell><cell>0.94</cell><cell>0.90</cell><cell>0.89</cell><cell>0.87</cell><cell>0.93</cell><cell>0.88</cell><cell>0.90</cell><cell>0.89</cell><cell>0.92</cell><cell>0.90</cell><cell>0.87</cell><cell>0.85</cell><cell>0.89</cell><cell>0.86</cell></row><row><cell>Landlust</cell><cell>59</cell><cell>0.94</cell><cell>0.94</cell><cell>0.93</cell><cell>0.94</cell><cell>0.93</cell><cell>0.93</cell><cell>0.92</cell><cell>0.93</cell><cell>0.92</cell><cell>0.92</cell><cell>0.91</cell><cell>0.92</cell><cell>0.87</cell><cell>0.87</cell><cell>0.87</cell><cell>0.87</cell></row><row><cell>Nieuwmarkt/Lastage</cell><cell>58</cell><cell>0.79</cell><cell>0.94</cell><cell>0.88</cell><cell>0.73</cell><cell>0.69</cell><cell>0.92</cell><cell>0.78</cell><cell>0.65</cell><cell>0.62</cell><cell>0.91</cell><cell>0.65</cell><cell>0.60</cell><cell>0.55</cell><cell>0.88</cell><cell>0.57</cell><cell>0.53</cell></row><row><cell>Average (𝜇 ± 𝜎)</cell><cell></cell><cell>0.88 ± 0.07</cell><cell>0.93 ± 0.02</cell><cell>0.91 ± 0.04</cell><cell>0.86 ± 0.09</cell><cell>0.84 ± 0.10</cell><cell>0.91 ± 0.02</cell><cell>0.86 ± 0.09</cell><cell>0.83 ± 0.11</cell><cell>0.81 ± 0.12</cell><cell>0.88 ± 0.03</cell><cell>0.82 ± 0.13</cell><cell>0.81 ± 0.10</cell><cell>0.76 ± 0.11</cell><cell>0.86 ± 0.04</cell><cell>0.77 ± 0.11</cell><cell>0.76 ± 0.11</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The inclusion of neighboring districts is justified by the limited spatial range and temporal duration of trips usually made by e-bikes and escooters. Due to their physical constraints, micro-mobility vehicles are unlikely to reach distant districts within short timeframes, making the consideration of neighboring districts only being a sufficient choice for effective spatio-temporal modeling.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://github.com/DataStories-UniPi/Shared-Mobility.git</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://api.deelfietsdashboard.nl/dashboard-api/public/vehicles_in_ public_space</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work was supported in part by the Horizon Framework Programme of the European Union under grant agreement No. 101093051 (EMERALDS; https://www.emeraldshorizon.eu/).</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Declaration on Generative AI</head><p>During the preparation of this work, the authors used Large Language Models in order to paraphrase and reword. After using this tool/service, the authors reviewed and edited the content as needed and take full responsibility for the publication's content.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Mobility as a Service</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">N</forename><surname>Mladenović</surname></persName>
		</author>
		<idno type="DOI">10.1016/B978-0-08-102671-7.10607-4</idno>
	</analytic>
	<monogr>
		<title level="m">International Encyclopedia of Transportation</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Vickerman</surname></persName>
		</editor>
		<imprint>
			<publisher>Elsevier</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="12" to="18" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A spatio-temporal LSTM model to forecast across multiple temporal and spatial scales</title>
		<author>
			<persName><forename type="first">F</forename><surname>O'donncha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Hu</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.ecoinf.2022.101687</idno>
	</analytic>
	<monogr>
		<title level="j">Ecological Informatics</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="page">101687</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting</title>
		<author>
			<persName><forename type="first">B</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Yin</surname></persName>
		</author>
		<idno type="DOI">10.24963/ijcai.2018/505</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th International Joint Conference on Artificial Intelligence</title>
				<meeting>the 27th International Joint Conference on Artificial Intelligence</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="3634" to="3640" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">GMAN: A Graph Multi-Attention Network for Traffic Prediction</title>
		<author>
			<persName><forename type="first">C</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Fan</surname></persName>
		</author>
		<idno type="DOI">10.1609/aaai.v34i01.5477</idno>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the AAAI Conference on Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="page" from="1234" to="1241" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">A Survey on Diffusion Models for Time Series and Spatio-Temporal Data</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jin</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2404.18886</idno>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Yu</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1707.01926</idno>
		<idno type="arXiv">arXiv:1707.01926</idno>
		<title level="m">Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Pelekis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Theodoridis</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-1-4939-0392-4</idno>
		<title level="m">Mobility Data Management and Exploration</title>
				<meeting><address><addrLine>New York, NY</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Taxi demand forecasting based on the temporal multimodal information fusion graph neural network</title>
		<author>
			<persName><forename type="first">W</forename><surname>Liao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Zeng</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10489-021-03128-1</idno>
	</analytic>
	<monogr>
		<title level="j">Applied Intelligence</title>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="page" from="12077" to="12090" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Spatiotemporal Multi-Graph Convolution Network for Ride-Hailing Demand Forecasting</title>
		<author>
			<persName><forename type="first">X</forename><surname>Geng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<idno type="DOI">10.1609/aaai.v33i01.33013656</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Artificial Intelligence</title>
				<meeting>the AAAI Conference on Artificial Intelligence</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="3656" to="3663" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A Comprehensive Survey on Graph Neural Networks</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pan</surname></persName>
		</author>
		<idno type="DOI">10.1109/TNNLS.2020.2978386</idno>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Neural Networks and Learning Systems</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="page" from="4" to="24" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Why do tree-based models still outperform deep learning on typical tabular data?</title>
		<author>
			<persName><forename type="first">L</forename><surname>Grinsztajn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Oyallon</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2207.08815</idno>
		<idno>.2207.08815</idno>
		<ptr target="https://doi.org/10.48550/arXiv" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 36th International Conference on Neural Information Processing Systems</title>
				<meeting>the 36th International Conference on Neural Information Processing Systems</meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="507" to="520" />
		</imprint>
	</monogr>
	<note>NIPS &apos;22)</note>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Predicting the Dynamic Demand of Bike-Sharing System in Chicago with Divvy Operation Data: A Data-Driven approach for bike-sharing demand forecasting</title>
		<author>
			<persName><forename type="first">H</forename><surname>Feng</surname></persName>
		</author>
		<idno type="DOI">10.1145/3466029.3466035</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th International Conference on E-Commerce, E-Business and E-Government</title>
				<meeting>the 5th International Conference on E-Commerce, E-Business and E-Government</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="30" to="34" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Short-Term Demand Forecasting of Urban Online Car-Hailing Based on the K-Nearest Neighbor Model</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Xiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Kong</surname></persName>
		</author>
		<idno type="DOI">10.3390/s22239456</idno>
	</analytic>
	<monogr>
		<title level="j">Sensors</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Dynamic bike sharing traffic prediction using spatiotemporal pattern detection</title>
		<author>
			<persName><forename type="first">S</forename><surname>Sohrabi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ermagun</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.trd.2020.102647</idno>
	</analytic>
	<monogr>
		<title level="j">Transportation Research Part D: Transport and Environment</title>
		<imprint>
			<biblScope unit="volume">90</biblScope>
			<biblScope unit="page">102647</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Predicting bicycling and walking traffic using street view imagery and destination data</title>
		<author>
			<persName><forename type="first">S</forename><surname>Hankey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhang</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.trd.2020.102651</idno>
	</analytic>
	<monogr>
		<title level="j">Transportation Research Part D-transport and Environment</title>
		<imprint>
			<biblScope unit="volume">90</biblScope>
			<biblScope unit="page">102651</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Correlates of the Built Environment and Active Travel: Evidence from 20 US Metropolitan Areas</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">T K</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Buehler</surname></persName>
		</author>
		<idno type="DOI">10.1289/EHP3389</idno>
	</analytic>
	<monogr>
		<title level="j">Environmental Health Perspectives</title>
		<imprint>
			<biblScope unit="volume">126</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Self-Attention ConvLSTM for Spatiotemporal Forecasting of Short-Term Online Car-Hailing Demand</title>
		<author>
			<persName><forename type="first">H</forename><surname>Ge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Li</surname></persName>
		</author>
		<idno type="DOI">10.3390/su14127371</idno>
	</analytic>
	<monogr>
		<title level="j">Sustainability</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Spatial-Temporal Diffusion Convolutional Network: A Novel Framework for Taxi Demand Forecasting</title>
		<author>
			<persName><forename type="first">A</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Shangguan</surname></persName>
		</author>
		<idno type="DOI">10.3390/ijgi11030193</idno>
	</analytic>
	<monogr>
		<title level="j">ISPRS Int. J. Geo Inf</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">193</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Traffic prediction in a bikesharing system</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zheng</surname></persName>
		</author>
		<idno type="DOI">10.1145/2820783.2820837</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems</title>
				<meeting>the 23rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">15</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Demand Forecasting of Online Car-Hailing With Stacking Ensemble Learning Approach and Large-Scale Datasets</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Jin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ye</surname></persName>
		</author>
		<idno type="DOI">10.1109/ACCESS.2020.3034355</idno>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="199513" to="199522" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Mobility Data Science: Perspectives and Challenges</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mokbel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sakr</surname></persName>
		</author>
		<idno type="DOI">10.1145/3652158</idno>
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Spatial Algorithms Syst</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
