<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ilya Ilyankou</string-name>
          <email>ilya.ilyankou.23@ucl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aldo Lipani</string-name>
          <email>aldo.lipani@ucl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefano Cavazzi</string-name>
          <email>stefano.cavazzi@os.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaowei Gao</string-name>
          <email>xiaowei.gao.20@ucl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James Haworth</string-name>
          <email>j.haworth@ucl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ordnance Survey</institution>
          ,
          <addr-line>Southampton</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SpaceTimeLab, University College London</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sentence transformers [1] are language models designed to perform semantic search. This study investigates the capacity of sentence transformers, fine-tuned on general question-answering datasets for asymmetric semantic search, to associate descriptions of human-generated routes across Great Britain with queries often used to describe hiking experiences. We find that sentence transformers have some zero-shot capabilities to understand quasi-geospatial concepts, such as route types and dificulty, suggesting their potential utility for routing recommendation systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic search</kwd>
        <kwd>sentence transformers</kwd>
        <kwd>language models</kwd>
        <kwd>hiking</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        Semantic search is diferent from traditional, keyword search in that it tries to capture the intent
of the searcher [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. For a good semantic search system, the sentence ‘The fox sat on the mat’
should be similar to ‘An animal rested upon the rug’, but diferent from
‘Fox News on Sat’.
      </p>
      <p>
        SentenceTransformers [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a Python library that contains a collection of predominantly
BERT-family [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] transformer-based neural network models that are fine-tuned for semantic
search. This study investigates the extent to which asymmetric semantic search models, designed
for shorter queries and longer documents [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], fine-tuned on general (non-geospatial)
questionanswering datasets, can understand vague, subjective, and complex quasi-geospatial concepts.
For example, do such models consistently associate a query ‘a walk with a variety of landscapes’
with documents describing longer walks going through various terrains over the documents
describing shorter, solely urban or rural walks?
      </p>
      <p>
        This topic is important because when searching for (hiking) activities, people tend to describe
their physical abilities or desired experiences [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ] over using precise geospatial terms.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. Methodology</title>
      <p>
        In our experiment, we take existing user-generated routes across Great Britain, add geospatial
context to generate their textual descriptions, use Sentence Transformers to create vector
embeddings for each description, and compare these vectors with embeddings of user queries.
2.1. Data
We use user-generated hiking routes from the Ordnance Survey’s OS Maps app—specifically, a
subset of 501,294 routes classified by Ballatore et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. We remove very short (under 1 km)
and very long (over 50 km) routes to focus on those that would typically be of interest to leisure
hikers, and can be completed in a day. After further removing routes with obvious GPS signal
issues, we have 496,723 routes whose average length is 11,289 metres. We then assign a set of
attributes to each route using geopandas [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and several Ordnance Survey datasets. A full list
of attributes is shown in Table 1 in Appendix.
      </p>
      <sec id="sec-3-1">
        <title>2.2. Generating descriptions</title>
        <p>
          We use simple template language to generate 3-4 sentence descriptions for each route based
on route attributes. Descriptions mention length, shape, start and end points, total elevation
gain and steepness. We explicitly state the walk is ‘predominantly uphill’ or ‘predominantly
downhill’ where the total elevation gain exceeds (or is less than) the total elevation loss by at
least 100 metres; this categorisation applies to approximately 8% of all routes. The descriptions
also include the types of areas the routes go near or through, including coast, surface water,
woodland, green space, urban areas, and national parks. For each area type, we mention for
how long in percentages, swapping numbers (e.g., ‘60 percent’) for words (e.g., ‘sixty percent’ or
‘most’) roughly half the time. We do this because language models are known to under-perform
when required to work with numbers [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>The resulting descriptions are between 112 and 589 characters long, with mean and median
of 299 and 296 characters respectively. The longest description reads:
‘This is a twenty-five km walk that begins in Yelverton, West Devon, Devon and
ends in Plymouth, City of Plymouth. Total elevation gain is seven hundred and
thirty-nine metres, and elevation grade is 2.9. The walk is predominantly downhill.
About 25 percent of the walk is within a national park, about thirty percent of
the walk is in a wooded area, about thirty-three percent of the walk goes through
an urban area, about 12 percent of the walk is within green space, about twenty
percent of the walk is along the coast, about forty-three percent of the walk is
alongside a body of water.’</p>
        <p>Other examples of descriptions of various lengths are shown in Table 2 in Appendix.</p>
      </sec>
      <sec id="sec-3-2">
        <title>2.3. Matching queries with descriptions</title>
        <p>
          We calculate vector representations, or embeddings, of all textual descriptions and
user queries using msmarco-{MiniLM-L6|distilbert}-cos-v5 and
multi-qa-{MiniLML6|distilbert|mpnet-base}-cos-v1 models. These are based on MiniLM [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], DistilBERT
[12], and MPNet [13] architectures, and fine-tuned on MS MARCO [ 14] (about 500k records)
and/or a compilation of question-answering datasets which we refer to here as Multi-QA [15]
(about 215M records). Neither collection is specific to the geospatial domain. Models tuned
on MS MARCO support input sequences of up to 384 tokens (word pieces), and those tuned
on Multi-QA support up to 512 tokens; our inputs fit comfortably within both limits, with the
longest description of 589 characters represented by 129 tokens.
        </p>
        <p>
          We use 20 queries (see Table 3 in Appendix) that resemble questions (e.g., ‘what is a walk
for an expert hiker’), and calculate cosine similarity between all queries and route descriptions
to rank the relevance of each description for each query. The queries are inspired by various
research papers that studied hiking experiences [
          <xref ref-type="bibr" rid="ref5 ref6 ref7 ref8">5, 7, 6, 16, 8, 17</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>2.4. Visualising results</title>
        <p>Our experiment is dificult to assess using standard information retrieval quality metrics such
as mean reciprocal rank (MRR) or mean average precision (MAP) [18] given we cannot easily
label an individual route description (document) as relevant to the query or not. Instead, we are
interested in the overall patterns of how the documents are ranked.</p>
        <p>We decided to plot cumulative means of relevant route attributes for ranked documents,
sorted from best to worst match (one can think of it as plotting average@k of relevant route
attribute means for all k between 1 and 496,723). As such, the y-value of the left-most point of
each line chart represents the attribute value of the top-matching document, while the y-value
of the right-most point represents the mean attribute value for the whole dataset. An increasing
cumulative mean (i.e., a line going up) signifies higher-ranking documents (on the left of the
x-axis) typically having lower values than lower-ranking documents (to the right), and vice versa.
We utilise a logarithmic scale for the x-axis to highlight the cumulative means of top-ranking
documents, while also presenting the overall trend for all documents.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Results</title>
      <p>The results are mixed. Unsurprisingly, sentence transformers do better when user queries have
similar terminology to the documents. All five models are able to associate a ‘seaside walk’
with routes described as having longer stretches along the coast. Four models clearly associate
‘a walk for someone who enjoys town walks’ with routes going through urban areas.</p>
      <p>Figure 1 shows cumulative mean length, grade, and elevation gain for more complex queries
targeting easier walks produced by multi-qa-mpnet-base-cos-v1. The queries mentioning a
‘beginner hiker’ and a ‘person with limited mobility’ are indeed associated with shorter and
lfatter routes; the results are less clear for the query mentioning an ‘elderly person’.</p>
      <p>Conversely, Figure 2 shows results produced by the same model for queries aimed at more
challenging hiking experiences. While the top-10 or so results for an ‘expert hiker’ are indeed
longer walks, both the slope and total elevation gain patterns are not convincing. A ‘sporty
person’ will receive similarly disappointing suggestions. But ‘someone who likes climbing
uphill’ will be pleasantly surprised, given both the grade and total elevation gain are much
higher for best matches.</p>
      <p>Peculiarly, even when fine-tuned on the same Multi-QA dataset, MiniLM, DistilBERT, and
MPNet models are in total disagreement over the walks that can be completed in under an hour
(Figure 3). We generally expect these to be routes of under 5 km [19]. While MiniLM shows
a logical pattern of top-matching routes being shorter, DistilBERT ranks the results in a near
reverse order; cumulative mean length of routes ranked by MPNet seems to hover around the
dataset mean.</p>
      <p>None of the models are good at associating ‘long’ and ‘very long’ walks with higher kilometre
values. Regrettably, most models associate ‘someone seeking greater challenges’ with
descriptions of shorter and flatter walks, and queries for people ‘preferring wilderness to man-made’
barely relate to walks going through national parks. Full results are available in the Appendix.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusion</title>
      <p>Sentence transformers, fine-tuned on a corpus of general question-answer pairs for asymmetric
semantic search, demonstrate some zero-shot ability to associate short and subjective queries
looking for particular hiking experiences with synthetically composed route descriptions. One
tested model was able to relate ‘beginner hikers’ and those with ‘limited mobility’ to shorter
and flatter walks; another was able to associate ‘walks that can be completed in under an hour’
with shorter walks. Models fine-tuned on the same dataset sometimes showed very diferent
results, signalling that architectures and pre-training matter.</p>
      <p>In future work, a more systematic approach to evaluate sentence transformers and other
language models for geospatial understanding should be introduced. We suggest focusing on
four aspects: model architecture, datasets for fine-tuning, geospatial descriptions, and evaluation.
Firstly, a wider array of sentence transformers should be tested to identify which architectures,
BERT-based and beyond, achieve better results. Secondly, existing general question-answering
datasets used for fine-tuning should be evaluated both independently and in combination to
see how dataset size, theme, and quality afect learning. Although the user queries we tested
are not ‘traditionally’ geospatial, will using smaller datasets of primarily geographic questions
make sentence transformers better understand hiking and other active living experiences?
Thirdly, we recognise that using a generic template to describe routes is just one of many
ways to represent geospatial data as text. As such, we suggest exploring more sophisticated
approaches of generating descriptions for routes and other geospatial objects (e.g., specific
locations represented as points, and general areas represented as polygons), given that various
sources (people, websites) describe such objects diferently. And lastly, a more formal way of
evaluating ranked results should be explored, accounting for the fact that user queries tested
here are more subjective and incomplete than those used in typical information retrieval tasks.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was supported by the Ordnance Survey, and the Engineering and Physical Sciences
Research Council [grant no. EP/Y528651/1].
[12] V. Sanh, L. Debut, J. Chaumond, T. Wolf, DistilBERT, a distilled version of BERT: smaller,
faster, cheaper and lighter, 2020. URL: http://arxiv.org/abs/1910.01108. doi:10.48550/
arXiv.1910.01108, arXiv:1910.01108 [cs].
[13] K. Song, X. Tan, T. Qin, J. Lu, T.-Y. Liu, MPNet: Masked and Permuted Pre-training for</p>
      <p>Language Understanding (2020).
[14] P. Bajaj, D. Campos, N. Craswell, L. Deng, J. Gao, X. Liu, R. Majumder, A. McNamara,
B. Mitra, T. Nguyen, M. Rosenberg, X. Song, A. Stoica, S. Tiwary, T. Wang, MS MARCO:
A Human Generated MAchine Reading COmprehension Dataset, 2018. URL: http://arxiv.
org/abs/1611.09268. doi:10.48550/arXiv.1611.09268, arXiv:1611.09268 [cs].
[15] sentence-transformers/multi-qa-MiniLM-L6-cos-v1 · Hugging Face, 2024. URL: https://
huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1.
[16] L. T. Sarjakoski, P. Kettunen, H.-M. Flink, M. Laakso, M. Rönneberg, T. Sarjakoski,
Analysis of verbal route descriptions and landmarks for hiking, Personal and
Ubiquitous Computing 16 (2012) 1001–1011. URL: https://doi.org/10.1007/s00779-011-0460-7.
doi:10.1007/s00779-011-0460-7.
[17] J.-P. Calbimonte, S. Martin, D. Calvaresi, N. Zappelaz, A. Cotting, Semantic Data Models
for Hiking Trail Dificulty Assessment, in: J. Neidhardt, W. Wörndl (Eds.), Information and
Communication Technologies in Tourism 2020, Springer International Publishing, Cham,
2020, pp. 295–306. doi:10.1007/978-3-030-36737-4_24.
[18] A. Bellogín, P. Castells, I. Cantador, Statistical biases in Information Retrieval metrics
for recommender systems, Information Retrieval Journal 20 (2017) 606–634. URL: https:
//doi.org/10.1007/s10791-017-9312-z. doi:10.1007/s10791-017-9312-z.
[19] S. M. Club, Scottish Mountaineering Club Journal, Scottish Mountaineering Club., 1893.</p>
    </sec>
    <sec id="sec-7">
      <title>A. Tables</title>
      <p>Route length in metres
Total elevation gain (ascent)
Total elevation loss (descent)
Hiking grade, calculated as (total_gain ÷ length_m × 100)
True if route start and end points are within 500m of each other
True if route is circular, and its 25m bufer has a high overlap with itself
Name of the nearest place to the route’s start point, limited to 1km
Name of the nearest place to the route’s end point, limited to 1 km
Percent of route length that falls within 50m of a body of water
Percent of route length that lies within 150m of GB boundary
True if at least 50% of the walk length is along the coast
Percent of route length that falls within a national park
Percent of route length that falls within a green space boundary
Percent of route length that falls within a woodland boundary
Percent of route length that falls within an urban area boundary
This is a circular, ten km walk that begins and ends in Priddy, Somerset. Total elevation gain is 130 metres,
and elevation grade is 1.3.</p>
      <p>This is a circular, 12 km walk that begins and ends in Tarrant Gunville, Dorset. Total elevation gain is one
hundred and ninety-one metres, and elevation grade is 1.6.</p>
      <p>This is a 2 km coastal walk that begins in Glenuig Bay, Highland and ends in Smirisary, Highland. Total
elevation gain is 142 metres, and elevation grade is 4.9. About seven percent of the walk is in a wooded
area, about 62 percent of the walk is along the coast.</p>
      <p>This is a circular, twenty-five km walk that begins and ends in Milton, Stirling. Total elevation gain is 1846
metres, and elevation grade is 7.4. The walk is entirely within a national park, about twenty percent of the
walk is in a wooded area, about ten percent of the walk is alongside a body of water.
This is a 6 km walk that begins in Pebbly Hill, Cotswold, Gloucestershire and ends in Stow-on-the-Wold,
Cotswold, Gloucestershire. Total elevation gain is 215 metres, and elevation grade is 3.5. The walk is
predominantly uphill. About seven percent of the walk is in a wooded area, about 8 percent of the walk
goes through an urban area.</p>
      <p>This is a 22 km walk that begins in Rampart Head, Cumberland and ends in Little Caldew, Cumberland.
Total elevation gain is 222 metres, and elevation grade is 1.0. About 6 percent of the walk is in a wooded
area, about 9 percent of the walk goes through an urban area, about 7 percent of the walk is within green
space, about 18 percent of the walk is along the coast, about twenty-seven percent of the walk is alongside
a body of water.</p>
      <p>This is a nineteen km walk that begins in Millbrook, Caerfili - Caerphilly and ends in Ynysfro Reservoirs,
Casnewydd - Newport. Total elevation gain is seven hundred and twenty-four metres, and elevation grade
is 3.7. The walk is predominantly downhill. About seventeen percent of the walk is in a wooded area,
about forty-five percent of the walk goes through an urban area, about eight percent of the walk is within
green space, about 17 percent of the walk is alongside a body of water.</p>
      <p>This is a 7 km coastal walk that begins in Rhoose Cardif International Airport, Rhoose / Y Rhws, Bro
Morgannwg - the Vale of Glamorgan and ends in Storehouse Point, Bro Morgannwg - the Vale of Glamorgan.
Total elevation gain is 194 metres, and elevation grade is 2.7. About thirteen percent of the walk is in a
wooded area, about 19 percent of the walk goes through an urban area, about 17 percent of the walk is
within green space, 77 percent of the walk is along the coast, about twelve percent of the walk is alongside
a body of water.</p>
      <p>Length
138
168
263
305
337
437
490
539</p>
    </sec>
    <sec id="sec-8">
      <title>B. Full results</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Reimers</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          , Sentence-BERT:
          <article-title>Sentence Embeddings using Siamese BERTNetworks,</article-title>
          <year>2019</year>
          . URL: http://arxiv.org/abs/
          <year>1908</year>
          .10084, arXiv:
          <year>1908</year>
          .10084 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>W.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. M.</given-names>
            <surname>Barnaghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bargiela</surname>
          </string-name>
          ,
          <article-title>Search with Meanings: An Overview of Semantic Search Systems (</article-title>
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of Deep Bidirectional Transformers for Language Understanding</article-title>
          ,
          <year>2019</year>
          . URL: http://arxiv.org/abs/
          <year>1810</year>
          .04805, arXiv:
          <year>1810</year>
          .04805 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , W. Wu,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>Question Retrieval with High Quality Answers in Community Question Answering</article-title>
          ,
          <source>in: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management</source>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , Shanghai China,
          <year>2014</year>
          , pp.
          <fpage>371</fpage>
          -
          <lpage>380</lpage>
          . URL: https://dl.acm.org/doi/10.1145/2661829.2661908. doi:
          <volume>10</volume>
          . 1145/2661829.2661908.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Davies</surname>
          </string-name>
          ,
          <article-title>Who walks, where and why? Practitioners' observations and perspectives on recreational walkers at UK tourist destinations</article-title>
          ,
          <source>Annals of Leisure Research</source>
          <volume>21</volume>
          (
          <year>2018</year>
          )
          <fpage>553</fpage>
          -
          <lpage>574</lpage>
          . URL: https://doi.org/10.1080/11745398.
          <year>2016</year>
          .
          <volume>1250648</volume>
          . doi:
          <volume>10</volume>
          .1080/11745398.
          <year>2016</year>
          .
          <volume>1250648</volume>
          , publisher: Routledge _eprint: https://- doi.org/10.1080/11745398.
          <year>2016</year>
          .
          <volume>1250648</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Molokáč</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hlaváčová</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tometzová</surname>
          </string-name>
          , E. Liptáková,
          <source>The Preference Analysis for Hikers' Choice of Hiking Trail, Sustainability</source>
          <volume>14</volume>
          (
          <year>2022</year>
          )
          <article-title>6795</article-title>
          . URL: https://www.mdpi.com/ 2071-1050/14/11/6795. doi:
          <volume>10</volume>
          .3390/su14116795, number: 11 Publisher: Multidisciplinary Digital Publishing Institute.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R. B.</given-names>
            <surname>Hull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. P.</given-names>
            <surname>Stewart</surname>
          </string-name>
          ,
          <source>The Landscape Encountered and Experienced While Hiking, Environment and Behavior</source>
          <volume>27</volume>
          (
          <year>1995</year>
          )
          <fpage>404</fpage>
          -
          <lpage>426</lpage>
          . URL: https://doi.org/10.1177/0013916595273007. doi:
          <volume>10</volume>
          .1177/0013916595273007, publisher: SAGE Publications Inc.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ballatore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cavazzi</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Morley,</surname>
          </string-name>
          <article-title>The context of outdoor walking: A classification of user-generated routes</article-title>
          ,
          <source>The Geographical Journal</source>
          <volume>189</volume>
          (
          <year>2023</year>
          )
          <fpage>485</fpage>
          -
          <lpage>500</lpage>
          . URL: https: //onlinelibrary.wiley.com/doi/abs/10.1111/geoj.12511. doi:
          <volume>10</volume>
          .1111/geoj.12511, _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/geoj.12511.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>[9] GeoPandas 0</article-title>
          .dev+untagged - GeoPandas 0+untagged.
          <volume>50</volume>
          .g5558c35.
          <source>dirty documentation</source>
          ,
          <year>2024</year>
          . URL: https://geopandas.org/en/stable/index.html.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Petrak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. S.</given-names>
            <surname>Moosavi</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          ,
          <source>Arithmetic-Based Pretraining - Improving Numeracy of Pretrained Language Models</source>
          ,
          <year>2023</year>
          . URL: http://arxiv.org/abs/2205.06733, arXiv:
          <fpage>2205</fpage>
          .06733 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Zhou, MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers</article-title>
          ,
          <year>2020</year>
          . URL: http: //arxiv.org/abs/
          <year>2002</year>
          .10957. doi:
          <volume>10</volume>
          .48550/arXiv.
          <year>2002</year>
          .
          <volume>10957</volume>
          , arXiv:
          <year>2002</year>
          .10957 [cs].
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>