<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Using Graph Databases for Historical Language Data: Challenges and Opportunities</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Barbara</forename><surname>Mcgillivray</surname></persName>
							<email>barbara.mcgillivray@kcl.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="institution">King&apos;s College London</orgName>
								<address>
									<addrLine>Strand Campus, Strand</addrLine>
									<postCode>WC2R 2LS</postCode>
									<settlement>London</settlement>
									<country key="GB">United Kingdom</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pierluigi</forename><surname>Cassotti</surname></persName>
							<email>pierluigi.cassotti@uniba.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<addrLine>Via E. Orabona 4</addrLine>
									<postCode>70125</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pierpaolo</forename><surname>Basile</surname></persName>
							<email>pierpaolo.basile@uniba.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<addrLine>Via E. Orabona 4</addrLine>
									<postCode>70125</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Davide</forename><forename type="middle">Di</forename><surname>Pierro</surname></persName>
							<email>davide.dipierro@uniba.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<addrLine>Via E. Orabona 4</addrLine>
									<postCode>70125</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stefano</forename><surname>Ferilli</surname></persName>
							<email>stefano.ferilli@uniba.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<addrLine>Via E. Orabona 4</addrLine>
									<postCode>70125</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department">IRCDL (The Conference on Information and Research science Connecting to Digital and Library science)</orgName>
								<orgName type="laboratory">19th</orgName>
								<address>
									<addrLine>February 23-24</addrLine>
									<postCode>2023</postCode>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Using Graph Databases for Historical Language Data: Challenges and Opportunities</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">52FE1F53506B8C846A195E854070FA8A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-04-29T06:30+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Knowledge Graphs, Latin Corpora, Semantic Change Detection, Graph Data Model (S. Ferilli) 0000-0003-3426-8200 (B. McGillivray)</term>
					<term>0000-0001-7824-7167 (P. Cassotti)</term>
					<term>0000-0002-0545-1105 (P. Basile)</term>
					<term>0000-0002-8081-3292 (D. Di Pierro)</term>
					<term>0000-0003-1118-0601 (S. Ferilli)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The integration of semantic information into language resources has the potential to open up new avenues of enquiry into the mechanisms of language change. We present the first experiments in integrating data from Latin textual corpora and language resources into a graph database via the GraphBRAIN Schema and show the potential of this model for research into the mechanisms of semantic change in Latin.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Research in Historical Linguistics often requires the analysis and support of heterogeneous data and tools, such as lexical resources, encyclopaedias, and large corpora. Nevertheless, these resources are often siloed. Graph Databases present an ideal opportunity to combine the advantages of DataBase Management Systems (DBMSs) for handling individuals (scalability, storage optimization, efficient handling, mining and browsing of the data, etc.) with the highlevel functionalities available in Knowledge Bases (KBs). Graph DBMS are intrinsically designed to store schemaless data. Differently from traditional DBMSs like the relational <ref type="bibr" target="#b0">[1]</ref> or objectoriented <ref type="bibr" target="#b1">[2]</ref> ones, they lack predefined structures. Following this approach, Neo4j 1 , one of the most common graph DBMSs, does not provide support for introducing ontology definitions based on labels and/or arcs. The absence of a schema may lead to ambiguity when reading and managing data in downstream applications due to the inherent ambiguity of the words used for expressing concepts. Hence, the semantics becomes blurred.</p><p>To address these issues, we propose the use of GraphBRAIN <ref type="bibr" target="#b2">[3]</ref> as a solution. GraphBRAIN consists of a graph database which follows the Labelled Property Graph (LPG) <ref type="bibr" target="#b3">[4]</ref> structure. This structure stores nodes with specific labels, arcs which represent relationships among nodes and properties on both nodes and arcs. Properties are stored in the format of key/value pairs. GraphBRAIN requires KB designers to define a data schema which operates also as an ontology. GraphBRAIN provides a mapping mechanism for exporting schemes into an SWcompliant language, the Web Ontology Language (OWL). These schemes guide access through all the CRUD operations on the database but also ensure interpretability and interoperability among different applications. Following the schemes, applications become compliant with each other. Neo4j, in its Enterprise Edition, does not provide any constraint definition process. In other versions, it supports a few constraints like unique node property constraints, node property existence constraints, relationship property existence constraints and node key constraints. Evidently, these tools are not as expressive as ontology definitions.</p><p>In <ref type="bibr" target="#b4">[5]</ref>, we adopted GraphBRAIN technology to model time-sensitive linguistic knowledge in a graph database, describing a time-sensitive model of linguistic knowledge that can be used for graph databases. In this paper, we show an application of this model to the lexical semantic analysis of Latin data, i.e. the analysis of the meanings of Latin words. Differently from previous approaches, such as Basile et al. <ref type="bibr" target="#b5">[6]</ref>, Hamilton et al. <ref type="bibr" target="#b6">[7]</ref>, and Carlo et al. <ref type="bibr" target="#b7">[8]</ref>, we exploit graph database potentialities to detect semantic changes in specific concepts.</p><p>Latin is in a particularly favourable position among historical languages for the large-scale analysis of semantic change processes, thanks to a number of factors. First, Latin researchers now enjoy unprecedented access to digital data covering over two thousand years of history. Thanks to the ERC-funded LiLa project<ref type="foot" target="#foot_0">2</ref> , seven Latin language resources and six corpora have been linked at the level of word lemmas so far, making Latin a unique case among historical languages. Second, we have access to extensive computational language resources for Latin, Latin WordNet <ref type="bibr" target="#b8">[9]</ref>, and digitised dictionaries of Latin, which provide rich information about words' semantics and examples of usage. Finally, focussing on Latin allows us to investigate semantic change processes over long time spans. Latin has one of the longest recorded histories of any human language, making it naturally suitable for quantitative studies <ref type="bibr" target="#b9">[10]</ref>. The first inscriptional records date from the sixth century BCE, and Latin continues to be used to the current day by the Catholic Church and some academic and legal institutions around the world. Written Latin diverged from the spoken vernaculars in the second half of the first millennium of the Christian era, but it remained in use as one of the principal channels of communication across most of Europe for the next thousand years. The humanists' conscious effort to reproduce Classical Latin led to a range of interesting developments, particularly affecting the neo-Latin lexicon to enable the expression of new concepts <ref type="bibr" target="#b10">[11]</ref>. This extensive chronological span has raised the question of the extent to which Latin is seen as a dead or fossilised language (e.g. Herman <ref type="bibr" target="#b11">[12]</ref>, Butterfield <ref type="bibr" target="#b12">[13]</ref>). However, it remains an open question to what extent this fossilisation affected the semantics of words, as we know that the Latin lexicon, in this respect, has remained dynamic (over 4,500 words have acquired new meanings since the Renaissance; Demo 2022). The extent to which post-classical Latin can really be considered as a "fixed" language <ref type="bibr">(Leonhardt [14]</ref>, Roelli <ref type="bibr" target="#b14">[15]</ref>, Langslow <ref type="bibr" target="#b15">[16]</ref>) from the point of view of its ability to generate new meanings of words is still largely unknown beyond anecdotal evidence.</p><p>In Section 2 we present the Linguistic Knowledge Graph, in Section 3 we describe the Latin data that we worked on, and in Section 4 we show how we loaded the Latin data into the Linguistic Knowledge Graph. Finally, in Section 5 we draw some conclusions and outline future directions of work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The Linguistic Knowledge Graph</head><p>The Linguistic Knowledge Graph (LKG) aims to capture different aspects of lexical resources, such as relations between words and concepts, morphological, and syntactical information. Moreover, LKG covers diachronic aspects of language, such as the date of publication of a document, and the birth and death of an author. The schema we designed takes inspiration from the ontological lexicon model LeMON <ref type="bibr" target="#b16">[17]</ref>. For space constraints, we report in Table <ref type="table" target="#tab_0">1</ref> node types and in Table <ref type="table" target="#tab_1">2</ref> the relationships adopted for diachronic analysis. The lexical unit is represented as node of type InflectedWord or Lemma, which are subclass of Word, i.e. Lemma IS_A Word and InflectedWord IS_A Word. The Lemma can be a multi-word expression (mwe), in this case, the flag mwe is set to True. The respective lemma of an InflectedWord can be retrieved exploiting the relationship HAS_LEMMA between InflectedWord and Lemma. The LexiconConcept is used to represent the word's meanings, and each instance of LexiconConcept represents a different meaning. For example, the LexiconConcept can represent the senses reported on a sense inventory, e.g. synsets in WordNet <ref type="bibr" target="#b17">[18]</ref>. The relationship between a word and its meaning is expressed using the relationship HAS_CONCEPT among instances of Word and instance of LexiconConcept. Multiple relationships can be defined over couples of LexiconConcept using the reflexive relationship SEM_RELATION. At the same time, reflexive relationships over the Word instances can be described by the LEX_RELATION relationship. The document structure from which words are extracted can be represented at different levels of granularity: Sentence,Text, Document, and Corpus. In particular, each excerpt can be represented as Text or Sentence, which is a subclass of Text. A Text may belong to (BELONG_TO) a Document and a Document can be part of (BELONG_TO) a Corpus. The occurrences of a word in a particular Text are traced by the relationship HAS_OCCURRENCE among Word and Text. In the case of sense-annotated corpora, such as SemCor, is possible to specify the occurrences of senses using the relationship HAS_EXAMPLE among LexiconConcept and Text. Currently, the LKG takes into account two types of metadata: author and language. The relationship HAS_AUTHOR among nodes of type Text and nodes of type Person determines the author of a Text. The relationship HAS_LANGUAGE among nodes of type Text, Document, Corpus, and Word to nodes of type Language specifies the respective language.</p><p>The time is modelled using two classes of nodes: TimeInterval, and TimePoint, both subclasses of TemporalSpecification. The TimeInterval type is used when the date is not precisely stated, while the TimePoint is used in cases where the date is fixed. The start and end extremes of the TimeInterval nodes can be specified using the respective relationships startTime and endTime.</p><p>In the current version of the LKG, time specification is supported for Person and Text. More specifically, the date of birth and death of authors is specified using the relationship BORN and DIED between Person and TemporalSpecification. The publishing date of a text is specified by the relationship PUBLISHED_IN among Text nodes and TemporalSpecification nodes. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Latin data</head><p>The data we loaded into the graph consists of a portion of the LatinISE corpus <ref type="bibr" target="#b18">[19]</ref> annotated at the level of dictionary senses. LatinISE is a Latin corpus covering the period from the fifth century BCE to the twenty-first century and contains 10 million word tokens, semi-automatically lemmatised and part-of-speech tagged. The metadata fields in LatinISE indicate text identifier, author, title, dates, century, genre, url of source, and optionally book title/number and character names (for plays). The annotated dataset was produced as part of the SemEval shared task on Unsupervised Lexical Semantic Change Detection <ref type="bibr" target="#b19">[20]</ref>. 40 Latin lemmas ("target words") are selected, of which 20 are known to have changed their meaning with the advent of Christianity (for example, beatus, which shifted its meaning from 'fortunate' to 'blessed') and 20 are known to not have changed their meaning between the BCE era and the CE era. For each of the 40 lemmas, 60 sentences are randomly extracted from LatinISE, 30 of them are from texts dated in the BCE era, and 30 from texts dated in the CE era. Each sentence was annotated by at least one expert annotator, according to the DuReL framework <ref type="bibr" target="#b20">[21]</ref>. The annotators were asked to judge the semantic relatedness of an instance of usage of a target word with respect to the list of its dictionary definitions using a four-point scale (Unrelated, Distantly Related, Closely Related, and Identical). The definitions were taken from the Latin portion of the Logeion online dictionary (https://logeion.uchicago.edu/) containing Lewis and Short's Latin-English Lexicon (1879) <ref type="bibr" target="#b21">[22]</ref>, Lewis' Elementary Latin Dictionary (1890) <ref type="bibr" target="#b22">[23]</ref>, and Du Fresne Du Cange et al. <ref type="bibr" target="#b23">[24]</ref>. See McGillivray et al. <ref type="bibr" target="#b24">[25]</ref> for further details about the dataset and its annotation framework.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Loading the Latin data in the Linguistic Knowledge Graph</head><p>For each instance of the target words in the Latin corpus we encode:</p><p>• the author as Person,</p><p>• the manuscript as Document,</p><p>• the year as TimePoint if the date is certain, TimeInterval otherwise,</p><p>• the sentence (left context, target word and right context) as Text,</p><p>• the definitions of the Lewis and Short Dictionary as LexiconConcept,</p><p>• the word lemma as Lemma,</p><p>• the inflected forms of the target words as InflectedWord,</p><p>• the scores associated with each LexiconConcept as properties of the HAS_EXAMPLE and HAS_OCCURRENCE relationships.</p><p>In order to simplify and make the visualisation more effective, we created the HAS_EXAMPLE relationship only in cases where the annotation reported a score of 4. In addition, to make more evident the distribution of senses with respect to centuries, we associate each date of publication of the texts with the reference century. We do this via the query given in Listing 1. In case a Text is not associated with a specific TimePoint, it will be linked with the century having the greatest overlap with the TimeInterval of the text itself. On the other hand, for texts for which a precise date is specified, the query associates the Text with the respective century of its year. The centuries are represented as TimeInterval, and the description attribute is validated with "century". A new relationship, called CLUSTER, is so created among nodes of type Text and nodes of type TimeInterval to indicate the century.</p><p>A subgraph for the word beatus is shown in Figure <ref type="figure" target="#fig_0">1</ref>. The graph shows the nodes representing the texts from which the word beatus is extracted, the centuries and the senses given in the Lewis and Short Dictionary. The relationships among these nodes are CLUSTER and HAS_EXAMPLE. The former connects nodes of type TimeInterval and nodes of type Text, see 1. The latter links LexiconConcepts and Texts. Most occurrences of the word beatus in the reference corpus are dated 1st century BCE and 11th century CE. One can immediately notice a difference in the distribution of the senses: "happy" and "fortunate" on the one hand are associated with the time period BCE (see the cluster of nodes on the left of Figure <ref type="figure" target="#fig_0">1</ref>), and "blessed", on the other hand, is associated with the time period CE (see the cluster of nodes on the right of Figure <ref type="figure" target="#fig_0">1</ref>). In fact, only one sentence in the dataset displays the sense "blessed" in the first century BCE. Similarly, only two sentences dated CE contain the word beatus with the meaning of "fortunate", the latter, on the other hand, is dated 1079-1142 CE and is an excerpt from the Sermones of Petrus Abaelardus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>In this work, we introduced an application of LKG for Latin data. It appears to be an interesting and novel approach to tackling the analysis of diachronic corpora. Furthermore, differently from previous approaches, it gives rise to explainable results since we take advantage of explicit relationships modelled as graphs. The LKG seems to lead to promising results, and it is ready forfurther investigations into Lexical Semantic Change Detection (LSCD). Future developments include a better visualization of resources, machine-learning-based techniques for automatic LSCD and an interface for querying and analysing the LKG data.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Graph for the Latin word beatus.</figDesc><graphic coords="6,89.29,84.19,416.68,251.99" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>LKG classes with their respective superclasses and attributes.</figDesc><table><row><cell>Class</cell><cell>Superclass</cell><cell>Attributes</cell></row><row><cell>Word</cell><cell></cell><cell>value:String</cell></row><row><cell></cell><cell></cell><cell>value:String</cell></row><row><cell>Lemma</cell><cell>Word</cell><cell>posTag:String</cell></row><row><cell></cell><cell></cell><cell>mwe:Boolean</cell></row><row><cell>InflectedWord</cell><cell>Word</cell><cell>value:String</cell></row><row><cell>Stem</cell><cell></cell><cell>value:String</cell></row><row><cell>LexiconConcept</cell><cell>Concept</cell><cell>id:String resource:String</cell></row><row><cell>Text</cell><cell></cell><cell>value:String</cell></row><row><cell>Sentence</cell><cell>Text</cell><cell></cell></row><row><cell>Document</cell><cell></cell><cell>title:String</cell></row><row><cell>Corpus</cell><cell></cell><cell>name:String</cell></row><row><cell>TemporalSpecification</cell><cell></cell><cell>name:String description:String</cell></row><row><cell></cell><cell></cell><cell>Year:Integer</cell></row><row><cell>TimePoint</cell><cell>TemporalSpecification</cell><cell>Month:Integer</cell></row><row><cell></cell><cell></cell><cell>day:Integer</cell></row><row><cell>TimeInterval</cell><cell>TemporalSpecification</cell><cell></cell></row><row><cell>Person</cell><cell></cell><cell>name:String lastname:String</cell></row><row><cell></cell><cell></cell><cell>iso639-1:String</cell></row><row><cell>Language</cell><cell></cell><cell>iso639-2:String</cell></row><row><cell></cell><cell></cell><cell>enName:String</cell></row><row><cell>Category</cell><cell></cell><cell>id:String</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>LKG relationships with their respective subject, object and attributes.</figDesc><table><row><cell>Relationship</cell><cell>Subject</cell><cell>Object</cell><cell>Attributes</cell></row><row><cell>IS_A</cell><cell>Sentence Lemma ∪𝐼𝑛𝑓 𝑙𝑒𝑐𝑡𝑒𝑑𝑊 𝑜𝑟𝑑</cell><cell>Text Word</cell><cell>id:Integer id:Integer</cell></row><row><cell></cell><cell>Text</cell><cell>Document</cell><cell>id:Integer</cell></row><row><cell>BELONG_TO</cell><cell>Document</cell><cell>Corpus</cell><cell>id:Integer</cell></row><row><cell></cell><cell>Text</cell><cell>Category</cell><cell></cell></row><row><cell cols="2">HAS_OCCURRENCE Word</cell><cell>Text</cell><cell>begin:Integer end:Integer</cell></row><row><cell>{LEX_RELATION}</cell><cell>Word</cell><cell>Word</cell><cell></cell></row><row><cell>HAS_LEMMA</cell><cell>Word</cell><cell>Lemma</cell><cell></cell></row><row><cell>HAS_CONCEPT</cell><cell>Word</cell><cell>LexiconConcept</cell><cell>grade:Float</cell></row><row><cell>HAS_EXAMPLE</cell><cell>LexiconConcept</cell><cell>Text</cell><cell></cell></row><row><cell>HAS_DEFINITION</cell><cell>LexiconConcept</cell><cell>Text</cell><cell></cell></row><row><cell>REFER_TO</cell><cell>LexiconConcept</cell><cell>Concept</cell><cell></cell></row><row><cell>{SEM_RELATION}</cell><cell>LexiconConcept</cell><cell>LexiconConcept</cell><cell></cell></row><row><cell>PUBLISHED_IN</cell><cell>Text ∪𝐷𝑜𝑐𝑢𝑚𝑒𝑛𝑡 ∪ 𝐶𝑜𝑟𝑝𝑢𝑠</cell><cell>TemporalSpecification</cell><cell></cell></row><row><cell>HAS_AUTHOR</cell><cell>Text ∪𝐷𝑜𝑐𝑢𝑚𝑒𝑛𝑡 ∪ 𝐶𝑜𝑟𝑝𝑢𝑠</cell><cell>Person</cell><cell></cell></row><row><cell>BORN</cell><cell>Person</cell><cell>TemporalSpecification</cell><cell></cell></row><row><cell>DIED</cell><cell>Person</cell><cell>TemporalSpecification</cell><cell></cell></row><row><cell>startTime</cell><cell>TimeInterval</cell><cell>TimePoint</cell><cell></cell></row><row><cell>endTime</cell><cell>TimeInterval</cell><cell>TimePoint</cell><cell></cell></row><row><cell>HAS_LANGUAGE</cell><cell cols="2">Text ∪𝐷𝑜𝑐𝑢𝑚𝑒𝑛𝑡 ∪ 𝐶𝑜𝑟𝑝𝑢𝑠 ∪ 𝑊 𝑜𝑟𝑑 Language</cell><cell></cell></row><row><cell>WITH *,</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">CASE WHEN minEnd-maxStart+1 &gt; 0 THEN minEnd-maxStart+1 ELSE 0 END as time_overlap</cell></row><row><cell>ORDER BY time_overlap DESC</cell><cell></cell><cell></cell><cell></cell></row><row><cell>WITH text,</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">collect({century:centuryNode})[0] AS max</cell><cell></cell><cell></cell></row><row><cell>WITH *,</cell><cell></cell><cell></cell><cell></cell></row><row><cell>max .century as century</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">CREATE (text)-[r:CLUSTER]-&gt;(century)</cell><cell></cell><cell></cell></row><row><cell>RETURN text,century</cell><cell></cell><cell></cell><cell></cell></row><row><cell>UNION ALL</cell><cell></cell><cell></cell><cell></cell></row><row><cell>MATCH</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">(centuryNode:TimeInterval)-[:startTime]-&gt;(startCentury:TimePoint),</cell></row><row><cell cols="3">(centuryNode:TimeInterval)-[:endTime]-&gt;(endCentury:TimePoint),</cell><cell></cell></row><row><cell cols="2">(text:Text)-[:PUBLISHED_IN]-&gt;(point:TimePoint)</cell><cell></cell><cell></cell></row><row><cell>WHERE</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">centuryNode.description="century" and</cell><cell></cell><cell></cell></row><row><cell cols="2">point.Year&gt;=startCentury.Year and</cell><cell></cell><cell></cell></row><row><cell>point.Year&lt;=endCentury.Year</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">WITH text, centuryNode as century</cell><cell></cell><cell></cell></row><row><cell cols="2">CREATE (text)-[r:CLUSTER]-&gt;(century)</cell><cell></cell><cell></cell></row><row><cell>RETURN text, century;</cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="4">Listing 1: Clustering publishing date by centuries</cell></row></table><note>MATCH(centuryNode:TimeInterval)-[:startTime]-&gt;(startCentury:TimePoint), (centuryNode:TimeInterval)-[:endTime]-&gt;(endCentury:TimePoint), (pubNode:TimeInterval)-[:startTime]-&gt;(startPub:TimePoint), (pubNode:TimeInterval)-[:endTime]-&gt;(endPub:TimePoint), (text:Text)-[:PUBLISHED_IN]-&gt;(pubNode) WHERE centuryNode.description="century" WITH text, centuryNode, CASE WHEN endPub.Year &gt; endCentury.Year THEN endCentury.Year ELSE endPub.Year END as minEnd, CASE WHEN startPub.Year &gt; startCentury.Year THEN startPub.Year ELSE startCentury.Year END as maxStart</note></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://lila-erc.eu/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgement</head><p>This work fulfils the research objectives of the PNRR project FAIR -Future AI Research, spoke 6 -Symbiotic AI, CUP H97G22000210007, as well as the CHANGES -Cultural Heritage Active Innovation for Next-Gen Sustainable Society, CUP H53C22000860006.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The paradigm of relational indexing: A survey</title>
		<author>
			<persName><forename type="first">H.-P</forename><surname>Kriegel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pfeifle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pötke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Seidl</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">BTW 2003-Datenbanksysteme für Business, Technologie und Web, Tagungsband der 10</title>
				<imprint>
			<publisher>Gesellschaft für Informatik eV</publisher>
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Object-oriented database management systems: concepts and issues</title>
		<author>
			<persName><forename type="first">E</forename><surname>Bertino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Martino</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computer</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="33" to="47" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Integration strategy and tool between formal ontology and graph database technology</title>
		<author>
			<persName><forename type="first">S</forename><surname>Ferilli</surname></persName>
		</author>
		<idno type="DOI">10.3390/electronics10212616</idno>
		<ptr target="https://www.mdpi.com/2079-9292/10/21/2616.doi:10.3390/electronics10212616" />
	</analytic>
	<monogr>
		<title level="j">Electronics</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A schema-first formalism for labeled property graph databases: Enabling structured data loading and analytics</title>
		<author>
			<persName><forename type="first">C</forename><surname>Sharma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sinha</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th ieee/acm international conference on big data computing, applications and technologies</title>
				<meeting>the 6th ieee/acm international conference on big data computing, applications and technologies</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="71" to="80" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A New Time-sensitive Model of Linguistic Knowledge for Graph Databases</title>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cassotti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ferilli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CEUR Workshop Proceedings</title>
				<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page">69</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Temporal random indexing: a tool for analysing word meaning variations in news</title>
		<author>
			<persName><forename type="first">P</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Caputo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Semeraro</surname></persName>
		</author>
		<idno>WS.org</idno>
		<ptr target="http://ceur-ws.org/Vol-1568/paper7.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (ECIR 2016)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">M</forename><surname>Martinez-Alvarez</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">U</forename><surname>Kruschwitz</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Kazai</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Hopfgartner</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><forename type="middle">P A</forename><surname>Corney</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Campos</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Albakour</surname></persName>
		</editor>
		<meeting>the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (ECIR 2016)</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">1568</biblScope>
			<biblScope unit="page" from="39" to="41" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Diachronic word embeddings reveal statistical laws of semantic change</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">L</forename><surname>Hamilton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Leskovec</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/p16-1141</idno>
		<ptr target="https://doi.org/10.18653/v1/p16-1141.doi:10.18653/v1/p16-1141" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016</title>
		<title level="s">Long Papers</title>
		<meeting>the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016</meeting>
		<imprint>
			<publisher>The Association for Computer Linguistics</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Training temporal word embeddings with a compass</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">D</forename><surname>Carlo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Bianchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Palmonari</surname></persName>
		</author>
		<idno type="DOI">10.1609/aaai.v33i01.33016326</idno>
		<ptr target="https://doi.org/10.1609/aaai.v33i01.33016326.doi:10.1609/aaai.v33i01.33016326" />
	</analytic>
	<monogr>
		<title level="m">The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence</title>
				<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="6326" to="6334" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Latin WordNet, una rete di conoscenza semantica per il latino e alcune ipotesi di utilizzo nel campo dell&apos;Information Retrieval, Strumenti digitali e collaborativi per</title>
		<author>
			<persName><forename type="first">S</forename><surname>Minozzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">le Scienze dell&apos;Antichita</title>
		<imprint>
			<biblScope unit="page" from="123" to="134" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Pinkster</surname></persName>
		</author>
		<title level="m">Sintassi e semantica latina</title>
				<imprint>
			<publisher>Rosenberg &amp; Sellier</publisher>
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Latin and the early modern world: linguistic identity and the polity from petrarch to the habsburg novelists</title>
		<author>
			<persName><forename type="first">J</forename><surname>Ramminger</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Vulgar Latin</title>
		<author>
			<persName><forename type="first">J</forename><surname>Herman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2000">2000</date>
			<publisher>Roger Wright</publisher>
		</imprint>
		<respStmt>
			<orgName>The Pennsylvania State University</orgName>
		</respStmt>
	</monogr>
	<note>Translated by</note>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">A companion to the latin language</title>
		<author>
			<persName><forename type="first">D</forename><surname>Butterfield</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Latin: Story of a World Language</title>
		<author>
			<persName><forename type="first">J</forename><surname>Leonhardt</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2013">2013</date>
			<publisher>The Belknap Press of Harvard University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Latin as the Language of Science and Learning</title>
		<author>
			<persName><forename type="first">P</forename><surname>Roelli</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<publisher>De Gruyter</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Langslow</surname></persName>
		</author>
		<title level="m">Bilingualism in ancient society</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Lemon: An ontology-lexicon model for the multilingual semantic web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Declerck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Buitelaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Wunner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mccrae</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Montiel-Ponsoda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Aguado De Cea</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">WORDNET: a lexical database for english</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Miller</surname></persName>
		</author>
		<ptr target="https://aclanthology.org/H92-1116/" />
	</analytic>
	<monogr>
		<title level="m">Speech and Natural Language: Proceedings of a Workshop Held at Harriman</title>
				<meeting><address><addrLine>New York, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann</publisher>
			<date type="published" when="1992">February 23-26, 1992. 1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Tools for historical corpus research, and a corpus of Latin</title>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kilgarriff</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">New Methods in Historical Corpus Linguistics</title>
				<editor>
			<persName><forename type="first">P</forename><surname>Bennett</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Durrell</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Scheible</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Whitt</surname></persName>
		</editor>
		<meeting><address><addrLine>Narr, Tübingen</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="247" to="257" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Semeval-2020 task 1: Unsupervised lexical semantic change detection</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hengchen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Dubossarsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tahmasebi</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.semeval-1.1</idno>
		<ptr target="https://doi.org/10.18653/v1/2020.semeval-1.1.doi:10.18653/v1/2020.semeval-1.1" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fourteenth Workshop on Semantic Evaluation, SemEval@COLING 2020</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Herbelot</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">X</forename><surname>Zhu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Palmer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Schneider</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>May</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Shutova</surname></persName>
		</editor>
		<meeting>the Fourteenth Workshop on Semantic Evaluation, SemEval@COLING 2020<address><addrLine>Barcelona (online</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">December 12-13, 2020. 2020</date>
			<biblScope unit="page" from="1" to="23" />
		</imprint>
	</monogr>
	<note>International Committee for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Diachronic Usage Relatedness (DURel): A framework for the annotation of lexical semantic change</title>
		<author>
			<persName><forename type="first">D</forename><surname>Schlechtweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Schulte Im Walde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Eckmann</surname></persName>
		</author>
		<ptr target="https://www.aclweb.org/anthology/N18-2027/" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</title>
				<meeting>the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<address><addrLine>New Orleans, Louisiana</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="169" to="174" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">T</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Short</surname></persName>
		</author>
		<title level="m">A Latin Dictionary, Founded on Andrews&apos; edition of Freund&apos;s Latin dictionary revised, enlarged, and in great part rewritten</title>
				<editor>
			<persName><forename type="first">Charlton</forename><forename type="middle">T</forename><surname>Lewis</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Ph</forename><forename type="middle">D</forename></persName>
		</editor>
		<editor>
			<persName><forename type="first">Charles</forename><surname>Short</surname></persName>
		</editor>
		<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Clarendon Press</publisher>
			<date type="published" when="1879">1879</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">An Elementary Latin Dictionary</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">T</forename><surname>Lewis</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1890">1890</date>
			<publisher>American Book Company</publisher>
			<pubPlace>New York, Cincinnati, and Chicago</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Du Fresne Du Cange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A L</forename><surname>Henschel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Carpentier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Adelung</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Favre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">;</forename><surname>Favre</surname></persName>
		</author>
		<title level="m">Glossarium mediaeet infimaelatinitatis, L</title>
				<meeting><address><addrLine>Niort</addrLine></address></meeting>
		<imprint>
			<biblScope unit="page" from="1883" to="1887" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">A new corpus annotation framework for latin diachronic lexical semantics</title>
		<author>
			<persName><forename type="first">B</forename><surname>Mcgillivray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kondakova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Burman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Dell'oro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bermúdez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Sabel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Márquez</forename><surname>Marongiu</surname></persName>
		</author>
		<author>
			<persName><surname>Cruz</surname></persName>
		</author>
		<idno type="DOI">10.1515/joll-2022-2007</idno>
		<idno>doi:</idno>
		<ptr target="https://doi.org/10.1515/joll-2022-2007" />
	</analytic>
	<monogr>
		<title level="j">Journal of Latin Linguistics</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="47" to="105" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
