<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">The OBDA-based &quot;Observatory of Research and Innovation&quot; of the Tuscany Region</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Alessandro</forename><surname>Mosca</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">SIRIS Lab</orgName>
								<orgName type="institution">Research division of SIRIS Academic</orgName>
								<address>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Bernardo</forename><surname>Rondelli</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">SIRIS Lab</orgName>
								<orgName type="institution">Research division of SIRIS Academic</orgName>
								<address>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Guillem</forename><surname>Rull</surname></persName>
							<affiliation key="aff0">
								<orgName type="laboratory">SIRIS Lab</orgName>
								<orgName type="institution">Research division of SIRIS Academic</orgName>
								<address>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">The OBDA-based &quot;Observatory of Research and Innovation&quot; of the Tuscany Region</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">D7074CC255D82409E8D94A83E217E9F4</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T08:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>OBDA</term>
					<term>Higher Education &amp; Research</term>
					<term>Data-driven policies</term>
					<term>Interactive Data Visualisation</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The Tuscany's Observatory of Research and Innovation portal is an instrument to promote more transparent and inclusive governance in the region. We show its interactive dashboard and underlying SPARQL endpoint, powered by SIRIS Academic's UNiCS platform, which integrates Open Data on the Higher Education &amp; Research field, following the Ontology-Based Data Access approach.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In line with the orientation adopted by the EU in its Europe 2020 strategy, the Italian region of Tuscany has defined a set of policies aimed at supporting the Higher Education and Research (HE&amp;R) system, and promoting innovation in the Tuscan territory. Tuscany has provided the Regional Research and Innovation Observatory as a tool to support the implementation of the Regional Development Program (PRS 20016/20 <ref type="foot" target="#foot_0">1</ref> ). The PRS is the cornerstone of regional policies: "[A] tool that expresses a vision for the future of Tuscany and proposes constructive dialogue with the actors of the territory" (E. Rossi, Region's President, PRS introductory speech). This tool has the ambition to communicate and enhance the strengths of the research system, and to host information on research, innovation and higher education.</p><p>Within the Observatory, and in support of the PRS, the Tuscany Region has decided to have an information dashboard capable of integrating HE&amp;R data, keeping them up to date, and supporting policy makers in designing their policies. This paper shows the Observatory's interactive dashboard, currently located at toscanaopenresearch.it 2 , and the underlying SPARQL endpoint, which is powered by SIRIS Academic's UNiCS platform <ref type="bibr" target="#b3">[5]</ref>, a system that integrates Open Data on the HE&amp;R field and makes them accessible to users through a unified domain ontology, following the so-called Ontology-Based Data Access (OBDA) approach <ref type="bibr" target="#b4">[6,</ref><ref type="bibr" target="#b2">3]</ref>. The paper is structured as follows. First, Section 2 introduces the underlying platform that provides data to the Observatory, then Section 3 describes the Observatory's interactive dashboard.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Tuscany Internal data</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The UNiCS Platform</head><p>University Analytics (UNiCS) integrates open data repositories about HE&amp;R in Europe and makes them available via a dedicated SPARQL endpoint <ref type="bibr" target="#b7">[9,</ref><ref type="bibr" target="#b6">8]</ref>. Queries are posed in terms of a domain ontology that provides a homogeneous view of the otherwise disparate integrated datasets. While data are originally stored in relational databases (DBs), UNiCS users see them as RDF data, the standard data model in Linked Data <ref type="foot" target="#foot_1">3</ref> . This is made possible by -ontop- <ref type="bibr" target="#b0">[1]</ref>, an OBDA system that allows querying relational DBs as virtual RDF graphs using SPARQL. Given the domain ontology, and an R2RML <ref type="bibr" target="#b5">[7]</ref> mapping description that connects the ontology with the underlying DBs, -ontop-translates the users' SPARQL queries into SQL ones that are then run on the federated DB.</p><p>In the context of the Tuscany's Observatory, the core architecture of UNiCS includes a relational DB into which the different Italian and European open data repositories have been integrated, as per the Data Exchange <ref type="bibr" target="#b1">[2]</ref> approach. The reason why these datasets have been copied into a single relational DB is that they are not available as proper, queryable DBs, but only as downloadable CSVs, so moving them into a more suitable storage is required. The datasets include official Italian student and researcher data<ref type="foot" target="#foot_2">4</ref> coming from the MIUR (Ministero dell'Istruzione, dell'Università e della Ricerca), and European data on FP7 and H2020 research projects <ref type="bibr">[4]</ref>. An extension to this architecture is planned to incorporate internal data managed by the Tuscany Region, which will be federated with the UNiCS DB, and mapped into the UNiCS domain ontology. The architecture is depicted in Figure <ref type="figure" target="#fig_0">1</ref>. (Being able to integrate not just relational sources but also datasets that are behind given SPARQL endpoints is an -ontop-extension we are currently working on in collaboration with the KRDB Research Centre<ref type="foot" target="#foot_3">5</ref> ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Data visualisation</head><p>Currently, the Observatory consists mainly of an interactive dashboard, hosting data visualisations (co-designed with the relevant stakeholders) fed by the underlying UNiCS SPARQL endpoint. The visualisations are generated in real time by Javascript code running on the user's browser, which interrogates the endpoint for the necessary data. Visualisations are interactive: the user can click on different components to drill down on the results being displayed, as well as apply filters to focus on a particular data subset. Pop-up windows are also displayed with additional information that is not originally provided by the visual representation of the data, once the user selects a specific item in it. As an example, Figure <ref type="figure" target="#fig_1">2</ref> shows the distribution of graduated students per Italian bachelor faculty, and the different colours are meant to represent the overall number of years they spent at the university. Once the user mouses over the graphic of a given faculty, a window appears with a summarized view of the data behind, showing numbers that are not visible in the original visualisation. Users can either download the data behind each visualisation or copy and paste the queries which generate those data (see Figure <ref type="figure" target="#fig_2">3</ref>), and execute them, possibly modified according to new specific needs.</p><p>The dashboard is now conceptually divided into four main sections: Teachers &amp; researchers (showing distributions per gender, age, and disciplinary sectors), Teaching (on student data, with provenance, success rates per bachelor and master degrees), Research at universities (on EU funded projects, relative/absolute budgets, targeted to public organisations), and Research at private companies (on EU/regional funded projects, targeted to private organisations).</p><p>The Observatory's portal also includes a dedicated SPARQL endpoint and the LODE-powered documentation of the relative domain ontology <ref type="foot" target="#foot_4">6</ref> . The endpoint includes a library of pre-defined queries that either refer to the dashboard visualisations or have been collaboratively specified with the managers of the regions to satisfy specific needs and strategic demands. Users that are not familiar with SPARQL can then profit of the library, modify existing queries and execute them. The portal's visualisations and the SPARQL endpoint can be used by policy makers to get a better understanding of the current situation, and both monitor the effectiveness of recent policies and also be able to design new policies based on evidence rather than intuition.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. UNiCS platform architecture tailored for the Tuscany's Observatory of Research and Innovation</figDesc><graphic coords="2,333.21,269.50,81.43,81.43" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 .</head><label>2</label><figDesc>Figure 2. Interactive visualisation that shows a pop-up windows with additional information as the user mouses over.</figDesc><graphic coords="3,174.90,150.96,245.49,153.11" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 .</head><label>3</label><figDesc>Figure 3. The dashboard allows users to see the SPARQL queries behind each visualisation, and also gives them the option to download the combined result of the queries in CSV format.</figDesc><graphic coords="4,122.30,150.96,350.70,217.59" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://goo.gl/6VY4Co</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">https://www.w3.org/DesignIssues/LinkedData.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">Ministero dell'Istruzione, dell'Università e della Ricerca: (i) Anagrafe nazionale studenti: anagrafe.miur.it; (ii) Cerca università: cercauniversita.cineca.it</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">http://www.inf.unibz.it/krdb/, Free University of Bozen-Bolzano, Italy.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_4">http://34.250.237.252/toscana/sparql/docs/index.html</note>
		</body>
		<back>

			<div type="availability">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>DATA VISUALISATIONs, QUERY SYSTEM OBDA LAYER DATA SOURCEs Local Governments &amp; Public Administration Department</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Ontop: Answering SPARQL queries over relational databases</title>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Cogrel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Komla-Ebri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Rezk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Rodriguez-Muro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Xiao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="471" to="487" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Data exchange: semantics and query answering</title>
		<author>
			<persName><forename type="first">R</forename><surname>Fagin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">G</forename><surname>Kolaitis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Miller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Popa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Theor. Comput. Sci</title>
		<imprint>
			<biblScope unit="volume">336</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="89" to="124" />
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The combined approach to query answering in DL-Lite</title>
		<author>
			<persName><forename type="first">R</forename><surname>Kontchakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Lutz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Toman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wolter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zakharyaschev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">KR&apos;10</title>
				<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="247" to="257" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="http://university-analytics.com/" />
		<title level="m">SIRIS Academic</title>
				<imprint>
			<date type="published" when="2017-09-12">12 Sept 2017</date>
		</imprint>
		<respStmt>
			<orgName>UNiCS</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Poggi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lembo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>De Giacomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lenzerini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rosati</surname></persName>
		</author>
		<title level="m">Linking Data to Ontologies</title>
				<meeting><address><addrLine>Berlin, Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="133" to="173" />
		</imprint>
	</monogr>
	<note>Journal on data semantics</note>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="https://www.w3.org/TR/r2rml/" />
		<title level="m">W3C: R2RML: RDB to RDF Mapping Language</title>
				<imprint>
			<date type="published" when="2017-09-12">12 Sept 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<ptr target="https://www.w3.org/TR/sparql11-protocol/" />
		<title level="m">SPARQL 1.1 Protocol</title>
				<imprint>
			<date type="published" when="2017-09-12">12 Sept 2017</date>
		</imprint>
	</monogr>
	<note>W3C</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<ptr target="https://www.w3.org/TR/sparql11-query/" />
		<title level="m">SPARQL 1.1 Query Language</title>
				<imprint>
			<date type="published" when="2017-09-12">12 Sept 2017</date>
		</imprint>
	</monogr>
	<note>W3C</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
