=Paper=
{{Paper
|id=Vol-1654/article-09
|storemode=property
|title=None
|pdfUrl=https://ceur-ws.org/Vol-1654/article-09.pdf
|volume=Vol-1654
|dblpUrl=https://dblp.org/rec/conf/semweb/ScannapiecoG16
}}
==None==
Classification Explorer: Navigational Querying of Statistical Classifications Taeke Gjaltema (UNECE) and Monica Scannapieco (Istat) (editors)i Abstract. In this paper, we describe an application that, starting from statistical classifications represented as Linked Data artefacts, allows for a rich user experience based on cross-classifications navigation. 1 Introduction Linked Data and Semantic Web standards are being progressively adopted by statistical organizations. Clas- sifications are an indispensable instrument to disseminate statistical data: without classifications no statistics. An integrated system of activity and product classifications allows the comparability of statistics produced in different statistical domains and countries. The High-Level Group for the Modernisation of Official Statistics (HLG-MOS)ii established a projectiii in 2016 with the use of Linked Data for modeling statistical data as one of its principal aims. Within this project we developed an application called the Classification Explorer. Starting from statistical classifications represented as Linked data artefacts, it allows for a rich user experience by enabling cross-classifi- cation navigation. Several of these classifications were made available for the SemStats challenge. This paper describes how the Classification Explorer application accesses and navigates these classifications. 2 Classification Explorer The Classification Explorer is an application that provides users with a visual experience to interact with the classifications that were made available by the SemStats challenge (http://semstats.org/2016/challenge/challenge- data). We first describe the main functionalities of the Classification Explorer (Section 2.1) and later the detail of the technological setting of the application (Section 2.1). 2.1 Main Functionalities The main functionalities of the Classification Explorer are detailed and shown below. Browsing classifications. For some of the uploaded classifications (so far 14), a browsing functionality is available that permits to: Presenting the overall list of available classifications (see Figure 1). Selecting one classification and displaying the details at all the levels (see Figure 2 for different levels’ details). Cross-navigating classifications. For some classifications, correspondence tables are available, thus permitting to pass from one classification to another. In particular, as shown in Figure 3, by selecting a specific activity of NACE iv classification, through the correspondence table with CPAv, it is it is possible to access all the products in CPA associated to the selected activity. The correspondence tables are defined at all levels of the classifications. Search among classifications. The Classification Explorer also has a search function. Given a code of a classification item or a text to search, the functionality returns all the matching classification items for all the classification that are in the database. The results are grouped by each classification they belong to (see Figure 4). Figure 1: Overall list of available classifications 2 Figure 2: First and second level detail of NACE Rev 2 Classification Export to CSV. In addition to the browsing and navigating functionalities illustrated above, the Classification Explorer also has an exporting functionality available. This functionality permits to export and download the displayed list of items in CSV format. 3 Figure 3: Cross-classification navigation through the correspondence table between CPA and NACE Figure 4: Searching results with searching text “horse” 4 2.2 Technological Setting The Classification Explorer is a single-page browser app that communicates with an RDF triple store. The client is a JavaScript application written using NodeJS (https://nodejs.org/en/), ReactJS (https://github.com/reactjs) for building user interfaces and Webpack (https://github.com/webpack/webpack), a tool to package, deploy and redeploy applications. The client is based on the React-Redux pattern (http://redux.js.org/) that manages the state of the application. As the application is single-page, the navigation is entirely done on the browser. React-router (https://github.com/reactjs/react-router) takes care of updating the URL so that the user can refresh, resume later or share the page she/he is on. It fetches the data from a remote server, detailed in the next section, using SPARQL 1.1 queries over HTTP. Data are then stored locally and so they are accessed only once. The server-side is an RDF triple store implemented with Stardog (http://stardog.com/). The data made availa- ble within the SemStats challenge have been all uploaded to the triple store. These are RDF data, modelled ac- cording to several vocabularies, the principal of which is XKOS. In particular, the classifications are organized in a single database with a named graph for each specific classification version. Figure 5 shows the result of a query for retrieving all the graphs. Figure 5: List of the named graph in Stardog database 3 Conclusions and Final Remarks The design and implementation of the Classification Explorer was driven by the idea that it is useful to have an integrated access to the various official classifications. Moreover, by exploiting the available correspondence tables, the Classification Explorer has the important advantage of enriching the user experience when accessing classifications by allowing a cross-navigation among them. The application code is available at: https://github.com/UNECE/Classification-Explorer. 5 When implementing the Classification Explorer, there were some issues related to the lack of harmonized design choices for RDF triples generation. For instance, in the European classifications (NACE and CPA, two versions of each), we had to fix the fact that they used xkos:hasLevels instead of xkos:levels. The Classification Explorer can be enriched in several ways, namely: The layout of the HTML pages can be improved. Indeed, so far only a basic layout is adopted. The navigation features can be improved by using URIs for concepts representing items in the corre- spondence tables instead of strings that prevent from a full navigation. Enriching the export and download functionality by having bulk downloads in different formats of the available classifications. i The application and the paper were a joint product of the participants of the Sprint Workshop of the UNECE HLG Linked Statistical Metadata project: Raffaella Aracri, Mauro Bruno, Franck Cotton, Eric Debonnel, Taeke Gjaltema, Dennis Grofils, Hans van Hoof, Olivier Levitt, Enrico Orsini, Andrea Pagano, Jean-Baptiste Rudant, Monica Scannapieco, Romain Tailhurat, Laura Tosco and Luca Valentino. ii High-Level Group for the Modernisation of Official Statistics (HLG-MOS) was established in 2010 to oversee and co- ordinate international work relating to statistical modernisation. It promotes standards-based modernisation of official statis- tics. The aim is to improve the efficiency of statistical production, and the ability to produce outputs that better meet user needs. One of the ways to collaborate and to make progress is through HLG projects in specific areas. iii Implementing ModernStats Standards Project: http://www1.unece.org/stat/platform/display/hlgbas/Implement- ing+Modernstats+Standards iv The Statistical classification of economic activities in the European Community, abbreviated as NACE, is the classifi- cation of economic activities in the European Union (EU); the term NACE is derived from the French Nomenclature statis- tique des activités économiques dans la Communauté européenneNomenclature statistique des activités économiques dans la Communauté européenne v The Statistical classification of products by activity, abbreviated as CPA, is the classification of products (goods as well as services) at the level of the European Union (EU). 6