<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Leveraging Semantic Web Technologies for Analysis of Crime in Social Science</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Luca Pulina</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonietta Mazzette</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Pandolfo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elena Piga</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Laura Ruiu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Camillo Tidore</string-name>
          <email>tidoreg@uniss.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DIBRIS, Universita di Genova</institution>
          ,
          <addr-line>Via Opera Pia, 13</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>POLCOMING, Universita di Sassari</institution>
          ,
          <addr-line>Viale Mancini n. 5</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we present the conceptual level of an ontologybased application aimed to support social scientists in their sociological analysis related on crime. Starting from several concrete issues posed by the research team of the Social Observatory on Crime of the University of Sassari, our goal is to build a Semantic Web based tool to collect, organize, and analyze data on crime, as well as for the exploitation of research results by institutions and civil society.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Generally speaking, crime analysis is the activity aimed at nding trends in
crimes in order to devise both policies and solutions to crime-related issues. From
a sociological point of view, the analysis of crime is addressed to understand how
criminal phenomena might in uence social assets in the context of urban and
rural areas. Moreover, it aims to identify prevention measures and their potential
e ects on social con gurations. In order to do that, data analysis plays a role
of paramount importance. Nowadays, information on urban crime is pro tably
used by a growing number of local governments to identify major risks and make
decisions about the safety of their community.</p>
      <p>
        Computer-assisted qualitative data analysis software (CAQDAS) were
successfully adopted by the sociological research community in order to both
organize and analyze such kind of data { see, e.g., [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. A number of bene ts of
CAQDAS have been recognized by sociological literature. On the other hand,
several issues are still open in relation to theoretical, methodological issues {
see, e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] { and practical aspects, such as heterogeneity of data sources, data
integration, and the exploitation of implicit knowledge related to the collected
data. It is well-established that the usage of Semantic Web (SW) technologies
can provide a valuable support in order to overcome the practical limits listed
above. Moreover, the usage of such technologies in crime and public safety elds
is not new { see, e.g., [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this paper we present the conceptual level of an ontology-based
application aimed at supporting social scientists in their sociological analysis related to
crime. It represents the rst step towards the development of a tool aimed at
organize and manage both quantitative and qualitative data related to this
application domain. In particular, the need of such a tool has emerged from several
concrete issues posed by the research team of the Social Observatory on Crime
(OSC)3 of the University of Sassari. On the one hand, the creation of a SW-based
crime information platform might allow stakeholders { institutions and civil
society { to easily access to data on crime, for instance by mapping crimes and
crime-related issues, and identifying where and how they are occurring, where
they are concentrated and why. On the other hand, it can facilitate researchers'
work in organizing and managing data collected from di erent sources, such as
statistical data and qualitative information from newspapers.</p>
      <p>The remainder of the paper is organized as follows. In Section 2 we describe
the OSC, while in Section 3 we report the whole process of design and
implementation of the presented ontology. We conclude the paper in Section 4 with
some nal remarks and discussing future works.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The Social Observatory on Crime</title>
      <p>The OSC originated in 2012 thanks to an interdisciplinary team (in particular
social scientists such as sociologists, psychologists, economists, jurists) which
involves researchers from the University of Sassari. It originated from the Urban
Study Center (CSU)4, that focuses on urban and territory evolution and
transformations; coordinates empirical study and promotes the culture of legality
by involving students in actively doing research. The CSU contributes to
promoting the adoption of governance approaches by involving private and public
bodies as both producers and bene ciaries of research outcomes. It also aims to
disseminate activities' results through seminars, conferences, educative courses
and scienti c publications. Since 2004, the CSU has started to focus on crime
and insecurity in order to observe their impacts on the Sardinian social context
and territory. The analysis and monitoring activities are based on data collected
from documents provided by justice o cers, newspapers and national statistical
reports (e.g., reports of the Italian National Institute of Statistics, ISTAT).</p>
      <p>
        The OSC was built aimed at promoting and generating governance
approaches by trying to involve di erent kinds of stakeholder, such as private
and public bodies. In fact, the inclusion of these actors was supposed to be
relevant both to collect basic information and data, and to de ne concerted
strategies for reducing criminal behaviors and attitudes. For a long time, both
literature and policies have focused on situational crime prevention strategies by
creating \defensible spaces", and less focused on the contrast of the motivations
that encourage deviance. During the last twenty years governance approaches
to crime have been promoted, through collaboration among a multiplicity of
actors (also \external" to the control/protection functions). Following this
approach, the CSU concentrated its e orts on identifying processes for increasing
3 Osservatorio Sociale sulla Criminalita, http://polcoming.uniss.it/node/1133
4 Centro Studi Urbani, http://www.centrostudiurbani.it
degree of key-stakeholders' participation and networking rather than de ning
speci c \interventions" and \architectonic fences". Recently [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], OSC identi ed
social indicators in order to create an \Informative System for data collection
and analysis". This tool was thought to support policy planning and
decisionmaking oriented at ghting and reducing criminal and illegal activities.
      </p>
      <p>The main goal of OSC is to develop an exhaustive database which includes
quantitative data (primary and secondary data from di erent sources, e.g.,
ISTAT and regional prosecutors) and qualitative information obtained, e.g., by
analyzing local newspapers (in particular La Nuova Sardegna5 and L'Unione
Sarda6). The newspaper consultation has allowed the collection a number of
detailed information (e.g., description of places where murders happened,
description of authors way of life and their past experiences, connections with other
types of crime, etc.) that otherwise would have been di cult to gain and record.
However, the consultation refers to those relevant crimes which get newspapers
attention such as murders, robberies, attacks, threats, and cannabis cultivation.</p>
      <p>A further objective of OSC is to analyze connections between widespread
insecurity and crime. In fact, individual and collective behaviors, decision-making
and economic activities are often strongly related to the types and the intensity
of these connections. Moreover, criminal phenomena should be analyzed in
relation to the process of modernization (and its consequences) that has involved
the targeted territories by shaping social con guration of urban and rural areas.
3</p>
    </sec>
    <sec id="sec-3">
      <title>The OCRA Ontology</title>
      <p>ocra aims at being the conceptual layer of a Semantic Web based tool focused
on the improvement of the processes related to the collection, organization,
management, and analysis of data on criminal phenomena in Sardinia by OSC. In
the following, we describe design and implementation of ocra (Ontology for
CRime Analysis). We can summarize as follows the main steps of this process:</p>
      <sec id="sec-3-1">
        <title>1. De nition of the domain. 2. Identi cation of the key concepts of the domain to be described. 3. Identi cation of the proper language and Tbox implementation. 4. Ontology population, i.e., lling the Abox with known facts.</title>
        <p>Firstly, we reviewed the semi-structured dataset collected by OSC during a 11
years-long research on criminality in Sardinia. Data on criminal phenomena were
collected through speci c forms to be lled with information obtained, e.g., by
local newspapers such as La Nuova Sardegna and L'Unione Sarda. The criminal
phenomena recorded are murder, attacks, robbery and cultivation of cannabis.
Data collection forms were mainly composed of the following information:
{ Data concerning the newspaper, e.g., name of the newspaper, date, and title
of the related article.
5 http://lanuovasardegna.it
6 http://unionesarda.it
{ Data concerning the crime, e.g., type, place, date, and motive.
{ Data concerning the authors of the crime and victims, e.g., name, job, age,
and records of criminal o enses.</p>
        <p>Regarding the second point, we analyzed collected data in order to
highlight common terminology, redundancies, and relationships between di erent
elements, as suggested by the domain experts of the OSC. The results of this
process enabled us to compute a taxonomy { depicted in Figure 1 { related to
di erent crimes.</p>
        <p>
          Considering the third point, we proceeded with the choice of the modeling
language analyzing the di erent alternatives o ered by OWL 2. To retain most
of the practical advantages of OWL 2, but to improve on its applicability, in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
has been introduced OWL 2 pro les, i.e., a sub-language of OWL 2 featuring
limitations on the available language constructs and their usage.
        </p>
        <p>Considering the available pro les, we excluded OWL 2 EL because it does not
support inverse object properties, while we discarded both OWL 2 QL and OWL
2 RL because they do not support, e.g., existential quanti cation to individuals.
Thus, ocra has been developed in OWL 2 DL, and its Tbox is composed of
81 classes, 36 object properties, and 53 data properties. In the following, we
describe main classes of the ocra ontology7:
ArticoloGiornale (NewspaperArticle) represents the class containing the
newspapers information about the speci c crime. Every instance of this class has
data properties such as Titolo (Title) and DataArticolo (ArticleDate).
Luogo (Place) includes the place where the crime has occurred. Individuals of
Luogo are also the places in which the victims and o enders were born or
live.
7 The full documentation is available at http://visionlab.uniss.it/OCRA.</p>
        <p>Movente (Motive) aims to model the motive of the crime. It has di erent
subclasses, each of which is a speci c motive, such as economical, political,
revenge, etc.</p>
        <p>PersonaFisica (Person) models people related to a speci c crime. It has two
sub-classes, namely Vittima (Victim) and Autore (O ender). Every
individual belonging to those classes has data properties such as Eta (Age), Sesso
(Gender), StatoCivile (MaritalStatus), Precedenti
(RecordsOfCriminalOffenses).</p>
        <p>Reato (Crime) is one of the central classes of ocra. It has two sub-classes
related to the main types of o enses taken into account: crimes involving
people or things and crimes related to drug { see below. Reato has di erent data
properties such as DataReato(CrimeDate), NumeroVittime(NumberOfVictims),
NumeroAutori (NumberOfO enders).</p>
        <p>ReatoAPersoneECose (CrimeToPersonsAndThings) In this class are included
individuals related to crimes that caused material damage or injure
people. ReatoAPersoneECose has two sub-classes: Omicidio(Murder), which
includes crimes with homicide, and NonOmicidio(NotMurder). The latter
covers a large series of crimes which have not led to murder. NonOmicidio
has three sub-classes, each one modeling di erent category o enses, namely
Attentato(Threat), Minaccia(Attack), Rapina(Robbery).</p>
        <p>ReatoCollegatoAllaDroga (CrimeRelatedToDrug) is the other principal
subclass of Reato and is related to all the drug o ences. In particular, we
modeled the following two sub-classes of drug o ences: Coltivazione(Plantation)
and Detenzione(Possession). Some of the most relevant data properties of
ReatoCollegatoAllaDroga are connected to the type and the number of
drugs con scated by the authorities, such as Semi (Seeds) and Piante(Plants).
Strumenti (Weapons) represents the class containing the instruments used by
an o ender to commit the crime. It has various sub-classes, such as
ArmiDaFuoco (FireArms), Esplosivi (Explosives) and Veicoli (Vehicles).</p>
        <p>Concerning object properties, we brie y describe the ones related to
Omicidio, because they enable domain experts to involve in their analysis important
data regarding places in which the crime has occurred, o enders, and victims.
Noticeable object properties are:
{ commessoDa(CommittedBy): connects Omicidio to Autore.
{ haCoinvolto(hasInvolvedIn): relationship between Omicidio and Vittima.
{ avvenutoA(takesPlaceIn): allows the identi cation of murder's place (Luogo).
{ commessoCon (hasWeapon): relationship with the murder weapon.
{ haMovente (hasMotive): it connects the o ense with the motive (Movente).</p>
      </sec>
      <sec id="sec-3-2">
        <title>In Figure 2 we show a graphical example of these relationships. Finally, the ocra Abox has been populated using data provided by the OSC. Actually, the Abox contains more than 15000 individuals, with their related properties, while the whole ontology is composed of about 365000 triples.</title>
        <p>4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>In this paper we described design and development of the ocra ontology, the
conceptual level of the ontology-based application aimed to support people of
OSC in their sociological analysis related to crime.</p>
      <p>Currently, we are developing a data integration layer in order to exploit
information coming from relevant external sources, e.g., open data provided by
ISTAT and DBpedia. We are also designing a Graphical User Interface to
support the ontology population stage, in order to make this process of knowledge
acquisition more interactive and dynamic. More, concerning the ontology
population, we are studying automated solutions for data collection and insertion.</p>
      <p>Finally, we are planning to perform more detailed experimental analysis on
the ocra ontologies. Some preliminary experiments have shown us that ocra
could be a challenging benchmark for OWL 2 DL reasoners.</p>
      <p>Acknowledgments The authors wish to thank the anonymous reviewers for their
valuable suggestions, which were helpful in improving the nal version of the
paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mangabeira</surname>
          </string-name>
          , W.C.
          <article-title>: Caqdas and its di usion across four countries: National specicities and common themes</article-title>
          .
          <source>Current Sociology</source>
          <volume>44</volume>
          (
          <issue>3</issue>
          ) (
          <year>1996</year>
          )
          <volume>191</volume>
          {
          <fpage>205</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>John</surname>
          </string-name>
          , W.S., Johnson, P.:
          <article-title>The pros and cons of data analysis software for qualitative research</article-title>
          .
          <source>Journal of Nursing Scholarship</source>
          <volume>32</volume>
          (
          <issue>4</issue>
          ) (
          <year>2000</year>
          )
          <volume>393</volume>
          {
          <fpage>397</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Asaro</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Biasiotti</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guidotti</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sagri</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tiscornia</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>A domain ontology: Italian crime ontology</article-title>
          .
          <source>In: Proceedings of the ICAIL 2003 Workshop on Legal Ontologies &amp; Web based legal information management.</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mazzette</surname>
          </string-name>
          , A., ed.:
          <article-title>La criminalita in Sardegna, Quarto rapporto di ricerca</article-title>
          . EDES,
          <string-name>
            <surname>Sassari</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bock</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fokoue</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haase</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoekstra</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          , et al.:
          <article-title>OWL 2 Web Ontology Language: Structural Speci cation and Functional-Style Syntax</article-title>
          .
          <source>W3C Recommendation</source>
          <volume>27</volume>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>